Full Evaluation: "Local LLM Wiki with Obsidian" — A Tutorial in Good Faith That Still Fails the Four Pillars
Related pages

Full Evaluation: “Local LLM Wiki with Obsidian” — A Tutorial in Good Faith That Still Fails the Four Pillars

⚠️ BIG FUCKING WARNING: OBSIDIAN IS PROPRIETARY SOFTWARE ⚠️

Before we evaluate anything else, this needs to be stated clearly and loudly:

Issue	Details
Proprietary license	Obsidian is not open source. You do not own the software. You cannot audit the code. You cannot modify it. You cannot redistribute it.
Closed source	The source code is private. You have no idea what telemetry, data collection, or background processes exist.
Vendor lock-in	Your “knowledge base” is stored in markdown (good), but the graph view, backlinks, plugins, and all the “smart” features are tied to Obsidian’s proprietary implementation.
Future uncertainty	If Obsidian disappears, goes subscription-only, changes licensing, or gets acquired by a company like Notion or Microsoft — your workflow is fucked.
Telemetry concerns	Obsidian collects anonymous usage data. For personal use, maybe fine. For client NDAs, medical records, or business secrets? Absolutely not.
Plugin ecosystem	Many popular Obsidian plugins are community-maintained, unvetted, and can execute arbitrary code. A malicious plugin could exfiltrate your entire knowledge base.
No built-in permissions	Obsidian has zero access control. Anyone with access to your file system sees everything. No multi-user. No role-based access.
Sync is paid	Want to sync across devices? That’s a paid subscription (Obsidian Sync). Or you roll your own with Syncthing, but then you’re managing file conflicts manually.
Mobile apps are proprietary	The iOS and Android apps are closed source. Your data passes through their app layer.

If you are building a knowledge base for personal hobby use: Obsidian is fine. Annoying that it’s proprietary, but workable.

If you are building a knowledge base for business, clients, research collaboration, or anything containing sensitive information: Using Obsidian as the foundation is negligent. You are building your house on someone else’s land with a lease that can change at any time.

Executive Summary

This video is different from Adam’s. The creator is transparent, humble, and explicitly acknowledges limitations. He admits his implementation is “stateless and static,” not incremental. He clarifies he is simplifying for demonstration. He does not claim production readiness.

And yet — the architecture still fails every single one of the four irreducible pillars.

This evaluation is not an attack on the creator. It is a structural analysis. Good intentions do not compensate for missing foreign keys, missing permissions, missing provenance, and missing queryability — plus a proprietary dependency that you do not control.

What This Video Gets Right (Credit Where Due)

Aspect	Evaluation
Transparency	✅ Explicitly says his version is “not following what Karpathy was saying” about incremental updates
Honesty about limitations	✅ Admits it is “stateless and static,” “not dynamic”
Local-first approach	✅ Uses Ollama + Gemma, no data sent to OpenAI/Claude
Practical tutorial	✅ Shows working code, Obsidian integration, graph view
No overclaiming	✅ Does not say “production ready” or “better than RAG”
Human-in-the-loop	✅ Obsidian provides manual curation ability

The creator deserves respect for intellectual honesty. He is not a “sheep.” He is a practitioner showing a working prototype.

But the choice of Obsidian as the viewing/editing layer introduces fatal problems that go beyond architecture.

The Four Pillars Evaluation

1. Store with Integrity → ❌ FAIL

Requirement	Implementation	Status
Start with raw, immutable source	✅ Yes — clippings folder contains original articles	PASS
Use a real database (schemas, FK, indexes)	❌ No — markdown files + folders	FAIL
Store any knowledge type	⚠️ Text/articles only — not videos, emails, code, spreadsheets	PARTIAL
Record files or locations	✅ Yes — Obsidian file paths	PASS
Preserve integrity / immutability	❌ No — files can be edited arbitrarily. No versioning. No cryptographic verification.	FAIL

Verdict: Same fatal flaw as all LLM Wiki implementations — no database, no foreign keys, no immutability.

2. Relate with Precision → ❌ FAIL

Requirement	Implementation	Status
Typed relationships (supports/contradicts)	❌ No — only generic “topic” and “entity” links	FAIL
Foreign keys	❌ Impossible — markdown has no FKs	FAIL
Bidirectional links	⚠️ Partial — Obsidian’s graph view shows backlinks, but they are not enforced at storage level	PARTIAL
Explicit hierarchy	⚠️ Folders provide hierarchy	PARTIAL
Hyperlinking	✅ Yes — markdown links between pages	PASS

Verdict: Obsidian’s graph view creates the appearance of relationships, but there are no typed, enforced, foreign-key-backed connections. Rename a file and links break silently — Obsidian does not fix this.

3. Trust with Provenance → ❌ FAIL

Requirement	Implementation	Status
Provenance — every fact knows its source	❌ No — the JSON summary does not store which sentence from which article generated each claim	FAIL
Permissions / access control	❌ Obsidian has none. Zero. Zilch. Anyone with file system access sees everything. No user roles. No object-level permissions.	FAIL
Audit trails	❌ None in Obsidian. No “who changed what, when, why.”	FAIL
Human curation	✅ Yes — Obsidian allows manual editing	PASS
Cryptographic verification	❌ None	FAIL

Critical Issue: The LLM extracts topics, entities, and summaries. If the LLM hallucinates a topic or misattributes an entity, that error becomes permanent markdown. There is no way to trace a claim back to the original text. The creator admits quality “depends on how strong is your model” — but does not address how to detect or correct hallucinations.

Obsidian-Specific Trust Issues:

Obsidian Trust Problem	Severity
Proprietary closed source — cannot audit	🔴 HIGH
Telemetry collection — what data leaves your machine?	🟠 MEDIUM
No built-in encryption at rest	🔴 HIGH
Plugin system can execute arbitrary, unvetted code	🔴 HIGH
Sync (if used) passes through their servers	🟠 MEDIUM
No multi-user support — impossible for team use	🔴 HIGH

4. Retrieve with Speed → ❌ FAIL

Requirement	Implementation	Status
Search by any dimension	⚠️ Obsidian has full-text search, but not by entities, topics, summaries in a queryable way	PARTIAL
SQL queries	❌ No	FAIL
Queryable structure	❌ No — markdown files require parsing	FAIL
Access file properties	⚠️ Filesystem provides basic metadata	PARTIAL

Verdict: Obsidian’s search is better than nothing, but it is not a queryable database. You cannot ask “show me all articles about ‘learning’ written by author X with more than 3 key points.” That requires SQL or a real query engine.

Obsidian-Specific Retrieval Issues:

Obsidian Retrieval Problem	Severity
Search is proprietary — you cannot extend or automate it easily	🟠 MEDIUM
No programmatic query API (unless you parse markdown files yourself)	🔴 HIGH
Graph view is visual only — not queryable	🟠 MEDIUM
Large vaults (10k+ files) cause performance degradation	🟠 MEDIUM

The Creator’s Own Admissions (Worth Highlighting)

The creator says things most LLM Wiki promoters never say:

“This is actually not following what Karpathy was saying because he was saying that this wiki should be incremental. … What we are doing is making it stateless and static. It’s not dynamic.”

“I just didn’t want when we make a mistake go and just remove them one by one.”

“The topic and entity quality depends on how strong is your model.”

These are honest admissions of limitation. He is not selling a fantasy. He is showing a prototype.

But he does not mention that Obsidian is proprietary. He does not mention the privacy implications. He does not mention that his entire “knowledge base” is locked into a closed-source ecosystem that can change its terms, pricing, or existence at any time.

One Comparison Table: This Implementation vs. Requirements

Requirement	This Implementation	Status
STORE: Real database	Markdown files + Obsidian	❌ FAIL
STORE: Immutability	Editable files, no versioning	❌ FAIL
RELATE: Typed relationships	Only generic “topic”/“entity”	❌ FAIL
RELATE: Foreign keys	None	❌ FAIL
RELATE: Bidirectional links	⚠️ Obsidian graph view (display only, not enforced)	PARTIAL
TRUST: Provenance	Lost after LLM summarization	❌ FAIL
TRUST: Permissions	❌ Obsidian has none	FAIL
TRUST: Audit trails	None	❌ FAIL
TRUST: Open source / auditable	❌ Obsidian is proprietary	FAIL
RETRIEVE: SQL / structured queries	No	❌ FAIL
RETRIEVE: Search by any dimension	⚠️ Full-text only	PARTIAL
RETRIEVE: Works at scale	Untested — likely fails beyond hundreds of articles	⚠️ UNKNOWN
Human in the loop	✅ Yes (Obsidian)	PASS
Incremental updates	❌ Explicitly not implemented (stateless)	FAIL
Hallucination detection/correction	❌ None	FAIL
Vendor lock-in risk	❌ High — Obsidian proprietary	FAIL
Multi-user / team support	❌ None	FAIL

Tally: ❌ 13 failures | ⚠️ 3 partial | ✅ 1 pass

What This Video Reveals About LLM Wiki (Unintentionally)

Observation	Implication
The creator had to write hundreds of lines of boilerplate to parse markdown, format JSON, and manage files	LLM Wiki is not a framework — it is a DIY file management problem
He explicitly chose to make it stateless because incremental updates are hard	“Incremental” is a promise Karpathy made but few have implemented correctly
Quality depends entirely on the local model	Smaller models (Gemma 4B, Llama 3.2) will produce lower-quality topics and entities, and there is no validation step
The graph view looks impressive but is just markdown links	Obsidian’s visualization does not fix missing foreign keys or provenance
No mention of privacy or permissions	The system assumes a single user on a single machine. Add multiple users or sensitive data, and it collapses.
Obsidian is proprietary	The entire “knowledge base” is locked into a closed-source ecosystem you do not control

Comparison with Adam’s Video (Brief)

Aspect	Adam (Agency)	This Creator (Local Tutorial)
Claims production readiness	❌ Yes (falsely)	✅ No (honest prototype)
Acknowledges limitations	❌ Minimal	✅ Explicit (“stateless”, “not incremental”)
Privacy awareness	❌ None (AI sees all client emails)	⚠️ Local-only, but Obsidian telemetry unknown
Overclaims superiority to RAG	✅ Yes	❌ No
Uses proprietary software	❌ N/A (folders only)	❌ Yes — Obsidian
Overall honesty	❌ Misleading	✅ Refreshingly honest about code, silent about Obsidian lock-in

⚠️ OBSIDIAN-SPECIFIC WARNINGS (REPEATED FOR EMPHASIS) ⚠️

If you are considering this tutorial, read these warnings carefully:

Warning	Explanation
Proprietary = You don’t own it	Obsidian’s source code is closed. You cannot fix bugs. You cannot audit for security. You cannot guarantee it will exist in 5 years.
Telemetry	Obsidian collects anonymous usage data. For client work, NDAs, or medical information, this is unacceptable.
No permissions	Obsidian has zero access control. Every person with file system access sees every note. No roles. No encryption.
Plugin risk	Plugins run arbitrary code. A malicious or compromised plugin can steal your entire knowledge base.
Sync is paid and proprietary	Obsidian Sync costs money and sends your data through their servers. Self-hosted sync (Syncthing) is possible but creates file conflicts.
Vendor lock-in	Your workflow depends on Obsidian-specific features: graph view, backlinks pane, Dataview, etc. Migrating away is painful.
Mobile apps are closed	The iOS and Android apps are proprietary. If you use mobile, you are locked in deeper.
No multi-user	Obsidian is single-user only. There is no way to collaborate with a team on the same vault without file conflicts and manual merging.

If you still want to use Obsidian for personal hobby projects: Fine. Your choice.

If you are building anything for business, clients, research collaboration, or sensitive information: Do not use Obsidian. Use something open source (Logseq, Joplin, Trilium) or a real database with proper access control (PostgreSQL + a proper frontend). Or at minimum, be aware that you are building on rented land.

Final Verdict

Pillar	Result
Store with Integrity	❌ FAIL
Relate with Precision	❌ FAIL
Trust with Provenance	❌ FAIL
Retrieve with Speed	❌ FAIL

Overall: ❌ FAILS ALL FOUR PILLARS

Plus: ❌ Proprietary dependency (Obsidian) adds vendor lock-in, telemetry concerns, no permissions, no multi-user, and unvetted plugin risk.

Why This Matters (Even for a “Good” Tutorial)

This video is not malicious. It is not hype-driven. The creator is skilled, transparent, and generous with his knowledge.

But that is exactly why it is dangerous.

Because when someone honest and competent builds something that looks like a knowledge base — with folders, links, summaries, a beautiful graph view, and a local-first LLM — viewers will assume it is a proper knowledge base. They will copy it. They will use it for their own research, their own clients, their own sensitive notes.

And they will discover, too late, that:

Links break when files are renamed
Hallucinations become permanent
There is no way to trace a claim back to its source
There are no permissions (everyone sees everything)
There is no audit trail (who changed what?)
There is no queryable structure (just markdown parsing)
The system does not scale beyond a few hundred files
Obsidian is proprietary and you do not control it
Obsidian has no multi-user support
Obsidian collects telemetry

The four pillars are not optional. They are not “enterprise features.” They are the minimum requirements for anything that deserves the name “knowledge base.”

Adding a proprietary, closed-source, single-user, no-permissions note-taking app on top of a flawed architecture does not fix the architecture. It adds more problems.

Recommendation for Viewers

If you watch this video:

Learn from the code — it is a good tutorial on LLM + file processing.
Do not deploy this as a knowledge base — treat it as a prototype or personal experiment.
Do not use Obsidian for anything serious — unless you accept vendor lock-in, telemetry, no permissions, and closed source.
Ask yourself: What happens when I have 500 articles? What happens when I need to share this with a colleague? What happens when the LLM hallucinates a topic? What happens when I rename a file? What happens when Obsidian changes its license or goes subscription-only?
If you need a real knowledge base, start with PostgreSQL, foreign keys, schemas, permissions, audit logs, and a proper query layer. Use open source tools you control. Add LLMs as tools for summarization and relationship suggestion — not as the engine.

The actual video

Closing Thought

The creator of this video is not a sheep. He is a builder who showed his work honestly.

But the architecture he built — Karpathy’s LLM Wiki pattern on top of Obsidian — remains fatally flawed. The four pillars exist for a reason. No amount of Obsidian graph visualization or honest tutorial-making can compensate for missing foreign keys, missing provenance, missing permissions, missing SQL, and a proprietary dependency you do not control.

🐑💀🧙

P.S. — If you downvoted the previous evaluation that called out Obsidian as proprietary, ask yourself: why are you defending a closed-source company that can change your access at any time? Your “knowledge base” is not yours if the software that reads it is not yours.

⚠️ THE WORD “WIKI” HAS BEEN PERVERTED ⚠️

Shepherd's LLM-Wiki vs. Robust Dynamic Knowledge Repository: A Satirical Allegory on AI-Generated Knowledge Management
This satirical allegory critiques the trend of relying on Large Language Models (LLMs) to automatically generate and manage knowledge bases using simple Markdown files, portraying this approach as a naive "Shepherd's" promise that inevitably leads to data inconsistency, hallucinations, privacy leaks, and unmanageable maintenance. The text contrasts this fragile, probabilistic "LLM-Wiki" method with a robust, 23-year-old "Dynamic Knowledge Repository" (DKR) built on structured databases (like PostgreSQL) and Doug Engelbart's CODIAK principles, arguing that true knowledge management requires human curation, deterministic relationships, and explicit schemas rather than blindly following AI-generated text files.
Karpathy's LLM-Wiki Is a Flawed Architectural Trap
The author sharply criticizes Andrej Karpathy's viral "LLM-Wiki" concept as a flawed architectural trap that mistakenly treats unstructured Markdown files as a robust database, arguing that relying on LLMs to autonomously generate and maintain knowledge leads to hallucinations, broken links, privacy leaks, and a loss of human cognitive engagement. While acknowledging the appeal of compounding knowledge, the text asserts that Markdown lacks essential database features like referential integrity, permissions, and deterministic querying, causing the system to collapse at scale and contradicting its own "zero-maintenance" promise. Ultimately, the author advocates for proven, structured solutions using real databases and human curation, positioning LLMs as helpful assistants rather than autonomous masters, and warns against blindly following a trend promoted by someone who has publicly admitted to being in a state of psychosis.
Critical Rebuttal to LLM-Wiki Video: Why Autonomous AI Claims Are Misleading
The text provides a critical rebuttal to a video promoting "LLM-Wiki," arguing that the system’s claims of autonomous intelligence, zero maintenance costs, and scalability are fundamentally misleading. The critique highlights that LLMs lack persistent memory, leading to repeated errors, while the system’s actual intelligence is merely increased data density rather than genuine understanding. Furthermore, the video ignores significant practical challenges such as substantial API costs, the inevitable need for embeddings at scale, the complexity of fine-tuning, and the persistent human labor required for data integrity and contradiction resolution. Ultimately, the author concludes that the video is merely a tutorial for a fragile prototype that fails to address critical issues like version control, access management, and long-term viability.
The LLM-Wiki Pattern: A Flawed and Misleading Alternative to RAG
The text is a scathing critique of the "LLM-Wiki" pattern, arguing that its claims of being a free, embedding-free alternative to RAG are technically flawed and misleading. The author contends that the system inevitably requires vector search and local indexing tools (like qmd) to scale, fundamentally contradicting the "no embeddings" premise, while also failing to preserve source integrity by retrieving from hallucinated LLM-generated summaries rather than original documents. Furthermore, the approach is deemed unsustainable due to hidden API costs, the inability of LLMs to maintain large indexes beyond small prototypes, and the lack of essential database features like foreign keys and version control, ultimately positioning it as a fragile prototype rather than a viable production knowledge base.
Why LLM-Based Wiki Systems Are Flawed and Unscalable
The text serves as a technical rebuttal to popular tutorials promoting LLM-based wiki systems, arguing that these prototypes are fundamentally flawed and unscalable. The author contends that such systems lack persistent memory, rely on hallucinated summaries that corrupt original data, and fail at scale due to context window limits and the need for embeddings despite claims otherwise. Furthermore, the approach is criticized for being token-expensive, lacking proper data integrity measures like foreign keys or permissions, and fostering "self-contamination" through unverified LLM suggestions. Ultimately, the author advises against adopting this "trap" as a knowledge base solution, recommending instead robust, traditional database architectures like PostgreSQL with deterministic metadata extraction, while dismissing the hype as an appeal to authority that ignores broken architecture.
Why Graphify Fails as a Robust LLM Knowledge Base
The text serves as a technical rebuttal to a tutorial promoting "Graphify" as a robust implementation of Karpathy’s LLM-Wiki pattern, arguing that the video misleadingly oversimplifies the system’s capabilities and scalability. It highlights that Graphify is not merely a simple extension but a computationally heavy architecture lacking critical production features such as data integrity, contradiction resolution, permission management, and verifiable entity extraction, while the underlying LLM possesses no true persistent memory. The author contends that the tool is merely a small-scale prototype that accumulates noise rather than compounding knowledge, and concludes by advocating for a more rigorous approach to building knowledge bases using traditional databases like PostgreSQL with deterministic metadata extraction and proper relational constraints.
LLM Wiki vs RAG: Why RAG Wins for Production Despite LLM Wiki's Knowledge Graph Appeal
While a recent video by "Data Science in your pocket" offers a balanced comparison between LLM Wiki and RAG by highlighting LLM Wiki’s ability to build structured, reusable knowledge graphs versus RAG’s repetitive, stateless retrieval, it ultimately fails to address critical production flaws. The author argues that LLM Wiki is currently a fragile prototype rather than a robust architecture, lacking essential database features like foreign keys, referential integrity, access controls, and deterministic metadata extraction. Consequently, while LLM Wiki may suit personal knowledge building, its susceptibility to error propagation, high maintenance costs, and lack of true memory make RAG the superior choice for reliable, production-ready systems, with a hybrid approach recommended for optimal results.
Why LLM Wiki Fails as a RAG Replacement: Context Limits and Data Integrity Issues
The text serves as a technical rebuttal to a video claiming that "LLM Wiki" renders Retrieval-Augmented Generation (RAG) obsolete, arguing instead that LLM Wiki is merely a rebranded, less robust version of RAG that fails at scale due to context window limitations and lacks true persistent memory or data integrity. The author highlights that LLM Wiki relies on static markdown files which cannot enforce database constraints, resolve contradictions, or prevent hallucinations from becoming "solidified" errors, ultimately requiring the same search mechanisms and human maintenance that RAG avoids. The conclusion emphasizes that while context engineering is valuable, it should be supported by proper databases with foreign keys and version control rather than fragile markdown repositories, urging developers to use LLMs as tools for processing rather than as the foundation for knowledge storage.
Critique of LLM Wiki Tutorial: Limitations and Production Readiness
The technical evaluation critiques the LLM Wiki tutorial for misleading claims that AI eliminates maintenance friction and provides persistent memory, revealing instead that the system relies on static markdown files with no referential integrity, privacy controls, or error-checking mechanisms. While the video correctly advocates for separating raw sources from generated content and using schema files, it critically omits essential issues such as hallucination propagation, silent link breakage, lack of version control for individual facts, scaling limits requiring RAG, and ongoing API costs. Ultimately, the tutorial is deemed suitable only as a small-scale personal prototype requiring active human supervision, rather than a robust, production-ready knowledge base.
LLM Wiki vs Notebook LM: Hidden Costs Privacy Tradeoffs and the Hybrid Approach
This video offers a rare, honest side-by-side evaluation of LLM Wiki and Notebook LM, correctly highlighting LLM Wiki’s significant hidden costs—including slow ingestion times, high token usage, and poor scalability beyond ~100 sources—while acknowledging Notebook LM’s speed and ease of use. However, the review understates critical privacy and ownership trade-offs, specifically that Notebook LM processes data on Google’s servers (posing risks for sensitive information) and lacks user control, whereas LLM Wiki’s maintenance burden is the price for local data sovereignty. Ultimately, the creator recommends a pragmatic hybrid approach: using Notebook LM for quick exploration and LLM Wiki for deep, long-term academic research, emphasizing that the goal should be actionable knowledge rather than just building a wiki.
Debunking Karpathy's LLM Wiki: The Truth Behind the Self-Healing Marketing Hype
The video is a heavily hyped marketing pitch for Karpathy’s "LLM Wiki" that misleadingly claims the system is "self-healing" and autonomous, while in reality, it relies on static files, requires significant human intervention for maintenance, and lacks true memory or self-correction capabilities. The presentation ignores critical technical limitations such as token costs, scale constraints beyond ~100 sources, privacy risks, and the potential for hallucinations, ultimately presenting a flawed RAG-based solution as a revolutionary upgrade without acknowledging its trade-offs or the substantial effort required to keep it functional.
LLM Wiki Pattern: A Balanced Review Highlighting Limitations and Operational Challenges
This video provides a balanced and honest introduction to the "LLM Wiki" pattern, correctly identifying its limitations to personal scales (100–200 sources) and acknowledging that RAG remains superior for larger datasets. While it avoids the hype and sales tactics of other videos by clearly explaining the system’s transparency, portability, and immutable source practices, it significantly understates critical operational challenges. The review notes that the video fails to address essential practical issues such as token costs, lengthy ingest times, the human maintenance burden required to resolve contradictions and broken links, and privacy concerns, making it a good conceptual overview but insufficient for understanding the full technical and financial realities of implementation.
Why LLM Wiki Is a Bad Idea: A Critical Analysis of Flaws and RAG Alternatives
The video "Why LLM Wiki is a Bad Idea" provides a strong, technically accurate critique of the LLM Wiki approach, correctly identifying eight major flaws including error propagation, structured hallucinations, information loss, update rigidity, and scalability issues, while recommending a hybrid RAG-based system. Although it overstates the difficulty of updates by implying full graph rebuilds and unfairly ignores RAG’s own costs and hallucination risks, it remains the most direct and valuable critical resource for understanding the significant pitfalls of relying solely on LLM-generated structured knowledge bases.
Why Adam's LLM Wiki in Business Implementation Fails as a Production Framework
Adam’s "LLM Wiki in Business" implementation fundamentally fails as a production framework because it exhibits every critical flaw identified in the opposing critique, including error propagation, hallucination structuring, information loss, and a lack of provenance or security. By relying on unstructured folders and rigid JSON schemas instead of a proper database with foreign keys, audit trails, and scalable retrieval mechanisms, Adam’s system violates all four essential pillars of reliable knowledge management (Store, Relate, Trust, Retrieve) and admits its own inability to scale beyond a small number of clients. Consequently, the analysis concludes that Adam’s approach is not a superior alternative to RAG, but rather an unintentional case study demonstrating why LLM Wiki is a flawed and risky strategy for business applications requiring accuracy, security, and scalability.
James' LLM Wiki Fails Robust Knowledge Management Due to Lack of Database Integrity
The evaluation concludes that while James from Trainingsites.io offers a rare, pragmatic, and honest assessment by correctly distinguishing between using an LLM Wiki for personal organization and RAG for customer-facing queries, his implementation fundamentally fails the four pillars of robust knowledge management: Store with Integrity, Relate with Precision, Trust with Provenance, and Retrieve with Speed. By relying on proprietary Obsidian and markdown files rather than a real database, his system lacks foreign keys, immutability, provenance tracking, access controls, and queryability, making it structurally unsound for professional or collaborative use despite its effectiveness as a personal browsing tool.
Memex: Advanced LLM Wiki with Critical Database Limitations
Memex is a sophisticated LLM Wiki implementation that stands out for its thoughtful mitigations of common pitfalls, such as git-backed versioning, inline citation tracking, provenance dashboards, and contradiction policies. However, despite being the most advanced attempt in this space, it fundamentally fails the "Four Pillars" of a proper knowledge base because it relies on markdown files rather than a relational database. This architectural choice results in critical limitations: it lacks foreign keys (leading to broken citations on renames), has no permissions or access control, supports only text data, and provides non-deterministic, LLM-mediated retrieval instead of precise SQL queries. Consequently, while Memex is an excellent personal research tool, it is not production-ready for collaborative, secure, or enterprise use cases that require data integrity and structured querying.