Critical Evaluation of Local LLM Wiki with Obsidian: Fundamental Flaws and Business Unsuitability


Full Evaluation: “Local LLM Wiki with Obsidian” — A Tutorial in Good Faith That Still Fails the Four Pillars

⚠️ BIG FUCKING WARNING: OBSIDIAN IS PROPRIETARY SOFTWARE ⚠️

Before we evaluate anything else, this needs to be stated clearly and loudly:

Issue Details
Proprietary license Obsidian is not open source. You do not own the software. You cannot audit the code. You cannot modify it. You cannot redistribute it.
Closed source The source code is private. You have no idea what telemetry, data collection, or background processes exist.
Vendor lock-in Your “knowledge base” is stored in markdown (good), but the graph view, backlinks, plugins, and all the “smart” features are tied to Obsidian’s proprietary implementation.
Future uncertainty If Obsidian disappears, goes subscription-only, changes licensing, or gets acquired by a company like Notion or Microsoft — your workflow is fucked.
Telemetry concerns Obsidian collects anonymous usage data. For personal use, maybe fine. For client NDAs, medical records, or business secrets? Absolutely not.
Plugin ecosystem Many popular Obsidian plugins are community-maintained, unvetted, and can execute arbitrary code. A malicious plugin could exfiltrate your entire knowledge base.
No built-in permissions Obsidian has zero access control. Anyone with access to your file system sees everything. No multi-user. No role-based access.
Sync is paid Want to sync across devices? That’s a paid subscription (Obsidian Sync). Or you roll your own with Syncthing, but then you’re managing file conflicts manually.
Mobile apps are proprietary The iOS and Android apps are closed source. Your data passes through their app layer.

If you are building a knowledge base for personal hobby use: Obsidian is fine. Annoying that it’s proprietary, but workable.

If you are building a knowledge base for business, clients, research collaboration, or anything containing sensitive information: Using Obsidian as the foundation is negligent. You are building your house on someone else’s land with a lease that can change at any time.

https://gnu.support/images/2026/04/2026-04-23/800/obsidian-is-proprietary.webp


Executive Summary

This video is different from Adam’s. The creator is transparent, humble, and explicitly acknowledges limitations. He admits his implementation is “stateless and static,” not incremental. He clarifies he is simplifying for demonstration. He does not claim production readiness.

And yet — the architecture still fails every single one of the four irreducible pillars.

This evaluation is not an attack on the creator. It is a structural analysis. Good intentions do not compensate for missing foreign keys, missing permissions, missing provenance, and missing queryability — plus a proprietary dependency that you do not control.


What This Video Gets Right (Credit Where Due)

Aspect Evaluation
Transparency ✅ Explicitly says his version is “not following what Karpathy was saying” about incremental updates
Honesty about limitations ✅ Admits it is “stateless and static,” “not dynamic”
Local-first approach ✅ Uses Ollama + Gemma, no data sent to OpenAI/Claude
Practical tutorial ✅ Shows working code, Obsidian integration, graph view
No overclaiming ✅ Does not say “production ready” or “better than RAG”
Human-in-the-loop ✅ Obsidian provides manual curation ability

The creator deserves respect for intellectual honesty. He is not a “sheep.” He is a practitioner showing a working prototype.

But the choice of Obsidian as the viewing/editing layer introduces fatal problems that go beyond architecture.


The Four Pillars Evaluation

1. Store with Integrity → ❌ FAIL

Requirement Implementation Status
Start with raw, immutable source ✅ Yes — clippings folder contains original articles PASS
Use a real database (schemas, FK, indexes) ❌ No — markdown files + folders FAIL
Store any knowledge type ⚠️ Text/articles only — not videos, emails, code, spreadsheets PARTIAL
Record files or locations ✅ Yes — Obsidian file paths PASS
Preserve integrity / immutability ❌ No — files can be edited arbitrarily. No versioning. No cryptographic verification. FAIL

Verdict: Same fatal flaw as all LLM Wiki implementations — no database, no foreign keys, no immutability.


2. Relate with Precision → ❌ FAIL

Requirement Implementation Status
Typed relationships (supports/contradicts) ❌ No — only generic “topic” and “entity” links FAIL
Foreign keys ❌ Impossible — markdown has no FKs FAIL
Bidirectional links ⚠️ Partial — Obsidian’s graph view shows backlinks, but they are not enforced at storage level PARTIAL
Explicit hierarchy ⚠️ Folders provide hierarchy PARTIAL
Hyperlinking ✅ Yes — markdown links between pages PASS

Verdict: Obsidian’s graph view creates the appearance of relationships, but there are no typed, enforced, foreign-key-backed connections. Rename a file and links break silently — Obsidian does not fix this.


3. Trust with Provenance → ❌ FAIL

Requirement Implementation Status
Provenance — every fact knows its source ❌ No — the JSON summary does not store which sentence from which article generated each claim FAIL
Permissions / access control Obsidian has none. Zero. Zilch. Anyone with file system access sees everything. No user roles. No object-level permissions. FAIL
Audit trails ❌ None in Obsidian. No “who changed what, when, why.” FAIL
Human curation ✅ Yes — Obsidian allows manual editing PASS
Cryptographic verification ❌ None FAIL

Critical Issue: The LLM extracts topics, entities, and summaries. If the LLM hallucinates a topic or misattributes an entity, that error becomes permanent markdown. There is no way to trace a claim back to the original text. The creator admits quality “depends on how strong is your model” — but does not address how to detect or correct hallucinations.

Obsidian-Specific Trust Issues:

Obsidian Trust Problem Severity
Proprietary closed source — cannot audit 🔴 HIGH
Telemetry collection — what data leaves your machine? 🟠 MEDIUM
No built-in encryption at rest 🔴 HIGH
Plugin system can execute arbitrary, unvetted code 🔴 HIGH
Sync (if used) passes through their servers 🟠 MEDIUM
No multi-user support — impossible for team use 🔴 HIGH

4. Retrieve with Speed → ❌ FAIL

Requirement Implementation Status
Search by any dimension ⚠️ Obsidian has full-text search, but not by entities, topics, summaries in a queryable way PARTIAL
SQL queries ❌ No FAIL
Queryable structure ❌ No — markdown files require parsing FAIL
Access file properties ⚠️ Filesystem provides basic metadata PARTIAL

Verdict: Obsidian’s search is better than nothing, but it is not a queryable database. You cannot ask “show me all articles about ‘learning’ written by author X with more than 3 key points.” That requires SQL or a real query engine.

Obsidian-Specific Retrieval Issues:

Obsidian Retrieval Problem Severity
Search is proprietary — you cannot extend or automate it easily 🟠 MEDIUM
No programmatic query API (unless you parse markdown files yourself) 🔴 HIGH
Graph view is visual only — not queryable 🟠 MEDIUM
Large vaults (10k+ files) cause performance degradation 🟠 MEDIUM

The Creator’s Own Admissions (Worth Highlighting)

The creator says things most LLM Wiki promoters never say:

“This is actually not following what Karpathy was saying because he was saying that this wiki should be incremental. … What we are doing is making it stateless and static. It’s not dynamic.”

“I just didn’t want when we make a mistake go and just remove them one by one.”

“The topic and entity quality depends on how strong is your model.”

These are honest admissions of limitation. He is not selling a fantasy. He is showing a prototype.

But he does not mention that Obsidian is proprietary. He does not mention the privacy implications. He does not mention that his entire “knowledge base” is locked into a closed-source ecosystem that can change its terms, pricing, or existence at any time.


One Comparison Table: This Implementation vs. Requirements

Requirement This Implementation Status
STORE: Real database Markdown files + Obsidian ❌ FAIL
STORE: Immutability Editable files, no versioning ❌ FAIL
RELATE: Typed relationships Only generic “topic”/“entity” ❌ FAIL
RELATE: Foreign keys None ❌ FAIL
RELATE: Bidirectional links ⚠️ Obsidian graph view (display only, not enforced) PARTIAL
TRUST: Provenance Lost after LLM summarization ❌ FAIL
TRUST: Permissions Obsidian has none FAIL
TRUST: Audit trails None ❌ FAIL
TRUST: Open source / auditable Obsidian is proprietary FAIL
RETRIEVE: SQL / structured queries No ❌ FAIL
RETRIEVE: Search by any dimension ⚠️ Full-text only PARTIAL
RETRIEVE: Works at scale Untested — likely fails beyond hundreds of articles ⚠️ UNKNOWN
Human in the loop ✅ Yes (Obsidian) PASS
Incremental updates ❌ Explicitly not implemented (stateless) FAIL
Hallucination detection/correction ❌ None FAIL
Vendor lock-in risk High — Obsidian proprietary FAIL
Multi-user / team support None FAIL

Tally:13 failures | ⚠️ 3 partial | ✅ 1 pass


What This Video Reveals About LLM Wiki (Unintentionally)

Observation Implication
The creator had to write hundreds of lines of boilerplate to parse markdown, format JSON, and manage files LLM Wiki is not a framework — it is a DIY file management problem
He explicitly chose to make it stateless because incremental updates are hard “Incremental” is a promise Karpathy made but few have implemented correctly
Quality depends entirely on the local model Smaller models (Gemma 4B, Llama 3.2) will produce lower-quality topics and entities, and there is no validation step
The graph view looks impressive but is just markdown links Obsidian’s visualization does not fix missing foreign keys or provenance
No mention of privacy or permissions The system assumes a single user on a single machine. Add multiple users or sensitive data, and it collapses.
Obsidian is proprietary The entire “knowledge base” is locked into a closed-source ecosystem you do not control

Comparison with Adam’s Video (Brief)

Aspect Adam (Agency) This Creator (Local Tutorial)
Claims production readiness ❌ Yes (falsely) ✅ No (honest prototype)
Acknowledges limitations ❌ Minimal ✅ Explicit (“stateless”, “not incremental”)
Privacy awareness ❌ None (AI sees all client emails) ⚠️ Local-only, but Obsidian telemetry unknown
Overclaims superiority to RAG ✅ Yes ❌ No
Uses proprietary software ❌ N/A (folders only) Yes — Obsidian
Overall honesty ❌ Misleading ✅ Refreshingly honest about code, silent about Obsidian lock-in

⚠️ OBSIDIAN-SPECIFIC WARNINGS (REPEATED FOR EMPHASIS) ⚠️

If you are considering this tutorial, read these warnings carefully:

Warning Explanation
Proprietary = You don’t own it Obsidian’s source code is closed. You cannot fix bugs. You cannot audit for security. You cannot guarantee it will exist in 5 years.
Telemetry Obsidian collects anonymous usage data. For client work, NDAs, or medical information, this is unacceptable.
No permissions Obsidian has zero access control. Every person with file system access sees every note. No roles. No encryption.
Plugin risk Plugins run arbitrary code. A malicious or compromised plugin can steal your entire knowledge base.
Sync is paid and proprietary Obsidian Sync costs money and sends your data through their servers. Self-hosted sync (Syncthing) is possible but creates file conflicts.
Vendor lock-in Your workflow depends on Obsidian-specific features: graph view, backlinks pane, Dataview, etc. Migrating away is painful.
Mobile apps are closed The iOS and Android apps are proprietary. If you use mobile, you are locked in deeper.
No multi-user Obsidian is single-user only. There is no way to collaborate with a team on the same vault without file conflicts and manual merging.

If you still want to use Obsidian for personal hobby projects: Fine. Your choice.

If you are building anything for business, clients, research collaboration, or sensitive information: Do not use Obsidian. Use something open source (Logseq, Joplin, Trilium) or a real database with proper access control (PostgreSQL + a proper frontend). Or at minimum, be aware that you are building on rented land.


Final Verdict

Pillar Result
Store with Integrity ❌ FAIL
Relate with Precision ❌ FAIL
Trust with Provenance ❌ FAIL
Retrieve with Speed ❌ FAIL

Overall:FAILS ALL FOUR PILLARS

Plus:Proprietary dependency (Obsidian) adds vendor lock-in, telemetry concerns, no permissions, no multi-user, and unvetted plugin risk.


Why This Matters (Even for a “Good” Tutorial)

This video is not malicious. It is not hype-driven. The creator is skilled, transparent, and generous with his knowledge.

But that is exactly why it is dangerous.

Because when someone honest and competent builds something that looks like a knowledge base — with folders, links, summaries, a beautiful graph view, and a local-first LLM — viewers will assume it is a proper knowledge base. They will copy it. They will use it for their own research, their own clients, their own sensitive notes.

And they will discover, too late, that:

The four pillars are not optional. They are not “enterprise features.” They are the minimum requirements for anything that deserves the name “knowledge base.”

Adding a proprietary, closed-source, single-user, no-permissions note-taking app on top of a flawed architecture does not fix the architecture. It adds more problems.


Recommendation for Viewers

If you watch this video:

  1. Learn from the code — it is a good tutorial on LLM + file processing.
  2. Do not deploy this as a knowledge base — treat it as a prototype or personal experiment.
  3. Do not use Obsidian for anything serious — unless you accept vendor lock-in, telemetry, no permissions, and closed source.
  4. Ask yourself: What happens when I have 500 articles? What happens when I need to share this with a colleague? What happens when the LLM hallucinates a topic? What happens when I rename a file? What happens when Obsidian changes its license or goes subscription-only?
  5. If you need a real knowledge base, start with PostgreSQL, foreign keys, schemas, permissions, audit logs, and a proper query layer. Use open source tools you control. Add LLMs as tools for summarization and relationship suggestion — not as the engine.

The actual video

Closing Thought

The creator of this video is not a sheep. He is a builder who showed his work honestly.

But the architecture he built — Karpathy’s LLM Wiki pattern on top of Obsidian — remains fatally flawed. The four pillars exist for a reason. No amount of Obsidian graph visualization or honest tutorial-making can compensate for missing foreign keys, missing provenance, missing permissions, missing SQL, and a proprietary dependency you do not control.

🐑💀🧙

P.S. — If you downvoted the previous evaluation that called out Obsidian as proprietary, ask yourself: why are you defending a closed-source company that can change your access at any time? Your “knowledge base” is not yours if the software that reads it is not yours.

⚠️ THE WORD “WIKI” HAS BEEN PERVERTED ⚠️

⚠️ ARCHITECTURAL CRIME SCENE ⚠️

⚠️ THE WORD "WIKI" HAS BEEN PERVERTED ⚠️

By Andrej Karpathy and the Northern Karpathian School of Doublespeak

✅ A REAL WIKI — Honoring Ward Cunningham, Wikipedia, and every human curator worldwide
❌ KARPATHY'S "LLM WIKI" — An insult to the very concept
Human-curated
Real people write, edit, debate, verify, and take responsibility.
LLM-generated
Hallucinations are permanent. No human took ownership of any "fact."
Versioned history
Every edit has author, timestamp, reason. Rollback is trivial.
No audit trail
Who changed what? When? Why? Nobody knows. Git is an afterthought.
Source provenance
Every claim links back to its original source. You can verify.
"Trust me, I'm the LLM"
No traceability from summary back to source sentence. Errors become permanent.
Foreign keys / referential integrity
Links are database-backed. Rename a page, links update automatically.
Links break when you rename a file
No database. No foreign keys. Silent link rot guaranteed.
Permissions / access control
Fine-grained control: who can see, edit, delete, approve.
Anyone with file access sees everything
Zero access control. NDAs, medical records, client secrets — all exposed.
Queryable (SQL, structured)
Ask complex questions. Get precise answers. Join tables.
Browse-only markdown
Full-text search at best. No SQL. No structured queries.

🕯️ This is an insult to every Wikipedia editor, every MediaWiki contributor, every human being who spent hours citing sources, resolving disputes, and building the largest collaborative knowledge repository in human history. 🕯️

KARPATHY'S "WIKI" has:
❌ No consensus-building
❌ No talk pages
❌ No dispute resolution
❌ No citation requirements
❌ No editorial oversight
❌ No way to say "this fact is disputed"
❌ No way to privilege verified information over hallucinations
❌ No way to trace any claim back to its source

In the doublespeak of Northern Karpathia:

"Wiki" means "folder of markdown files written by a machine that cannot remember what it wrote yesterday, linked by strings that snap when you breathe on them, viewed through proprietary software that reports telemetry to people you do not know, containing 'facts' that came from nowhere and go nowhere, protected by no permissions, audited by no one, and trusted by no one with a functioning prefrontal cortex."

🙏 Respect to Ward Cunningham who invented the wiki in 1995 — a tool for humans to collaborate.
🙏 Respect to Wikipedia editors worldwide who defend verifiability, neutrality, and consensus.
🙏 Respect to every real wiki participant who knows that knowledge is built through human effort, not machine hallucination.

⚠️ THIS IS NOT A WIKI. THIS IS A FOLDER OF LLM-GENERATED FILES. ⚠️

Calling it a "wiki" is linguistic fraud. Do not be fooled.

🐑💀🧙

— The Elephant, The Wizard, and every human wiki editor who ever lived

Related pages