Knowledge Organization in Obsidian

TL;DR

Treat knowledge organization as an adaptable system, not a universal “perfect” file structure that works for every person and every vault.

Briefing Cornell Notes

Briefing

Knowledge organization in Obsidian is best treated as a practical, adaptable system—not a one-time “perfect” setup. The core message is that personal knowledge management should borrow proven ideas from library and archival practice (classification, cataloging, and controlled access points), then reshape them to fit how a specific person works. The payoff is less time spent reinventing structure and more reliable retrieval when notes multiply.

The talk begins by rejecting the idea that any single file structure or tagging scheme will solve everyone’s problems. Instead, it offers a rapid history of how Western institutions organized knowledge—starting with Mesopotamian tablets labeled with descriptors, moving through ancient Greek and Roman library practices that used prefatory scrolls and author-related metadata, and then into periods where organization often reflected the “whims” of whoever controlled collections. After the printing press, the information flood triggered new classification efforts, including Gabriel Naudé’s seven-category library scheme (theology, medicine, juris law, history, philosophy, mathematics, and amenities), designed partly to make shelves look orderly.

Modern library systems emerged as collections grew and became too large for ad hoc methods. In the United States, federal investment after the burning of Washington’s library led to the Jefferson collection being acquired without a ready sorting system, prompting improvised approaches. Later, major classification frameworks took hold: Dewey Decimal Classification (still widely used in public libraries), the Library of Congress Classification (built from Cutter Expansive Classification and now the most extensive hierarchy), and Universal Decimal Classification by Paul Otlet and Henri La Fontaine—described as faceted and analytical-synthetic, enabling database-style queries using number strings (for example, filtering by subject and time range).

From that history, the talk draws a key distinction: classification answers “where things go,” while cataloging answers “what things are.” In personal note systems, “classification” maps to folder structures and shelf-like placement, while “cataloging” maps to the internal structure of notes—titles, metadata, and linked content that make retrieval possible. The speaker emphasizes that humans naturally remember locations (a “memory palace” effect), which is why folder trees and consistent placement work.

The practical guidance then shifts to how to implement these ideas in Obsidian. Folder structures can vary by project: one part of a vault might use a decimal-style scheme, while another uses a different hierarchy better suited to the content. The talk also recommends reusing established classification systems rather than building from scratch, with specific examples like Cutter and Library of Congress subject headings (including “see also” and narrower terms). Finally, it challenges simplistic tag-first thinking. Tags are compared to uncontrolled reader indexing—useful but prone to inconsistency and misspellings—so the talk argues for more structured retrieval via linking and “access points” (library-style entry terms). As a creative twist, it discusses using emojis as semi-structured status or note-type markers, noting that emojis carry real semantics and could be treated as a more constrained alternative to free-form tags.

Cornell Notes

The talk argues that personal knowledge organization in Obsidian should borrow from library science while staying flexible. It distinguishes classification (“where items go,” often implemented as folder trees) from cataloging (“what items are,” implemented as note titles/metadata and internal structure). A quick history of library systems—from ancient labeled tablets to Dewey, Library of Congress, and Universal Decimal—shows why large collections need structured retrieval. The practical takeaway is to reuse proven classification ideas (or parts of them) rather than inventing a whole new system, and to rely more on linking and controlled access points than on free-form tags. Emojis are presented as a possible constrained alternative for note status, since they have built-in meaning.

Why does the talk separate “classification” from “cataloging,” and how does that translate to Obsidian?

Classification is treated as the placement system—what shelf/folder a thing belongs in—because humans can remember locations. In Obsidian terms, that maps to folder structure (and sometimes consistent naming) so notes feel like items on a shelf. Cataloging is treated as the description system—what a thing is—so retrieval works even when you don’t remember where it was placed. In Obsidian terms, cataloging maps to the internal note structure: titles, metadata, and the way notes are organized and linked inside the note rather than only where they sit in the vault.

What historical problem drove the shift from ad hoc organization to formal classification systems?

As collections grew—especially after the printing press—people faced “too much to read” and “too much to know,” making haphazard organization impractical. The talk frames major systems as responses to scale: Dewey Decimal Classification for broad public use, Library of Congress Classification for large academic/national collections, and Universal Decimal Classification for faceted, query-like retrieval using number strings.

How do Dewey, Library of Congress, and Universal Decimal differ in the way they support retrieval?

Dewey Decimal Classification organizes materials into a numeric hierarchy and remains common in public libraries. Library of Congress Classification is described as extremely large and hierarchical, with subject headings that can guide navigation through broad-to-narrow terms. Universal Decimal Classification is presented as analytical-synthetic and faceted: a string of numbers can represent multiple attributes at once, enabling database-style filtering (e.g., subject plus time range).

Why does the talk warn against relying on free-form tags as the main organizing strategy?

Tags are portrayed as historically tempting because they let readers label content, but they become unreliable at scale. The talk points to real-world tagging systems (like Goodreads) where misspellings and inconsistent wording create messy, hard-to-remember categories. Without structure, tags lose their retrieval power as the number of tags grows.

What does “access point” mean in this context, and why is it useful?

An access point is a controlled entry term that helps users find materials by common search intents—such as searching by author name, a recommended title, or a topic phrase. The talk contrasts this with uncontrolled tags by arguing that linking and structured entry terms make retrieval more dependable, similar to how library catalogs work with standardized headings.

How does the talk use emojis to rethink tagging?

Emojis are treated as semi-structured markers rather than arbitrary labels. The talk describes mapping emoji colors/shapes to note categories and status levels (for example, using a pink-based scheme for non-research notes and a yellow-based scheme for research notes). The key claim is that emojis have real semantics and constraints (they’re not just meaningless characters), which could reduce the chaos of free-form tagging.

Review Questions

In your own Obsidian workflow, what would you treat as “classification” (placement) versus “cataloging” (description)?
Which of the three major library classification approaches (Dewey, Library of Congress, Universal Decimal) best matches how you search for notes, and why?
What retrieval problems can free-form tags create as your vault grows, and what alternative does the talk recommend?

Key Points

1
Treat knowledge organization as an adaptable system, not a universal “perfect” file structure that works for every person and every vault.
2
Use classification for placement (folder trees) and cataloging for description (note titles/metadata and internal structure) to improve retrieval.
3
Borrow from established library classification systems instead of reinventing a complete scheme from scratch.
4
Folder structures can vary by project; different hierarchies can coexist within one vault when they fit the content.
5
Free-form tags tend to degrade into inconsistent, misspelled, hard-to-remember categories as they scale.
6
Linking notes and using controlled access points can provide more reliable retrieval than uncontrolled tagging.
7
Emojis can be used as constrained, semi-structured status markers because they carry meaning rather than functioning as arbitrary labels.

Highlights

Classification answers “where,” cataloging answers “what”—a split that maps cleanly onto Obsidian folders versus note-internal metadata and structure.

Universal Decimal Classification is presented as faceted and analytical-synthetic, enabling multi-attribute queries through number strings.

Free-form tags are criticized for inconsistency at scale, while library-style access points and linking are framed as sturdier retrieval mechanisms.

Folder trees can be project-specific, letting different classification approaches coexist inside one personal vault.

Topics

Knowledge Organization
Library Classification
Cataloging vs Classification
Tags vs Linking
Obsidian Folder Structure

Mentioned

Paul Otlet
Henri La Fontaine
Gabriel Naudé
Thomas Jefferson
Dewey Decimo