Get AI summaries of any video or article — Sign up free
Taking notes from YouTube videos in Obsidian thumbnail

Taking notes from YouTube videos in Obsidian

Nicole van der Hoeven·
5 min read

Based on Nicole van der Hoeven's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Start by choosing a capture strategy—manual notes, embedded playback, timestamp buttons, auto-pausing note capture, or caption-based highlights—because each affects how accurately you can return to moments later.

Briefing

Turning YouTube learning into usable notes inside Obsidian hinges on one practical problem: getting reliable timestamps or highlights from a watched video into a knowledge system without losing your place. The workflow options range from quick-and-simple embeds to purpose-built plugins, a browser extension that pauses playback for note-taking, and a newer approach that sidesteps timestamps entirely by using YouTube captions.

The simplest method requires no plugins: split the screen between Obsidian and YouTube, then manually create notes while watching. A more “integrated” version pastes YouTube’s embed code into Obsidian so the video plays inside the vault. That integration is convenient for keeping focus, but the embedded player is limited—scrubbing works, yet accidentally clicking back into the code can reset playback position, making it less reliable for precise note-taking.

For a more robust in-Obsidian workflow, the media extended plugin adds a dedicated media player experience. It can embed and play the video without relying on an iframe, and it supports timestamping from the player. A hotkey (control T in the walkthrough) inserts timestamps into the note as the video runs, producing a trail of timestamped snippets. However, clicking those timestamps may open the video in an external browser rather than staying within Obsidian, and timestamp accuracy can drift—one example lands back near the beginning instead of the intended 56-minute mark.

Another Obsidian-first option, timestamp notes (via a plugin), also timestamps while watching, but it renders timestamps as clickable buttons inside the sidebar. That makes it easier to jump around within the same Obsidian workspace. The tradeoff is that the notes are still tied to the video player’s behavior, and the workflow shown doesn’t solve the “pause, write, resume” friction.

That gap is addressed by the Chrome extension e-note. As the video plays, typing triggers an automatic pause, giving time to capture notes without losing context. After finishing, e-note provides an “open management” view with screenshots and timestamped notes; exporting as Markdown sends the material into an Obsidian vault. The downside is that navigation back to exact moments happens in the browser, not directly inside Obsidian, and the export is manual rather than one-click to a reading system.

The most recent and arguably most scalable approach is Readwise Reader, now in public beta with a YouTube integration. Instead of timestamping, it uses YouTube’s automatic captions: as the video plays, caption text highlights track the spoken content. Notes and highlights are then synced into Obsidian via an official Readwise plugin, bringing metadata and searchable text. The limitation is clear—highlights are text-based, not tied to exact playback timestamps.

Readwise Reader also adds an AI layer through Ghostreader, which can summarize caption-derived highlights and generate prompts or questions. In practice, it produces useful summaries and thought-provoking questions, though some tasks (like generating Q&A pairs) can fail depending on the prompt and available highlights.

In the end, the choice depends on where the user is watching and what they need most: timestamped, in-sidebar navigation (timestamp notes) on a laptop; “watch now, organize later” with searchable text (Readwise Reader) on mobile; and e-note when pausing for richer notes matters. The broader goal remains consistent: convert YouTube’s fast, often distracting consumption into structured, retrievable knowledge inside Obsidian.

Cornell Notes

Reliable note-taking from YouTube into Obsidian starts with choosing how to capture context: timestamps, captions, or both. For in-Obsidian playback, a simple iframe embed works but can be fragile when playback position resets. The media extended plugin improves robustness and supports timestamp hotkeys, though timestamp accuracy and navigation behavior can be inconsistent. Timestamp notes keeps timestamp entries as clickable buttons in the Obsidian sidebar, making it easier to scrub after the fact. For a different workflow, e-note pauses playback automatically when typing and exports Markdown to Obsidian, but navigation back to moments occurs in the browser. Readwise Reader avoids timestamps by using YouTube captions, syncing text highlights and metadata into Obsidian and enabling AI features like Ghostreader summaries and question generation.

Why do iframe-based YouTube embeds in Obsidian feel unreliable for note-taking?

The embedded player is limited and can be easy to disrupt. If someone clicks back into the pasted code accidentally, playback position can reset, which breaks the “note at the exact moment” workflow. It’s workable for watching and scrubbing, but not ideal when precise time-based capture matters.

How does the media extended plugin improve the “watch and timestamp” workflow inside Obsidian?

Media extended plays the video in a more robust way than an iframe and supports timestamping from the player. A hotkey (control T in the walkthrough) inserts the timestamp into the note while the video is running. That produces timestamped notes without needing to manually manage an iframe, though clicking timestamps may open the video in a browser and timestamp placement can sometimes be off.

What makes timestamp notes especially convenient after you’ve captured highlights?

Timestamp notes turns timestamps into clickable buttons inside Obsidian. Clicking a timestamp jumps to the corresponding moment while staying in the sidebar context, which makes it fast to scrub through the video based on your notes rather than hunting manually.

What problem does e-note solve that the in-Obsidian timestamp workflows don’t?

e-note automatically pauses the YouTube video when the user starts typing. That gives time to write fuller notes without losing the current moment, reducing the back-and-forth between pausing and resuming that can be awkward with other approaches.

How does Readwise Reader handle YouTube notes differently from timestamp-based plugins?

Readwise Reader relies on YouTube’s automatic captions instead of timestamping playback moments. As the video plays, caption text highlights track the spoken words. Notes and highlights sync into Obsidian with Readwise metadata, but the highlights are text-based rather than tied to exact playback timestamps.

What does Ghostreader add on top of caption-based highlights in Readwise Reader?

Ghostreader can generate AI outputs such as summaries and thought-provoking questions based on the highlighted caption text. In the walkthrough, it produced a summary for a Playwright-related video and generated legitimate questions (e.g., advantages, challenges, and available resources). Some tasks like generating Q&A pairs can error out depending on the prompt and highlight set.

Review Questions

  1. Which workflow best preserves exact playback moments inside Obsidian, and what are its limitations?
  2. Compare how media extended and timestamp notes handle timestamp insertion and navigation after capturing notes.
  3. Why might a caption-based approach like Readwise Reader be preferable even without exact timestamps?

Key Points

  1. 1

    Start by choosing a capture strategy—manual notes, embedded playback, timestamp buttons, auto-pausing note capture, or caption-based highlights—because each affects how accurately you can return to moments later.

  2. 2

    Use iframe embeds only for lightweight note-taking; accidental interaction with the code can reset playback position.

  3. 3

    media extended supports timestamp hotkeys while playing in Obsidian, but timestamp accuracy and where links open can be inconsistent.

  4. 4

    timestamp notes makes timestamps clickable buttons in the Obsidian sidebar, improving post-session scrubbing through your notes.

  5. 5

    e-note’s auto-pause-on-typing reduces the friction of writing while watching, and it exports Markdown into an Obsidian vault.

  6. 6

    Readwise Reader’s YouTube integration uses automatic captions, syncing searchable text and metadata into Obsidian without relying on playback timestamps.

  7. 7

    Ghostreader can turn caption highlights into summaries and targeted questions, but some AI tasks may fail depending on the prompt and highlight quality.

Highlights

Iframe embeds in Obsidian can be fragile: clicking back into the code can reset playback position, undermining precise timestamp note-taking.
media extended timestamps with a hotkey (control T) while playing, but timestamp navigation may open in a browser and timestamps can drift.
timestamp notes turns timestamps into clickable buttons in the Obsidian sidebar, making it easy to jump around after the fact.
e-note pauses the video automatically when typing, then exports timestamped notes to Obsidian as Markdown.
Readwise Reader syncs caption-based highlights into Obsidian and adds Ghostreader AI summaries and questions—without exact playback timestamps.

Topics