How To Highlight Physical Books in Readwise

TL;DR

Photographing one page at a time improves OCR reliability when importing physical-book highlights into Readwise.

Briefing Cornell Notes

Briefing

Readwise can turn physical-book highlighting into searchable, spaced-repetition notes—but doing it well depends on how the highlights are captured and tagged. The core workflow is to photograph a single page (not two tightly packed pages) so OCR can reliably extract the highlighted sentence, then save that passage as a Readwise highlight tied to the correct book. Once the text lands in Readwise, it becomes part of the app’s daily review system, which boosts recall and retention compared with manually flipping back through a physical copy.

The process starts with a practical problem: physical highlights are easy to mark but hard to review quickly. In the example, a hockey analytics book from 2020 had yellow highlighter notes, yet revisiting them required page-by-page searching—too slow to be useful. The workaround was earlier manual journaling with page numbers and thoughts, but the goal now is to migrate those insights into Readwise so they show up in reminders and benefit from space repetition.

In Readwise, highlights can be added either by typing the passage or by using the “photo” route. The photo route relies on OCR, so the transcript emphasizes a key capture rule: when photographing, include only one page per image when pages are close together. If two pages appear in the same photo, OCR may struggle and misread where the text belongs, even if it eventually guesses a page number. After taking a photo, the user selects the extracted sentence, saves it as a highlight, and optionally adds a note such as the page number or a brief interpretation.

Organizing notes inside Readwise also matters. The transcript demonstrates adding a title to structure the imported content: by using the tag “.h1” (and suggesting “h2.h” for lower-level headings), Readwise can treat that line as a header. In the browser view, the header then appears as a section label—effectively creating a table-of-contents-like structure for the book’s highlights.

There’s a tradeoff. OCR can be faster for long passages, but it can introduce typos and formatting issues (including spacing and posture-related alignment problems). Typing may be slower but can be more accurate. The takeaway is not a single “best” method; instead, it’s a repeatable approach—photo-based OCR for efficiency, manual typing for precision—so readers can test which workflow fits their reading habits and tolerance for cleanup.

Cornell Notes

The transcript lays out a workflow for importing physical-book highlights into Readwise using OCR and optional manual typing. To make OCR reliable, it recommends photographing one page at a time—especially when pages are close together—then selecting the extracted sentence and saving it as a highlight tied to the correct book. It also shows how to create structure in Readwise by adding headers using the “.h1” tag (and notes that “h2.h” can be used for subtitles). The main decision point is speed versus accuracy: OCR can be quicker for long passages but may produce typos or formatting glitches, while typing is slower but more precise.

Why does importing physical-book highlights into Readwise matter for review habits?

Physical highlights are easy to mark but hard to revisit quickly because they require flipping through pages. Readwise turns those highlighted passages into items that can be reviewed via spaced repetition, improving recall and retention. Instead of searching page-by-page, highlights become part of a daily review workflow (with a configurable number of highlights per day).

What capture rule improves OCR accuracy when photographing book pages?

Photograph only one page per image. When two pages are included—especially if they’re close together—OCR can mis-handle the layout and extraction. The transcript notes that OCR may still guess a page number, but single-page photos are “much better” for getting the intended sentence cleanly.

What are the two main ways to add a highlight in Readwise from a physical book?

One method is typing the passage manually (“via text”). The other is adding via photo, which uses OCR to extract the highlighted text from the image. After OCR extraction, the user selects the sentence and saves it as a highlight.

How can imported highlights be organized into sections inside Readwise?

By adding header tags in the note/title field. The transcript demonstrates using “.h1” so Readwise treats the line as a header; in the browser view, that header appears as a section label. It also suggests using “h2.h” for subtitles, which can create a more table-of-contents-like structure.

When might manual typing be preferable to OCR?

Typing can be preferable when OCR errors would be costly. The transcript points to potential issues with OCR: typos, spacing problems, and formatting misalignment. For long passages, OCR may still be faster, but the user may need to weigh speed against cleanup time.

Review Questions

What specific photo-taking adjustment helps OCR avoid misreading when pages are close together?
How does the “.h1” tag change how content appears in Readwise’s browser view?
What tradeoffs does the transcript identify between OCR-based importing and manual typing?

Key Points

1
Photographing one page at a time improves OCR reliability when importing physical-book highlights into Readwise.
2
Use Readwise’s “add highlight” flow to save extracted sentences as highlights tied to the correct book.
3
After OCR extraction, verify and select the intended sentence before saving to avoid incorrect imports.
4
Add structural headings using the “.h1” tag so sections appear as headers in Readwise (with “h2.h” suggested for subtitles).
5
OCR-based importing can be faster for long passages, but it may introduce typos or formatting/spacing issues.
6
Manual typing is slower but can reduce OCR mistakes and preserve formatting accuracy.
7
The most effective workflow depends on whether speed or precision matters more for the reader’s review process.

Highlights

The fastest path to importing physical highlights is photo-based OCR—but only if each image contains a single page.

Adding “.h1” turns a line into a header inside Readwise, creating readable sections for imported highlights.

OCR can mis-handle formatting and introduce typos, so long passages may require a speed-versus-accuracy decision.

Single-page photos reduce OCR confusion when book pages are tightly spaced.

Topics

Readwise Highlights
OCR Import
Spaced Repetition
Header Tagging
.h1 Tag
Physical Books

Mentioned

Readwise