Debunking Visual Myths: What science says and how to leverage it for better PKM

TL;DR

Only the ~13-millisecond image-processing claim is tied to a specific, repeatable research paper; many other popular percentages lack rigorous support.

Briefing Cornell Notes

Briefing

A cluster of popular “visual superiority” claims—like people processing images tens of thousands of times faster than text or remembering most information from what they see—doesn’t hold up under scrutiny. The only widely cited number in the set that traces cleanly to repeatable research is a specific timing estimate: it takes about 13 milliseconds for the brain to process an image. Other headline percentages (including “90% visual” and “80% remembered from seeing”) trace back to older training-industry charts and vague secondary sources, with the trail going cold and no clear experimental basis.

That debunking matters because it changes how people should design knowledge systems. Even if visuals aren’t inherently “better” by default, they can still be powerful when used with intent—especially in PKM (personal knowledge management) workflows that rely on retrieval, understanding, and synthesis. The takeaway isn’t “ignore visuals,” but “stop treating visuals as magic backed by dubious statistics.”

The practical pivot comes through an article on the syntactic theory of visual communication by Paul Martin Lester. The argument starts with a cultural mismatch: education often treats pictures as secondary to text, even though modern life is increasingly visually mediated. From there, Lester challenges the idea that pictures are simply “nonverbal” by proposing that images can carry structured meaning—though not in the same sequential, word-order way as language.

Lester’s core distinction is between pictures as presentational (holistic, taken in at once) versus text as discursive (narrative, sequential). He also argues that pictures lack an established “visual alphabet” and recognized syntax in the strict linguistic sense—yet they still rely on conventions, component recognition, and semiotic rules. References in the discussion point to how humans recognize objects from parts (via “geons,” basic shape-like components) and how meaning emerges when signs are combined and interpreted by viewers.

A key semiotics thread ties visuals to a broader communication model: a single sign doesn’t convey meaning by itself; meaning depends on other signs. In practice, that’s why a newspaper layout works when the front-page photo, caption, headline, and story form a coordinated reading path. The synthesis of words and images is described as complementary—text can “load” an image with cultural and moral context, while images can communicate ideas that would take many words.

From Lester’s syntactic theory, the discussion distills six PKM-oriented takeaways. Visuals and text should reinforce each other through deliberate design and tight alignment between titles, captions, and diagrams. Images function as compression: they can condense complex concepts into “atomic” units that later get rearranged into new narratives. Verbalization—turning visuals into captions or summary sentences—strengthens recall. Finally, illustration is framed as a method for converting abstract ideas into concrete representations, and the overall recommendation is to invest in learning illustration as a high-leverage skill rather than relying on text-only study habits.

Cornell Notes

Popular claims that visuals are processed dramatically faster than text and that people remember most information from seeing it lack solid research support. The one timing figure that does map to repeatable work is an image-processing estimate of about 13 milliseconds; other percentages trace to older charts and secondary sources without clear experimental backing. The discussion then uses Paul Martin Lester’s syntactic theory of visual communication to argue that pictures can carry structured meaning, even if they don’t function like text with word order. For PKM, the practical move is deliberate design: align visuals with captions and titles, use images as compression units for synthesis, and verbalize what the visual means to strengthen recall. The result is a workflow where words and images complement each other rather than compete.

Which “visual superiority” statistics are credible, and which look like folk wisdom?

The timing claim that “it takes 13 milliseconds for the brain to process an image” is supported by a specific 2013 research paper with an experiment that measured the figure. By contrast, claims like “90% of information processed is visual,” “images are processed 60,000 times faster than text,” and “people remember 80% of what they see” don’t show clear, traceable experimental evidence. The “10/20/80” style memory percentages are traced to older training-industry charts and earlier sources (including the Cone of Experience and related pyramid-like diagrams), but the trail doesn’t reliably connect to rigorous studies with matching numbers.

Why does the discussion insist that visuals still matter even if the big percentages are wrong?

Even without proof that visuals dominate cognition by default, visuals can still improve understanding and recall when they’re designed to work with memory. The argument emphasizes that visuals are useful because they compress complex ideas into a form that can be grasped quickly, and because pairing images with verbal labels (captions, titles, summary sentences) turns that compressed representation into something retrievable. The practical stance is: visuals aren’t magic, but they’re a high-leverage tool when used deliberately.

What does “presentational vs discursive” mean for how pictures and text are processed?

Pictures are described as presentational: they’re taken in holistically, like viewing a shop window at once, without needing sequential “reading” of components. Text is described as discursive: it’s narrative and sequential, requiring a step-by-step structure to follow meaning. This difference helps explain why images and text can complement each other—images provide an overall structure at a glance, while text supplies sequential meaning and context.

How does semiotics explain why a picture alone may not communicate meaning?

Semiotics treats meaning as emerging from signs in combination. A single sign “means nothing” on its own; meaning depends on how signs relate and how viewers interpret them. The discussion uses a newspaper-reading example: people typically look at the front-page photograph first, then read the caption, then the headline, and only then the story—showing that coordinated sign systems (photo + caption + headline + text) create understanding more effectively than any one element by itself.

What are the six practical PKM takeaways for using visuals effectively?

The takeaways are: (1) use visuals and text in a mutually reinforcing way, with deliberate design and alignment; (2) don’t treat visuals as decoration—create “concept visuals” that communicate the idea; (3) use images as compression units to synthesize and rearrange ideas at a meta level; (4) add verbal summaries (captions/titles/summary sentences) to strengthen recall; (5) convert abstract ideas into concrete representations through illustration; and (6) sharpen tools by investing time in illustration, framed as higher value than spending only on writing courses.

What role does “verbalization” play in making visuals stick?

Verbalization is presented as essential for recall. The workflow described involves attaching summary sentences to visuals—such as writing a short explanation on the back of an image or using slide notes in PowerPoint—so the visual and the verbal meaning become one integrated unit of information. The underlying claim is that images become more usable when they’re translated into words in the mind.

Review Questions

Which of the “visual superiority” claims has a clear research trail, and what is the specific figure?
How do presentational and discursive systems differ, and why does that matter for designing notes?
What does “compression” mean in the context of using atomic illustrations for PKM synthesis?

Key Points

1
Only the ~13-millisecond image-processing claim is tied to a specific, repeatable research paper; many other popular percentages lack rigorous support.
2
Treat visual claims as design guidance, not as proof that visuals automatically outperform text.
3
Use deliberate alignment between visuals and their verbal elements (titles, captions, and summary sentences) to improve coherence and recall.
4
Build “atomic” illustrations of ideas so they can be rearranged into new narratives for higher-level synthesis.
5
Pair images with verbalization to turn a visual into an accessible memory cue rather than a standalone decoration.
6
Illustration is framed as a practical method for converting abstract concepts into concrete representations that are easier to remember.
7
Investing in illustration skills can be a high-leverage alternative to relying solely on writing-focused learning.

Highlights

The only cleanly supported statistic in the “visual superiority” set is that image processing can be measured at about 13 milliseconds; the rest of the big percentages don’t trace to solid experiments.

A picture’s meaning depends on context and other signs—like how a newspaper photo, caption, headline, and story form a coordinated reading sequence.

Images can act as “compression” units: once ideas are condensed into visuals, they can be shuffled to create new, higher-level summaries.

Verbalizing visuals—through captions or summary sentences—turns diagrams into stronger recall anchors than images alone.

Topics

Visual Myths
PKM
Visual Communication
Semiotics
Concept Diagrams

Mentioned

Paul Martin Lester
Edgar Dale
Susan Z. (Zag)
Elon Musk
David Hurley
Hilia (Hilia the boore)
Matic Pross
Trifler
Susan Z. (Zag)
PKM