How CODES become THEMES - Questions that YOU should ask yourself when developing themes

TL;DR

Themes are higher-level structures built to answer research questions, not outputs determined by statistical rules.

Briefing Cornell Notes

Briefing

Turning codes into themes is less a mechanical step and more a judgment call: the themes that emerge must fit the study’s research questions and be communicable to readers. In qualitative analysis, there’s no statistical test that declares one set of themes “correct.” Instead, the responsibility—and the anxiety—comes from deciding what counts as a theme based on the researcher’s aims, perspective, and interpretation.

A key starting point is understanding what themes are for. Codes function as a way of organizing what the data says, but themes are the higher-level structure that turns those coded extracts into an answer to the research questions. That means the same dataset can produce different thematic frameworks depending on what the study is trying to find out. The transcript illustrates this with migrants: one researcher uses the data to study migrants’ self-esteem—how it’s defined, constructed, and shaped by factors and barriers—so work-related negative experiences (like failed communication) can roll up into a broader theme such as “factors affecting self-esteem,” with those codes acting as sub-themes. Another researcher uses the same extracts to understand why migrants change jobs frequently, so the same coded material gets reorganized into a different theme structure—such as “reasons for leaving work”—where “failed communication” and related issues (like difficulty forming social ties at work) become distinct reasons. The takeaway is direct: data does not dictate themes; research questions and judgment do.

Once coding is done and the researcher is staring at a pile of codes, the transcript emphasizes a set of practical questions for getting unstuck. The first is diagnostic: what is the data really telling about the research questions? This requires stepping back from overwhelm and asking what story the coded material supports—what has been learned, and which parts are relevant to the study’s aims. Even if participants go off topic, that can be a sign of rapport and comfort; the task then becomes selecting what is relevant and defensible.

The second question shifts from discovery to communication: how will the findings be explained to others? The transcript recommends a mental rehearsal—imagining speaking to a family member, a classroom, or at a conference—and compressing the answer into a one- or two-minute summary. That exercise naturally pushes the mind toward themes because summarizing tends to group information into general patterns and topics.

After that, the researcher returns to the data and tests whether the codes can be organized into the categories that emerged during the communication exercise. The guidance is not to force categories without evidence; themes should be supported by the extracts. Finally, there’s a clarity check: if someone else read the thematic framework without the researcher’s verbal help, would it still make sense and tell a coherent story? If the framework can stand on its own, it meets the transcript’s primary criterion for a good thematic framework.

Cornell Notes

Themes are the higher-level categories that convert coded extracts into an answer to the study’s research questions. Because qualitative research lacks statistical “theme tests,” the themes that emerge depend on the researcher’s judgment and study aims, even when the same dataset is used. A practical way to get unstuck is to ask (1) what the data is really telling about the research questions and (2) how the findings would be communicated to others in a short, clear summary. Once those answers are clear, the researcher checks whether codes can be organized into that structure without forcing unsupported categories. The final test is whether the thematic framework would be understandable to a neutral reader without extra explanation.

Why can two researchers produce different themes from the same dataset?

Themes are built to answer different research questions. The transcript’s migrants example shows this: one study targets migrants’ self-esteem (definition, construction, barriers), so work-related negative experiences can become sub-themes under a broader theme like “factors affecting self-esteem.” Another study targets why migrants change jobs frequently, so the same extracts get reorganized into a different theme structure—such as “reasons for leaving work”—where issues like failed communication and difficulty forming social ties at work become specific reasons. The dataset provides material; the research aims and judgment determine how it is organized.

What is the first question to ask when codes feel overwhelming?

The first question is what the data is really telling about the research questions. The transcript stresses stepping back from the volume of coded material and asking what story the codes support: what was learned, which parts are relevant, and how the coded extracts contribute to answering the study’s aims. It also notes that participants going off topic can indicate comfort and rapport, but relevance still needs to be judged for the thematic framework.

How does imagining a short explanation to others help theme development?

The transcript recommends a mental exercise: picture explaining the findings to someone (a family member, a classroom, or a conference audience) in one or two minutes. As the researcher tries to summarize, the mind naturally groups information into general patterns and topics—essentially the shape of themes. This helps translate coded evidence into a communicable structure rather than staying stuck in the mechanics of coding.

What should guide whether codes get organized into a thematic framework?

Organization should follow evidence. After the researcher identifies what they learned and how they would communicate it, they return to the data to see whether codes can fit into that structure. The transcript warns against forcing categories when the data doesn’t support them. If there is supporting evidence in the extracts, the researcher is free to organize as needed to answer the research questions clearly.

What is the transcript’s criterion for judging whether a thematic framework is “good”?

The key criterion is clarity for a neutral reader. After organizing codes into themes, the researcher asks: if they weren’t there to explain it, would the thematic framework still make sense and tell a coherent story of the data? If it is clear and understandable on its own, it meets the standard described.

Review Questions

When you feel stuck, how would you answer the question “What is the data really telling me about my research questions?” using examples from your coded extracts?
What would you say in a one- or two-minute summary of your findings, and which parts of that summary naturally become candidate themes?
After organizing codes into themes, what specific check would you perform to ensure a neutral reader could understand the thematic framework without additional explanation?

Key Points

1
Themes are higher-level structures built to answer research questions, not outputs determined by statistical rules.
2
The same dataset can yield different thematic frameworks because research aims and judgment shape theme development.
3
When overwhelmed by codes, start by asking what the coded material actually reveals about the research questions.
4
Use a communication rehearsal (e.g., a one- or two-minute explanation to an imagined audience) to surface natural theme groupings.
5
Organize codes into themes only when the extracts provide evidence; avoid forcing unsupported categories.
6
Test the thematic framework’s quality by checking whether it would still tell a coherent story to a neutral reader without extra guidance.

Highlights

Qualitative themes depend on researcher judgment and study aims—there’s no equation that declares which themes are “right.”

The same migrants dataset can produce different themes depending on whether the study targets self-esteem or job-changing behavior.

A one- or two-minute imagined explanation to an audience helps convert coded evidence into theme-shaped summaries.

Thematic frameworks should be understandable on their own; clarity for a neutral reader is the final test.

Topics

Thematic Development
Codes to Themes
Research Questions
Qualitative Judgment
Communicating Findings

Mentioned

Dr Kriukow