How CODES become THEMES - Questions that YOU should ask yourself when developing themes
Based on Qualitative Researcher Dr Kriukow's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Themes are higher-level structures built to answer research questions, not outputs determined by statistical rules.
Briefing
Turning codes into themes is less a mechanical step and more a judgment call: the themes that emerge must fit the study’s research questions and be communicable to readers. In qualitative analysis, there’s no statistical test that declares one set of themes “correct.” Instead, the responsibility—and the anxiety—comes from deciding what counts as a theme based on the researcher’s aims, perspective, and interpretation.
A key starting point is understanding what themes are for. Codes function as a way of organizing what the data says, but themes are the higher-level structure that turns those coded extracts into an answer to the research questions. That means the same dataset can produce different thematic frameworks depending on what the study is trying to find out. The transcript illustrates this with migrants: one researcher uses the data to study migrants’ self-esteem—how it’s defined, constructed, and shaped by factors and barriers—so work-related negative experiences (like failed communication) can roll up into a broader theme such as “factors affecting self-esteem,” with those codes acting as sub-themes. Another researcher uses the same extracts to understand why migrants change jobs frequently, so the same coded material gets reorganized into a different theme structure—such as “reasons for leaving work”—where “failed communication” and related issues (like difficulty forming social ties at work) become distinct reasons. The takeaway is direct: data does not dictate themes; research questions and judgment do.
Once coding is done and the researcher is staring at a pile of codes, the transcript emphasizes a set of practical questions for getting unstuck. The first is diagnostic: what is the data really telling about the research questions? This requires stepping back from overwhelm and asking what story the coded material supports—what has been learned, and which parts are relevant to the study’s aims. Even if participants go off topic, that can be a sign of rapport and comfort; the task then becomes selecting what is relevant and defensible.
The second question shifts from discovery to communication: how will the findings be explained to others? The transcript recommends a mental rehearsal—imagining speaking to a family member, a classroom, or at a conference—and compressing the answer into a one- or two-minute summary. That exercise naturally pushes the mind toward themes because summarizing tends to group information into general patterns and topics.
After that, the researcher returns to the data and tests whether the codes can be organized into the categories that emerged during the communication exercise. The guidance is not to force categories without evidence; themes should be supported by the extracts. Finally, there’s a clarity check: if someone else read the thematic framework without the researcher’s verbal help, would it still make sense and tell a coherent story? If the framework can stand on its own, it meets the transcript’s primary criterion for a good thematic framework.
Cornell Notes
Themes are the higher-level categories that convert coded extracts into an answer to the study’s research questions. Because qualitative research lacks statistical “theme tests,” the themes that emerge depend on the researcher’s judgment and study aims, even when the same dataset is used. A practical way to get unstuck is to ask (1) what the data is really telling about the research questions and (2) how the findings would be communicated to others in a short, clear summary. Once those answers are clear, the researcher checks whether codes can be organized into that structure without forcing unsupported categories. The final test is whether the thematic framework would be understandable to a neutral reader without extra explanation.
Why can two researchers produce different themes from the same dataset?
What is the first question to ask when codes feel overwhelming?
How does imagining a short explanation to others help theme development?
What should guide whether codes get organized into a thematic framework?
What is the transcript’s criterion for judging whether a thematic framework is “good”?
Review Questions
- When you feel stuck, how would you answer the question “What is the data really telling me about my research questions?” using examples from your coded extracts?
- What would you say in a one- or two-minute summary of your findings, and which parts of that summary naturally become candidate themes?
- After organizing codes into themes, what specific check would you perform to ensure a neutral reader could understand the thematic framework without additional explanation?
Key Points
- 1
Themes are higher-level structures built to answer research questions, not outputs determined by statistical rules.
- 2
The same dataset can yield different thematic frameworks because research aims and judgment shape theme development.
- 3
When overwhelmed by codes, start by asking what the coded material actually reveals about the research questions.
- 4
Use a communication rehearsal (e.g., a one- or two-minute explanation to an imagined audience) to surface natural theme groupings.
- 5
Organize codes into themes only when the extracts provide evidence; avoid forcing unsupported categories.
- 6
Test the thematic framework’s quality by checking whether it would still tell a coherent story to a neutral reader without extra guidance.