Get AI summaries of any video or article — Sign up free
How to Really do Braun and Clarke's 6 step Thematic Analysis (explained in 3 steps) thumbnail

How to Really do Braun and Clarke's 6 step Thematic Analysis (explained in 3 steps)

5 min read

Based on Qualitative Researcher Dr Kriukow's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Thematic analysis produces themes that function like chapter headings, summarizing the dataset’s story and answering research questions.

Briefing

The core takeaway is that Braun and Clarke’s thematic analysis becomes manageable once it’s treated as a structured pipeline for turning messy qualitative data into credible “themes”—the chapter-like headings that answer research questions. The process starts with coding to break transcripts into smaller, meaningful units, then moves to organizing those codes into focused groups, and finally builds themes by using those organized codes to craft clear, defensible findings.

The method begins with defining what “themes” actually are. Themes are not vague impressions; they function like narrative chapters that summarize the story emerging from the data. Within each theme sits the supporting description—often grounded in specific excerpts—showing how that theme helps answer the study’s research questions. Because humans can’t reliably hold all transcript detail in mind, thematic analysis requires rigor: it must demonstrate that themes come from the data rather than from expectations or assumptions.

Coding is the entry point. Codes are short tags—small concepts or summaries—attached to parts of the text as the researcher reads. For example, a participant describing difficulty at work might receive a code like “struggling at work,” while a description of emotional impact could be tagged as “feeling depressed.” Coding reduces the amount of raw text to manageable units while also building understanding of what the dataset contains. The transcript is processed systematically, often one transcript at a time, until the researcher has a substantial set of codes.

As codes accumulate, the next step is focus coding (a term borrowed from grounded theory practice, though it can be used outside that tradition). The goal is to clean up the initial chaos: instead of a long, random list of tags, codes are reorganized into groups that make sense—such as grouping emotion-related codes under an “emotions” heading or clustering coping-strategy codes together. Naming and grouping aren’t about chasing a single “correct” label; they’re about creating a structured path toward themes that can be justified as a transparent analytic procedure.

The final step is developing themes. This stage is described as a continuation of earlier work: the researcher keeps building understanding and effectively constructs a table of contents for the dataset. Themes emerge by asking what research-question prompts the codes and groups are already answering. For instance, if many codes relate to mental health impacts in a study about nurses during the COVID-19 pandemic, “affected mental health” can become a theme; if additional codes cluster around self-care and coping behaviors, “coping strategies” may become another theme. The emphasis is on using the organized codes as a tool for exploring the data further and for selecting what to communicate.

Finally, the approach is reconciled with Braun and Clarke’s six-step framework. Two of Braun and Clarke’s steps are treated as not central to theme construction (familiarization at the start, and writing the report at the end). That leaves four theme-focused steps, which can be translated into the three-step lens: the “searching for themes” work aligns with focus coding/organizing codes, while naming, defining, and refining themes happen as an iterative process rather than a strict linear sequence. The practical benefit is reduced anxiety: instead of treating stages as separate gates, the work is understood as overlapping steps that can be mapped back to Braun and Clarke when academic referencing is required.

Cornell Notes

Thematic analysis turns qualitative transcripts into “themes,” which function like chapter headings that tell the story of the data and answer research questions. The process starts with coding: attaching short tags (e.g., “struggling at work,” “feeling depressed”) to meaningful parts of the text to reduce complexity and build understanding. Next comes focus coding, where the many codes are organized into sensible groups (such as “emotions” or “coping strategies”) to make the dataset coherent and defensible. Finally, themes are developed by using those grouped codes to decide what headings best capture the dataset’s main story. This three-step framing matches Braun and Clarke’s six-step approach, but with less overwhelm because theme work is treated as iterative rather than strictly sequential.

What counts as a “theme” in thematic analysis, and why can’t researchers just pick the most obvious ideas?

A theme is a broad, chapter-like topic that captures the story the dataset tells in relation to the research questions. It’s not a hunch or a favorite takeaway; it must be grounded in the data. Because researchers can’t reliably track all details across multiple transcripts, the method requires structured steps (coding and organizing) so themes reflect patterns in the text rather than expectations.

How does coding make qualitative data more workable?

Coding begins by reading through transcripts and assigning short tags—small concepts or summaries—to relevant text segments. Examples include tagging a participant’s difficulty at work as “struggling at work” and tagging emotional impact as “feeling depressed.” These codes reduce the volume of raw text into manageable units while preserving meaning, making it easier to later trace how themes are built.

What is focus coding, and what problem does it solve?

Focus coding is the step where the researcher reorganizes a long list of initial codes into structured groups. After initial coding, tags can feel random and overwhelming because they cover everything participants mention (emotions, challenges, coping, family relationships, and more). Focus coding “cleans the room” by grouping related codes—such as placing emotion codes under an “emotions” heading—so patterns become visible and themes can be developed credibly.

How do themes actually get created from grouped codes?

Theme development uses the organized code groups as a decision tool. Researchers ask what research-question prompts the codes already answer. For example, in a study about nurses during COVID-19, many codes about mental health impacts could justify a theme like “affected mental health,” while clusters about self-care and coping could support a theme like “coping strategies.” Themes then become the final headings, supported by the underlying codes and relevant excerpts.

How does a three-stage approach map onto Braun and Clarke’s six steps?

The mapping works by treating familiarization and report writing as outside the core theme-construction workflow. That leaves theme-focused work that can be translated into three stages: (1) coding, (2) organizing codes (aligned with the work of searching for themes), and (3) developing themes through naming/defining/refining. The key practical point is that Braun and Clarke’s steps can feel like strict progression, but in practice naming and defining overlap with searching, so the three-stage framing reduces anxiety while still matching the underlying logic.

Review Questions

  1. When coding, what criteria should guide which parts of the transcript receive a tag, and how do those tags help later theme development?
  2. How would you distinguish between a code group (focus coding) and a final theme (theme development) in your own analysis?
  3. Why does the process treat naming/defining themes as iterative rather than strictly sequential, and how does that affect how you judge “completion”?

Key Points

  1. 1

    Thematic analysis produces themes that function like chapter headings, summarizing the dataset’s story and answering research questions.

  2. 2

    Coding is the starting mechanism: attach short, meaningful tags to transcript segments to reduce complexity while preserving meaning.

  3. 3

    Focus coding reorganizes many initial codes into coherent groups (e.g., “emotions,” “coping strategies”) to prevent an overwhelming, random code list.

  4. 4

    Theme development uses the organized code groups to decide what final headings best capture patterns relevant to the research questions.

  5. 5

    Braun and Clarke’s six-step framework can be translated into a three-stage workflow by treating familiarization and report writing as separate from theme construction.

  6. 6

    Naming and defining themes often happen iteratively alongside searching for patterns, so “progression” between steps should be understood as overlapping rather than strictly linear.

Highlights

Themes are treated as chapter-like units that must be grounded in the data, not selected based on researcher assumptions.
Coding turns long transcripts into manageable tags such as “struggling at work” or “feeling depressed,” enabling systematic pattern building.
Focus coding “cleans the room” by grouping a long list of codes into structured categories so themes can emerge.
The three-stage framing aligns with Braun and Clarke’s six steps while reducing student anxiety by emphasizing overlap between theme-searching and theme-defining.
Theme development is driven by research-question prompts: the codes already point to what headings should communicate.

Topics

Mentioned