Get AI summaries of any video or article — Sign up free
Qualitative data analysis - Developing THEMES from CODES | "From Codes to Themes" episode 3 thumbnail

Qualitative data analysis - Developing THEMES from CODES | "From Codes to Themes" episode 3

5 min read

Based on Qualitative Researcher Dr Kriukow's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Build themes after focus coding, but treat theme development as a manual, interpretive step that must fit the story the data supports.

Briefing

Theme development starts with a practical decision: keep coding detailed enough to notice when “good” and “bad” experiences actually reflect different underlying drivers. Using a hypothetical dataset about job retention among chefs and actors, the process begins after stage-two coding (focus codes) that groups initial detailed codes into broader buckets. From there, the work shifts into building themes—an explicitly manual, interpretive step where the analyst must be confident the emerging structure tells a coherent story about what shapes job satisfaction, which in turn is treated as closely connected to retention.

Rather than naming themes as “job retention factors,” the analysis pivots to job satisfaction because the dataset rarely frames answers in terms of leaving or staying. Instead, the material centers on whether people enjoy their work and why. That logic produces two main themes: factors positively influencing job satisfaction and factors negatively affecting job satisfaction. “Good things” and “bad things” remain temporary labels from the coding stage; they get replaced with wording that better matches the data and the study’s likely implications.

A key turning point comes from repeated reflection during coding: some experiences look like mixtures of workplace conditions and individual dispositions. For example, “work environment providing opportunity to take risks” differs from “willingness to take risks,” even though both relate to learning and growth. Because the coding was granular, the analyst can separate these overlapping ideas into distinct subthemes rather than collapsing them into vague categories like “training” or “self-development,” which would risk losing important nuance.

With the two main themes set, the analyst organizes subthemes into two types of factors: external (workplace/environment conditions) and personal internal (individual attitudes, traits, and readiness). Under the positive theme, internal factors include being passionate and dedicated, being willing to learn and develop, turning challenges into advantages, taking criticism on board, appreciating small things, and maintaining resilience and optimism. External factors include autonomy and control at work, access to learning and training opportunities, meaningful or emotionally engaging work, and the presence of like-minded people. The process also involves “cleaning” the wording so subthemes are consistent in grammar and form, and then aggregating coding counts (in NVivo terms) to check how many subthemes exist within each category and theme.

The negative theme follows a similar pattern, though it is treated as a smaller, secondary byproduct. Negative codes are consolidated and reworded into clearer subthemes such as boring work leading to burnout, meaningless or not engaging work, lack of confidence, and work that feels stale or unchanging. Some external conditions are also represented, including insufficient autonomy. Other items are reframed to generalize beyond a single quote—such as “not liking to be the center of attention,” which is recast as “work not matching personality.” Finally, overlapping ideas like “doing the job for publicity” are merged into broader notions of work lacking authenticity, which maps back to meaning and engagement.

Overall, theme development here is less about forcing a framework and more about iteratively translating detailed codes into a defensible thematic structure that fits what participants actually said—while keeping an eye on how those themes could inform practical guidance for improving retention through job satisfaction.

Cornell Notes

The analysis builds themes from coded qualitative data by translating detailed “good” and “bad” experiences into two main thematic categories tied to job satisfaction. Because the dataset rarely mentions leaving or staying directly, the themes are framed as factors positively influencing job satisfaction and factors negatively affecting job satisfaction, not “job retention factors.” A major method step is separating workplace-driven influences (external factors) from individual dispositions (personal internal factors), enabled by detailed coding that preserves nuance. After organizing subthemes, the analyst cleans wording for consistency and uses aggregation checks to confirm the structure. The result is a thematic framework that can support practical implications for improving retention via satisfaction.

Why does the analysis avoid naming themes as “factors influencing job retention,” even though the study’s topic is retention?

The dataset does not typically include statements about leaving or staying. Instead, it focuses on enjoyment of the job and reasons behind satisfaction or dissatisfaction. Because the evidence rarely maps directly onto “retention,” the themes are reframed as job satisfaction: factors positively influencing job satisfaction and factors negatively affecting job satisfaction. This keeps the themes close enough to retention to support implications, while matching what participants actually said.

How does the analyst decide to split overlapping ideas into external versus personal internal factors?

Repeated reflection during coding reveals that some experiences combine workplace conditions with personal readiness. For instance, “work environment providing opportunity to take risks” is treated as external, while “willingness to take risks” is treated as personal internal. Detailed coding makes these distinctions visible; without that granularity, the analyst would likely collapse them into broad buckets like “training” or “self-development,” losing key nuance.

What kinds of subthemes appear under the positive job satisfaction theme?

Internal/personal internal subthemes include being passionate and dedicated, being willing to learn and develop, turning challenges into advantages, taking criticism on board, appreciating small things, and maintaining resilience and optimism. External subthemes include autonomy and control at work, learning and training opportunities, meaningful or emotionally engaging work, and like-minded people in the workplace.

How are negative experiences turned into a coherent second theme?

Negative codes are consolidated and reworded into subthemes such as boring work leading to burnout, meaningless or not engaging work, lack of confidence, and work that feels stale or unchanging. Some external conditions also appear, including not enough autonomy. Specific statements are generalized when needed—for example, “not liking to be the center of attention” becomes “work not matching personality.”

What does “cleaning” subthemes mean in this workflow?

After moving codes into external versus personal internal categories within each main theme, the analyst revises the subtheme labels for consistent wording and grammar. The goal is to make subthemes sound coherent and parallel (e.g., consistent phrasing like “being able to…” or “work not matching…”), rather than leaving them as rough fragments from earlier coding.

Review Questions

  1. How would you justify reframing a retention-focused research question into job satisfaction themes based on what the dataset actually contains?
  2. What evidence from coding would convince you that an item should be treated as external versus personal internal?
  3. When a negative code seems too specific to one participant’s quote, what criteria should guide whether to generalize it into a broader subtheme?

Key Points

  1. 1

    Build themes after focus coding, but treat theme development as a manual, interpretive step that must fit the story the data supports.

  2. 2

    If the dataset rarely mentions leaving or staying, frame themes around job satisfaction rather than forcing “job retention factors.”

  3. 3

    Keep detailed codes long enough to detect when workplace conditions and personal dispositions are distinct even if they overlap conceptually.

  4. 4

    Separate subthemes into external factors (workplace/environment conditions) and personal internal factors (attitudes, traits, readiness) to preserve nuance.

  5. 5

    “Clean” subtheme wording for consistent grammar and parallel structure so the thematic framework reads coherently.

  6. 6

    Use aggregation checks to confirm how many subthemes exist within each theme and category, supporting a structured final output.

  7. 7

    Generalize overly specific negative statements carefully (e.g., recast a single dislike into a broader “work not matching personality” idea) when it improves transferability.

Highlights

The analysis reframes retention into job satisfaction because the dataset emphasizes enjoyment and struggle, not explicit decisions to leave or stay.
Granular coding enables a crucial split: workplace opportunities (external) versus individual willingness (personal internal) even when both relate to risk-taking and growth.
Two main themes emerge—positively influencing job satisfaction and negatively affecting job satisfaction—each organized into external and personal internal subthemes.
Negative experiences are consolidated into clearer constructs like burnout from boring work, meaning loss, low confidence, and stale conditions, with specific quotes generalized when appropriate.

Topics

Mentioned