Qualitative data analysis - Developing THEMES from CODES | "From Codes to Themes" episode 3
Based on Qualitative Researcher Dr Kriukow's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Build themes after focus coding, but treat theme development as a manual, interpretive step that must fit the story the data supports.
Briefing
Theme development starts with a practical decision: keep coding detailed enough to notice when “good” and “bad” experiences actually reflect different underlying drivers. Using a hypothetical dataset about job retention among chefs and actors, the process begins after stage-two coding (focus codes) that groups initial detailed codes into broader buckets. From there, the work shifts into building themes—an explicitly manual, interpretive step where the analyst must be confident the emerging structure tells a coherent story about what shapes job satisfaction, which in turn is treated as closely connected to retention.
Rather than naming themes as “job retention factors,” the analysis pivots to job satisfaction because the dataset rarely frames answers in terms of leaving or staying. Instead, the material centers on whether people enjoy their work and why. That logic produces two main themes: factors positively influencing job satisfaction and factors negatively affecting job satisfaction. “Good things” and “bad things” remain temporary labels from the coding stage; they get replaced with wording that better matches the data and the study’s likely implications.
A key turning point comes from repeated reflection during coding: some experiences look like mixtures of workplace conditions and individual dispositions. For example, “work environment providing opportunity to take risks” differs from “willingness to take risks,” even though both relate to learning and growth. Because the coding was granular, the analyst can separate these overlapping ideas into distinct subthemes rather than collapsing them into vague categories like “training” or “self-development,” which would risk losing important nuance.
With the two main themes set, the analyst organizes subthemes into two types of factors: external (workplace/environment conditions) and personal internal (individual attitudes, traits, and readiness). Under the positive theme, internal factors include being passionate and dedicated, being willing to learn and develop, turning challenges into advantages, taking criticism on board, appreciating small things, and maintaining resilience and optimism. External factors include autonomy and control at work, access to learning and training opportunities, meaningful or emotionally engaging work, and the presence of like-minded people. The process also involves “cleaning” the wording so subthemes are consistent in grammar and form, and then aggregating coding counts (in NVivo terms) to check how many subthemes exist within each category and theme.
The negative theme follows a similar pattern, though it is treated as a smaller, secondary byproduct. Negative codes are consolidated and reworded into clearer subthemes such as boring work leading to burnout, meaningless or not engaging work, lack of confidence, and work that feels stale or unchanging. Some external conditions are also represented, including insufficient autonomy. Other items are reframed to generalize beyond a single quote—such as “not liking to be the center of attention,” which is recast as “work not matching personality.” Finally, overlapping ideas like “doing the job for publicity” are merged into broader notions of work lacking authenticity, which maps back to meaning and engagement.
Overall, theme development here is less about forcing a framework and more about iteratively translating detailed codes into a defensible thematic structure that fits what participants actually said—while keeping an eye on how those themes could inform practical guidance for improving retention through job satisfaction.
Cornell Notes
The analysis builds themes from coded qualitative data by translating detailed “good” and “bad” experiences into two main thematic categories tied to job satisfaction. Because the dataset rarely mentions leaving or staying directly, the themes are framed as factors positively influencing job satisfaction and factors negatively affecting job satisfaction, not “job retention factors.” A major method step is separating workplace-driven influences (external factors) from individual dispositions (personal internal factors), enabled by detailed coding that preserves nuance. After organizing subthemes, the analyst cleans wording for consistency and uses aggregation checks to confirm the structure. The result is a thematic framework that can support practical implications for improving retention via satisfaction.
Why does the analysis avoid naming themes as “factors influencing job retention,” even though the study’s topic is retention?
How does the analyst decide to split overlapping ideas into external versus personal internal factors?
What kinds of subthemes appear under the positive job satisfaction theme?
How are negative experiences turned into a coherent second theme?
What does “cleaning” subthemes mean in this workflow?
Review Questions
- How would you justify reframing a retention-focused research question into job satisfaction themes based on what the dataset actually contains?
- What evidence from coding would convince you that an item should be treated as external versus personal internal?
- When a negative code seems too specific to one participant’s quote, what criteria should guide whether to generalize it into a broader subtheme?
Key Points
- 1
Build themes after focus coding, but treat theme development as a manual, interpretive step that must fit the story the data supports.
- 2
If the dataset rarely mentions leaving or staying, frame themes around job satisfaction rather than forcing “job retention factors.”
- 3
Keep detailed codes long enough to detect when workplace conditions and personal dispositions are distinct even if they overlap conceptually.
- 4
Separate subthemes into external factors (workplace/environment conditions) and personal internal factors (attitudes, traits, readiness) to preserve nuance.
- 5
“Clean” subtheme wording for consistent grammar and parallel structure so the thematic framework reads coherently.
- 6
Use aggregation checks to confirm how many subthemes exist within each theme and category, supporting a structured final output.
- 7
Generalize overly specific negative statements carefully (e.g., recast a single dislike into a broader “work not matching personality” idea) when it improves transferability.