Open coding and Axial coding in Qualitative data analysis
Based on Qualitative Researcher Dr Kriukow's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Open coding is an initial, exploratory stage that tags text with descriptive, data-driven codes without forcing prior frameworks onto the data.
Briefing
Open coding and axial (focused) coding are practical, two-stage ways to analyze qualitative data: first tag the text with descriptive, data-driven codes, then group those codes into connected categories that can later become themes. Although the terms come from grounded theory, the workflow fits many other qualitative approaches—case studies, thematic analysis, and other coding-based designs—because it’s fundamentally about moving from raw text to organized meaning.
Open coding is the initial, exploratory phase. As researchers read transcripts or documents, they avoid imposing prior frameworks or preconceived ideas. Instead, they create codes that act like “sticky notes” attached to specific segments of text, describing what the segment is about and what it suggests. This stage is often detailed—sometimes even line-by-line or sentence-by-sentence—because the goal is to code a large portion of the material while the analyst is still unsure where patterns will emerge. The result is typically a high volume of codes, often numbering in the hundreds.
In an example scenario about online learning, open coding might generate codes such as “technical issues,” “distractions at home,” and “lack of engagement,” alongside positive codes like “convenience of learning” (no commute) and “more engaging learning context.” The specific labels depend on what appears in the data, but the principle stays the same: codes should be descriptive and grounded in the text.
Axial coding—also called focused coding in some constructivist grounded theory approaches—comes next. This stage reorganizes the initial codes by grouping them into categories and looking for connections among them. For instance, the “technical issues,” “distractions at home,” and “lack of engagement” codes could be grouped under a category such as “challenges of online learning,” while “convenience of learning” and “more engaging learning context” could be grouped under “advantages of online learning.” The “connections” part is not treated as a mysterious analytic leap; it’s essentially the act of sorting and categorizing codes based on how they relate.
As the categories take shape, the analyst becomes more familiar with the dataset, which supports the next step: developing themes. The workflow is therefore staged—open coding produces many descriptive codes, axial/focused coding organizes them into connected groups, and that organization helps themes emerge.
A final practical question is whether axial/focused coding involves additional coding or only organizing what already exists. Different approaches exist. One option is to complete open coding for all transcripts first, then only group and reorganize in the second stage. Another option is to code part of the data, pause to organize those codes, and then continue coding the remaining material using the more focused, grouped structure. Either way, the core idea remains: axial/focused coding makes the analysis more targeted by turning a long list of initial codes into an organized structure that can lead to themes.
Cornell Notes
Open coding (also called initial coding) is the first, exploratory stage of qualitative analysis. Analysts read the data without forcing prior frameworks, tagging segments of text with descriptive codes—often in fine detail—so most of the material gets coded and the number of codes can reach the hundreds. Axial coding (also called focused coding) follows by grouping and organizing those initial codes into categories based on connections, such as “challenges of online learning” versus “advantages of online learning.” This categorization increases familiarity with the dataset and sets up the development of themes. Whether axial coding includes new coding or mainly reorganizes existing codes depends on the chosen workflow.
What makes open coding “data-driven” and how should analysts treat prior frameworks?
Why is open coding often detailed (even line-by-line), and what outcome should researchers expect?
How do axial coding (focused coding) categories form from open codes?
What is the practical purpose of axial/focused coding beyond organizing codes?
Does axial/focused coding require re-coding the data, or can it be only reorganization?
Review Questions
- When generating open codes, what behaviors help prevent researchers from imposing preconceived frameworks on the data?
- In the online learning example, how would you justify grouping specific open codes into a single axial category?
- What decision determines whether axial/focused coding is treated as “only organizing” or as “coding again with a focused structure”?
Key Points
- 1
Open coding is an initial, exploratory stage that tags text with descriptive, data-driven codes without forcing prior frameworks onto the data.
- 2
Open coding is often detailed (sometimes line-by-line or sentence-by-sentence) to capture most meaningful segments and can produce hundreds of codes.
- 3
Axial coding (focused coding) reorganizes open codes by grouping them into categories based on connections among codes.
- 4
Categories formed during axial/focused coding help analysts become more familiar with the dataset and support the later development of themes.
- 5
Axial/focused coding can be handled either as pure reorganization after full open coding or as an iterative process where codes are grouped midstream and then used to guide further coding.