Open coding and Axial coding in Qualitative data analysis

TL;DR

Open coding is an initial, exploratory stage that tags text with descriptive, data-driven codes without forcing prior frameworks onto the data.

Briefing Cornell Notes

Briefing

Open coding and axial (focused) coding are practical, two-stage ways to analyze qualitative data: first tag the text with descriptive, data-driven codes, then group those codes into connected categories that can later become themes. Although the terms come from grounded theory, the workflow fits many other qualitative approaches—case studies, thematic analysis, and other coding-based designs—because it’s fundamentally about moving from raw text to organized meaning.

Open coding is the initial, exploratory phase. As researchers read transcripts or documents, they avoid imposing prior frameworks or preconceived ideas. Instead, they create codes that act like “sticky notes” attached to specific segments of text, describing what the segment is about and what it suggests. This stage is often detailed—sometimes even line-by-line or sentence-by-sentence—because the goal is to code a large portion of the material while the analyst is still unsure where patterns will emerge. The result is typically a high volume of codes, often numbering in the hundreds.

In an example scenario about online learning, open coding might generate codes such as “technical issues,” “distractions at home,” and “lack of engagement,” alongside positive codes like “convenience of learning” (no commute) and “more engaging learning context.” The specific labels depend on what appears in the data, but the principle stays the same: codes should be descriptive and grounded in the text.

Axial coding—also called focused coding in some constructivist grounded theory approaches—comes next. This stage reorganizes the initial codes by grouping them into categories and looking for connections among them. For instance, the “technical issues,” “distractions at home,” and “lack of engagement” codes could be grouped under a category such as “challenges of online learning,” while “convenience of learning” and “more engaging learning context” could be grouped under “advantages of online learning.” The “connections” part is not treated as a mysterious analytic leap; it’s essentially the act of sorting and categorizing codes based on how they relate.

As the categories take shape, the analyst becomes more familiar with the dataset, which supports the next step: developing themes. The workflow is therefore staged—open coding produces many descriptive codes, axial/focused coding organizes them into connected groups, and that organization helps themes emerge.

A final practical question is whether axial/focused coding involves additional coding or only organizing what already exists. Different approaches exist. One option is to complete open coding for all transcripts first, then only group and reorganize in the second stage. Another option is to code part of the data, pause to organize those codes, and then continue coding the remaining material using the more focused, grouped structure. Either way, the core idea remains: axial/focused coding makes the analysis more targeted by turning a long list of initial codes into an organized structure that can lead to themes.

Cornell Notes

Open coding (also called initial coding) is the first, exploratory stage of qualitative analysis. Analysts read the data without forcing prior frameworks, tagging segments of text with descriptive codes—often in fine detail—so most of the material gets coded and the number of codes can reach the hundreds. Axial coding (also called focused coding) follows by grouping and organizing those initial codes into categories based on connections, such as “challenges of online learning” versus “advantages of online learning.” This categorization increases familiarity with the dataset and sets up the development of themes. Whether axial coding includes new coding or mainly reorganizes existing codes depends on the chosen workflow.

What makes open coding “data-driven” and how should analysts treat prior frameworks?

Open coding is exploratory and data-driven, meaning codes are created directly from what appears in the text rather than from preconceived theories or earlier frameworks. The analyst reads the material and assigns descriptive tags to segments of text—like “sticky notes”—to capture what each segment is about and what it suggests. The key rule is to avoid imposing prior ideas on the data while generating these initial codes.

Why is open coding often detailed (even line-by-line), and what outcome should researchers expect?

Because the analyst is still unsure where patterns will emerge, open coding is often detailed—sometimes line-by-line or sentence-by-sentence—to ensure that most of the dataset is coded. This approach typically produces a large code set, commonly in the hundreds, since every meaningful segment can generate a new descriptive code.

How do axial coding (focused coding) categories form from open codes?

Axial/focused coding groups and organizes the initial codes by looking for connections among them. For example, codes like “technical issues,” “distractions at home,” and “lack of engagement” can be organized into a category such as “challenges of online learning.” Positive codes like “convenience of learning” and “more engaging learning context” can be grouped into a category such as “advantages of online learning.”

What is the practical purpose of axial/focused coding beyond organizing codes?

The purpose is to move from a long list of initial codes toward a structured understanding of the dataset. Through grouping and recognizing relationships among codes, the analyst becomes more familiar with the data, which supports developing themes later. In other words, axial/focused coding is a bridge from descriptive coding to thematic interpretation.

Does axial/focused coding require re-coding the data, or can it be only reorganization?

Different workflows exist. One approach is to complete open coding for all transcripts first, then only organize and group those codes in the second stage. Another approach is to code part of the data, pause to organize the emerging codes, and then continue coding the remaining data using the more focused, grouped structure. Either way, the second stage becomes more targeted than the first.

Review Questions

When generating open codes, what behaviors help prevent researchers from imposing preconceived frameworks on the data?
In the online learning example, how would you justify grouping specific open codes into a single axial category?
What decision determines whether axial/focused coding is treated as “only organizing” or as “coding again with a focused structure”?

Key Points

1
Open coding is an initial, exploratory stage that tags text with descriptive, data-driven codes without forcing prior frameworks onto the data.
2
Open coding is often detailed (sometimes line-by-line or sentence-by-sentence) to capture most meaningful segments and can produce hundreds of codes.
3
Axial coding (focused coding) reorganizes open codes by grouping them into categories based on connections among codes.
4
Categories formed during axial/focused coding help analysts become more familiar with the dataset and support the later development of themes.
5
Axial/focused coding can be handled either as pure reorganization after full open coding or as an iterative process where codes are grouped midstream and then used to guide further coding.

Highlights

Open coding treats codes like “sticky notes”: descriptive tags attached to text segments created directly from what the data says.

Axial (focused) coding is essentially sorting and grouping—turning an overwhelming list of initial codes into connected categories.

In an online learning example, “technical issues,” “distractions at home,” and “lack of engagement” can be grouped under “challenges,” while “convenience” and “more engaging context” can form “advantages.”

Whether axial/focused coding adds new coding or mainly reorganizes existing codes depends on the chosen workflow, but both aim for a more focused analysis.

Although grounded theory coined the terms, the same coding stages can be applied to many qualitative designs, including thematic analysis and case studies.

Topics

Open Coding
Axial Coding
Focused Coding
Thematic Development
Coding Workflow

Mentioned

Brandon Clark