Don't use AI for Qualitative data analysis

TL;DR

Qualitative thematic analysis requires an auditable trail from coded excerpts to final themes, especially in academic settings.

Briefing Cornell Notes

Briefing

Qualitative thematic analysis succeeds or fails on one non-negotiable requirement: the work must produce an auditable trail from coded excerpts to final themes. That’s why many “upload your data, get your themes” AI tools fall short for academic and other high-stakes settings—there’s rarely a transparent path showing how codes were built, merged, renamed, and turned into themes grounded in the text.

The core workflow is straightforward in principle. Researchers start by coding—tagging segments of transcripts with short labels that summarize what each segment is saying. Those codes then get reshaped: merged, joined, renamed, and reorganized until they form a thematic framework—broad topics and recurring patterns that run through the dataset. For the themes to be credible, they must be grounded in the data rather than reflecting expectations, assumptions, or personal bias. The only way to demonstrate that grounding is to show the chain of decisions: which coded segments support each theme, and how the coding evolved into the final thematic structure.

Designated AI analysis tools are criticized for two linked limitations. First, they typically don’t provide the audit trail needed to verify rigor—meaning it’s hard or impossible to show how the analysis moved from coding to themes. Second, they often output final themes that can’t be meaningfully altered or reused. In practice, that blocks a common scholarly need: recycling and adapting the same underlying coding framework for multiple outputs. A single dataset may support many articles or studies, each with a slightly different thematic emphasis; researchers rely on the ability to revisit and remodel codes and themes. Tools that only deliver a fixed end result make that iterative, publication-ready workflow difficult.

Instead of relying on AI “theme generators,” the recommended approach is to use professional qualitative data analysis software when possible—tools built for coding, retrieval, and traceability rather than automated interpretation. NVivo is highlighted as a primary choice, alongside Atlas.ti and Max QDA as strong alternatives. These packages support the essential tasks: coding the data, developing themes, and then drilling back into the underlying quotes. They also enable exporting and organizing evidence so findings can be written with defensible support.

But software isn’t the only route. Excel, visual mind-mapping tools like Miro, and even Microsoft Word (or pen-and-paper methods) can work if they support the two “major rules” of qualitative analysis. First, the method must allow systematic coding—assigning tags to transcript segments in a way that’s consistent and manageable. Second, it must allow traceability—linking each theme back to the original quotes so the analysis can be checked and written up. The emphasis is on meeting the process requirements, not chasing a particular brand of tool. As long as the final thematic framework is built rigorously and can be demonstrated through traceable evidence, the choice of tool is secondary to the integrity of the method.

Cornell Notes

The central requirement for qualitative thematic analysis is an audit trail: themes must be traceable back to coded excerpts, and codes must be grounded in the data rather than expectations or bias. Many AI tools that generate themes from uploaded data are criticized for lacking this documentation and for producing fixed outputs that can’t be revised or reused for multiple publications. A workable workflow starts with coding (tagging transcript segments), then reshaping codes through merging, joining, and renaming until themes emerge. Whether using NVivo, Atlas.ti, Max QDA, Excel, Miro, Microsoft Word, or manual methods, the method must support two rules: systematic coding and traceability from themes back to original quotes.

Why is an “audit trail” so central to qualitative thematic analysis?

Because credibility depends on showing how themes were built from the text. The audit trail documents the path from coding to final themes—what segments were tagged, how codes evolved (merged, joined, renamed), and which coded excerpts support each theme. Without that chain, themes can’t be verified as grounded in the data, which matters in academic evaluation and rigor-focused settings.

What does the coding-to-themes process look like in practice?

Coding means tagging transcript segments with labels that summarize what each segment is about. After initial coding, researchers reorganize the code set—reshaping it by merging similar codes, joining related ones, and renaming to clarify meaning. Themes then emerge as broader topics or recurring patterns that run through the dataset, and those themes should be derived from the underlying codes.

What specific shortcomings make many designated AI qualitative analysis tools a poor fit?

Two major issues are emphasized: (1) lack of audit trail—tools often don’t show how the system moved from coding to themes; and (2) limited ability to revise or reuse outputs—final themes are typically hard to alter, rename, merge, or adapt for new publication purposes. That undermines iterative scholarly work where one dataset can support many different articles or studies.

Which professional software options are recommended for qualitative analysis, and what do they do well?

NVivo is presented as a main choice, with Atlas.ti and Max QDA also recommended. These tools support the essential human workflow: coding the data, developing themes, and then retrieving supporting quotes. They also make traceability practical—double-clicking codes to jump directly to the relevant excerpts—plus they support exporting and organizing evidence for writing.

How can non-specialized tools like Microsoft Word, Excel, or Miro support thematic analysis?

They can work if they enable systematic coding and traceability. For example, Microsoft Word can be used with tables (transcript on one side, codes on the other) or comments to link themes to codes. Excel can track codes with color or formatting. Miro can organize mind maps to structure codes and themes. Manual methods (printouts, highlighting, cutting) are also viable when they preserve consistent tagging and clear links from themes back to original quotes.

What matters most when choosing tools for qualitative analysis?

The process requirements matter more than the tool brand. The method must let researchers (1) assign tags to transcript segments in a systematic way and (2) trace themes back to the original quotes so evidence can be checked and written up. If those two conditions are met, the choice of tool is largely secondary to rigor and demonstrable traceability.

Review Questions

What two capabilities must any tool (AI, software, or manual) provide to support credible thematic analysis?
Describe how codes typically evolve into themes, and explain how that evolution should be documented for auditability.
Why does the inability to revise or reuse AI-generated themes create problems for researchers working with one dataset across multiple outputs?

Key Points

1
Qualitative thematic analysis requires an auditable trail from coded excerpts to final themes, especially in academic settings.
2
Themes must be grounded in the data through coding; they can’t rely on expectations or personal bias.
3
Many AI “theme generator” tools are criticized for lacking audit trail documentation and for producing fixed outputs that can’t be revised or reused.
4
Professional qualitative software like NVivo, Atlas.ti, and Max QDA supports coding, theme development, and quote-level traceability.
5
Non-specialized tools (Excel, Miro, Microsoft Word, or manual methods) can work if they enable systematic coding and traceability back to original quotes.
6
Tool choice is secondary to meeting the process requirements and demonstrating rigor in how the thematic framework was built.

Highlights

The decisive test for any qualitative method is whether themes can be traced back to the original coded quotes.

Designated AI tools are faulted for missing the audit trail and for locking in final themes that can’t be meaningfully reworked.

NVivo is emphasized as a primary option, with Atlas.ti and Max QDA presented as strong alternatives for quote-level retrieval.

Excel, Miro, Microsoft Word, and even pen-and-paper approaches are viable when they support systematic coding and traceability.

A single coding framework can be remolded for multiple publications—something fixed AI outputs often prevent.

Topics

Mentioned

NVivo
Atlas.ti
Max QDA
Microsoft Word
Excel
Miro

Don't use AI for Qualitative data analysis - use these tools instead