Ethical Thematic Analysis with ChatGPT: Step-by-step Tutorial

TL;DR

Use a three-stage workflow: initial descriptive coding, manual focus coding, then theme development.

Briefing Cornell Notes

Briefing

Ethical thematic analysis with ChatGPT hinges on one practical idea: keep tight control of the coding workflow so the model can’t “shortcut” to ready-made themes, and preserve traceability from every code back to the original participant quotes. The payoff is credibility—results can be treated as trustworthy and valid—because the process stays transparent and bias is actively managed rather than outsourced to an AI output.

The method is built around a three-stage thematic analysis workflow. Stage one creates a foundation of detailed, descriptive initial codes. These codes must be granular and grounded in what participants actually say, not abstract labels. The workflow also demands breadth: the coder should ask for lots of codes so every part of the transcript is captured, rather than letting the model return a small set of broad categories that resemble themes too early. Crucially, the prompt should require a full list of quotes associated with each code. Without quote-level traceability, later reporting becomes guesswork and transparency collapses.

Stage two refines and organizes those initial codes through “focus coding.” Here, the transcript-level outputs are transferred into Microsoft Word (or similar external tools like Excel) so the researcher can manage the audit trail. Each participant’s material is kept separate using distinct color formatting in Word, creating a visual map of where every code and quote came from. Focus coding is then done primarily manually: codes are grouped into higher-level clusters that represent patterns or trends, not final themes. This manual step helps prevent the model from collapsing the process into a theme list. When duplicates appear across participants—codes that mean essentially the same thing but use different wording—the researcher consolidates them by creating a new code description and attaching the original code names as comments, preserving the lineage back to the original quotes.

Stage three develops themes from the focus-coded structure. The recommended approach is manual again: themes and sub-themes are organized to communicate what the data supports, ensuring the reader understands the findings as well as the researcher does. ChatGPT can be used at this stage only if the researcher chooses, but the emphasis remains on human judgment for the final narrative structure.

To keep the workflow ethical, the tutorial repeatedly warns against two shortcuts: asking ChatGPT to analyze transcripts thematically in one step, and disclosing the study’s research questions or exact aims in the prompt. Revealing those details encourages the model to steer toward predetermined answers. Instead, the prompt should provide context (e.g., the transcript is from teacher interviews about remote teaching) while withholding the specific research question framing. The workflow also recommends uploading or pasting one transcript at a time (even with ChatGPT Plus) to reduce confusion and maintain consistent prompting.

Overall, the tutorial’s core message is that AI can assist coding, but ethical thematic analysis requires the researcher to control inputs, enforce quote-level traceability, and do the key organizing and theme-making steps themselves—using Microsoft Word as the transparency backbone and manual focus coding as the safeguard against bias and premature conclusions.

Cornell Notes

Ethical thematic analysis with ChatGPT depends on preventing the model from jumping straight to themes. The workflow starts with detailed initial coding: prompts should request many descriptive codes and require a full list of participant quotes for each code, so every later claim can be traced back to evidence. Next, codes are transferred into Microsoft Word with participant-specific color coding, then focus coding is performed mainly manually to group similar codes and consolidate duplicates while preserving links to original code names and quotes. Theme development is ideally manual as well, using the organized focus-coded structure to craft a coherent set of themes and sub-themes that reflect what the data supports. This approach supports transparency, bias control, and validity through an auditable chain from transcript → code → quote → theme.

Why does the tutorial insist on “descriptive” initial codes rather than abstract labels?

Descriptive codes stay close to what participants actually said, which makes later refinement meaningful. Abstract labels (for example, “professional skill set”) can obscure the specific content under that code, making it harder to understand and compare what was said across the dataset. The tutorial contrasts this with more concrete phrasing (e.g., “skills are important in being a good teacher”) that preserves the substance of the participant’s meaning.

What prompt requirements are treated as non-negotiable for ethical, transparent analysis?

The prompt should require (1) lots of codes so the analysis doesn’t collapse into a small set of broad categories, and (2) a full list of quotes associated with each code. Quote-level traceability is what allows later stages—refining codes, grouping them, and reporting themes—to remain transparent and verifiable rather than relying on AI-generated summaries.

How does the workflow reduce the risk that ChatGPT will “shortcut” to themes?

It avoids asking for thematic analysis in one step and avoids giving away the study’s research questions or exact aims. The tutorial argues that disclosing those elements encourages the model to steer toward predetermined answers. Instead, it provides general context (e.g., the transcript is from teacher interviews about remote work) while requesting coding outputs that can be monitored and revised.

Why upload or paste one transcript at a time, and why keep participant materials separate in Microsoft Word?

One-at-a-time processing reduces confusion and helps maintain consistent prompting. In Microsoft Word, participant-specific color coding creates a clear audit trail: codes and quotes can be traced back to the correct participant file when consolidating duplicates or checking what a code originally meant.

What exactly happens during focus coding, and why is it recommended to be manual?

Focus coding groups initial codes into clusters representing patterns or trends, not final themes. It’s recommended to be manual to preserve control and prevent the model from reorganizing the data into themes prematurely. During this step, duplicates across participants are consolidated by creating a new code description and attaching the original code names as comments so the researcher can trace back to the original quotes.

When should ChatGPT be used for theme development, and what’s the preferred approach?

The preferred approach is manual theme development: themes and sub-themes are organized by reading the focus-coded structure and deciding what story the data supports. ChatGPT can be used at this stage if desired (for example, by asking it to propose theme structures based on focus codes), but the tutorial emphasizes that human judgment is central for the final narrative and communication of findings.

Review Questions

What two prompt features most directly support transparency and validity in the initial coding stage?
How does participant-specific color coding in Microsoft Word help during focus coding and duplicate consolidation?
What kinds of information should be withheld from the prompt to reduce the risk of premature theme formation?

Key Points

1
Use a three-stage workflow: initial descriptive coding, manual focus coding, then theme development.
2
Require many descriptive initial codes and demand quote-level traceability by asking for a full list of quotes per code.
3
Prevent shortcutting by not asking for thematic analysis in one step and by withholding the study’s exact research questions and aims from the prompt.
4
Process transcripts one at a time and keep participant materials separate, using Microsoft Word color coding to preserve an audit trail.
5
Perform focus coding mainly manually: group codes into pattern clusters, consolidate duplicates, and preserve lineage by commenting original code names.
6
Develop final themes ideally through manual interpretation of the organized focus codes so the narrative reflects what the data supports.

Highlights

Ethical thematic analysis requires an auditable chain from transcript → code → associated quotes → grouped codes → themes.

The tutorial treats “lots of descriptive codes” plus “full quotes per code” as the backbone of transparency and validity.

Withholding research questions and exact aims from the prompt helps stop ChatGPT from steering toward predetermined themes.

Microsoft Word color coding by participant turns traceability into a practical, visual workflow.

Focus coding is positioned as the key control point—done mainly manually to avoid premature, AI-driven theme formation.

Topics

Thematic Analysis
Initial Coding
Focus Coding
Theme Development
Ethical AI Coding

Mentioned

Microsoft Word
nvivo