LESSON 38 - METHODS OF DATA COLLECTION: TYPES OF DATA || SOURCES OF DATA
Based on RESEARCH METHODS CLASS WITH PROF. LYDIAH WAMBUGU's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Data collection determines whether researchers can answer research questions, test hypotheses, and produce conclusions and recommendations.
Briefing
Data collection is the make-or-break step in research: without it, researchers can’t answer research questions, test hypotheses, or produce conclusions, recommendations, and suggestions for further study. Choosing the right data collection approach starts with three decisions—what kind of data is needed (numerical or narrative), which collection methods fit that data (questionnaires, observation, interviews, or document analysis), and how the resulting data will be analyzed (since the data type directly shapes the analysis plan).
“Data” is defined as quantitative or qualitative information about a variable gathered from research subjects. Variables are measurable characteristics that vary across people, objects, or events, and the information collected about those characteristics can appear in many forms: images, numbers, words, figures, facts, and ideas. The lesson also distinguishes between a tool and a method. A tool (or instrument) is the device used to collect data—such as an interview guide—while a method is the technique for collecting data—such as conducting interviews. Instrumentation goes beyond designing the instrument; it includes how the instrument will be administered under real conditions.
The lesson then breaks data into two major types. Quantitative data are numerical and mathematically computable, split into discrete quantitative data (whole numbers only, like counts of people or animals) and continuous quantitative data (values along a continuum, like height, weight, or distance that can include decimals). Qualitative data are non-numerical and categorical, typically textual or descriptive. Examples include interview transcripts, reports or minutes, emails, field notes, and video or audio recordings, as well as photographs. Qualitative data are discussed alongside nominal and ordinal measurement scales: nominal for naming categories and ordinal for ordering categories where rank differences are not necessarily equal.
Measurement is treated as a broader concept than data collection or tools. It includes assigning scores, numbers, meanings, and descriptions to represent the characteristic of interest. At a minimum, measurement requires defining terms operationally and identifying the scale of measurement. Four scales of measurement are highlighted: nominal (names), ordinal (ordered ranks), interval (no absolute zero), and ratio (has an absolute zero).
Finally, the lesson outlines three sources of data—primary, secondary, and tertiary. Primary sources are original materials created at the time of an event or by someone who directly experienced it (e.g., handwritten manuscripts, government documents, public records, photographs). Secondary sources are later accounts based on primary sources, used to interpret or evaluate them (e.g., textbooks, biographies, analytical articles). Tertiary sources compile or digest other sources (e.g., dictionaries, encyclopedias, directories, guidebooks, manuals, Wikipedia, and similar reference digests). The primary-versus-secondary boundary isn’t always clear; classification depends on how and why a source is used, not only on what it is.
When using secondary data, the lesson emphasizes that it must be available, relevant, accurate, and sufficient for the research question. Across both primary and secondary work, data quality depends on validity and reliability, piloting instruments before full administration, and collecting data only with authorization—typically involving ethics approval and permission from those controlling access to participants. The lesson closes by reiterating four core data collection methods—questionnaires, observation, interviews, and document analysis—setting up the next session on questionnaires.
Cornell Notes
The lesson frames data collection as essential for turning research questions into testable evidence and actionable conclusions. “Data” is defined as quantitative or qualitative information about variables, gathered from research subjects, and it can take forms ranging from numbers and images to words and facts. Data types split into quantitative (discrete vs. continuous) and qualitative (categorical, often textual/descriptive), with measurement scales ranging from nominal and ordinal to interval and ratio. Data collection methods rely on correct distinctions between methods (techniques) and instruments/tools (devices), plus proper instrumentation (how tools are administered). Sources of data fall into primary, secondary, and tertiary, and the primary/secondary label depends on how the source is used, not just what it is.
How do researchers distinguish “data,” “variables,” “measurement,” and “instruments” in this lesson?
What are the two main types of data, and how do discrete vs. continuous quantitative data differ?
Why does the lesson treat measurement scales (nominal, ordinal, interval, ratio) as central to data collection and analysis?
What are the three sources of data, and what’s the key rule about whether something is “primary” or “secondary”?
What requirements does the lesson list for using secondary data, and what quality/ethics steps apply to data collection generally?
Which factors guide the choice of data collection method, and how do they connect to unit of analysis and audience preferences?
Review Questions
- How does the lesson explain the difference between a method of data collection and an instrument/tool, and where does instrumentation fit in?
- Give one example each of discrete quantitative, continuous quantitative, and qualitative data, and match each to the appropriate measurement scale type discussed (nominal/ordinal/interval/ratio).
- Why can a source be treated as primary or secondary depending on how it is used, and what checks should be applied when relying on secondary data?
Key Points
- 1
Data collection determines whether researchers can answer research questions, test hypotheses, and produce conclusions and recommendations.
- 2
“Data” is quantitative or qualitative information about variables collected from research subjects, and it can appear as numbers, words, images, or facts.
- 3
Tools/instruments are devices used to collect data, while methods are the techniques; instrumentation includes both instrument design and how it is administered.
- 4
Quantitative data are numerical and split into discrete (whole numbers) and continuous (values along a continuum), while qualitative data are non-numerical and categorical.
- 5
Measurement is broader than data collection: it includes operational definitions and selecting measurement scales (nominal, ordinal, interval, ratio).
- 6
Primary, secondary, and tertiary sources differ by originality and timing, but the primary/secondary label depends on how the source is used.
- 7
Secondary data must be available, relevant, accurate, and sufficient; instruments should be piloted, data quality should be valid and reliable, and collection requires authorization and ethics approval.