LESSON 38 - METHODS OF DATA COLLECTION: TYPES OF DATA || SOURCES OF DATA

TL;DR

Data collection determines whether researchers can answer research questions, test hypotheses, and produce conclusions and recommendations.

Briefing Cornell Notes

Briefing

Data collection is the make-or-break step in research: without it, researchers can’t answer research questions, test hypotheses, or produce conclusions, recommendations, and suggestions for further study. Choosing the right data collection approach starts with three decisions—what kind of data is needed (numerical or narrative), which collection methods fit that data (questionnaires, observation, interviews, or document analysis), and how the resulting data will be analyzed (since the data type directly shapes the analysis plan).

“Data” is defined as quantitative or qualitative information about a variable gathered from research subjects. Variables are measurable characteristics that vary across people, objects, or events, and the information collected about those characteristics can appear in many forms: images, numbers, words, figures, facts, and ideas. The lesson also distinguishes between a tool and a method. A tool (or instrument) is the device used to collect data—such as an interview guide—while a method is the technique for collecting data—such as conducting interviews. Instrumentation goes beyond designing the instrument; it includes how the instrument will be administered under real conditions.

The lesson then breaks data into two major types. Quantitative data are numerical and mathematically computable, split into discrete quantitative data (whole numbers only, like counts of people or animals) and continuous quantitative data (values along a continuum, like height, weight, or distance that can include decimals). Qualitative data are non-numerical and categorical, typically textual or descriptive. Examples include interview transcripts, reports or minutes, emails, field notes, and video or audio recordings, as well as photographs. Qualitative data are discussed alongside nominal and ordinal measurement scales: nominal for naming categories and ordinal for ordering categories where rank differences are not necessarily equal.

Measurement is treated as a broader concept than data collection or tools. It includes assigning scores, numbers, meanings, and descriptions to represent the characteristic of interest. At a minimum, measurement requires defining terms operationally and identifying the scale of measurement. Four scales of measurement are highlighted: nominal (names), ordinal (ordered ranks), interval (no absolute zero), and ratio (has an absolute zero).

Finally, the lesson outlines three sources of data—primary, secondary, and tertiary. Primary sources are original materials created at the time of an event or by someone who directly experienced it (e.g., handwritten manuscripts, government documents, public records, photographs). Secondary sources are later accounts based on primary sources, used to interpret or evaluate them (e.g., textbooks, biographies, analytical articles). Tertiary sources compile or digest other sources (e.g., dictionaries, encyclopedias, directories, guidebooks, manuals, Wikipedia, and similar reference digests). The primary-versus-secondary boundary isn’t always clear; classification depends on how and why a source is used, not only on what it is.

When using secondary data, the lesson emphasizes that it must be available, relevant, accurate, and sufficient for the research question. Across both primary and secondary work, data quality depends on validity and reliability, piloting instruments before full administration, and collecting data only with authorization—typically involving ethics approval and permission from those controlling access to participants. The lesson closes by reiterating four core data collection methods—questionnaires, observation, interviews, and document analysis—setting up the next session on questionnaires.

Cornell Notes

The lesson frames data collection as essential for turning research questions into testable evidence and actionable conclusions. “Data” is defined as quantitative or qualitative information about variables, gathered from research subjects, and it can take forms ranging from numbers and images to words and facts. Data types split into quantitative (discrete vs. continuous) and qualitative (categorical, often textual/descriptive), with measurement scales ranging from nominal and ordinal to interval and ratio. Data collection methods rely on correct distinctions between methods (techniques) and instruments/tools (devices), plus proper instrumentation (how tools are administered). Sources of data fall into primary, secondary, and tertiary, and the primary/secondary label depends on how the source is used, not just what it is.

How do researchers distinguish “data,” “variables,” “measurement,” and “instruments” in this lesson?

Variables are measurable characteristics that vary across people, objects, or events. Data is the quantitative or qualitative information collected about those variables from research subjects (e.g., numbers, words, images, facts). Measurement is the broader process of assigning scores, numbers, meanings, or descriptions so the assigned values represent the characteristic of interest; it includes operational definitions and choosing the scale of measurement. Instruments (tools) are the devices used to collect data—like an interview guide—while methods are the techniques for collecting data—like conducting interviews. Instrumentation also includes the conditions and procedures for administering the instrument, not just designing it.

What are the two main types of data, and how do discrete vs. continuous quantitative data differ?

Quantitative data are numerical and mathematically computable. Discrete quantitative data can take only whole-number values (e.g., counts of people or animals; you can’t meaningfully speak of 10.5 people or 50.7 animals). Continuous quantitative data can take any value within a continuum, including decimals (e.g., height, weight, and distance such as 16.576 kilometers). Qualitative data are non-numerical and categorical, typically textual or descriptive (e.g., yes/no responses, marital status, satisfaction levels, and materials like interview transcripts, reports, emails, field notes, and recordings).

Why does the lesson treat measurement scales (nominal, ordinal, interval, ratio) as central to data collection and analysis?

Measurement scales determine how data can be categorized or quantified. Nominal scales name categories; ordinal scales order categories where rank gaps aren’t necessarily equal; interval scales have no absolute zero; ratio scales have an absolute zero. Because the scale of measurement shapes what can be computed or compared, it links directly to the type of data (quantitative vs. qualitative) and therefore influences how the data can be analyzed.

What are the three sources of data, and what’s the key rule about whether something is “primary” or “secondary”?

Primary sources are original materials created at the time of an event or by someone who directly experienced it (e.g., handwritten manuscripts, government documents, public records, photographs). Secondary sources are later accounts based on primary sources, used to analyze, evaluate, interpret, or criticize them (e.g., textbooks, biographies, analysis articles). Tertiary sources compile or digest other sources (e.g., dictionaries, encyclopedias, directories, guidebooks, manuals, Wikipedia, and similar digests). The primary/secondary distinction isn’t always clear-cut; classification depends on how and why the source is used (content and purpose), not only on the source’s origin.

What requirements does the lesson list for using secondary data, and what quality/ethics steps apply to data collection generally?

Secondary data must be available, relevant, accurate, and sufficient to answer the research question. For quality, instruments should produce data that is valid and reliable, and researchers should pilot test instruments before administering them to the main sample. For ethics and access, data collection must not proceed without authorization, including ethics approval and permission from those who control access to potential participants.

Which factors guide the choice of data collection method, and how do they connect to unit of analysis and audience preferences?

Method choice depends on (1) the type of data needed—numerical vs. narrative (quantitative vs. qualitative), (2) the source—whether primary, secondary, or a combination, and whether data come from individuals or groups (which determines the unit of analysis), and (3) the audience’s preference for numerical or narrative evidence. These factors help align the method with the research design and intended interpretation of findings.

Review Questions

How does the lesson explain the difference between a method of data collection and an instrument/tool, and where does instrumentation fit in?
Give one example each of discrete quantitative, continuous quantitative, and qualitative data, and match each to the appropriate measurement scale type discussed (nominal/ordinal/interval/ratio).
Why can a source be treated as primary or secondary depending on how it is used, and what checks should be applied when relying on secondary data?

Key Points

1
Data collection determines whether researchers can answer research questions, test hypotheses, and produce conclusions and recommendations.
2
“Data” is quantitative or qualitative information about variables collected from research subjects, and it can appear as numbers, words, images, or facts.
3
Tools/instruments are devices used to collect data, while methods are the techniques; instrumentation includes both instrument design and how it is administered.
4
Quantitative data are numerical and split into discrete (whole numbers) and continuous (values along a continuum), while qualitative data are non-numerical and categorical.
5
Measurement is broader than data collection: it includes operational definitions and selecting measurement scales (nominal, ordinal, interval, ratio).
6
Primary, secondary, and tertiary sources differ by originality and timing, but the primary/secondary label depends on how the source is used.
7
Secondary data must be available, relevant, accurate, and sufficient; instruments should be piloted, data quality should be valid and reliable, and collection requires authorization and ethics approval.

Highlights

Without collected data, research can’t reliably produce conclusions, recommendations, or suggestions for further study.

Discrete quantitative data are whole-number counts, while continuous quantitative data can include decimals along a continuum.

Measurement is not just assigning numbers—it also includes assigning meanings and descriptions through operational definitions and scale selection.

A source’s classification as primary or secondary depends on purpose and use, not only on the source itself.

Using secondary data requires checks for availability, relevance, accuracy, and sufficiency, alongside validity, reliability, piloting, and ethics authorization.

Topics

Data Collection Methods
Types of Data
Sources of Data
Measurement Scales
Primary vs Secondary Sources

Mentioned

Lydiah Wambugu