How to Start AI and Robotics Research (No Matter Your Background)
Based on Code Mechanics: My PhD Life in AI & Robotics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Start research now by aligning expectations with current background knowledge rather than waiting for an arbitrary skill threshold.
Briefing
Starting AI and robotics research doesn’t require a specific degree level or a pre-built “research aptitude” resume. The practical unlock is to set expectations that match current background knowledge—and then follow a paper-reading strategy that turns curiosity into a repeatable workflow. The core message is that waiting for an arbitrary threshold usually backfires: the more people learn, the more they realize what they don’t know, so the goalposts keep moving. Instead, researchers should begin now, because early momentum opens doors and makes later learning faster.
The strategy begins with “appropriate expectations,” meaning a realistic view of what someone can gain right now from reading and doing research. Two archetypes illustrate the difference. Student one—new to the field, early in a bachelor’s program or just out of high school—may recognize some math and key terms, and understand the introduction’s language, but still miss how the work fits into the broader literature, what the contributions are, what the results mean, and how the results were generated. That’s not a failure; it’s a valid starting point. Student two—master’s level or later in a PhD, with literature review experience and at least one paper—reads with stronger context, understands the math formalism more deeply, and focuses on contributions, how results were produced, and how the work could be extended or improved. Even critique should be handled carefully: jumping straight to criticism can hide useful insights, so it’s often better to identify what’s strong first, then critique with awareness of potential bias.
With expectations aligned, the workflow shifts to “strategy,” starting by choosing a research area and learning how the field validates work. For robotics, major venues mentioned include IRA, IROS, and JRR, and the video emphasizes using Google Scholar to check where papers were published, how peer review signals value, and citation counts as a rough (imperfect) proxy for impact—especially noting that very new papers may have low citations simply due to recency.
Next comes the most actionable step: begin with a survey paper found via Google Scholar. Survey papers consolidate dozens of related works into a digestible map, offering multiple paths for deeper exploration. Access can be a constraint, so the transcript points to arXiv preprints as alternatives when journal or conference versions are paywalled, with the caveat that preprints may differ from final publications.
Finally, reading is done strategically rather than top-to-bottom. The method uses two handwritten “concept map” sheets. One sheet tracks keywords: as terms appear in the abstract, figures, and tables, unfamiliar words get written down without immediately looking them up. The second sheet captures figures and tables with one-sentence summaries in the reader’s own words. After this first pass, the reader uses the keyword list differently depending on experience level. Student one asks what background knowledge is needed—for example, if “convolutions” appear, the likely prerequisite is matrix multiplication and, more broadly, linear algebra. Student two asks how a term or architecture matters in context—why a specific design (like a D3QN architecture) is used, what it enables, and how it fits into the larger problem (such as terrain traversibility for mobile robots).
The payoff is faster comprehension, better paper selection, and a clearer sense of “what and why” behind each reading. The approach is intentionally slow at first, but writing in one’s own words and tying concepts to a bigger research question accelerates learning over time. Teaching what’s learned is recommended as a motivation and a diagnostic for understanding gaps.
Cornell Notes
The transcript argues that starting AI and robotics research is less about having the “right” background and more about setting expectations that match where you are today, then using a repeatable strategy to learn from papers. Two reader profiles illustrate how expectations differ: beginners focus on grasping context, contributions, and result generation at a high level, while more advanced researchers analyze contributions, methods, and how work could be extended. The core workflow centers on choosing a research area, checking major venues and impact signals, starting with a survey paper, and then reading strategically using handwritten concept maps. Instead of reading top-to-bottom, readers first map keywords and summarize figures/tables, then look up prerequisites (for beginners) or contextual meaning and implications (for advanced readers). This matters because it turns curiosity into momentum and prevents unrealistic expectations from causing discouragement.
How should a beginner set expectations when reading a research paper for the first time?
What changes in expectations once someone is further along (master’s/PhD stage)?
Why start with survey papers instead of diving into a random research article?
What is the transcript’s paper-reading method, and why avoid top-to-bottom reading first?
How does the strategy differ for beginners versus advanced researchers when handling keywords?
How do venue knowledge and citation counts fit into the research-start strategy?
Review Questions
- When reading a paper, what are the two handwritten outputs the transcript recommends creating first, and what does each one capture?
- How should a beginner decide what to learn next after encountering an unfamiliar keyword, and how is that different from what an advanced researcher should do?
- Why does the transcript recommend starting with a survey paper, and what practical workaround does it suggest if the survey is paywalled?
Key Points
- 1
Start research now by aligning expectations with current background knowledge rather than waiting for an arbitrary skill threshold.
- 2
Use “appropriate expectations” to decide what you can realistically understand from a paper at your current level.
- 3
Choose a research area and identify major venues in that field to understand how peer review and community validation work.
- 4
Begin with a survey paper found on Google Scholar to get a structured map of the topic before reading primary work.
- 5
Read strategically: first extract keywords and summarize figures/tables in your own words, then do a full top-to-bottom read after you’ve grounded the basics.
- 6
For beginners, treat keywords as signals for prerequisite learning (e.g., convolutions → matrix multiplication → linear algebra).
- 7
For advanced readers, treat keywords as signals for contextual meaning and implications (e.g., why a specific architecture is used and how it affects outcomes).