Get AI summaries of any video or article — Sign up free
How to Start AI and Robotics Research (No Matter Your Background) thumbnail

How to Start AI and Robotics Research (No Matter Your Background)

6 min read

Based on Code Mechanics: My PhD Life in AI & Robotics's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Start research now by aligning expectations with current background knowledge rather than waiting for an arbitrary skill threshold.

Briefing

Starting AI and robotics research doesn’t require a specific degree level or a pre-built “research aptitude” resume. The practical unlock is to set expectations that match current background knowledge—and then follow a paper-reading strategy that turns curiosity into a repeatable workflow. The core message is that waiting for an arbitrary threshold usually backfires: the more people learn, the more they realize what they don’t know, so the goalposts keep moving. Instead, researchers should begin now, because early momentum opens doors and makes later learning faster.

The strategy begins with “appropriate expectations,” meaning a realistic view of what someone can gain right now from reading and doing research. Two archetypes illustrate the difference. Student one—new to the field, early in a bachelor’s program or just out of high school—may recognize some math and key terms, and understand the introduction’s language, but still miss how the work fits into the broader literature, what the contributions are, what the results mean, and how the results were generated. That’s not a failure; it’s a valid starting point. Student two—master’s level or later in a PhD, with literature review experience and at least one paper—reads with stronger context, understands the math formalism more deeply, and focuses on contributions, how results were produced, and how the work could be extended or improved. Even critique should be handled carefully: jumping straight to criticism can hide useful insights, so it’s often better to identify what’s strong first, then critique with awareness of potential bias.

With expectations aligned, the workflow shifts to “strategy,” starting by choosing a research area and learning how the field validates work. For robotics, major venues mentioned include IRA, IROS, and JRR, and the video emphasizes using Google Scholar to check where papers were published, how peer review signals value, and citation counts as a rough (imperfect) proxy for impact—especially noting that very new papers may have low citations simply due to recency.

Next comes the most actionable step: begin with a survey paper found via Google Scholar. Survey papers consolidate dozens of related works into a digestible map, offering multiple paths for deeper exploration. Access can be a constraint, so the transcript points to arXiv preprints as alternatives when journal or conference versions are paywalled, with the caveat that preprints may differ from final publications.

Finally, reading is done strategically rather than top-to-bottom. The method uses two handwritten “concept map” sheets. One sheet tracks keywords: as terms appear in the abstract, figures, and tables, unfamiliar words get written down without immediately looking them up. The second sheet captures figures and tables with one-sentence summaries in the reader’s own words. After this first pass, the reader uses the keyword list differently depending on experience level. Student one asks what background knowledge is needed—for example, if “convolutions” appear, the likely prerequisite is matrix multiplication and, more broadly, linear algebra. Student two asks how a term or architecture matters in context—why a specific design (like a D3QN architecture) is used, what it enables, and how it fits into the larger problem (such as terrain traversibility for mobile robots).

The payoff is faster comprehension, better paper selection, and a clearer sense of “what and why” behind each reading. The approach is intentionally slow at first, but writing in one’s own words and tying concepts to a bigger research question accelerates learning over time. Teaching what’s learned is recommended as a motivation and a diagnostic for understanding gaps.

Cornell Notes

The transcript argues that starting AI and robotics research is less about having the “right” background and more about setting expectations that match where you are today, then using a repeatable strategy to learn from papers. Two reader profiles illustrate how expectations differ: beginners focus on grasping context, contributions, and result generation at a high level, while more advanced researchers analyze contributions, methods, and how work could be extended. The core workflow centers on choosing a research area, checking major venues and impact signals, starting with a survey paper, and then reading strategically using handwritten concept maps. Instead of reading top-to-bottom, readers first map keywords and summarize figures/tables, then look up prerequisites (for beginners) or contextual meaning and implications (for advanced readers). This matters because it turns curiosity into momentum and prevents unrealistic expectations from causing discouragement.

How should a beginner set expectations when reading a research paper for the first time?

A beginner (the transcript’s “student one”) should expect partial understanding: they may recognize some math and key terms, and understand the introduction’s language, but they likely won’t grasp how the paper fits into the broader literature, what the contributions are, how the results were generated, or what the results mean in context. The key is to treat that as a normal starting point, not a sign to stop. As repeated reading builds familiarity, themes and concepts start connecting, and the reader gradually gains the background needed to understand the full arc of a paper.

What changes in expectations once someone is further along (master’s/PhD stage)?

A more advanced reader (“student two”) should expect stronger context and deeper technical comprehension. They’re more likely to understand where the paper sits in the domain, interpret the math formalism more accurately, and analyze algorithmic or architectural differences versus prior work. Their focus shifts toward contributions and potential extension: how the paper’s method could improve their own results, and how to evaluate the work critically. The transcript also warns against rushing into criticism; it recommends identifying what’s good first, then critiquing while checking for bias.

Why start with survey papers instead of diving into a random research article?

Survey papers provide a “lay of the land” view by compiling many related works (often 50–100) and summarizing techniques and themes in one place. That breadth creates multiple routes for deeper exploration and helps readers understand the structure of a topic before committing to details. The transcript also notes practical access issues: survey papers may be paywalled, so arXiv preprints can be used as a starting point, with the caveat that preprints may differ from final versions.

What is the transcript’s paper-reading method, and why avoid top-to-bottom reading first?

The method uses two handwritten concept-map sheets. One sheet captures keywords: as unfamiliar terms appear in the abstract, figures, and tables, the reader writes them down without looking them up immediately. The second sheet summarizes figures and tables in one sentence each, written in the reader’s own words (not copied captions). This approach prevents getting lost in rabbit holes, grounds understanding early, and helps the reader decide whether the paper is worth deeper attention before doing a full read.

How does the strategy differ for beginners versus advanced researchers when handling keywords?

For beginners, keywords trigger prerequisite learning: the reader asks what background knowledge is needed to understand the term. Example given: if “convolutions” appear, the reader may need matrix multiplication and, more broadly, linear algebra. For advanced researchers, keywords trigger contextual analysis: the reader asks what the term means in that specific paper, how it’s used, why it’s chosen over alternatives, and how it connects to the larger research problem. The transcript uses D3QN as an example of an architecture to evaluate in terms of expected outcomes and tradeoffs.

How do venue knowledge and citation counts fit into the research-start strategy?

The transcript recommends identifying major publishing venues in the field (robotics examples listed include IRA, IROS, and JRR) to gauge whether work is peer reviewed and valued by the community. It also suggests using citation counts as a rough impact indicator, while acknowledging limitations: very recent papers may have low citations simply because they haven’t had time to be adopted.

Review Questions

  1. When reading a paper, what are the two handwritten outputs the transcript recommends creating first, and what does each one capture?
  2. How should a beginner decide what to learn next after encountering an unfamiliar keyword, and how is that different from what an advanced researcher should do?
  3. Why does the transcript recommend starting with a survey paper, and what practical workaround does it suggest if the survey is paywalled?

Key Points

  1. 1

    Start research now by aligning expectations with current background knowledge rather than waiting for an arbitrary skill threshold.

  2. 2

    Use “appropriate expectations” to decide what you can realistically understand from a paper at your current level.

  3. 3

    Choose a research area and identify major venues in that field to understand how peer review and community validation work.

  4. 4

    Begin with a survey paper found on Google Scholar to get a structured map of the topic before reading primary work.

  5. 5

    Read strategically: first extract keywords and summarize figures/tables in your own words, then do a full top-to-bottom read after you’ve grounded the basics.

  6. 6

    For beginners, treat keywords as signals for prerequisite learning (e.g., convolutions → matrix multiplication → linear algebra).

  7. 7

    For advanced readers, treat keywords as signals for contextual meaning and implications (e.g., why a specific architecture is used and how it affects outcomes).

Highlights

The transcript’s central claim is that research momentum matters more than having the “right” degree level—start now with realistic expectations.
Survey papers are positioned as the fastest way to build a topic map, especially when arXiv preprints provide access.
A two-sheet handwritten concept map (keywords + one-sentence figure/table summaries) replaces inefficient top-to-bottom reading.
Beginners use keywords to find prerequisites; advanced researchers use keywords to evaluate design choices and broader implications.
Critique should come after identifying what’s strong, to reduce bias and improve learning.

Mentioned