Get AI summaries of any video or article — Sign up free
The Simplest Math Problem No One Can Solve - Collatz Conjecture thumbnail

The Simplest Math Problem No One Can Solve - Collatz Conjecture

Veritasium·
6 min read

Based on Veritasium's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Collatz (3x+1) applies n → 3n+1 when n is odd and n → n/2 when n is even, and the conjecture claims every positive starting value eventually reaches the loop 4 → 2 → 1.

Briefing

The Collatz conjecture—also known as 3x+1, the Syracuse problem, and several other names—asks whether repeatedly applying two simple rules to any positive integer always eventually lands in the loop 4 → 2 → 1. Despite the problem’s elementary setup, no one has proved the claim for all starting values, and mathematicians treat it as a rare example of a question that is easy to state but stubbornly hard to resolve. Its importance is partly cultural—Erdős reportedly warned that mathematics wasn’t “ripe” enough for questions like this—and partly technical, because progress has required blending number theory with probabilistic reasoning, computation, and dynamical systems.

The rules are straightforward: if the current number is odd, replace it with 3n + 1; if it’s even, divide by 2. Starting from 7, the sequence eventually falls into the 4-2-1 loop after a long chain of ups and downs. The behavior varies wildly: 27 rockets up to 9,232 before descending, taking 111 steps to reach the loop. These “hailstone numbers” are named for their bounce-like trajectories. Even adjacent starting values can diverge dramatically, which makes pattern-hunting difficult.

Why does the conjecture seem plausible at all? One reason is that the process has a built-in bias toward shrinking. Although odd numbers get multiplied by 3, they are immediately followed by an even number and then a division by 2. More carefully, the path from one odd number to the next odd number behaves like multiplying by about 3/2, but with frequent extra halvings: sometimes the next odd appears after dividing by 2 once (a factor near 3/4 overall), sometimes after dividing by 4 (near 3/8), and so on. Using a geometric-mean argument, the average multiplicative effect from one odd to the next is less than 1, suggesting that typical trajectories should drift downward.

Another line of evidence comes from computation. Every starting value up to 2^68 has been checked, roughly 295 quintillion cases, and none escapes the 4-2-1 loop. If a different loop existed, calculations indicate it would have to be enormous—at least 186 billion numbers long. Yet brute force can’t settle the conjecture, because a counterexample could be astronomically large and effectively impossible to “guess” without a guiding principle.

Mathematicians have also proved weaker “almost all” results. Riho Terras showed in 1976 that almost all sequences dip below their starting value; later work tightened the bound, and in 2019 Terry Tao proved that almost every starting number eventually reaches values below any slowly growing function f(x) that still tends to infinity. Tao described this as close to the conjecture without actually solving it. The gap remains crucial: “almost all” can be true even when infinitely many exceptional cases exist.

The transcript also highlights why proof is so elusive. A single divergent trajectory to infinity or a closed loop disconnected from 4-2-1 would refute the conjecture, but no such structure has been found. The problem’s complexity is underscored by the fact that negative integers behave differently, producing three independent loops rather than just one. Finally, there’s a philosophical edge: generalizations like John Conway’s FRACTRAN are Turing-complete and tied to the halting problem, raising the possibility that Collatz might be undecidable with current methods. In the meantime, the conjecture remains a benchmark for how far mathematics can go with a rule set that fits on a chalkboard.

Cornell Notes

Collatz (3x+1) starts with any positive integer and repeatedly applies: if odd, map n → 3n+1; if even, map n → n/2. The conjecture claims every starting value eventually reaches the unique loop 4 → 2 → 1. Evidence includes a strong “downward drift” argument: odd numbers lead to even numbers that are then halved, and the average effect from one odd to the next is less than 1. Computation has verified all starting values up to 2^68 (about 295 quintillion) without finding a counterexample. Partial results by Riho Terras and especially Terry Tao show that “almost all” sequences eventually fall below arbitrarily slowly growing thresholds, but that still doesn’t prove the claim for every single starting value.

What exactly is the Collatz conjecture, and what does it mean for a sequence to “end up” in the 4-2-1 loop?

For a starting positive integer n, repeatedly apply two rules: if n is odd, replace it with 3n+1; if n is even, replace it with n/2. The conjecture says that no matter which positive integer is chosen, the resulting sequence eventually reaches the repeating cycle 4 → 2 → 1 → 4 → … . Once the sequence hits 4, the deterministic rules force it into that loop forever.

Why does the process look like it should grow, and what mechanism actually biases it toward shrinking?

At first glance, odd numbers are multiplied by 3 while even numbers are only halved, so one might expect growth. The key catch is that 3n+1 always turns an odd number into an even number, so the next step is a division by 2. More refined reasoning tracks the jump from one odd term to the next odd term: after multiplying by 3 and adding 1, the sequence may divide by 2 once, or by 4, or by 8, etc., before reaching the next odd. Using a geometric-mean calculation over these possibilities yields an average multiplicative factor less than 1, implying a statistical downward drift.

How much has brute-force computation verified, and what does that verification not guarantee?

All starting values up to 2^68 have been checked, totaling about 295,147,905,179,352,825,856 cases, and every one eventually reaches 1 and the 4-2-1 loop. This does not prove the conjecture because a counterexample could exist beyond 2^68. The transcript notes that if any loop other than 4-2-1 exists, it would have to be at least 186 billion numbers long, but that still leaves infinitely many possibilities outside the checked range.

What do “almost all” results mean in this context, and why don’t they settle the conjecture?

“Almost all” has a technical meaning: as the starting values grow without bound, the fraction of starting numbers whose sequences satisfy a property approaches 1. Riho Terras showed in 1976 that almost all sequences eventually go below their initial value, and later bounds were tightened. Terry Tao’s 2019 result is stronger: for almost all starting numbers, the sequence contains values smaller than any function f(x) that still tends to infinity, even if f grows extremely slowly (examples given include log x and iterated logs). But “almost all” can still allow exceptional starting values; the conjecture requires the property for every starting integer.

How does the behavior differ for negative integers, and why is that surprising?

Applying the same odd/even rules to negative integers does not produce just one loop. Instead, there are three independent loops on the negative side, starting at low values such as -17 and -5. That contrast—multiple disconnected loops for negatives but only the 4-2-1 loop for positives so far—raises questions about why the positive and negative dynamics differ.

What role does the idea of undecidability or Turing completeness play in the discussion?

The transcript points to John Conway’s FRACTRAN, a generalization of 3x+1 that is Turing-complete. Turing completeness implies the system can simulate any computation and is therefore subject to the halting problem, meaning some inputs might never produce an output. While this doesn’t prove Collatz itself is undecidable, it suggests a plausible scenario: even if the conjecture is true or false, there may be no method to prove it within standard mathematical frameworks.

Review Questions

  1. What is the average-effect argument that makes 3x+1 seem more likely to shrink than grow, and how does it use the behavior between odd terms?
  2. Why does verifying all starting values up to 2^68 still leave the conjecture unresolved?
  3. What does Terry Tao’s “almost all” theorem guarantee, and what specific gap remains compared with proving the conjecture for every starting value?

Key Points

  1. 1

    Collatz (3x+1) applies n → 3n+1 when n is odd and n → n/2 when n is even, and the conjecture claims every positive starting value eventually reaches the loop 4 → 2 → 1.

  2. 2

    Despite occasional large climbs (like 27 reaching 9,232), the process shows a statistical downward drift because odd steps force an even number followed by at least one halving.

  3. 3

    A geometric-mean argument tracks how often sequences divide by 2, 4, 8, and so on before reaching the next odd term, yielding an average factor less than 1 from odd to odd.

  4. 4

    Brute-force verification has confirmed the conjecture for all starting values up to 2^68 (about 295 quintillion), but that cannot rule out counterexamples beyond that range.

  5. 5

    Riho Terras and later results proved that “almost all” sequences eventually fall below their starting value or below arbitrarily slowly growing thresholds, but “almost all” still allows exceptional cases.

  6. 6

    Negative integers behave differently: the same rules produce three independent loops (including ones starting at -17 and -5), unlike the single known positive loop.

  7. 7

    The discussion raises the possibility that Collatz-like systems could be undecidable in principle, drawing an analogy to Turing-complete generalizations such as Conway’s FRACTRAN.

Highlights

The conjecture’s core claim is deceptively simple: every positive integer should eventually fall into the repeating cycle 4 → 2 → 1 under the odd/even rules.
The “downward drift” intuition comes from analyzing the jump from one odd number to the next odd number, where repeated halving dominates the occasional multiplication by 3.
Terry Tao’s 2019 result proves a near-miss: for almost all starting values, sequences dip below any function f(x) that still grows to infinity—yet that still doesn’t cover every starting value.
Even after checking roughly 295 quintillion cases up to 2^68, no proof exists, and a counterexample could be far beyond any feasible search.
Negative starting values generate multiple loops, suggesting the dynamics on the negative side are structurally different from the positive side.

Topics

  • Collatz Conjecture
  • 3x+1
  • Hailstone Numbers
  • Benford's Law
  • Turing Completeness

Mentioned

  • Paul Erdos
  • Luther Collatz
  • Jeffrey Lagarias
  • Alex Kontorovich
  • Yakov Sinai
  • Riho Terras
  • Terry Tao
  • John Conway
  • C. Brian Haselgrove
  • George Polya