Get AI summaries of any video or article — Sign up free
Midjourney v6, Altman 'Age Reversal' and Gemini 2 - Christmas Edition thumbnail

Midjourney v6, Altman 'Age Reversal' and Gemini 2 - Christmas Edition

AI Explained·
5 min read

Based on AI Explained's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Midjourney v6 improves prompt adherence by preserving spatial relationships and internal structural details that earlier versions often missed.

Briefing

Midjourney v6 is making image generation more obedient to real-world composition—especially spatial relationships—pushing outputs closer to photo realism. Compared with Midjourney 5.2, v6 better preserves details like “bigger than,” “next to,” and “on top of,” and it more reliably includes elements specified in prompts, such as a stream running through the center of a Roman triumphal arch. Earlier Midjourney versions often captured the general subject while dropping relational or structural specifics; v6 narrows that gap, which is why the results feel less like an approximation and more like a faithful rendering of the prompt’s layout.

The workflow around v6 also matters. After generating an image, upscaling can shift the look from “glossy AI” toward more lifelike texture and lighting. The transcript highlights Magnific AI as a quick drag-and-drop tool for upscaling Midjourney v6 (and DALL·E 3) images in under a minute, producing a more photo-real finish with minimal parameter tweaking. Another prompt tactic is to avoid generic “photo realism” or “4K” language and instead use “--style raw,” which the transcript claims nudges Midjourney away from stylized output and toward more realistic rendering even before upscaling. Text rendering remains a tradeoff: Gemini 3 is described as better at text, while it tends to avoid “photo realism,” possibly because OpenAI wants clearer signals that images are AI-generated and uses watermarking.

Beyond image quality, the transcript ties the holiday AI moment to a broader acceleration in healthspan and compute. OpenAI CEO Sam Altman is cited as investing $180 million in Retro, a longevity company focused on adding roughly “10 good years” rather than immortality. Retro’s core approach is framed as partial cell reprogramming: introducing transcription factor genes that push mature cells toward a more youthful state without changing their type or function or converting them into stem cells. The transcript notes that some researchers view age reversal as increasingly an engineering problem, but it also flags skepticism—earlier longevity efforts have struggled to show clear wins, and PubMed language quoted in the transcript suggests biological immortality remains unproven.

Altman’s public remarks are also used to spotlight a shift in tone across big tech longevity: talk of “imminence” and the 2030s as a potentially revolutionary decade is becoming more common. At the same time, an OpenAI employee’s counterpoint is included: chasing immortality could undermine the ecological benefits of death and renewal, and the pursuit of stability may keep damaged systems alive longer than they should.

Finally, the transcript connects momentum in longevity and generative media to who controls compute at scale. A report from SemiAnalysis is cited claiming Google is training “Gemini 2” before “Gemini 1 Ultra” is released and that Google’s compute buildout—especially TPU v5 capacity—will outstrip OpenAI’s near-term resources. The implication is that scale and training influence capacity could determine who leads across photo realism, multimodal models, and the next wave of AI capabilities in 2024 and beyond.

Cornell Notes

Midjourney v6 improves prompt adherence, especially for spatial and relational details, making generated images more faithful to instructions and closer to photo realism. Upscaling tools like Magnific AI and prompt tweaks like “--style raw” can further reduce the “glossy AI” look. The transcript then pivots to longevity, citing Sam Altman’s $180 million investment in Retro and describing partial cell reprogramming via transcription factor genes as a route to adding years of healthy life. It also notes a growing industry shift toward “imminent” healthspan breakthroughs and highlights internal debate about whether extending life indefinitely is desirable. Compute scale is framed as a key driver, with claims that Google’s TPU v5 buildout could outpace OpenAI’s training capacity for models like Gemini 2.

What specific change in Midjourney v6 makes it feel more reliable than Midjourney 5.2?

Midjourney v6 is described as adhering to prompts more sensitively, particularly preserving relational and structural details. Examples include keeping elements consistent with instructions like “something being bigger than,” “next to,” or “on top of” other objects, and retaining prompt components such as a stream running through the center of a Roman triumphal arch. Midjourney 5 is said to capture the main topic but often misses these spatial relationships and internal elements.

How do upscaling and prompt wording influence the move toward photo realism?

The transcript highlights Magnific AI as a fast upscaling step that turns Midjourney v6 (and DALL·E 3) outputs into more lifelike, photo-real images in under a minute, with minimal settings changes. It also recommends using “--style raw” instead of adding words like “photo realism” or “4K” to the prompt; this is claimed to shift outputs away from glossy, clearly AI-styled rendering toward more realistic results even before upscaling.

Why does the transcript suggest Gemini 3 may avoid “photo realism”?

Gemini 3 is described as better at text, but it tends to avoid photo realism. The transcript’s guess is that OpenAI wants users to know when an image is AI-generated and relies on watermarking, which may discourage outputs that fully blend into real photographs.

What is Retro’s longevity strategy as described here, and what does it aim to achieve?

Retro is framed as pursuing partial cell reprogramming using transcription factor genes. The goal is to reprogram mature cells toward a more youthful state without changing their type or function and without turning them into stem cells—compared to changing code without changing what the program does. The transcript emphasizes that Retro’s aim is adding about 10 good years of life rather than living forever.

What compute-scale claim is made about Google versus OpenAI, and why does it matter?

A SemiAnalysis report is cited claiming Google is training Gemini 2 ahead of Gemini 1 Ultra and that Google’s compute resources will outstrip OpenAI’s near-term. The transcript asserts Google will have more TPU v5s, while OpenAI’s compute is a fraction of Google’s, implying Google’s training influence capacity could be larger—potentially affecting leadership in photo realism and next-generation model performance.

Review Questions

  1. How does Midjourney v6 handle relational details differently from Midjourney 5.2, and why does that matter for prompt-based image editing?
  2. What does partial cell reprogramming aim to change in Retro’s approach, and what does the transcript say about the evidence for “age reversal” being imminent?
  3. According to the transcript’s compute comparison, what role do TPU v5 and H100-class GPUs play in predicting who will lead next in model training?

Key Points

  1. 1

    Midjourney v6 improves prompt adherence by preserving spatial relationships and internal structural details that earlier versions often missed.

  2. 2

    Upscaling can materially change the perceived realism of Midjourney outputs; Magnific AI is presented as a fast, low-friction option for that step.

  3. 3

    Using “--style raw” in prompts is suggested as a more effective path to realism than adding “photo realism” or “4K” keywords.

  4. 4

    Sam Altman’s $180 million investment in Retro is tied to a healthspan goal of adding roughly 10 good years, not immortality.

  5. 5

    Retro’s approach centers on partial cell reprogramming via transcription factor genes to rejuvenate mature cells without changing their type or function.

  6. 6

    Longevity discourse is shifting toward talk of imminence and the 2030s as a potentially revolutionary decade, alongside internal debate about the desirability of indefinite life extension.

  7. 7

    Compute scale is framed as a decisive advantage, with claims that Google’s TPU v5 buildout could outpace OpenAI’s near-term training capacity for models like Gemini 2.

Highlights

Midjourney v6 is credited with better preserving “bigger than/next to/on top of” relationships and internal prompt elements like a stream through an arch—details that Midjourney 5 often dropped.
Magnific AI is described as turning glossy AI renders into more lifelike, photo-real images in under a minute using simple drag-and-drop upscaling.
Retro’s longevity thesis is partial cell reprogramming: transcription factor genes push mature cells toward youth without converting them into stem cells.
The transcript links model leadership to compute scale, citing claims that Google’s TPU v5 capacity will outstrip OpenAI’s resources for training Gemini 2.

Topics

  • Midjourney v6
  • Prompt Adherence
  • Healthspan Longevity
  • Partial Cell Reprogramming
  • Gemini 2 Compute

Mentioned