Midjourney Surpasses DALL-E 2 - Incredible Midjourney V4 Upgrade
Based on MattVidPro's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Midjourney V4 is presented as a substantial improvement over Midjourney V3 in prompt coherence and background integration.
Briefing
Midjourney V4’s public release is being treated as a real turning point: with the same prompts used to benchmark earlier models, V4 produces more coherent, more “art-directed” images than Midjourney V3 and often matches or beats DALL·E 2 on detail and prompt fidelity. The practical takeaway is simple—people who previously chose DALL·E 2 for consistency and Midjourney for style now have a stronger case that Midjourney V4 can deliver both, at least across a range of common prompt types.
A core comparison centers on a straightforward test: “penguin in Venice.” Midjourney V3 already renders the scene with recognizable elements, but V4 adds noticeably sharper structure and clearer environmental cues—water, boats, buildings, and color palette—while keeping the subject coherent. DALL·E 2, by contrast, is described as leaning more toward photo realism, producing images that can look more like actual photographs, but with different artistic interpretations of the same prompt. The result is framed as subjective: DALL·E 2 may win for “photographic” output, while Midjourney V4 is credited with higher artistic coherence and stronger background integration.
The transcript then shifts from one-off tests to a broader “through-the-ringer” set of prompts. A “lemon wearing sunglasses relaxing on the beach” prompt is used to show that Midjourney V4 can generate more cohesive character styling and background integration than Midjourney V3, while DALL·E 2’s outputs are described as more photographic but sometimes less detailed or more “mushy.” For a “character concept old Mage warlock” prompt, Midjourney V4 is portrayed as adding richer character detail from a short prompt—more expressive faces, stronger visual identity, and clearer magical elements—where DALL·E 2 is characterized as producing simpler, clip-art-like results unless more prompt work is done.
Content-policy differences also enter the comparison. When generating a detailed portrait depicting Walter White from Breaking Bad, DALL·E 2 is said to be constrained by policy, leading to vague resemblance, while Midjourney is described as producing a more recognizable likeness (wrinkles, hair color, and overall facial features). Another OpenAI-style benchmark—an armchair in the shape of an avocado—serves as a creativity and texture test, where Midjourney is credited with more detailed, layered avocado skin and pit features, and with upscaled results that retain the concept more convincingly.
Across more complex scenes—like a Shih Tzu puppy dressed as a pirate sailing on a pirate ship, an iPhone selfie of Bigfoot in the Loch Ness monster playing video games, and various logo-style prompts—Midjourney V4 is repeatedly described as producing clearer faces, better environmental detail, and more consistent concept execution. The Bigfoot “iPhone selfie” prompt is a notable exception where neither model performs perfectly, but Midjourney is still framed as producing more coherent results overall.
By the end, the verdict is blunt: Midjourney V4 is portrayed as head-to-head with DALL·E 2, with the added argument that Midjourney’s improvements over V3 are large enough to justify switching or at least testing again. Cost is also mentioned as a factor in the broader “fight” between the two systems, with Midjourney positioned as the cheaper option while closing the quality gap.
Cornell Notes
Midjourney V4’s release is presented as a major quality jump over Midjourney V3, especially for prompt coherence and artistic detail. Using repeated benchmarks with the same prompts, V4 is credited with clearer subject-background integration (e.g., “penguin in Venice”) and richer character concepts (e.g., “old Mage warlock”) even from short prompts. DALL·E 2 is still described as strong for photo-realistic output, but it’s portrayed as less detailed or more “mushy” in several side-by-side tests. Content policy constraints are also highlighted: DALL·E 2 is said to struggle with generating a Walter White portrait, while Midjourney produces a more recognizable likeness. Overall, the comparison lands on Midjourney V4 being competitive with DALL·E 2, with cost mentioned as an extra advantage.
What benchmark prompt best illustrates the coherence jump from Midjourney V3 to V4?
How does the comparison treat “artistic coherence” versus “photo realism”?
Why does the Walter White test produce a different outcome for DALL·E 2 and Midjourney?
What does the “armchair in the shape of an avocado” prompt reveal about creativity and texture?
How do the models compare on short prompt character concepts like “old Mage warlock”?
Which prompt category is used to test logo-style outputs?
Review Questions
- In the “penguin in Venice” comparison, what specific visual elements are cited as improving from V3 to V4?
- How does content policy influence the Walter White portrait results, and why does that matter for interpreting benchmark outcomes?
- Across the lemon and warlock prompts, what pattern suggests Midjourney V4’s strengths with short prompts?
Key Points
- 1
Midjourney V4 is presented as a substantial improvement over Midjourney V3 in prompt coherence and background integration.
- 2
The “penguin in Venice” test highlights V4’s clearer environmental structure (water, boats, buildings) while keeping the subject consistent.
- 3
DALL·E 2 is repeatedly characterized as stronger for photo-realistic output, but sometimes less detailed or less cohesive in side-by-side comparisons.
- 4
Short, concept-heavy prompts (like “old Mage warlock”) tend to produce richer, more characterful results in Midjourney V4 than in DALL·E 2.
- 5
Content policy constraints can materially affect benchmark comparisons, illustrated by the Walter White portrait test.
- 6
Midjourney V4 is described as competitive with DALL·E 2 across multiple prompt types, including concept mashups and logo-style prompts.
- 7
Cost is mentioned as an additional reason many users may prefer to try Midjourney V4 even if DALL·E 2 remains a strong option.