DALL-E 2 FREE & FREE TRIAL Alternatives! Best Text to Image AI & Midjourney is PUBLIC!
Based on MattVidPro's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Midjourney’s outputs can be used as an input structure for DALL·E 2 variations to improve coherence and detail finishing.
Briefing
Text-to-image AI access is widening fast, and a handful of tools are now either free, free-trial, or effectively public—making it possible to get high-quality results without paying for the most famous names. The standout theme is that “best” depends less on raw image quality and more on what each model does well: Midjourney’s artistic color and composition, DALL·E 2’s coherence and upscaling, and several open/free alternatives that can produce surprisingly usable images (and sometimes near-photographic text).
A key practical insight comes from comparing Midjourney and DALL·E 2. Midjourney can generate strong outlines and creative “taste,” but it can fall short on coherence—especially when details don’t fully lock together. DALL·E 2 can then take Midjourney’s rough structure and enhance it into a more finished, higher-quality result. Examples include a cute creature in an alleyway where DALL·E 2 adds clarity, and a frog image where Midjourney produced a deformed frog but DALL·E 2 recognized it as a frog and remade it in higher quality. The takeaway: using Midjourney for style and composition, then DALL·E 2 for refinement, can outperform relying on either model alone.
From there, the list ranks nine text-to-image options, mostly emphasizing free or public access. NightCafe is positioned as a strong VQGAN-based option with high coherence, easy controls, high-resolution outputs, and a community/discovery angle via Discord—though it’s placed near the bottom of the list because Midjourney is still viewed as better overall. Crayon (formerly DALL·E mini) is described as fast and easy, with improving updates, but it’s also flagged for long wait times and server access issues after popularity surged; it also lacks strict content filtering, which can lead to “tasteless” outputs.
Latin diffusion is highlighted as completely free and unusually strong at generating text—so strong that it’s claimed to be better than most competitors for typography, while still producing coherent art overall. It does include an NSFW filter, described as stricter than Crayon but looser than DALL·E 2. Another free route is a DALL·E flow variant that runs in Google Colab, producing DALL·E 2–sized, highly upscaled images; it’s described as more coherent than Crayon and potentially with minimal content filtering. Mind’s Eye beta (from multimodal art) also starts in Google Colab but moves into a dedicated interface with options like Latin diffusion, VQGAN, and upscaling methods, plus the ability to run locally.
Midjourney itself is treated as the major creative benchmark: it’s moved into open beta with a free trial, uses a Discord-based workflow (including /imagine prompts and aspect-ratio commands), and generates four images per prompt with upscaling/variation options. ShonenCove AI is placed above Midjourney in the tiering, described as a 12 billion parameter model with extremely coherent, camera-like results and no content filter—but it requires translating prompts into Russian and runs through Discord with a limited daily prompt allowance.
At the top end sits DALL·E 2, still framed as the “holy grail” for public access, with OpenAI reportedly working on pricing before broader release. Finally, Google’s “party” (spelled as “party” in the transcript) is presented as a massive 20 billion parameter system that produces clear, well-formed text and high-resolution images with few failures—though release timing remains uncertain. Overall, the practical message is clear: free access is now good enough to experiment widely, and combining tools (especially Midjourney for style plus DALL·E 2 for coherence) can yield the most reliable results.
Cornell Notes
The transcript lays out a ranked set of text-to-image AI tools that are free, free-trial, or publicly accessible, emphasizing that “best” depends on what you need—style, coherence, or text accuracy. A major practical point is the workflow of using Midjourney to generate the initial artistic outline, then feeding that into DALL·E 2 to enhance coherence and finish details. Several alternatives are highlighted for specific strengths: Latin diffusion for unusually strong text generation, NightCafe for high-coherence VQGAN outputs, and Google Colab-based DALL·E flow for DALL·E 2–sized upscaling. Midjourney is described as moving into open beta with a free trial and a Discord-based prompt workflow, while ShonenCove AI is positioned as highly coherent but requires Russian prompt translation and runs via Discord. DALL·E 2 and Google’s large-parameter model remain the top targets, but access and release timing differ.
Why does combining Midjourney and DALL·E 2 often produce better results than using either alone?
Which free option is singled out for text quality, and what tradeoffs come with it?
What makes Crayon (formerly DALL·E mini) appealing, and why might someone skip it?
How does Midjourney’s workflow work in practice, according to the transcript?
What special requirement does ShonenCove AI impose, and how does that affect usability?
What’s the main difference between the Colab-based DALL·E flow and other tools on the list?
Review Questions
- If Midjourney sometimes struggles with coherence, what specific role does DALL·E 2 play in the workflow described?
- Which tool is claimed to be especially strong at generating text, and how does its NSFW filtering compare to DALL·E 2 and Crayon?
- What operational or input constraints (wait times, translation requirements, prompt limits) affect access to Crayon, ShonenCove AI, and Midjourney?
Key Points
- 1
Midjourney’s outputs can be used as an input structure for DALL·E 2 variations to improve coherence and detail finishing.
- 2
DALL·E 2 is presented as the “coherence/upscaling” upgrade, while Midjourney is credited for artistic taste and color variety.
- 3
NightCafe is a VQGAN-based free option with high coherence, easy controls, and community sharing, but it’s ranked below Midjourney.
- 4
Crayon is fast and easy but can be hard to access due to long wait times and server issues after rapid popularity growth.
- 5
Latin diffusion is completely free and especially strong at generating text, with an NSFW filter described as stricter than Crayon but looser than DALL·E 2.
- 6
DALL·E flow (Google Colab) and Mind’s Eye beta (multimodal art) provide free experimentation paths with upscaling and interface options, at the cost of setup time.
- 7
ShonenCove AI is described as extremely coherent with no content filter, but it requires translating prompts into Russian and limits usage to about 10 prompts per day.