Google Dropped Lyria 3 AI Music but Sunauto v3 STOLE the show!
Based on MattVidPro's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.
Lyria 3 in Gemini can generate only up to 30 seconds and blocks requests for specific artists or artist-specific sounds.
Briefing
Google’s Lyria 3 (spelled “LIIA/Lyria 3” in the transcript) lands inside Gemini with a clear tradeoff: it’s a polished, safety-locked music generator that can produce only up to 30 seconds, but it brings a notable “watermarked” output that’s detectable by computers as AI-made. In Gemini, Lyria 3 delivers catchy, genre-flexible tracks—90s rap, Latin pop, and eight-bit metal—while refusing requests for specific artists or artist-specific sounds. The model’s 30-second limit also shapes expectations: prompts can yield something that sounds professional and clinical, yet it’s inherently built for short, gimmicky results rather than full-length songwriting.
The broader Gemini push matters because the transcript ties Lyria 3’s release to Gemini 3.1 Pro, described as a significant intelligence upgrade with measurable benchmark gains. The standout capability highlighted for Gemini 3.1 Pro is its ability to generate SVGs—illustrated with an Xbox controller rendered entirely via code—plus a claim that it produces far more detailed 3D-style outputs than the prior Gemini 3 Pro. That matters to music users because it signals tighter multimodal tooling around the same ecosystem: music generation in Gemini is paired with stronger general creative generation elsewhere in the platform.
Still, the transcript’s main comparison is less about Google’s model and more about what competitors can do with longer, more permissive generation. Suno v3 is positioned as the better overall music engine, with the creator arguing it can do “way more” than Lyria 3. The biggest disruption comes from Tsunado AI’s upcoming V3 preview, which the transcript frames as unusually unrestricted. In that preview, the model can be steered with tags that include direct artist inputs—explicitly demonstrated with Juice Wrld—while producing voice-like results that the creator claims are highly accurate. The transcript also notes a legal gray area: U.S. copyright law would likely treat profit-making from voice likeness as problematic, while Tsunado’s terms of service reportedly push responsibility onto users (“don’t generate anything that infringes… and then go trying to make money”).
Beyond artist mimicry, Tsunado V3 is shown generating long-form-feeling tracks from simple prompts: a “eat the periodic table” rock/rap concept, style swaps that mimic Queen, Eminem, Taylor Swift, and Michael Jackson, and even genre mashups like country ballads about herding jellyfish or arena-rock lyrics that drift into trap-like rhythms. The transcript repeatedly returns to a practical point: each model has strengths and weaknesses tied to how it handles tags, tokens, and style nuance. Lyria 3 is praised for polish and safety, but limited by length and restrictions; Tsunado V3 is praised for breadth and creative freedom, but sometimes produces muddier audio or quality drop-offs when the prompt demands complex rendering.
Overall, the transcript paints AI music as a fast-moving marketplace where “best” depends on the job: short, clean, safety-locked tracks in Gemini versus more permissive, tag-driven generation elsewhere—plus a reminder that the legal and quality tradeoffs are still evolving.
Cornell Notes
Lyria 3 in Gemini is presented as a polished AI music generator that outputs only up to 30 seconds and blocks requests for specific artists or artist-specific sounds. A key differentiator is a computer-detectable watermark, meant to signal AI origin even when humans can’t easily notice it. The transcript then contrasts this with Suno v3 and Tsunado AI’s upcoming V3 preview, which is described as far less restricted and able to follow artist tags (e.g., Juice Wrld) to produce voice-like results. Tsunado V3 is also shown handling wide genre swings—rap, pop, metal, country, and mashups—though it can occasionally sound muddled or distort when prompts get complex. The practical takeaway: model choice hinges on desired freedom, length, and how well tags map to the right musical “tokens.”
What are the most important constraints of Lyria 3 in Gemini?
Why does the transcript connect Gemini 3.1 Pro to the music discussion?
How does Tsunado AI’s V3 preview differ from Lyria 3 in creative freedom?
What legal and policy tension appears when using artist voice likeness in AI music?
Why do the results vary even when prompts seem similar?
What practical workflow lesson emerges from the comparisons?
Review Questions
- Which two limitations most define Lyria 3’s usefulness according to the transcript, and how do they shape the kinds of songs it’s best at?
- What evidence is given that Tsunado V3 preview can follow artist tags, and what risks does the transcript associate with that capability?
- How do tag choice and token mapping explain why one model might nail a genre while another produces muddier results?
Key Points
- 1
Lyria 3 in Gemini can generate only up to 30 seconds and blocks requests for specific artists or artist-specific sounds.
- 2
Lyria 3 outputs include a computer-detectable watermark indicating AI origin, even when humans may not notice it.
- 3
Gemini 3.1 Pro is highlighted for broader creative capability, including SVG generation and improved detail compared with Gemini 3 Pro.
- 4
Suno v3 is positioned as offering more overall capability than Lyria 3, while Tsunado AI’s V3 preview is described as far less restricted.
- 5
Tsunado V3 preview is demonstrated using artist tags (including Juice Wrld) to produce voice-like results, raising legal and monetization concerns.
- 6
The transcript emphasizes that each model’s tag/token system affects quality, so “best” depends on the style and the prompt strategy.