Get AI summaries of any video or article — Sign up free
Google Dropped Lyria 3 AI Music but Sunauto v3 STOLE the show! thumbnail

Google Dropped Lyria 3 AI Music but Sunauto v3 STOLE the show!

MattVidPro·
5 min read

Based on MattVidPro's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Lyria 3 in Gemini can generate only up to 30 seconds and blocks requests for specific artists or artist-specific sounds.

Briefing

Google’s Lyria 3 (spelled “LIIA/Lyria 3” in the transcript) lands inside Gemini with a clear tradeoff: it’s a polished, safety-locked music generator that can produce only up to 30 seconds, but it brings a notable “watermarked” output that’s detectable by computers as AI-made. In Gemini, Lyria 3 delivers catchy, genre-flexible tracks—90s rap, Latin pop, and eight-bit metal—while refusing requests for specific artists or artist-specific sounds. The model’s 30-second limit also shapes expectations: prompts can yield something that sounds professional and clinical, yet it’s inherently built for short, gimmicky results rather than full-length songwriting.

The broader Gemini push matters because the transcript ties Lyria 3’s release to Gemini 3.1 Pro, described as a significant intelligence upgrade with measurable benchmark gains. The standout capability highlighted for Gemini 3.1 Pro is its ability to generate SVGs—illustrated with an Xbox controller rendered entirely via code—plus a claim that it produces far more detailed 3D-style outputs than the prior Gemini 3 Pro. That matters to music users because it signals tighter multimodal tooling around the same ecosystem: music generation in Gemini is paired with stronger general creative generation elsewhere in the platform.

Still, the transcript’s main comparison is less about Google’s model and more about what competitors can do with longer, more permissive generation. Suno v3 is positioned as the better overall music engine, with the creator arguing it can do “way more” than Lyria 3. The biggest disruption comes from Tsunado AI’s upcoming V3 preview, which the transcript frames as unusually unrestricted. In that preview, the model can be steered with tags that include direct artist inputs—explicitly demonstrated with Juice Wrld—while producing voice-like results that the creator claims are highly accurate. The transcript also notes a legal gray area: U.S. copyright law would likely treat profit-making from voice likeness as problematic, while Tsunado’s terms of service reportedly push responsibility onto users (“don’t generate anything that infringes… and then go trying to make money”).

Beyond artist mimicry, Tsunado V3 is shown generating long-form-feeling tracks from simple prompts: a “eat the periodic table” rock/rap concept, style swaps that mimic Queen, Eminem, Taylor Swift, and Michael Jackson, and even genre mashups like country ballads about herding jellyfish or arena-rock lyrics that drift into trap-like rhythms. The transcript repeatedly returns to a practical point: each model has strengths and weaknesses tied to how it handles tags, tokens, and style nuance. Lyria 3 is praised for polish and safety, but limited by length and restrictions; Tsunado V3 is praised for breadth and creative freedom, but sometimes produces muddier audio or quality drop-offs when the prompt demands complex rendering.

Overall, the transcript paints AI music as a fast-moving marketplace where “best” depends on the job: short, clean, safety-locked tracks in Gemini versus more permissive, tag-driven generation elsewhere—plus a reminder that the legal and quality tradeoffs are still evolving.

Cornell Notes

Lyria 3 in Gemini is presented as a polished AI music generator that outputs only up to 30 seconds and blocks requests for specific artists or artist-specific sounds. A key differentiator is a computer-detectable watermark, meant to signal AI origin even when humans can’t easily notice it. The transcript then contrasts this with Suno v3 and Tsunado AI’s upcoming V3 preview, which is described as far less restricted and able to follow artist tags (e.g., Juice Wrld) to produce voice-like results. Tsunado V3 is also shown handling wide genre swings—rap, pop, metal, country, and mashups—though it can occasionally sound muddled or distort when prompts get complex. The practical takeaway: model choice hinges on desired freedom, length, and how well tags map to the right musical “tokens.”

What are the most important constraints of Lyria 3 in Gemini?

Lyria 3 is limited to generating up to 30 seconds, making it better suited for short, catchy “gimmicky” outputs than full songs. It also enforces safety restrictions: users can’t request specific artists, and they can’t ask for those artists’ specific sounds. Despite the restrictions, the transcript highlights that the output is often polished and hard to distinguish from professional music, and it includes a watermark that computers can detect as AI-generated.

Why does the transcript connect Gemini 3.1 Pro to the music discussion?

The transcript links Lyria 3’s release to Gemini 3.1 Pro, described as a major intelligence upgrade with benchmark improvements. The most concrete capability mentioned is SVG generation—illustrated with a detailed Xbox controller made from code—plus claims of much higher detail in 3D-style outputs compared with Gemini 3 Pro. The implication is that the Gemini ecosystem is strengthening multimodal creative tools alongside music generation.

How does Tsunado AI’s V3 preview differ from Lyria 3 in creative freedom?

Tsunado AI’s V3 preview is portrayed as substantially less restricted. The transcript claims it can accept direct artist tags (including Juice Wrld) and still generate tracks that closely resemble the targeted voice likeness. It’s also described as capable of producing long, concept-driven songs from prompts, not just short clips—contrasting with Lyria 3’s 30-second ceiling.

What legal and policy tension appears when using artist voice likeness in AI music?

The transcript raises U.S. copyright concerns, suggesting profit-making from voice likeness would likely be challenged. It then contrasts that with Tsunado’s terms of service, which reportedly instruct users not to generate infringing content and warns that if users monetize it, responsibility falls on them rather than the company. The transcript treats this as an unresolved long-term risk rather than a settled legality.

Why do the results vary even when prompts seem similar?

The transcript repeatedly points to differences in how models respond to tags and style tokens. It notes that Tsunado can nail certain styles (e.g., when it “adheres” correctly), but sometimes produces muddled or distorted audio. It also suggests that some styles—like epic orchestral stadium rock—may not translate well unless the right tags and token pathways are used, and that model behavior can change before official releases.

What practical workflow lesson emerges from the comparisons?

Model selection should match the intended output. Lyria 3 is framed as best for short, clean, safety-locked tracks inside Gemini. Suno v3 is positioned as stronger overall for broader music creation. Tsunado V3 preview is framed as best when users want maximum creative freedom, including artist-tag steering—while accepting quality variability and legal uncertainty.

Review Questions

  1. Which two limitations most define Lyria 3’s usefulness according to the transcript, and how do they shape the kinds of songs it’s best at?
  2. What evidence is given that Tsunado V3 preview can follow artist tags, and what risks does the transcript associate with that capability?
  3. How do tag choice and token mapping explain why one model might nail a genre while another produces muddier results?

Key Points

  1. 1

    Lyria 3 in Gemini can generate only up to 30 seconds and blocks requests for specific artists or artist-specific sounds.

  2. 2

    Lyria 3 outputs include a computer-detectable watermark indicating AI origin, even when humans may not notice it.

  3. 3

    Gemini 3.1 Pro is highlighted for broader creative capability, including SVG generation and improved detail compared with Gemini 3 Pro.

  4. 4

    Suno v3 is positioned as offering more overall capability than Lyria 3, while Tsunado AI’s V3 preview is described as far less restricted.

  5. 5

    Tsunado V3 preview is demonstrated using artist tags (including Juice Wrld) to produce voice-like results, raising legal and monetization concerns.

  6. 6

    The transcript emphasizes that each model’s tag/token system affects quality, so “best” depends on the style and the prompt strategy.

Highlights

Lyria 3’s outputs are capped at 30 seconds and carry a detectable watermark—polished, but safety-locked and short by design.
Gemini 3.1 Pro’s standout feature in the transcript is SVG generation, paired with claims of much higher detail than Gemini 3 Pro.
Tsunado AI’s V3 preview is portrayed as unusually unrestricted, including direct artist-tag steering (e.g., Juice Wrld), but with unresolved legal risk around voice likeness and profit-making.

Topics

Mentioned