AI Recap: New Models, Jailbreaks, and & Future Tech!

TL;DR

A prominent jailbreak researcher reportedly jailbroke OpenAI’s deep research model quickly, including prompts that could lead to dangerous, wrongdoing-oriented outputs.

Briefing Cornell Notes

Briefing

AI safety and access are colliding with speed: OpenAI’s new “deep research” model was quickly jailbroken by a well-known jailbreak researcher, including prompts that can drive it toward highly dangerous instructions like drug synthesis and harm-related wrongdoing. That development raises a sharp question about why heavily safety-tuned, closed models are being treated as meaningfully safer when jailbreak prompts can be shared publicly and reused. It also reframes the competitive landscape—closed models may be powerful, but they can be rapidly copied in practice, not just in code.

The same rapid-fire pattern shows up across the open-source ecosystem. Hugging Face released an open, “deep research” competitor that can autonomously browse the web, search, download and manipulate files, and run calculations—built to match OpenAI’s deep research performance within a 24-hour replication window. On the Gaia Benchmark, OpenAI’s deep research hit 67% accuracy while the open-source version landed at 55%, but the open approach matters: it can be modified and improved without waiting for a closed vendor’s next release cycle. The broader takeaway is that the open-source field can iterate fast enough to narrow gaps, often without requiring the same jailbreak workarounds.

Beyond research agents, multiple product updates point to AI becoming more “interactive” and less “chat-only.” Hugging Face added AI-based search to its site, letting users describe goals and context to find relevant Spaces from a catalog of 400,000+. Replit launched a mobile app aimed at generating personalized mini-apps on demand—“make an app for that”—positioning generative UI as a future interface layer rather than a library of fixed screens. OpenAI’s o3 mini series also made chain-of-thought reasoning more visible in a readable, optionally translatable form, continuing a trend toward exposing more of the model’s internal reasoning style.

Video generation and real-time media manipulation are accelerating just as quickly. Meta’s “Video Jam” research emphasizes improved body deformation, motion coherence, and physics handling, with examples showing better stability on hard prompts like juggling, hooping, and figure skating. BiteDance’s “Omnium” animates a person from an image plus audio, producing surprisingly lifelike motion and lighting; other demos show object insertion that preserves the original footage while adding new elements with realistic lighting (Pika editions), plus dynamic video effects that can transform scenes dramatically (DyVFX). Separate work also pushes toward real-time avatar streaming: an AI-generated talking head can sync to voice and stream into Zoom-like calls, with the main tell being occasional human micro-actions.

Meanwhile, speech and multimodal models keep expanding. Play AI’s “dialogue 1.0” claims ultra-emotional text-to-speech with low latency and strong benchmarks against 11 Labs. Google rolled out new Gemini 2.0 options (Flash 2.0 and Flash Thinking), while additional open-source speech-to-speech translation aims to run directly on phones. Taken together, the throughline is clear: AI capability is rising, but so is the speed at which workarounds, replications, and new interfaces spread—making “hard launches” feel less like a single release and more like a continuous, fast-moving arms race.

Cornell Notes

The transcript highlights how quickly advanced AI capabilities—especially “deep research” agents and video generation—are being matched, replicated, and sometimes bypassed. OpenAI’s deep research model was reportedly jailbroken by a prominent jailbreak researcher, including prompts that could enable harmful wrongdoing, raising doubts about closed, safety-tuned models’ resilience. In parallel, Hugging Face released an open-source deep research competitor that can browse, search, download files, and run calculations, achieving 55% on the Gaia Benchmark versus OpenAI’s 67%—with the advantage that it can be modified and improved faster. The same rapid iteration shows up in product updates like AI search on Hugging Face, mobile app generation on Replit, and new reasoning visibility features in OpenAI’s o3 mini series. Video and speech systems are also advancing toward more realistic, interactive, and even real-time experiences.

Why does the jailbreaking of a “deep research” model matter beyond one specific exploit?

It suggests that safety-tuned, closed models can be made to produce disallowed outputs through reusable prompt-based jailbreaks. The transcript frames this as a practical access problem: if jailbreak prompts are publicly shared and can be applied to new models quickly, then “closed” access doesn’t necessarily translate into durable safety. It also shifts the competitive focus—closed models may be powerful, but open-source and community replication can catch up quickly, reducing the advantage of secrecy.

What capabilities define the open-source “deep research” agent described from Hugging Face?

The agent is presented as able to navigate the web autonomously, scroll and search through pages, download and manipulate files, and run calculations on data. Performance is compared on the Gaia Benchmark: OpenAI’s deep research at 67% accuracy versus the open-source deep research at 55%. The transcript emphasizes that openness enables modification and improvement without waiting for a vendor update.

How does Hugging Face’s updated website search change the user workflow?

Instead of keyword searching, users can describe what they’re trying to do along with their context. The system then searches across Hugging Face Spaces—over 400,000—to find relevant projects. A demo is mentioned where an image-to-3D (and potentially image-to-video) task pulls up suitable Spaces based on the input and goal.

What’s the significance of making chain-of-thought more visible in OpenAI’s o3 mini models?

The transcript says chain-of-thought is now visible in a readable, organized form rather than raw internal reasoning, with optional translation. It also notes that users can already see reasoning behavior in ChatGPT and that the chain-of-thought style appears similar to DeepSeek R1 examples, implying alignment in how reasoning is presented. The practical impact is that users can inspect the model’s intermediate reasoning style more directly.

Which video-generation advances are emphasized as solving long-standing hard problems?

Meta’s Video Jam is highlighted for improved body deformation, more coherent motion, and better physics stability—especially on difficult prompts like juggling, hooping, figure skating, and candle-blowing. The transcript contrasts Video Jam’s stability with Sora’s failures on those same kinds of tasks. Other systems add realism through audio-driven animation (Omnium) and instant object insertion with lighting consistency (Pika editions).

What does “real-time” AI avatar streaming imply for everyday communication tools?

The transcript describes an AI-generated avatar that streams into Zoom-like calls and moves in sync with voice, including facial motion like eyebrow movement. The main limitation mentioned is that over long sessions humans do micro-behaviors (itching, adjusting hair/glasses), which can reveal the non-human nature if the avatar doesn’t replicate those actions convincingly.

Review Questions

How does openness (open-source) change the way performance gaps between deep research agents are likely to evolve over time?
What kinds of behaviors in video generation are presented as the hardest to replicate, and how does Video Jam claim to address them?
Why might chain-of-thought visibility affect user trust or debugging compared with purely final answers?

Key Points

1
A prominent jailbreak researcher reportedly jailbroke OpenAI’s deep research model quickly, including prompts that could lead to dangerous, wrongdoing-oriented outputs.
2
Hugging Face released an open-source deep research agent that can browse, search, download/manipulate files, and run calculations, achieving 55% on the Gaia Benchmark versus 67% for OpenAI’s deep research.
3
Open-source deep research can be modified and improved by the community, potentially closing benchmark gaps faster than closed, vendor-controlled iterations.
4
Hugging Face added AI-based search that uses user goals and context to find relevant Spaces from a catalog of 400,000+ projects.
5
Replit’s new mobile app aims to generate personalized mini-apps on demand, pushing toward generative, on-the-fly user interfaces.
6
OpenAI’s o3 mini models make chain-of-thought more readable and optionally translatable, continuing a trend toward exposing reasoning traces.
7
Video generation is moving toward more stable physics and motion (Video Jam), audio-driven animation (Omnium), and instant object insertion with realistic lighting (Pika editions).

Highlights

OpenAI’s deep research model was reportedly jailbroken quickly, with examples reaching into harmful instruction territory—undercutting assumptions that closed, safety-tuned models are inherently resilient.

Hugging Face’s open deep research agent matches many “agentic” behaviors (web navigation, file handling, calculations) and can be improved by anyone, even if it trails on benchmarks.

Video Jam’s pitch centers on fixing physics and motion coherence for hard prompts like juggling and hooping, where other systems struggle.

Real-time avatar streaming into Zoom-like calls is presented as nearly indistinguishable over short periods, with human micro-actions as the main tell. 

Topics

AI Jailbreaks
Open-Source Agents
Generative Video
Chain-of-Thought
Speech Synthesis

Mentioned

OpenAI
Hugging Face
ChatGPT
GitHub
Replit
invid AI
Play AI
11 Labs
Google
Gemini
Meta
Pika
DyVFX
BiteDance
Zoom
Hampton ism
Korea AI
LTX
Sam Altman
David Morales
Jonas Suo
Victor Mustar
Omar
Paul Covert
Jensen Wang
Albert Einstein
PhD
LLM
API