OpenAI DevDay 2025 - What Hit What Missed

TL;DR

OpenAI announced “apps inside ChatGPT” with an apps SDK in preview and a later ChatGPT app-store rollout, enabling third-party functionality directly within chat.

Briefing Cornell Notes

Briefing

OpenAI’s DevDay 2025 keynote centered on four practical product moves: apps inside ChatGPT, a new Agent Kit for building agentic systems, Codex becoming generally available, and fresh API/model updates—especially for video and real-time voice. The biggest “why it matters” thread running through all of them is speed-to-build: fewer barriers for turning ideas into working AI experiences, whether that’s a workflow app, an internal agent, or a voice-enabled application.

First came “apps inside ChatGPT,” positioned as an apps SDK in preview, with a ChatGPT app-store rollout later. The pitch is straightforward: developers can build app experiences that run within ChatGPT, letting users trigger third-party capabilities without leaving the chat. Examples included using Canva to convert an outline into a deck and using Coursera to watch a course inside ChatGPT and then chat with the video content. The unresolved question is discovery—how a new app gets found and adopted when users can already access established partners like Canva, Expedia, and Zillow through their own channels and generic search.

Second, OpenAI introduced Agent Kit, a bundled toolkit for building, deploying, and optimizing agents. It includes Agent Builder, a drag-and-drop visual canvas for assembling agent workflows, plus a connector registry intended to standardize how tools and services connect—potentially including third-party connectors and MCPs. Agent Builder drew the most attention, with demos that leaned toward workflow-style agent apps but also suggested the system can support more complex, recursive agent behavior. The target audience appears to be teams that want agentic functionality without a dedicated AI engineering staff, enabling internal sharing of agent experiences. Notably, the discussion didn’t emphasize A2A communication with other agents or multi-agent systems, though the connector registry hints at broader integration.

OpenAI also paired Agent Kit with additional “guardrails” and agent evaluation (evals) support, aiming to reduce the pain of testing and measuring agent behavior—an area where many teams struggle with reliability and iteration.

Third, Codex was announced as generally available, with usage momentum highlighted: daily usage reportedly grew 10x since early August, and the first three weeks reportedly exceeded 40 trillion tokens. The emphasis here was less on flashy new features and more on scaling and expanding deployment, including integrations such as Codex in Slack and upcoming admin tooling. A demo also encouraged voice-based usage.

Finally, API updates focused on Sora 2 and real-time voice. Sora 2 is coming via API with controls like aspect ratio and video length, plus the ability to inject images as context. Pricing was framed as a lever for market competition: basic Sora at 10 cents per second, and Sora 2 Pro at 30–50 cents per second depending on resolution. OpenAI also announced a new “GPT real-time mini,” described as 70% cheaper, with lower costs for both audio input and audio output. Combined with falling image costs, the economics look more favorable for real-time voice applications that rely on function calling and related capabilities.

Overall, the keynote landed as a more polished rollout than earlier launches, with a clear emphasis on developer tooling, integration, and cost reductions that could make agentic and real-time experiences easier to build and deploy.

Cornell Notes

OpenAI’s DevDay 2025 keynote laid out four product directions: apps inside ChatGPT (with an apps SDK preview), Agent Kit for building and optimizing agents, Codex moving to general availability, and new API/model updates for video and real-time voice. Apps inside ChatGPT aims to let third-party services run directly in the chat experience, but adoption may hinge on discovery against major partners. Agent Kit’s centerpiece is Agent Builder, a drag-and-drop interface for assembling agent workflows, supported by a connector registry and added guardrails/evals tooling. Codex’s general availability was backed by strong usage growth and massive token throughput. On the API side, Sora 2 and a cheaper GPT real-time mini lower the cost of producing video and building voice-first apps, making real-time experiences more feasible.

What does “apps inside ChatGPT” change for developers and users, and what risk remains?

It introduces an apps SDK in preview, with a ChatGPT app-store rollout later, so developers can build app experiences that run within ChatGPT. Users can trigger capabilities without leaving the chat—examples included turning an outline into a deck via Canva and watching Coursera content inside ChatGPT and then chatting with the video. The main adoption risk raised is discovery: new apps may struggle to surface against established partners like Canva, Expedia, and Zillow when users rely on generic search.

How does Agent Kit aim to make agent building easier than existing approaches?

Agent Kit bundles tools for building, deploying, and optimizing agents. Agent Builder provides a drag-and-drop UI for assembling agent workflows, targeting teams that want agentic functionality without a large AI engineering staff. A connector registry standardizes how components connect, including pre-made third-party connectors and MCPs. Additional guardrails and agent eval tooling are meant to reduce reliability and testing friction.

What’s notable about Agent Builder’s capabilities based on the demos and commentary?

The examples emphasized workflow-style agent apps, but the system also appears capable of more advanced behavior such as recursive agents. The keynote framing suggested the fastest path is building agentic experiences for internal use and sharing, rather than immediately focusing on multi-agent coordination.

What metrics were used to signal Codex momentum, and what was the tone of the Codex update?

Codex’s general availability came with usage signals: daily usage reportedly grew 10x since early August, and within the first three weeks it processed over 40 trillion tokens. The update felt more like a scaling and investment signal—plus integrations like Codex in Slack and upcoming admin tools—than a list of major new features. A demo also highlighted voice usage.

Why could the Sora 2 and GPT real-time mini pricing matter for real-time applications?

Sora 2 via API adds practical controls (aspect ratio, video length) and image context injection, with pricing framed as competitive: basic Sora at 10 cents per second and Sora 2 Pro at 30–50 cents per second depending on resolution. Separately, GPT real-time mini is described as 70% cheaper, including lower costs for audio input and audio output. Together with cheaper image costs, the economics improve for voice-first, function-calling real-time apps.

Review Questions

Which parts of Agent Kit are intended to reduce integration and reliability friction, and how do they work together?
What discovery challenge could limit adoption of ChatGPT in-app apps, even if the apps SDK is available?
How do the announced pricing structures for Sora 2 and GPT real-time mini influence the feasibility of real-time voice and video applications?

Key Points

1
OpenAI announced “apps inside ChatGPT” with an apps SDK in preview and a later ChatGPT app-store rollout, enabling third-party functionality directly within chat.
2
Discovery may become the bottleneck for new ChatGPT apps because users can already find major partner services through established channels and generic search.
3
Agent Kit’s Agent Builder offers a drag-and-drop way to assemble agent workflows, aiming to help teams without dedicated AI engineering resources.
4
A connector registry is designed to standardize integrations, including pre-made third-party connectors and MCPs, even though multi-agent/A2A capabilities weren’t emphasized.
5
Agent Kit adds guardrails and agent eval support to address common reliability and testing challenges.
6
Codex moved to general availability with reported 10x daily usage growth since early August and over 40 trillion tokens processed in the first three weeks.
7
API updates include Sora 2 with aspect ratio, video length, and image context controls, plus a cheaper GPT real-time mini that lowers audio input/output costs for real-time voice apps.

Highlights

Apps inside ChatGPT are rolling out via an apps SDK preview, but the biggest practical hurdle is likely discovery against entrenched partners like Canva, Expedia, and Zillow.

Agent Builder’s drag-and-drop canvas is positioned as a shortcut for teams that want agentic experiences without building everything from scratch.

Codex’s general availability was backed by scale signals: 10x daily usage growth since early August and 40 trillion tokens in the first three weeks.

Sora 2 API pricing and the 70% cheaper GPT real-time mini both point to a cost-driven push for more real-time video and voice applications. 

Topics

ChatGPT Apps SDK
Agent Kit
Agent Builder
Codex General Availability
Sora 2 API
GPT Real-time mini
Agent Evals
Connector Registry