I Tested ChatGPT's New Overnight Mode

TL;DR

Pulse delivers proactive assistance tied to very recent chats, which can shift users toward earlier planning so insights arrive by morning.

Briefing Cornell Notes

Briefing

Overnight Mode is pushing ChatGPT toward a more proactive, workflow-changing future—and it may also open a new, easier path to ads inside the product. OpenAI’s Pulse offers proactive assistance tied to very recent chats (roughly the last day or two), arriving without users explicitly requesting it. In practice, the feature nudges users to plan conversations earlier so Pulse can “work overnight” and deliver insights by morning. That behavioral shift matters because it signals a broader industry move from reactive chat to scheduled, background help.

Pulse also changes the economics of attention. Because Pulse is proactive—an experience users didn’t necessarily ask for—sponsored content becomes simpler to integrate without undermining the “objectivity” of a user-initiated conversation. The transcript points to the likelihood of sponsored cards appearing in the Pulse-style interface, with users able to ignore them. At the same time, OpenAI has opened an ads monetization role for ChatGPT, suggesting Pulse is connected to a larger push on monetization. Beyond near-term revenue mechanics, Pulse is framed as an early widely available consumer example of AI becoming proactive, aligning with a broader expectation that 2026 will bring more proactive systems across major model makers.

The week’s other big thread is how model competition is reshaping enterprise toolchains. Microsoft is diversifying Copilot by adding Anthropic models, with Claude Opus 4.1 highlighted as particularly strong for work tasks like slide decks and spreadsheets. The transcript ties this to OpenAI’s GDPval study, which benchmarks models on economically useful tasks using expert-prepared contexts and blind evaluations. Even though OpenAI sponsored and ran the study, it reportedly found Opus 4.1 outperforming ChatGPT models on those structured, real-world-style tasks—an endorsement that helps explain Microsoft’s move.

Meta’s situation looks more precarious. Early discussions with Google Cloud about integrating Gemini into Meta’s ad operations suggest Gemini’s multimodal capabilities could improve how ads match users on Facebook and Instagram. The transcript treats this as a step back for Meta’s own AI ambitions, especially given Meta’s investment in Llama and recent researcher departures. The underlying question: if Llama isn’t “in the conversation” for key business use cases, where does Meta’s planned massive AI spending actually go?

Meanwhile, OpenAI’s compute buildout continues to dominate the infrastructure narrative. The Stargate project is described as an umbrella for a multi-year, potentially 100x+ compute scale-up, supported by major partners and financing. The vision extends further: autonomous data center construction, including robotic assembly and chip-fab processes, aimed at matching surging AI demand.

On the agent frontier, Moonshot’s Kimmy K (a trillion-parameter model) is now available as an agent called Okay Computer, letting it access a virtual environment with a file system, browser, and terminal to execute multi-step tasks—turning requests into websites, dashboards, or documents. The transcript frames this as a push toward agentic autonomy, with Chinese developers pushing the edge and potentially accelerating adoption among individuals.

Adobe’s strategy also shifts from building to buying. After public market pressure over AI performance, Adobe announced broad third-party model integration across Firefly, including video via Luma Ray 3 and additional model support from Google, OpenAI, and others. The implication: competing directly on model quality is too costly, forcing even SaaS-style companies to let external “brains” into their products.

Finally, California’s Senate Bill 53 moves AI safety into enforceable territory. The bill requires major AI developers to disclose safety and security protocols, adds whistleblower protections and incident reporting, and creates Cal compute for AI research. Because major model makers are based in California, the transcript expects immediate operational changes at OpenAI, Anthropic, and Google—and a ripple effect for other states and future federal policy.

Cornell Notes

Pulse, OpenAI’s proactive ChatGPT feature, ties assistance to very recent chats and can change user behavior by encouraging earlier planning so insights arrive “overnight.” The same proactive design may also make sponsored cards easier to place, supported by OpenAI’s ads monetization hiring. Microsoft is expanding Copilot with Anthropic’s Claude Opus 4.1, backed by OpenAI’s GDPval benchmarking that favors Opus 4.1 on structured, economically useful tasks. Meta’s early talks with Google Cloud about Gemini for ad targeting suggest Llama may not be sufficient for key business needs. California’s Senate Bill 53 adds enforceable AI safety disclosure, whistleblower protections, and incident reporting, forcing major model makers to adjust operations.

How does Pulse change day-to-day work habits, not just outputs?

Pulse is designed to proactively surface assistance based on what a user discussed in the last day or two. That proactive timing pushes users to start conversations earlier—often a day or two ahead of when they think they’ll need help—so Pulse can “work overnight” and deliver morning insights for the actual task.

Why does Pulse make ads easier to integrate than ads inside normal chat?

The transcript argues that ad placement is harder to reconcile with the “objectivity” of user-initiated conversations. Pulse is proactive and arrives as an experience users didn’t explicitly request, so a sponsored card in a Pulse-style interface can be ignored if unwanted. OpenAI’s ads monetization role for ChatGPT is presented as a sign that this direction is already underway.

What evidence is used to justify Microsoft adding Anthropic models to Copilot?

Microsoft’s shift toward Claude Opus 4.1 is supported by task performance claims for work like slide decks and spreadsheets. The transcript also cites OpenAI’s GDPval study, which benchmarks models on expert-prepared, blind-evaluated tasks designed to be economically useful. Even with OpenAI running the study, Opus 4.1 is reported to outperform ChatGPT models on that category of tasks.

What does Meta’s Gemini ad-targeting discussion imply about Llama’s competitiveness?

Early talks with Google Cloud about integrating Gemini into Meta ad operations—leveraging Gemini’s multimodal capabilities—suggest Meta may need capabilities beyond what its own Llama model can deliver for ad targeting. The transcript frames this as a step back for Meta’s AI posture and raises a strategic question about where Meta’s planned large AI spending will go if Llama isn’t leading in key business use cases.

How do agentic tools like Okay Computer signal a shift in AI autonomy?

Moonshot’s Kimmy K is offered as an agent called Okay Computer, giving access to a virtual computer environment with a file system, browser, and terminal. The agent can execute multi-step tasks autonomously, such as turning requests into websites, data dashboards, or production documents—pushing beyond chat into tool-using, task-executing autonomy.

What changes does California’s Senate Bill 53 force on major model makers?

SB 53 requires major AI developers to publicly disclose safety and security protocols, establishes whistleblower protections, and mandates incident reporting. It also creates Cal compute, a public computing cluster for AI research. Since OpenAI, Anthropic, and Google are based in San Francisco, the transcript expects immediate compliance changes and broader influence on other states’ approaches.

Review Questions

Which design choice in Pulse makes it both workflow-changing and potentially more compatible with sponsored content?
How does GDPval differ from evaluating models on messy real-world jobs, and why does that distinction matter?
What specific compliance obligations does SB 53 impose, and why could they affect daily operations at California-based model makers?

Key Points

1
Pulse delivers proactive assistance tied to very recent chats, which can shift users toward earlier planning so insights arrive by morning.
2
Proactive experiences like Pulse may lower friction for ads, including sponsored cards, because they’re not embedded inside a user-initiated conversation.
3
Microsoft’s Copilot diversification toward Claude Opus 4.1 is supported by task benchmarking (GDPval) that favors Opus 4.1 on structured, economically useful work tasks.
4
Meta’s early Gemini partnership talks for ad targeting suggest Llama may not meet performance needs for key business workflows, raising questions about where Meta’s AI spending goes.
5
OpenAI’s Stargate is framed as a multi-year compute scale-up with partners and financing, plus a longer-term goal of autonomous data center production.
6
Moonshot’s Okay Computer agent (built on Kimmy K) demonstrates tool-using autonomy via a virtual environment with file system, browser, and terminal access.
7
California’s Senate Bill 53 creates enforceable AI safety disclosure and incident reporting requirements that major model makers must implement immediately.

Highlights

Pulse’s proactive timing nudges users to start key conversations a day or two early so the system can deliver overnight insights.

GDPval uses expert-prepared, blind-evaluated tasks to measure economically useful performance—reportedly favoring Claude Opus 4.1 over ChatGPT models.

Meta’s Gemini ad-targeting discussions point to a practical gap between Llama’s current capabilities and what ad operations require.

SB 53 requires public disclosure of safety/security protocols plus whistleblower protections and incident reporting, with compliance pressure landing immediately on California-based model makers.

Okay Computer turns Kimmy K into an agent that can autonomously execute multi-step tasks inside a virtual computer environment.

Topics

Pulse Proactivity
Copilot Model Mix
Gemini Ad Targeting
Stargate Compute Buildout
AI Agent Autonomy
Third-Party Model Integration
AI Safety Legislation

Mentioned

ChatGPT
Pulse
Copilot
Claude Opus 4.1
Gemini
Google Cloud
Llama
Stargate
Oracle
Nvidia
SoftBank
Kimmy K
Okay Computer
Adobe Firefly
Luma Ray 3
Cal compute
Nate B Jones
GDPval

I Tested ChatGPT's New Overnight Mode—It Changed How I Work