Get AI summaries of any video or article — Sign up free
OpenAI’s new browser feels familiar… thumbnail

OpenAI’s new browser feels familiar…

Fireship·
4 min read

Based on Fireship's video on YouTube. If you like this content, support the original creators by watching, liking and subscribing to their content.

TL;DR

Atlas integrates ChatGPT directly into a Chromium-based browser workflow, aiming to make browsing feel like using an assistant that can act.

Briefing

OpenAI’s new AI-powered browser, Atlas, aims to make web browsing feel like using a chat assistant that can act on a user’s behalf—turning routine tasks like ordering food into simple natural-language requests. Built around ChatGPT and available on Mac, Atlas integrates an assistant that can “see” what the user is doing, use browsing history as context, and run in agent mode to take actions such as placing orders. The pitch is less about adding a chatbot sidebar and more about rethinking how a browser is used, even if the day-to-day experience still resembles a familiar Chromium-based interface.

Atlas is positioned as a browser that can remember and help across sessions, which is where the convenience comes with a tradeoff. The system can draw on browsing history to answer questions and guide next steps, while also giving users controls to manage what it remembers and to delete specific memories. Still, agentic browser features—where an AI can take actions—raise the stakes for privacy and security, because browsers contain sensitive cookies and personal data.

That risk is not theoretical. Earlier this year, a Brave team demonstrated prompt injection attacks against Perplexity’s Comet AI browser by hiding malicious instructions inside an image; the payload would trigger when the image was screenshotted and analyzed by the model. The same team reported that Fellow, which markets itself as an agentic browser, was also vulnerable to prompt injection by directing the model to visit a website containing malicious instructions. The takeaway from those findings is that indirect prompt injection is systemic across AI-powered browser categories, not a one-off bug.

The transcript also contrasts Atlas with another browser effort: Ladybird. Unlike Chromium forks, Ladybird is described as fully independent, rebuilding both the engine and browser from scratch. The project recently cleared a 90% threshold on web platform tests, making it eligible to serve as an alternative browser engine on iOS. Ladybird is not pursuing AI features, but its existence underscores a broader point: browser engines and platforms are still in flux, and competition is intensifying as companies chase the next major AI product.

Finally, the discussion pivots to a practical constraint on AI usefulness: large language models are often “worthless” without strong search. The sponsor, Melee Search, is presented as a solution—an API that returns relevant results in milliseconds using hybrid semantic and multimodal search, designed to understand user intent and provide rich context. The underlying message is that the next wave of AI experiences in browsers and apps depends not just on agent behavior, but on fast, accurate retrieval that makes AI responses feel grounded rather than vague.

Cornell Notes

OpenAI’s Atlas browser brings ChatGPT into the browsing workflow, aiming to let users delegate tasks through agent mode—such as ordering food by describing what they want—while using browsing history as context. Atlas is available on Mac and is built on Chromium, so the interface still feels familiar even if the interaction model is meant to be “reinvented.” The convenience depends on AI access to sensitive browser data, which creates security and privacy concerns. Brave researchers have shown prompt-injection vulnerabilities in other agentic browsers (Perplexity’s Comet and Fellow), including indirect attacks triggered via images or malicious webpages. The transcript also highlights Ladybird as an independent engine project and argues that AI needs strong search to be genuinely useful.

What does Atlas try to change about browsing, beyond adding a chatbot?

Atlas integrates ChatGPT as an assistant inside a Chromium-based browser experience. The assistant can “see” what the user is doing, use browsing history as context for questions, and operate in agent mode to take actions on the user’s behalf—illustrated by ordering food through natural-language instructions instead of manually navigating apps and steps.

How does Atlas handle memory, and why does that matter?

Atlas can use browsing history as context, but it also provides controls to manage what it remembers and to delete certain memories. That control is meant to mitigate privacy concerns, since the browser environment contains sensitive data like cookies and personal information.

What security problem has shown up in agentic AI browsers, according to Brave’s findings?

Brave demonstrated prompt injection attacks that work indirectly. One attack against Perplexity’s Comet hid malicious instructions inside an image that would trigger when a screenshot was taken and analyzed by the model. Another attack against Fellow involved instructing the model to visit a website containing malicious instructions. The broader conclusion is that indirect prompt injection is a systemic challenge across AI-powered browser categories.

How does Ladybird differ from Chromium-based approaches like Atlas?

Ladybird is described as fully independent, rebuilding both the engine and browser from scratch rather than forking existing browser code. It recently passed a 90% threshold on web platform tests, making it eligible to be an alternative browser engine on iOS, even though it does not pursue AI features.

Why does the transcript argue that LLMs need search to be useful?

It claims an LLM is “basically worthless” without a good search engine, because retrieval grounds answers and enables relevant actions. The sponsor, Melee Search, is pitched as a fast hybrid semantic and multimodal search API designed to return relevant results in milliseconds and understand user intent.

Review Questions

  1. What specific capabilities does Atlas provide through ChatGPT integration (e.g., context, memory, agent actions), and what user controls exist?
  2. Describe the two prompt-injection scenarios Brave researchers demonstrated and explain why they are considered systemic rather than isolated.
  3. Compare Ladybird’s approach to browser engineering with Chromium forks, and connect that difference to the transcript’s broader theme about AI browser competition.

Key Points

  1. 1

    Atlas integrates ChatGPT directly into a Chromium-based browser workflow, aiming to make browsing feel like using an assistant that can act.

  2. 2

    Agent mode enables the AI to take actions on a user’s behalf, such as completing tasks from natural-language instructions.

  3. 3

    Atlas can use browsing history as context, but it includes controls to manage and delete what it remembers.

  4. 4

    Prompt injection attacks have been demonstrated in other agentic browsers via indirect channels like images and malicious webpages, suggesting a category-wide security risk.

  5. 5

    Brave’s findings emphasize that indirect prompt injection is not an edge case but a systemic challenge for AI-powered browsers.

  6. 6

    Ladybird is pursuing an independent browser engine built from scratch and recently cleared a 90% web platform test threshold for iOS eligibility.

  7. 7

    AI experiences depend on strong search; Melee Search is presented as a fast hybrid semantic and multimodal retrieval layer to improve relevance and reduce vagueness.

Highlights

Atlas positions agentic browsing as a task-delegation experience—users describe what they want, and the browser assistant performs the steps.
Brave’s prompt-injection demonstrations show how malicious instructions can be smuggled through images or websites and triggered during model analysis.
Indirect prompt injection is framed as a systemic risk across AI-powered browsers because browsers hold sensitive cookies and data.
Ladybird’s independence from Chromium forks and its 90% web platform test milestone make it a credible alternative engine path for iOS.
The transcript links AI usefulness to retrieval quality, arguing that LLMs need strong search to deliver grounded results.

Topics

Mentioned

  • LLM
  • iOS
  • API
  • US