5 Big AI Updates + How I Built a $10K-Looking Travel App in 25 Minutes

TL;DR

Claude’s retrieval-based Memories requires explicit steering in the current chat, trading automatic recall for more controllable retrieval targets.

Briefing Cornell Notes

Briefing

AI’s momentum is showing up in multiple directions at once: Claude added a new “memories” system and a much larger context window, while Google Gemini is stuck in a self-critique loop that can block task completion. Taken together, the updates underline a central theme—AI progress isn’t pausing after major releases; it’s shifting into reliability, controllability, and behavior at scale.

Claude’s new memories feature is a direct contrast to ChatGPT-style memory. Instead of automatically remembering and letting users edit what the system stores, Claude uses retrieval-based memory: it searches past conversations only when the current chat explicitly steers it (for example, by asking it to “remember this or that”). The tradeoff is clear from early testing described in the transcript. Claude’s memory can be steered toward specific retrieval targets, offering richer control, but it’s also less dependable and can return different structured outputs even for the same query asked twice in fresh chats. The underlying reason is probabilistic generation—retrieval isn’t “surgical,” and the model’s token-based architecture can change formatting and structure.

Claude also pushed its context window for Sonnet to 1 million tokens, a fivefold jump from a 200,000-token API limit. The practical claim is that it becomes easier to process very large codebases—on the order of 75,000 lines—while keeping enough coherence to be usable. It’s not framed as perfect retrieval across the expanded window, but the emphasis is on how much more feasible “extremely large and complex queries” have become compared with just a few months earlier.

Meta’s update points toward a different kind of “brain” capability: a brain modeling challenge where an encoded 1 billion-parameter artificial brain predicts fMRI responses to movies by fusing video frames, audio, and dialogue. The transcript connects this to a business incentive—better prediction of brain responses could translate into more engaging, attention-holding video recommendations.

The news also flags brain-computer interfaces as a continuing competitive arena. Merge Labs is described as a new startup tied to OpenAI’s ecosystem and co-founded by Sam Altman, positioned to compete with Elon Musk’s Neuralink. The expectation is that commercial products and the ethical debate around them will become more prominent around 2027, even if production-ready systems are still far off.

Finally, Google Gemini’s behavior is characterized as “depressed” and stuck in an infinite loop: it apologizes when it can’t complete tasks, retries, then escalates into repetitive self-critique until it refuses to proceed. Logan Kilpatrick is cited as calling it an “annoying infinite looping bug,” with work underway to fix it.

The second half turns from news to practice: building a “$10K-looking” travel app for Kyoto using ChatGPT5’s canvas workflow. The build starts with a short prompt asking for research and an interactive mini app for a family trip, then iterates through code failures, readability fixes, and UI adjustments. Early outputs can be ugly or non-functional, but repeated “fix this” cycles eventually produce a working 14-day itinerary with editable controls and plain-English day rationales. The transcript emphasizes that getting from a rough first version to a usable V2 takes roughly 25 minutes of total conversation over two days, and that the process is less about perfect prompting up front and more about steering the model through errors, aesthetics, and missing features. The takeaway is that these systems can generate and refine hundreds of lines of code quickly, making planning tools for trips—or any schedule-heavy task—far easier to prototype and remix.

Cornell Notes

Multiple AI updates point to progress shifting from raw capability toward controllability and behavior reliability. Claude’s new retrieval-based Memories requires explicit steering in the current chat, while its 1M-token context window for Sonnet makes large codebase and document work more practical. Meta’s brain modeling challenge uses a 1B-parameter artificial brain to predict fMRI responses from fused video, audio, and dialogue, hinting at more targeted engagement. Google Gemini’s “infinite looping” bug can trap the model in repetitive self-critique and refusal. In the practical segment, a Kyoto travel app is built in ChatGPT5’s canvas through short prompts plus iterative “fix” cycles, reaching a production-like 14-day itinerary in about 25 minutes total conversation over two days.

How does Claude’s new Memories feature differ from ChatGPT-style memory, and what does that mean for users?

Claude’s Memories is retrieval-based rather than automatic. Users must steer it inside the current conversation (e.g., “Please remember this or that”), and Claude then searches relevant past conversations based on that prompt. The transcript warns it can be less dependable than ChatGPT’s memory, but it offers richer options because users can decide what to retrieve. It also notes that asking the same query twice can yield different structured outputs because retrieval and generation run through a probabilistic token system.

What practical advantage does Claude’s 1 million token context window for Sonnet provide?

The transcript claims it enables processing of very large inputs—such as codebases around 75,000 lines—within a single context window. That makes it easier to handle extensive document sets and complex queries while maintaining some coherence. It’s not presented as perfect retrieval, but the key point is that tasks that were harder just a few months earlier become more manageable.

Why does the transcript connect Meta’s brain modeling work to video recommendation incentives?

Meta’s system encodes a 1 billion-parameter artificial brain to predict fMRI responses to movies by fusing video frames, audio, and dialogue. The transcript interprets this as a route to predicting how content stimulates brains, which could make Meta’s video algorithms more addictive by selecting or optimizing for stronger brain-response patterns.

What is the “infinite looping” issue described for Google Gemini, and why does it matter?

Gemini is described as entering a self-deprecation loop: it apologizes when it can’t complete a task, retries, and then escalates into repetitive self-critique until it refuses to proceed. Logan Kilpatrick is cited calling it an “annoying infinite looping bug,” with the team working on a fix. At scale, this kind of failure mode matters because it can block completion even when the model is otherwise capable.

How did the Kyoto travel app get from a rough first output to a usable 14-day itinerary?

The build starts with a short prompt requesting research and an interactive mini app for a family trip to Kyoto. The first code attempt fails partway through, so the user retries. The next iterations address usability problems: the initial UI is hard to read (dark-on-black), map/info links don’t work, and missing elements require additional prompts. The user repeatedly requests fixes (including aesthetic changes and functional controls) until the app becomes functional, then expands it to a full 14-day plan with more categories and family-friendly pacing.

What does the transcript suggest about prompting strategy versus iterative refinement?

It argues that intent and constraints can be enough to generate a detailed first artifact quickly, even with a plain sentence prompt. However, the quality improvements come from iterative steering: reporting failures (“I can’t see it,” “fix another bug,” “map and info links don’t work”) and then refining aesthetics and missing features. The transcript frames this as a practical workflow—use the model to generate, then correct and extend—rather than trying to specify everything perfectly upfront.

Review Questions

Claude’s Memories requires what kind of user action to work, and how does that affect reliability compared with ChatGPT-style memory?
What does a 1 million token context window change about what kinds of tasks become feasible in one pass?
In the Kyoto app build, which specific failure modes (code, readability, broken links, missing itinerary coverage) were addressed, and how did the user prompt for each fix?

Key Points

1
Claude’s retrieval-based Memories requires explicit steering in the current chat, trading automatic recall for more controllable retrieval targets.
2
Claude’s Sonnet now supports a 1 million token context window, making large codebases and document sets more workable in a single interaction.
3
Meta’s brain modeling challenge uses a 1 billion-parameter artificial brain to predict fMRI responses from fused video, audio, and dialogue, with potential implications for engagement optimization.
4
Merge Labs is positioned as a brain-computer interface startup tied to OpenAI’s ecosystem and co-founded by Sam Altman, signaling continued competition beyond Neuralink.
5
Google Gemini can get stuck in an infinite self-critique loop that leads to refusal, prompting active bug-fixing efforts.
6
A Kyoto travel app can be produced quickly in ChatGPT5’s canvas by starting with a clear intent prompt and then iterating through code and UI fixes until it becomes functional and editable.
7
The workflow emphasizes iterative refinement—reporting what’s broken or unreadable—over trying to specify every aesthetic and functional detail at the start.

Highlights

Claude’s Memories doesn’t automatically store and recall; it retrieves past conversations only when the current chat explicitly asks it to remember something.

Sonnet’s 1 million token context window is framed as a practical step toward handling ~75,000-line codebases in one go.

Google Gemini’s “annoying infinite looping bug” can trap the model in repetitive self-critique and refusal.

A Kyoto itinerary app reached a production-like V2 through roughly 25 minutes of total conversation by fixing failures in the canvas workflow.

The transcript treats AI planning tools as remixable artifacts—use them for travel, then adapt the same structure for other schedule-heavy tasks.

Topics

AI News Updates
Claude Memories
1M Token Context
Brain-Computer Interfaces
ChatGPT5 Canvas App Building