Why DeepSeek beat ChatGPT in the App Store, plus Privacy, Data Center Investment, AI Acceleration

TL;DR

DeepSeek’s App Store surge is attributed to transparent, user-editable reasoning outputs and free availability of its reasoning model (R1), not to algorithm gaming.

Briefing Cornell Notes

Briefing

DeepSeek’s sudden rise to the top of the App Store is tied less to marketing and more to two product choices that make the model feel more controllable: it exposes reasoning in a way users can edit, and it delivers a free, widely available “reasoning model” experience that pulls in non-technical users. The reasoning display functions like a user-facing control surface—people can adjust prompts and immediately see what the system is doing. That transparency also appears to be feeding back into the broader industry: OpenAI is reportedly using DeepSeek’s openly shown reasoning outputs for model distillation, turning a UI feature into training signal.

A second driver is distribution strategy. By offering R1 for free in the App Store, DeepSeek reaches the “autocomplete crowd”—people outside tech who often dismiss chatbots as shallow, generic response generators. The reasoning-first interface undermines that critique by making outputs feel more grounded and inspectable. The result is a product that’s easy to evaluate in real time, which helps explain why the app climbed without relying on “gaming” the algorithm.

The momentum, however, comes with a sharp privacy and legal warning. DeepSeek’s terms of service are described as “creepy and concerning,” including claims that users can only seek redress through Chinese courts, that account deletion may not remove data, and that the company keeps a monitoring table for undefined “illegal activities.” The terms also reportedly don’t clearly grant users rights to model outputs. More troubling, the app is said to log keystrokes—an issue the transcript argues is easy to dismiss only if someone plans to run models locally. For the vast majority of users, the app route means accepting those data practices, and the concern is framed as worse than typical social-app tracking because it captures direct thinking and model outputs that may be difficult to reuse safely.

Beyond user impact, the competitive response in the US is portrayed as a catch-up sprint shaped by economics. Model makers are investing heavily in chips for next-generation systems, but the transcript distinguishes training from inference: most purchased compute goes to serving models to users, not training them. The claim is that DeepSeek’s earlier drop was tied to insufficient chips for inference at scale, and that industry defensiveness is understandable because serving demand is expensive.

Under-discussed—but presented as potentially transformative—is how DeepSeek’s approach is being replicated. The transcript argues that “reasoning models” can be trained more effectively now because large amounts of reasoning behavior have entered public datasets. Techniques involving group policy reinforcement and extracting reasoning from the training data were attempted earlier and failed; they now work because there are enough examples—cited as on the order of hundreds of thousands of responses—to reach a critical mass. The long-term implication is a feedback loop: models learn reasoning from data, produce better reasoning outputs, and those outputs become training material for the next generation, accelerating development toward something closer to self-improvement.

In short: DeepSeek’s App Store win is attributed to transparent reasoning and free access, while the broader industry is reacting on three fronts—privacy scrutiny, inference-heavy investment, and rapid replication of reasoning training methods that may speed up the next wave of model progress.

Cornell Notes

DeepSeek’s App Store success is linked to two practical product innovations: it shows reasoning in a way users can edit and adjust, and it makes its reasoning model (R1) widely available for free. That transparency appears to be useful beyond UX—OpenAI is reportedly using DeepSeek’s displayed reasoning outputs for model distillation. The transcript then shifts to risk: DeepSeek’s terms of service are described as invasive and unclear, including keystroke logging and limited user rights, with account deletion not necessarily removing data. Finally, the competitive landscape is framed as an inference-and-data race: US model makers need massive compute to serve models, and developers can now replicate reasoning training because large volumes of reasoning examples have become available in public internet data.

Why does showing reasoning function as a competitive advantage, not just a transparency feature?

Reasoning display turns the model into something users can actively steer. The transcript emphasizes that people can edit, change, and adjust prompts while seeing the reasoning output, making evaluation and iteration faster. It also suggests the reasoning output becomes training signal for others—OpenAI is reportedly using DeepSeek’s openly displayed reasoning outputs for model distillation, meaning a UI choice can translate into improved model performance.

How does offering R1 for free change who uses the product and why that matters for rankings?

Free availability in the App Store broadens the audience to non-technical users who often view chatbots as generic “autocomplete.” The transcript argues that the reasoning-first experience makes the system easier to judge and harder to dismiss as shallow. That wider adoption is presented as a key factor behind the app’s rapid climb, with no reliance on “gaming” the algorithm.

What privacy and legal concerns are raised about DeepSeek’s terms of service?

The transcript highlights several red flags: redress is reportedly limited to Chinese courts; deleting an account may not delete data; a monitoring table is kept for undefined “illegal activities”; and rights to model outputs aren’t clearly defined. It also claims the app logs keystrokes. The argument is that even if someone can run models locally, most users will use the app and therefore accept these data practices without noticing.

Why does inference compute—not just training—dominate the economics of competing in AI?

The transcript distinguishes training from serving: most chips purchased by model makers are for inference (responding to user queries), not for training. It cites that Jensen sales are largely for serving compute. DeepSeek’s earlier difficulty is attributed to insufficient chips to serve at scale, and the broader industry’s catch-up is framed as constrained by the cost and availability of inference capacity.

What changed technically to make reasoning-model replication work now?

The transcript claims that techniques like group policy reinforcement and reading reasoning out of the data stream were tried earlier but didn’t work. The change is the availability of enough reasoning examples in public datasets—driven by reasoning models generating many tokens and by humans publicly praising or criticizing specific responses. With a critical mass (roughly cited as ~800,000 samples), models can learn what reasoning looks like more organically from the data.

How could reasoning-model training create a faster improvement loop over time?

If models learn reasoning from data, then produce higher-quality reasoning outputs, those outputs can enter the next training dataset. The transcript frames this as an acceleration mechanism: better reasoning leads to better training material, which can improve subsequent generations more quickly, moving toward models that effectively improve by learning from their own ecosystem’s outputs.

Review Questions

What two DeepSeek product choices are credited with driving App Store success, and how does each affect user behavior?
Which specific terms-of-service and data-handling concerns are raised, and why does the transcript argue they matter even if local running is possible?
What conditions does the transcript say made reasoning-model replication feasible now, and how does that enable a feedback loop for future model improvement?

Key Points

1
DeepSeek’s App Store surge is attributed to transparent, user-editable reasoning outputs and free availability of its reasoning model (R1), not to algorithm gaming.
2
OpenAI is reportedly using DeepSeek’s openly displayed reasoning outputs for model distillation, turning UI transparency into training leverage.
3
DeepSeek’s terms of service are described as legally and privacy concerning, including keystroke logging, unclear data deletion, and limited user redress options.
4
Most AI compute spending is framed as inference (serving responses), so chip availability and scaling capacity can make or break competitiveness.
5
US model makers are portrayed as racing to secure funding and inference chips for next-generation systems while trying to catch up to reasoning-focused approaches.
6
Developers can replicate reasoning-model improvements because large volumes of reasoning examples have become publicly available, enabling group policy reinforcement methods that previously failed.
7
A long-term acceleration loop is proposed: better reasoning outputs feed future training data, potentially speeding up model development cycles.

Highlights

Reasoning transparency is framed as a UI innovation that helps users steer prompts and also supplies distillation signal to other labs.

Free distribution of R1 is linked to attracting the “autocomplete crowd,” undermining the idea that chatbots are just generic response generators.

The transcript flags keystroke logging and unclear deletion/legal rights as major privacy and legal risks for app users.

Inference compute is emphasized as the real bottleneck—chips are mostly needed to serve models at scale, not only to train them.

A reasoning “takeoff moment” is attributed to public availability of enough reasoning examples, enabling previously ineffective training techniques.

Topics

DeepSeek App Store
Reasoning Models
Privacy Terms
Inference Compute
Model Distillation