Get AI summaries of any video or article — Sign up free

Jerry — Person Summaries

AI-powered summaries of 4 videos about Jerry.

4 summaries

No matches found.

Building OpenAI o1 (Extended Cut)

OpenAI · 3 min read

OpenAI’s latest preview models, o1 and o1 mini, put “reasoning” at the center: they spend more time thinking before answering, aiming to turn extra...

Reasoning ModelsReinforcement LearningModel Evaluation

Long term credit assignment with temporal reward transp… | Cathy Yeh | OpenAI Scholars Demo Day 2020

OpenAI · 3 min read

Long-delayed rewards can make standard reinforcement learning painfully slow because discounting shrinks the learning signal for actions that only...

Temporal Reward TransportLong-Horizon Credit AssignmentDiscounted Returns

Mamba part 2 - Can it replace Transformers?

West Coast Machine Learning · 3 min read

Mamba’s core pitch is simple: it aims to match—and in some settings surpass—Transformer-style language modeling while scaling linearly with sequence...

Mamba vs TransformersSelective State SpacesS4 State Space Models

Consistency Models

West Coast Machine Learning · 3 min read

Consistency models aim to cut diffusion sampling time by replacing many denoising steps with a learned, one-step (or few-step) mapping from a noisy...

Diffusion SDEProbability-Flow ODEScore Matching