Get AI summaries of any video or article — Sign up free

West Coast Machine Learning — Channel Summaries

AI-powered summaries of 15 videos about West Coast Machine Learning.

15 summaries

No matches found.

Diffusion Policy Controlling Robots - Part 1

West Coast Machine Learning · 3 min read

Diffusion policy is being positioned as a practical way to teach robots dexterous, vision-guided manipulation from relatively few...

Diffusion PolicyRobot ControlScore Function

Tree of Thought Prompting

West Coast Machine Learning · 3 min read

Tree of Thought prompting reframes large language model problem-solving as an explicit search process: generate candidate intermediate “thoughts,”...

Tree of ThoughtPrompt EngineeringChain of Thought

Mamba sequence model - part 1

West Coast Machine Learning · 2 min read

Mamba’s core pitch is that sequence models can match Transformer-quality results on language and other modalities while scaling linearly with...

Selective State SpacesStructured State Space ModelsS4 and HiPPO

State of GPT

West Coast Machine Learning · 3 min read

Large language models are built through a pipeline that starts with internet-scale next-token pre-training and then progressively adds human...

GPT Training PipelinePre-Training vs Fine-TuningRLHF Reward Modeling

DeepSeek Multihead Latent Attention

West Coast Machine Learning · 3 min read

DeepSeek V2’s standout inference optimization is Multi-Head Latent Attention (MLA), a transformer attention redesign that slashes the size of the KV...

Multi-Head Latent AttentionKV Cache CompressionRoPE Positional Encoding

Mamba part 2 - Can it replace Transformers?

West Coast Machine Learning · 3 min read

Mamba’s core pitch is simple: it aims to match—and in some settings surpass—Transformer-style language modeling while scaling linearly with sequence...

Mamba vs TransformersSelective State SpacesS4 State Space Models

Consistency Models

West Coast Machine Learning · 3 min read

Consistency models aim to cut diffusion sampling time by replacing many denoising steps with a learned, one-step (or few-step) mapping from a noisy...

Diffusion SDEProbability-Flow ODEScore Matching

Diffusion Policy Controlling Robots - Part 2

West Coast Machine Learning · 3 min read

Diffusion policy for robot control turns a noisy guess of future actions into a smooth, goal-reaching trajectory by repeatedly denoising an action...

Diffusion PolicyRobot ControlScore-Based Models

Mamba part 4 - System Details and Implementation

West Coast Machine Learning · 3 min read

Mamba’s core implementation hinges on a state-space “mixer” that updates a hidden state sequentially while keeping most computations...

Mamba System ArchitectureState-Space DiscretizationΔ Time-Step Modulation

Mamba part 3 - Details of Mamba and Structured State Space

West Coast Machine Learning · 3 min read

Mamba’s core pitch is that sequence modeling can be made both fast and selective without attention’s quadratic cost. The approach builds on state...

State Space ModelsS4 DiscretizationSelective State Spaces

Alpha Geometry

West Coast Machine Learning · 3 min read

Alpha Geometry is a system that solves a difficult subset of geometry proofs—specifically “plane geometry” problems—without human demonstrations, by...

Auxiliary ConstructionSymbolic DeductionSynthetic Proof Generation

Biology of LLMs - Part 1

West Coast Machine Learning · 3 min read

Mechanistic interpretability is moving from “what concepts are stored where” toward “how those concepts get used to produce the next token.” The...

Mechanistic InterpretabilityTransformer CircuitsSuperposition

Parameter Efficient Fine Tuning

West Coast Machine Learning · 2 min read

Parameter-efficient fine-tuning is presented as a practical way to adapt large Transformer and language models to new tasks without retraining the...

Fine-TuningAdaptersPrefix Tuning

Rotary Positional Embeddings (RoPE): Part 1

West Coast Machine Learning · 3 min read

Rotary Positional Embeddings (RoPE) replace the usual “add a position vector” approach with a rotation-based scheme that bakes relative distance...

Rotary Positional EmbeddingsRelative PositioningSinusoidal Positional Encoding

Transformer Circuits Part 1

West Coast Machine Learning · 3 min read

Transformer circuits work centers on a simple but powerful claim: even in a stripped-down, one-layer attention-only Transformer, the model’s behavior...

Transformer ArchitectureResidual StreamsAttention Circuits