Jeff Dean — Person Summaries

AI-powered summaries of 10 videos about Jeff Dean.

10 summaries

No matches found.

Google's New AI Is Smarter Than Everyone's But It Costs HALF as Much. Here's Why They Don't Care.

AI News & Strategy Daily | Nate B Jones · 3 min read

Gemini 3.1 Pro signals a strategic shift in AI: Google is optimizing for “pure reasoning” at frontier quality and at a price that makes that...

Gemini 3.1 ProARC AGI2Model Routing

Gemini 1.5 Pro for Video Analysis

Sam Witteveen · 2 min read

Gemini 1.5 Pro can extract highly specific information from a long video—down to approximate timestamps for when key topics appear—making video-based...

Video AnalysisGemini 1.5 ProLong Context Window

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Sam Witteveen · 2 min read

Mistral’s newly released “8x7B” model is a Mixture of Experts (MoE) system: eight separate expert networks, each roughly the size of Mistral 7B, are...

Mixture of ExpertsGating NetworksMistral 8x7B

Vicuna - 90% of ChatGPT quality by using a new dataset?

Sam Witteveen · 3 min read

Vicuna is being positioned as an open-source-style chat model that delivers roughly “90% of ChatGPT quality” by fine-tuning a LLaMa base model on...

VicunaLLaMa Fine-TuningShareGPT Dataset

Gemini 2.0 Flash Thinking

Sam Witteveen · 3 min read

Google has released an experimental Gemini 2.0 Flash model branded “Gemini 2.0 Flash Thinking,” notable for exposing full reasoning traces...

Gemini 2.0 Flash ThinkingChain of Thought TracesTest-Time Compute

Googles Attempt to take on Open AI

MattVidPro · 3 min read

Google’s Gemini 1.5 Pro is positioned as a direct leap in long-context, multimodal AI—capable of handling up to a 1 million token context window and...

Gemini 1.5 ProLong ContextMultimodal AI

Distributed Representations of Words and Phrases and their Compositionality

arXiv (Cornell University) · 2013 · 18,083 citations · 5 min read

This paper asks how to efficiently learn high-quality distributed vector representations for words and phrases, and whether these representations...

PaperNatural language processingRepresentation learningWord embeddings

Caught Distilling from Claude?

Sam Witteveen · 3 min read

A fresh wave of allegations claims Chinese AI labs are running large-scale “distillation attacks” to copy capabilities from Claude—using fleets of...

Distillation AttacksClaudeReinforcement Learning

Lecture 04: Data Management (FSDL 2022)

The Full Stack · 3 min read

Data management is the hidden driver of machine-learning performance: spending far more time on data than on models—especially on dataset quality,...

Data ExplorationStorage ArchitectureSQL And Data Frames

Masterclass: Knowledge Graphs & Massive Language Models — The Future of AI, RelationalAI | KGC 2023

The Knowledge Graph Conference · 3 min read

Conversational AI is being treated as a new “computer for humans,” but the practical breakthrough isn’t that it behaves like people—it’s that it can...

Instructable ComputersTransformer Language ModelsRLHF Alignment

Jeff Dean — Person Summaries

Google's New AI Is Smarter Than Everyone's But It Costs HALF as Much. Here's Why They Don't Care.

Gemini 1.5 Pro for Video Analysis

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Vicuna - 90% of ChatGPT quality by using a new dataset?

Gemini 2.0 Flash Thinking

Googles Attempt to take on Open AI

Distributed Representations of Words and Phrases and their Compositionality

Caught Distilling from Claude?

Lecture 04: Data Management (FSDL 2022)

Masterclass: Knowledge Graphs & Massive Language Models — The Future of AI, RelationalAI | KGC 2023

Get summaries like this for any content