John Schulman — Person Summaries

AI-powered summaries of 5 videos about John Schulman.

5 summaries

No matches found.

AI Declarations and AGI Timelines – Looking More Optimistic?

AI Explained · 3 min read

Predictions about when “human-level” AI arrives are getting more specific—and the policy response is getting more concrete—at the same time that...

AGI TimelinesAI Safety PolicyCompute Regulation

Kimi K2.5- The Agent Swarm

Sam Witteveen · 2 min read

Moonshot AI’s Kimi K2.5 positions itself less as a single “bigger model” and more as a platform for task-specialized reasoning—especially through an...

Kimi K2.5Agent SwarmVision Coding

OpenAI Scholars Demo Day 2019

OpenAI · 3 min read

OpenAI Scholars Demo Day 2019 showcased how machine learning research ideas—from reinforcement learning and language modeling to model compression...

Reinforcement LearningIntrinsic MotivationDiscount Factor

Episode 15 - Inside the Model Spec

OpenAI · 3 min read

OpenAI’s “model spec” is a public, human-readable set of rules meant to steer how AI models should behave—especially when instructions collide. It...

Model SpecChain of CommandPolicy Authority Levels

Continuous control with deep reinforcement learning

arXiv (Cornell University) · 2016 · 6,777 citations · 5 min read

This paper asks whether deep reinforcement learning for continuous-action control can be made stable and effective without discretizing actions, and...

PaperReinforcement learningDeep reinforcement learningContinuous control