John Schulman — Person Summaries
AI-powered summaries of 5 videos about John Schulman.
5 summaries
AI Declarations and AGI Timelines – Looking More Optimistic?
Predictions about when “human-level” AI arrives are getting more specific—and the policy response is getting more concrete—at the same time that...
Kimi K2.5- The Agent Swarm
Moonshot AI’s Kimi K2.5 positions itself less as a single “bigger model” and more as a platform for task-specialized reasoning—especially through an...
OpenAI Scholars Demo Day 2019
OpenAI Scholars Demo Day 2019 showcased how machine learning research ideas—from reinforcement learning and language modeling to model compression...
Episode 15 - Inside the Model Spec
OpenAI’s “model spec” is a public, human-readable set of rules meant to steer how AI models should behave—especially when instructions collide. It...
Continuous control with deep reinforcement learning
This paper asks whether deep reinforcement learning for continuous-action control can be made stable and effective without discretizing actions, and...