Reflection AI’s Misha Laskin on the AlphaGo Moment for LLMs | Training Data
LLMs are democratizing digital intelligence, but we’re all waiting for AI agents to take this to the next level by planning tasks and executing actions to actually transform the way we work and live our lives.
Yet despite incredible hype around AI agents, we’re still far from that “tipping point” with best in class models today. As one measure: coding agents are now scoring in the high-teens % on the SWE-bench benchmark for resolving GitHub issues, which far exceeds the previous unassisted baseline of 2% and the assisted baseline of 5%, but we’ve still got a long way to go.
Why is that? What do we need to truly unlock agentic capability for LLMs? What can we learn from researchers who have built both the most powerful agents in the world, like AlphaGo, and the most powerful LLMs in the world?
To find out, we’re talking to Misha Laskin, former research scientist at DeepMind. Misha is embarking on his vision to build the best agent models by bringing the search capabilities of RL together with LLMs at his new company, Reflection AI. He and his cofounder Ioannis Antonoglou, co-creator of AlphaGo and AlphaZero and RLHF lead for Gemini, are leveraging their unique insights to train the most reliable models for developers building agentic workflows.
Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital
00:00 Introduction
01:11 Leaving Russia, discovering science
10:01 Getting into AI with Ioannis Antonoglou
15:54 Reflection AI and agents
25:41 The current state of Ai agents
29:17 AlphaGo, AlphaZero and Gemini
32:58 LLMs don’t have a ground truth reward
37:53 The importance of post-training
44:12 Task categories for agents
45:54 Attracting talent
50:52 How far away are capable agents?
56:01 Lightning round
1 view
92
29
4 months ago 00:00:07 1
You want me to what?..
4 months ago 00:04:04 1
SIMONE SIMONS - (OFFICIAL MUSIC VIDEO)
4 months ago 00:03:53 1
Farewell of Slavianka (Прощание славянки) | Orchestral Performance
4 months ago 00:46:40 1
“19-YEARS IS A LOT“ | Alessandro Del Piero Reflects on his Career | Calcio & Chill | Serie A
4 months ago 01:07:05 1
Reflection AI’s Misha Laskin on the AlphaGo Moment for LLMs | Training Data
4 months ago 00:04:25 1
If Bad Omens, Sleep Token and Bring Me The Horizon Were to Collaborate (An AI Experiment)
4 months ago 01:00:00 1
Joker | Ambient Soundscape
4 months ago 00:32:05 1
Joe Rogan AI Experience Episode #002 - Donald Trump
4 months ago 00:00:28 1
Druski removes cable from backpackkid’s piano 😂🎹 #kai #kaicenatfunny
4 months ago 00:04:41 1
SWFT | MY AIR FLOW | INSTRUMENTAL SINGLE | AI VIDEO
4 months ago 00:01:33 1
Manuel Sainsily & Will Selviz · Sora Showcase
4 months ago 00:01:55 6
Tim Fu · Sora Showcase
4 months ago 00:04:39 1
AMV - Nostromo - Light Years (Schiller - Lichtjahre)
4 months ago 00:01:02 1
Ben Desai · Sora Showcase
4 months ago 00:02:28 1
Disney Mulan Reflection - Madarin Taiwanese
4 months ago 01:00:08 1
Dusk Delight Lounge | Enjoy Uplifting and Radiant Rhythms
4 months ago 00:03:56 1
Bossa Roma (Original) - . Gypsy Guitar Duo | Vadim Kolpakov & Nicolas Adams
4 months ago 00:01:49 1
At the Beach Film: Restored to Amazing Life
4 months ago 00:00:15 1
Music Ai Meditation 240715: Moonlit Night Sea, Video by Jechang Kim, Music by #shorts #AOMA