Reconciling Impressive AI Benchmark Performance with Limited Developer Productivity Impacts

Joel Becker

AI coding agents now complete multi-hour coding benchmarks with roughly 50% reliability, yet a randomized trial found experienced open-source developers took about 19% longer when allowed frontier AI tools than when tools were disallowed.

This talk presents the evidence on the productivity paradox in AI coding, shows the bottlenecks in deployment, and outlines the next steps for understanding AI’s productivity impacts

Speaker: Joel Becker, METR

Registerat weblink to attend in person or via Zoom

Monday, 03/16/26

12:00 PM - 01:00 PM

Contact:

Website: Click to Visit

Cost:

Free

Save this Event:

iCalendar
Google Calendar
Yahoo! Calendar
Windows Live Calendar

Gates Computer Science Building

Stanford University
Room 119
Stanford, CA 94305

<						>
S	M	T	W	T	F	S
						01
02	03	04	05	06	07	08
09	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

Monday, 03/16/26

Contact:

Cost:

Save this Event:

Gates Computer Science Building

Categories: