Date: April 28 2025, Monday Time: 10:00 AM – 12:00 PM SGT Venue: University Hall Auditorium (Level 2, Lee Kong Chian Wing), 21 Lower Kent Ridge Rd, Singapore 119077
Abstract:
Large language models (LLMs) have demonstrated remarkable capabilities in generating coherent text and completing various natural language tasks. Nevertheless, their ability to perform complex, general reasoning has remained limited. In this talk, I will describe OpenAI’s new o-series models, which are trained via reinforcement learning to generate a hidden chain of thought before its response. We have found that the performance of these models consistently improve with more reinforcement learning compute and with more inference compute. The latest model, o3, surpasses previous state-of-the-art models in a variety of benchmarks that require reasoning, including mathematics competitions, programming contests, and advanced science question sets. I will discuss the implications of scaling this paradigm even further.
Biography:
Noam Brown is a research scientist at OpenAI investigating reasoning and multi-agent AI. He co-created Libratus and Pluribus, the first AIs to defeat top humans in two-player no-limit poker and multiplayer no-limit poker, respectively, and Cicero, the first AI to achieve human-level performance in the natural language strategy game Diplomacy. He has received the Marvin Minsky Medal for Outstanding Achievements in AI, was named one of MIT Tech Review’s 35 Innovators Under 35, and his work on Pluribus was named by Science as one of the top 10 scientific breakthroughs of 2019. Noam received his PhD from Carnegie Mellon University.