NUS Artificial Intelligence Institute - [Online] AI Horizons: Reinforcement Learning from Human Feedback – Can Scaling Lead to Compositional Generalization?

AI Horizons: Reinforcement Learning from Human Feedback – Can Scaling Lead to Compositional Generalization?

Speaker: Simon Schug (Princeton University)

Date: Friday March 6, 2026

Time: 10.00AM SGT

Host: Dr Dianbo Liu (Assistant Professor, National University of Singapore)

Abstract:

Many complex tasks can be decomposed into simpler, independent parts. Can neural networks systematically capture such discrete, compositional structure despite their continuous, distributed nature? The impressive capabilities of large-scale neural networks suggest that the answer to this question is yes. However, even for the most capable models, there are failure cases that raise doubts about their compositionality. Do we need to endow neural network architectures with modular or even symbolic structure, or will scaling suffice? In this talk, Simon will shed light on this question and identify conditions under which compositional generalization succeeds.

Sidebar

Main Menu

AI Horizons: Reinforcement Learning from Human Feedback – Can Scaling Lead to Compositional Generalization?