Main content start

FriSem

Date
Fri December 6th 2024, 3:15 - 4:30pm
Location
Building 420, Room 050

Daniel Wurgaft, Ph.D. Student in Psychology, admitted Autumn 2023, presenting his First-Year Project (FYP)

Title: Towards A Normative Theory Explaining The Emergence Of In-context Learning

Abstract: Large language models (LLMs) demonstrate a striking ability to learn without updates to their weights, a process known as in-context learning (ICL). This capacity enables LLMs to rapidly adapt to novel tasks, achieving generality that was previously unseen in neural networks. Despite its importance, the mechanisms behind ICL and its emergence during training remain poorly understood. Recent research has identified several factors influencing the emergence of ICL, including the necessity of diverse training tasks and the impact of task dimensionality and context length. In this work, we provide a Bayesian perspective on the training dynamics of ICL, offering early empirical evidence to explain these phenomena. Additionally, we introduce a novel experimental setting that replicates key ICL findings while allowing for clear analysis. We conclude by highlighting parallels to human meta-learning and suggesting directions for future research.