Colin headshot

Colin Raffel

Associate Research Director, AI Engineering & Infrastructure

Assistant Professor, Department of Computer Science, University of Toronto

Canada CIFAR Artificial Intelligence Chair

Colin Raffel is an Associate Professor at the University of Toronto and an Associate Research Director at the Vector Institute. His research in machine learning aims to make it easy to get computers to do new things. Areas of focus in his lab include machine learning algorithms that require little or no labeled data in order to perform a task and systems for collaborative and continual development of machine learning models. He received a PhD in Electrical Engineering from Columbia University in 2016, a Master’s in Music from Stanford University in 2010, and a Bachelor’s in Mathematics from Oberlin College in 2009.

Research Interests

  • Scalable Deep Learning
  • Collaborative Learning
  • Large Language Models
  • Open-Source Software

Highlights

  • NSF CAREER award
  • Runner-up for the Caspar Bowden award for Outstanding Research in Privacy Enhancing Technologies
  • CACM and SIGPLAN Research Highlights
  • Best paper and best poster awards from ISMIR

Publications

The fineweb datasets: Decanting the web for the finest text data at scale

Guilherme Penedo and Hynek Kydlicek and Anton Lozhkov and Margaret Mitchell and Colin Raffel and Leandro Von Werra and Thomas Wolf

2024

A survey on model moerging: Recycling and routing among specialized experts for collaborative learning

Prateek Yadav and Colin Raffel and Mohammed Muqeeth and Lucas Caccia and Haokun Liu and Tianlong Chen and Mohit Bansal and Leshem Choshen and Alessandro Sordoni

2024

AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution

Fengyuan Liu and Nikhil Kandpal and Colin Raffel

2024