TalkRL: The Reinforcement Learning Podcast

Updated: 08 Apr 2024 • 53 episodes
www.talkrl.com

TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.

Show episodes

08 Apr 2024 • EN

Vincent Moens on TorchRL

Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch.  Featured References TorchRL: A data-driven decision-making library for PyTorch Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis

40 min
00:00
40:14
No file found

Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI. Featured Reference Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Arash Ahmadian, Chris Cremer, Mat

33 min
00:00
33:30
No file found

Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Institute Courtios, and co-director of the Robotics and Embodied AI Lab (REAL).  Featured Links  Reinforcement Learning Conference  Closing the Gap between TD

21 min
00:00
21:38
No file found
07 Mar 2024 • EN

Ian Osband

Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty.   We spoke about:  - Information theory and RL  - Exploration, epistemic uncertainty and joint predictions  - Epistemic Neural Networks and scaling to LLMs  Featured References  Reinforcement Learning, Bit

68 min
00:00
01:08:26
No file found
12 Feb 2024 • EN

Sharath Chandra Raparthy

Sharath Chandra Raparthy on In-Context Learning for Sequential Decision Tasks, GFlowNets, and more!   Sharath Chandra Raparthy is an AI Resident at FAIR at Meta, and did his Master's at Mila.   Featured Reference  Generalization to New Sequential Decision Making Tasks with In-Context Learning    Sharath Chandra Raparth

40 min
00:00
40:41
No file found

Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more!   Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta. Martin Klissarov is a PhD student at Mila and McGill and research scientist intern at Meta.   Featured References  Motif: Intrinsic Motiva

57 min
00:00
57:24
No file found