argmax by Vahe Hagopian, Taka Hasegawa, Farrukh Rahman

Last Updated: April 29, 2026

A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm

Mixture of Experts

Published: October 8, 2024

In this episode we talk about the paper "Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.

LoRA

Published: September 2, 2023

We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y

15: InstructGPT

Published: March 28, 2023

In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.

14: Whisper

Published: March 17, 2023

This week we talk about Whisper. It is a weakly supervised speech recognition model.

13: AlphaTensor

Published: March 11, 2023

We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.

12: SIRENs

Published: October 25, 2022

In this episode we talked about "Implicit Neural Representations with Periodic Activation Functions" and the strength of periodic non-linearities.

11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer

Published: September 30, 2022

In this episode we discuss this video: https://youtu.be/jPCV4GKX9Dw

10: Outracing champion Gran Turismo drivers with deep reinforcement learning

Published: August 23, 2022

We discuss Sony AI's accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include:

9: Heads-Up Limit Hold'em Poker Is Solved

Published: July 29, 2022

Today we talk about recent AI advances in Poker; specifically the use of counterfactual regret minimization to solve the game of 2-player Limit Texas Hold'em.

8: GATO (A Generalist Agent)

Published: July 29, 2022

Today we talk about GATO, a multi-modal, multi-task, multi-embodiment generalist agent.

7: Deep Unsupervised Learning Using Nonequilibrium Thermodynamics (Diffusion Models)

Published: June 14, 2022

We start talking about diffusion models as a technique for generative deep learning.

6: Deep Reinforcement Learning at the Edge of the Statistical Precipice

Published: June 6, 2022

We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility.

5: QMIX

Published: April 26, 2022

We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL.

4: Can Neural Nets Learn the Same Model Twice?

Published: April 6, 2022

Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility

3: VICReg

Published: March 21, 2022

Todays paper: VICReg (https://arxiv.org/abs/2105.04906)

2: data2vec

Published: March 7, 2022

Todays paper: data2vec (https://arxiv.org/abs/2202.03555)

1: Reward is Enough

Published: February 21, 2022

This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.