Course Schedule

Weekday Regular Schedule

Group Type Hours Location
01 Lecture Monday 13-16 Orentstein 103

Resources

Introduction to Multi-Armed Bandits, Aleksandrs Slivkins

Bandit Algorithms Tor Lattimore and Csaba Szepesvari

Detailed Schedule

Week Date Lecture topics references Lecture Slides Lecture Scribes
1 Jan 1 Introduction and Stochastic Bandits Chapters 1 and Introduction from Slivkins,
Chapter 5.3 from Lattimore and Szepesvari,
Sleeping bandits from Regret bounds for sleeping experts and bandits
by Robert Kleinberg, Alexandru Niculescu-Mizil, Yogeshwer Sharma
Hebrew Notes 1 Scribe 1
2 Jan 8 Lower Bounds MAB Chapters 2 from Slivkins,
Sleeping bandits from Regret bounds for sleeping experts and bandits
by Robert Kleinberg, Alexandru Niculescu-Mizil, Yogeshwer Sharma
Hebrew Notes 2 Scribe 2
3 Jan 15 Bayesian bandits and Thompson sampling algorithm Chapters 3 from Slivkins,
Near-optimal Regret Bounds for Thompson Sampling
by SHIPRA AGRAWAL and NAVIN GOYAL,
Hebrew Notes 3 Scribe 3
week 4 Jan 22 No new material
4 Jan 29 Lipschitz bandits Chapter 4 from Slivkins,
The Value of Knowing a Demand Curve: Bounds on Regret for On-line Posted-Price Auctions
by Robert Kleinberg and Tom Leighton
Hebrew Notes 4 Scribe 4
5 Feb 5 Adversarial costs: Full feedback Chapter 5 from Slivkins,
Learning, Regret minimization, and Equilibria (Section 4.3)
by A. Blum and Y. Mansour
From External to internal regret (Section 7)
by A. Blum and Y. Mansour
Hebrew Notes 5 Scribe 5
6 Feb 12 Adversarial costs: MAB Reduction: previous years class notes
THE NONSTOCHASTIC MULTIARMED BANDIT PROBLEM
by P. AUER, N. CESA-BIANCHI, Y. FREUND, and R. SCHAPIRE
Explore no more: Improved high-probability regret bounds for non-stochastic bandits
by G. Neu
Hebrew Notes 6 Scribe 6
7 Feb 19 Linear Cost Full information:
Efficient algorithms for online decision problems Kalai and Vempala 2005
Reduction and Brycentic spanner
Online linear optimization and adaptive routing Awerbuch and Kleinberg 2008
Stochastic:
Chapter 19 Bandit Algorithms book (Lattimore and Szepesvari)
Additional resources:
Slivkins book Chapter 7
Stochastic Linear Optimization under Bandit Feedback Dani, Hayes, Kakade 2008
Hebrew Notes 7 Scribe 7
8 Feb 26 Contextual Bandits Lecture notes from 2018 course
Efficient Algorithms for Adversarial Contextual Learning
Vasilis Syrgkanis, Akshay Krishnamurthy, Robert E. Schapire, ICML 2016
Hebrew Notes 8 Scribe 8
9 March 4 Zero-sum games and applications Lecture notes from 2018/19 and 2009/10 course
Slivkins chapter 9
Hebrew Notes 9
10 March 11 Swap and and correlated equilibrium Lecture notes from 2018/19 Hebrew Notes 10
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License