Πηγαίνετε εκτός σύνδεσης με την εφαρμογή Player FM !
Shimon Whiteson
Manage episode 279413376 series 2536330
Shimon Whiteson is a Professor of Computer Science at Oxford University, the head of WhiRL, the Whiteson Research Lab at Oxford, and Head of Research at Waymo UK.
Featured References
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson
Additional References
- Shimon Whiteson - Multi-agent RL, MIT Embodied Intelligence Seminar
- The StarCraft Multi-Agent Challenge, Samvelyan et al 2019
- Direct Policy Transfer with Hidden Parameter Markov Decision Processes, Yao et al 2018
- Value-Decomposition Networks For Cooperative Multi-Agent Learning, Sunehag et al 2017
- Whiteson Research Lab
- Waymo acquires Latent Logic to accelerate progress towards safe, driverless vehicles, Oxford News
- Waymo
61 επεισόδια
Manage episode 279413376 series 2536330
Shimon Whiteson is a Professor of Computer Science at Oxford University, the head of WhiRL, the Whiteson Research Lab at Oxford, and Head of Research at Waymo UK.
Featured References
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson
Additional References
- Shimon Whiteson - Multi-agent RL, MIT Embodied Intelligence Seminar
- The StarCraft Multi-Agent Challenge, Samvelyan et al 2019
- Direct Policy Transfer with Hidden Parameter Markov Decision Processes, Yao et al 2018
- Value-Decomposition Networks For Cooperative Multi-Agent Learning, Sunehag et al 2017
- Whiteson Research Lab
- Waymo acquires Latent Logic to accelerate progress towards safe, driverless vehicles, Oxford News
- Waymo
61 επεισόδια
सभी एपिसोड
×Καλώς ήλθατε στο Player FM!
Το FM Player σαρώνει τον ιστό για podcasts υψηλής ποιότητας για να απολαύσετε αυτή τη στιγμή. Είναι η καλύτερη εφαρμογή podcast και λειτουργεί σε Android, iPhone και στον ιστό. Εγγραφή για συγχρονισμό συνδρομών σε όλες τις συσκευές.