Connections between POMDPs and partially observed n-player mean-field games
Bora Yongacoglu – University of Toronto, Canada
In this talk, we will study a discrete-time model of mean-field games with finitely many players and partial observability of the global state, and we will describe the deep connection between such n-player mean-field games and partially observed Markov decision problems (POMDPs). We focus primarily on settings with mean-field observability, where each player privately observes its own local state as well as the complete mean-field distribution. We prove that if one's counterparts use symmetric stationary memoryless policies, then a given agent faces a fully observed, time homogenous MDP. We leverage this to prove the existence of a memoryless, stationary perfect equilibrium in the n-player game with mean-field observability. We also show, by example, that the symmetry condition cannot be relaxed without loss of generality. Under narrower observation channels, in which the mean-field information is compressed before being observed by each agent, we show that the agent faces a POMDP rather than an MDP, even when its counterparts use symmetric policies.
Bio : Bora Yongacoglu is a post-doctoral fellow in the Department of Electrical and Computer Engineering at University of Toronto, where he studies learning in multi-agent systems. He received his PhD and MSc. degrees in mathematics from Queen's University, and his B.A. in mathematics and economics from McGill University.
![Peter Caines](/system/assets/000/000/410/410.CainesP2_card.jpg)
![Aditya Mahajan](/system/assets/000/000/465/465.MahajanA2_card.jpg)
![Shuang Gao](/system/assets/000/001/287/1287.GaoShuang3_card.jpg)
![Rinel Foguen](/system/assets/000/001/180/1180.FoguenRinel_card.jpg)
![Borna Sayedana](/system/assets/000/001/929/1929.BornaSayedana_card.jpg)
![Alex Dunyak](/system/assets/000/002/007/2007.AlexDunyak_card.jpg)
Lieu
CIM
Pavillon McConnell
Université McGill
Montréal QC H3A 0E9
Canada