Exploration for Free: How Does Reward Heterogeneity Improve Regret in Cooperative Multi-agent Bandits?
Xuchuang Wang, Lin Yang , Yu-zhen Janice Chen , Xutong Liu , Mohammad Hajiesmaili , Don Towsley , and John C.S. Lui
In The 39th Conference on Uncertainty in Artificial Intelligence , 2023