Multi-agent reinforcement learning
An introduction to Multi-Agents Reinforcement Learning (MARL) - Hugging Face Deep RL Course
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/learn/deep-rl-course/unit7/introduction-to-marl
Search-Improved Game-Theoretic Multiagent Reinforcement Learning in General and Negotiation Games
Multiagent reinforcement learning (MARL) has benefited significantly from population-based and game-theoretic training regimes. Many applications have focused on two-player zero-sum games, employing standard reinforcement learning to compute oracle'' orexploiter'' response policies via approximate best response. In this paper, we introduce Monte Carlo tree search with generative world state sampling to augment the best response steps. We show empirical convergence to Nash equilibria and the effects of various choices of meta-solvers across a suite of general-sum and n-player sequential games. We then present case studies on negotiation games including Colored Trails and a multi-issue bargaining game ``Deal or no Deal''. We propose two new forms of meta-solvers based on the Nash Bargaining Solution (NBS) and simple gradient ascent algorithms to solve them. The NBS meta-solvers produce agents that achieve higher social welfare than purely Nash-inspired ones, and reach closest to the Pareto-frontier in Colored Trails. Finally, we report on the generalization capabilities of agents trained via this regime by evaluating them against human participants. Overall, we find that search and generative modeling helps find stronger policies during training, enables online Bayesian co-player prediction, and trains fair agents that can achieve comparable social welfare negotiating with humans as humans trading among themselves.
https://www.deepmind.com/publications/search-improved-game-theoretic-multiagent-reinforcement-learning-in-general-and-negotiation-games
Multi-agent reinforcement learning
Multi-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that coexist in a shared environment. Each agent is motivated by its own rewards, and does actions to advance its own interests; in some environments these interests are opposed to the interests of other agents, resulting in complex group dynamics.
https://en.wikipedia.org/wiki/Multi-agent_reinforcement_learning


Seonglae Cho
