Learning Roles with Emergent Social Value Orientations

01/31/2023
by   Wenhao Li, et al.
0

Social dilemmas can be considered situations where individual rationality leads to collective irrationality. The multi-agent reinforcement learning community has leveraged ideas from social science, such as social value orientations (SVO), to solve social dilemmas in complex cooperative tasks. In this paper, by first introducing the typical "division of labor or roles" mechanism in human society, we provide a promising solution for intertemporal social dilemmas (ISD) with SVOs. A novel learning framework, called Learning Roles with Emergent SVOs (RESVO), is proposed to transform the learning of roles into the social value orientation emergence, which is symmetrically solved by endowing agents with altruism to share rewards with other agents. An SVO-based role embedding space is then constructed by individual conditioning policies on roles with a novel rank regularizer and mutual information maximizer. Experiments show that RESVO achieves a stable division of labor and cooperation in ISDs with different complexity.

READ FULL TEXT

page 6

page 10

page 18

page 19

page 20

page 21

page 22

research
03/18/2020

Multi-Agent Reinforcement Learning with Emergent Roles

The role concept provides a useful tool to design and understand complex...
research
08/10/2022

The emergence of division of labor through decentralized social sanctioning

Human ecological success relies on our characteristic ability to flexibl...
research
04/22/2020

Evolving Dyadic Strategies for a Cooperative Physical Task

Many cooperative physical tasks require that individuals play specialize...
research
12/25/2019

A Logical Model for Supporting Social Commonsense Knowledge Acquisition

To make machine exhibit human-like abilities in the domains like robotic...
research
03/18/2020

ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

The role concept provides a useful tool to design and understand complex...
research
01/18/2023

Learning to Participate through Trading of Reward Shares

Enabling autonomous agents to act cooperatively is an important step to ...
research
07/02/2021

User Role Discovery and Optimization Method based on K-means + Reinforcement learning in Mobile Applications

With the widespread use of mobile phones, users can share their location...

Please sign up or login with your details

Forgot password? Click here to reset