Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling

11/08/2022
by   Ou Deng, et al.
0

This paper explores human behavior in virtual networked communities, specifically individuals or groups' potential and expressive capacity to respond to internal and external stimuli, with assortative matching as a typical example. A modeling approach based on Multi-Agent Reinforcement Learning (MARL) is proposed, adding a multi-head attention function to the A3C algorithm to enhance learning effectiveness. This approach simulates human behavior in certain scenarios through various environmental parameter settings and agent action strategies. In our experiment, reinforcement learning is employed to serve specific agents that learn from environment status and competitor behaviors, optimizing strategies to achieve better results. The simulation includes individual and group levels, displaying possible paths to forming competitive advantages. This modeling approach provides a means for further analysis of the evolutionary dynamics of human behavior, communities, and organizations in various socioeconomic issues.

READ FULL TEXT
research
06/12/2018

Multi-Agent Deep Reinforcement Learning with Human Strategies

Deep learning has enabled traditional reinforcement learning methods to ...
research
12/14/2020

SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning

A characteristic of reinforcement learning is the ability to develop unf...
research
06/19/2023

CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning

Before taking actions in an environment with more than one intelligent a...
research
03/03/2018

Model-Based Stochastic Search for Large Scale Optimization of Multi-Agent UAV Swarms

Recent work from the reinforcement learning community has shown that Evo...
research
01/15/2023

Modeling Human Cognition with a Hybrid Deep Reinforcement Learning Agent

Human cognition model could help us gain insights in how human cognition...
research
11/05/2019

Learning to flock through reinforcement

Flocks of birds, schools of fish, insects swarms are examples of coordin...
research
08/08/2016

Complexity Results for Manipulation, Bribery and Control of the Kemeny Procedure in Judgment Aggregation

We study the computational complexity of several scenarios of strategic ...

Please sign up or login with your details

Forgot password? Click here to reset