jaggbow / saf Goto Github PK

This repository contains code for the paper "Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning". https://arxiv.org/abs/2210.03022

Python 87.96% Shell 12.04%

coordination multi-agent-reinforcement-learning reinforcement-learning reinforcement-learning-environments

saf's People

Contributors

Stargazers

Watchers

Forkers

andrewrwilliams tianyu-z edu-ai juandavidvargas19

saf's Issues

MARLGrid integration

Integrate the MARLGrid environment into the codebase and include coordination and heterogeneity levels

Put all relevant files under the folder src/envs/marlgrid/. Once the environment code is ready, make sure it's callable from src/envs/__init__.py using the get_env function.

Additional IPPO/MAPPO training details

We still need to implement additional PPO training details to benefit from the full performance of IPPO and MAPPO [1,2]. Here are the things that should be implemented:

Feature Pruning: Form a state by concatenating environment provided global state and agent's local observation and then prune out redundant information. This is highly environment specific so we might need to change the obs_to_state_wrapper to account for that. No change needed elsewhere.
Value Normalization: Regress value network output to the normalized value target. This was found to help the training significantly for MAPPO
Recurrent-MAPPO: MAPPO that operates with RNNs (GRU for example) instead of simple MLPs
Frame stacking: Provide a stack of observations instead of only one

References:

[1] The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

[2] (https://github.com/marlbenchmark/on-policy/tree/main/onpolicy)[https://github.com/marlbenchmark/on-policy/tree/main/onpolicy]

jaggbow / saf Goto Github PK

saf's People

Contributors

Stargazers

Watchers

Forkers

saf's Issues

MARLGrid integration

Additional IPPO/MAPPO training details

References:

Add support for CNN architectures

Check if marlgrid has random initialization for agent and goal positions

Add SAF as an additional model

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs