some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》

DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.

homework

Assignments for CS294-112.

llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

make-an-agent

marl-papers

Paper list of multi-agent reinforcement learning (MARL)

marl-tutorial

mfrl

Mean Field Multi-Agent Reinforcement Learning

models

Models and examples built with TensorFlow

moon

Moon is a minimal, one column jekyll theme.

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

nerfies.github.io

neural-network-diffusion

We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters

nxdo

Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

oyster

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

paper-list-of-marl

A new paper list for multi-agent reinforcement learning (actively maintained)

policygenerator

pomdps.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating discrete and continuous, fully and partially observable Markov decision processes.

pytorch-maddpg

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

pytorch-summary

Model summary in PyTorch similar to `model.summary()` in Keras

recurrent-deep-q-learning

Solving POMDP using Recurrent networks

robust_trainer

Code for robust trainer on MuJoCo

sa_dqn

[NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning

cheryyunl Goto Github PK

Cheryl Liang's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs