GithubHelp home page GithubHelp logo

Cheryl Liang's Projects

adhoc_aamas-17 icon adhoc_aamas-17

Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"

ai-toolbox icon ai-toolbox

A C++ framework for MDPs and POMDPs with Python bindings

atla_robust_rl icon atla_robust_rl

Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework

deepmind_mas_enviroment icon deepmind_mas_enviroment

some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》

drm-pretrain icon drm-pretrain

DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

marl-papers icon marl-papers

Paper list of multi-agent reinforcement learning (MARL)

mfrl icon mfrl

Mean Field Multi-Agent Reinforcement Learning

models icon models

Models and examples built with TensorFlow

moon icon moon

Moon is a minimal, one column jekyll theme.

multiagent-particle-envs icon multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

neural-network-diffusion icon neural-network-diffusion

We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters

nxdo icon nxdo

Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games

open_spiel icon open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

oyster icon oyster

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

pomdps.jl icon pomdps.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating discrete and continuous, fully and partially observable Markov decision processes.

pytorch-maddpg icon pytorch-maddpg

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

sa_dqn icon sa_dqn

[NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.