Comments (7)
是否critic没有用history obs结构太过简单?无法学习?
from marl-algorithms.
震荡下降的原因我也不太清楚。
Critic没有使用history obs确实是一方面因素,另外还有Advantage的计算、更新的方式都会影响AC方法的性能。
from marl-algorithms.
对的,应要比性能的话应该把G2ANet这种加到Critic上,Critic不行Actor应该很难学习。项目很好,对我这个刚开始搞MARL的很有用。AC算法都能跑AlphaStar了按理说不应该像图标上这么差的~
from marl-algorithms.
嗯,如果以后有时间了我会把MAAC也加进来去比一比。
from marl-algorithms.
发表一下浅见,我自己实现的SAC和PPO,在SMAC上测试的结果是简单地图3m 8m 2s3z能达到95%以上胜率,中等难度的3s5z能达到80%,MMM能达到95%,但是训练很慢,要1千万个sample以上,在不对称地图上就不大行。
from marl-algorithms.
我最近看了张崇洁老师的论文DOP,是基于AC的值分解,他的效果就很好。
from marl-algorithms.
我最近看了张崇洁老师的论文DOP,是基于AC的值分解,他的效果就很好。
实验灌水的文章罢了
from marl-algorithms.
Related Issues (20)
- 关于参数reuse_network HOT 3
- 关于COMA critic网络输入 HOT 3
- 关于g2anet中hard_weights的问题 HOT 1
- 可以使用其他的环境跑这里面的算法吗? HOT 1
- 自定义的环境能使用这里面的算法跑吗? HOT 1
- custom data traing HOT 1
- 策略函数中的eval_hidden和target_hidden如何理解 HOT 2
- None
- 关于qtran_base.py中_get_individual_q的一个小问题 HOT 2
- 关于qtran的问题 HOT 1
- Translate code comments to English
- Quick Start 会报错,请问是什么问题。 HOT 2
- 关于GA-Common和GA-AC的问题 HOT 1
- 关于evaluate的胜率
- 关于attention的训练依据的问题 HOT 1
- 关于在别的环境应用qmix出现episodes rewards dropout的问题 HOT 2
- 关于训练得到的模型的问题 HOT 1
- 关于get_action_weights的问题 HOT 1
- 关于涉及环境参数的一些疑问 HOT 1
- 关于QMIX的Trick:Eligibility traces HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from marl-algorithms.