GithubHelp home page GithubHelp logo

Comments (11)

zcysun avatar zcysun commented on August 29, 2024

Hello, I have another question. The current average reward graph is drawing a point every 200eposide, how can I modify the frequency to require him to update it every specific eposides, so that the points sampled by the graph can be controlled?

from marl_cavs.

DongChen06 avatar DongChen06 commented on August 29, 2024

Hi: As shown in the question, I want to get the observation information of the surrounding vehicles (or all the vehicles in the area). For example, I want to calculate the combined cost of all the vehicles in the area, view the speed information of all the vehicles (including self vehicles and HVs), etc., how do I get this information first?

Thank you very much for your answer.

Hi, if you know how to debug, then you may look at the merge_env_v1.py, there should be some variables called road.vehicles. It contains all the vehicle information

from marl_cavs.

DongChen06 avatar DongChen06 commented on August 29, 2024

Hello, I have another question. The current average reward graph is drawing a point every 200eposide, how can I modify the frequency to require him to update it every specific eposides, so that the points sampled by the graph can be controlled?

For this question, you may look at the training codes and modify the log frequency. thanks

from marl_cavs.

zcysun avatar zcysun commented on August 29, 2024

Thank you very much for your answer!

Currently the code outputs an average reward graph not with one coordinate point per eposide, but with one reward value per "EVAL_INTERVAL", and it renders and outputs three training videos per EVAL_INTERVAL; this way, when I turn down EVAL_INTERVAL (for debugging purposes) I will be able to save the training videos frequently. I don't actually want the training videos right now, so I have the following questions:

  1. Where can I change the code so that each eposide outputs a reward value so that I end up with a densely-populated average reward plot?
  2. how do I turn off rendering and not have the training video appear and save, I only care about the average reward results.

Also, besides the average reward plot, what other meaningful result plots can I output for analysis?

Thank you very much for your answer!

from marl_cavs.

zcysun avatar zcysun commented on August 29, 2024

I'm really sorry for the interruption, but I'd like to ask you a few more questions!

Looking at the information in road.vehicles as you said, I printed them and the output is as follows:
! output
It seems that each vehicle will have three pieces of information, the first being the number (again, a question. Why is this number all three digits when the training video is something like #1#2?) The remaining two numbers I can't seem to find out exactly what they mean, can you tell me?

Thank you very much for your answer!

from marl_cavs.

zcysun avatar zcysun commented on August 29, 2024

I'm very sorry for the inconvenience.
! Screenshot 2023-11-29 09-43-10!
As you can see above, I noticed that the eval_logs and eval_videos folders are empty after each training session. Is this normal? Also, are the training logs saved and where should I look for them?

I would appreciate if you could answer!

from marl_cavs.

DongChen06 avatar DongChen06 commented on August 29, 2024

I'm very sorry for the inconvenience. ! Screenshot 2023-11-29 09-43-10! As you can see above, I noticed that the eval_logs and eval_videos folders are empty after each training session. Is this normal? Also, are the training logs saved and where should I look for them?

I would appreciate if you could answer!

Yes, it is normal. When you implemented the evaluation, there will be videos in the folders

from marl_cavs.

zcysun avatar zcysun commented on August 29, 2024

Hello!

After debugging your publicly available code and running MAACKTR algorithm for approximately 1000 episodes, here is the result graph of average rewards. It seems there is no clear convergence trend(I ran it for several more times, but there still seems to be no convergence trend.). I would like to ask how many episodes it typically takes to achieve convergence, and if it doesn't converge, what are the key parameters to adjust?
Figure_5

Thank you for your response!

from marl_cavs.

DongChen06 avatar DongChen06 commented on August 29, 2024

Hello!

After debugging your publicly available code and running MAACKTR algorithm for approximately 1000 episodes, here is the result graph of average rewards. It seems there is no clear convergence trend(I ran it for several more times, but there still seems to be no convergence trend.). I would like to ask how many episodes it typically takes to achieve convergence, and if it doesn't converge, what are the key parameters to adjust? Figure_5

Thank you for your response!

Hi, which traffic modes you used? You may need 5000-10000 episodes to see an apparent increasing trend.

from marl_cavs.

zcysun avatar zcysun commented on August 29, 2024

Thanks for your reply!

The traffic modes is medium, when I increase the number of training rounds to around 10,000 the running time is around 12hours, is this within reason?

Also, I would like to output the same type of average speed graph as in your paper, where do I modify the code?
截图 2023-12-01 09-23-36

Thank you!

from marl_cavs.

DongChen06 avatar DongChen06 commented on August 29, 2024

The plot is based on three random seeds. So you may need to run the codes for three times with different seeds.

from marl_cavs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.