First of all thanks for releasing the source code for the successful training of Doom

Hi, The run.sh file here <a href="h

I used the following command to train a model on track_1. <div class="highlight hi

I used the following command to train a model on track_1. <div class=

What's the training settings for track_1 and track_2 model about arnold HOT 6 CLOSED

glample commented on July 30, 2024

What's the training settings for track_1 and track_2 model

from arnold.

Comments (6)

glample commented on July 30, 2024

Hi,

The run.sh file here https://github.com/glample/Arnold/blob/master/run.sh will contain the running commands to visualize the agents, and these commands also contain the training parameters. The parameters we used that are not specified in the commands are the default ones in the code.

We did not use curriculum learning or anything like this, although it might help a bit.

from arnold.

flyers commented on July 30, 2024

Hi,
Thanks for your quick reply. I will try to reproduce this from scratch.
By the way, how long does it take for the model to reach the performance of the released ones?

from arnold.

glample commented on July 30, 2024

On a P100 it should take a couple of days.

from arnold.

flyers commented on July 30, 2024

I used the following command to train a model on track_1.

python3 arnold.py --exp_name track_1 --main_dump_path $PWD/dumped \
--frame_skip 3 --action_combinations "attack+move_lr;turn_lr;move_fb" \
--network_type "dqn_rnn" --recurrence "lstm" --n_rec_layers 1 --hist_size 4 --remember 1 \
--labels_mapping "" --game_features "target,enemy" --bucket_size "[10, 1]" --dropout 0.5 \
--speed "on" --crouch "off" --map_ids_test 1 --manual_control 1 \
--scenario "deathmatch" --wad "deathmatch_rockets" --gpu_id 0

The training process runs up to 10119600 iterations while the log shows that the best performance model is best-120000.pth, achieving a frag score of 62. I also note that the variance of the frags score is quite large between different iterations. Is that normal?

from arnold.

glample commented on July 30, 2024

What is the K/D ratio? The number of frags isn't necessarily very relevant because a bot can get quite a good number of frags by shooting randomly. Did you visualize the agent to have a look at how it behaves?

Regarding the variance between different iterations, this is normal yes. Variance should decrease if you increase the evaluation time, but the number of frags usually oscillates quite a lot (as opposed to the K/D which is usually more stable).

from arnold.

Maxwell2017 commented on July 30, 2024

I used the following command to train a model on track_1.
python3 arnold.py --exp_name track_1 --main_dump_path $PWD/dumped \
--frame_skip 3 --action_combinations "attack+move_lr;turn_lr;move_fb" \
--network_type "dqn_rnn" --recurrence "lstm" --n_rec_layers 1 --hist_size 4 --remember 1 \
--labels_mapping "" --game_features "target,enemy" --bucket_size "[10, 1]" --dropout 0.5 \
--speed "on" --crouch "off" --map_ids_test 1 --manual_control 1 \
--scenario "deathmatch" --wad "deathmatch_rockets" --gpu_id 0
The training process runs up to 10119600 iterations while the log shows that the best performance model is best-120000.pth, achieving a frag score of 62. I also note that the variance of the frags score is quite large between different iterations. Is that normal?

Hello flyers. Did you reproduce the Pretrain model? I don't know if it is caused by my parameter setting. After three days of training, the track1 result is not very high, K/D ratio less than 1 @flyers

from arnold.

What's the training settings for track_1 and track_2 model about arnold HOT 6 CLOSED

Comments (6)

Related Issues (13)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs