I tried to implement deepstack with python, and generated 4M training samples for the

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

A question about training the neural networks about deepholdem HOT 8 OPEN

zhoujz10 commented on July 18, 2024

A question about training the neural networks

from deepholdem.

Comments (8)

zhoujz10 commented on July 18, 2024

@happypepper

from deepholdem.

happypepper commented on July 18, 2024

what was your river loss? or did you solve 2 streets?

from deepholdem.

zhoujz10 commented on July 18, 2024

@happypepper Hi, thank you for your reply. I solve 2 streets instead of using a river network. And I calculated my exploitability of a turn case, the exploitability is around 2 mbb, so I guess my resolving process is right. Maybe there are bugs in my bucketing?

How many epochs did you use to train your network? I used thousands of epochs but my training loss is still very high.

from deepholdem.

happypepper commented on July 18, 2024

After around 80 epochs, it stopped improving. Validation loss after first epoch was 0.08 already.

How did you do bucketing? k means + EMD?

from deepholdem.

zhoujz10 commented on July 18, 2024

@happypepper I use k-means on the river round, and EMD on other rounds. I used the same bucketing in the reference papers.

I noticed that in your code, you made a change when calculating the loss.

In line 64 in masked_huber_loss.lua, your code is:

local loss_multiplier = (batch_size * feature_size) / self.mask_sum:sum()

This means you average the loss on valid buckets, not on all the 1000 buckets. I think this makes sense, and the author's repo has a bug here.

Is there any way to debug my bucketing? I'm at the end of my rope.

from deepholdem.

happypepper commented on July 18, 2024

how is it possible to use k-means for river? There is only one number instead of distribution. EMD is usually used in combination with k-means.

You can email me and we can communicate outside of github somehow, it's easier

from deepholdem.

zhoujz10 commented on July 18, 2024

@happypepper Hi, I just sent an email to you and described the method of generating river clusters.

from deepholdem.

aligatorblood commented on July 18, 2024

Hi, can you send me this email as well?

from deepholdem.

Recommend Projects

A question about training the neural networks about deepholdem HOT 8 OPEN

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs