Comments (8)
from deepholdem.
what was your river loss? or did you solve 2 streets?
from deepholdem.
@happypepper Hi, thank you for your reply. I solve 2 streets instead of using a river network. And I calculated my exploitability of a turn case, the exploitability is around 2 mbb, so I guess my resolving process is right. Maybe there are bugs in my bucketing?
How many epochs did you use to train your network? I used thousands of epochs but my training loss is still very high.
from deepholdem.
After around 80 epochs, it stopped improving. Validation loss after first epoch was 0.08 already.
How did you do bucketing? k means + EMD?
from deepholdem.
@happypepper I use k-means on the river round, and EMD on other rounds. I used the same bucketing in the reference papers.
I noticed that in your code, you made a change when calculating the loss.
In line 64 in masked_huber_loss.lua, your code is:
local loss_multiplier = (batch_size * feature_size) / self.mask_sum:sum()
This means you average the loss on valid buckets, not on all the 1000 buckets. I think this makes sense, and the author's repo has a bug here.
Is there any way to debug my bucketing? I'm at the end of my rope.
from deepholdem.
how is it possible to use k-means for river? There is only one number instead of distribution. EMD is usually used in combination with k-means.
You can email me and we can communicate outside of github somehow, it's easier
from deepholdem.
@happypepper Hi, I just sent an email to you and described the method of generating river clusters.
from deepholdem.
Hi, can you send me this email as well?
from deepholdem.
Related Issues (20)
- Unnecessary init checks?
- Warm-up opponent ranges HOT 3
- can i have your model files? HOT 21
- bucket_conversion invalid arguments while running raw_converter.lua HOT 2
- Bug in terminal_equity.set_call_matrix HOT 3
- bad argument #2 to '?'
- Be the Dealer HOT 2
- arguments.lua - Change SB / BB
- Invalid index in scatterAdd at .../HTensorMath.c:495
- "Not our Turn" with protocol demo data HOT 2
- Assertion "indexValue >=0" failed in next_round_value_pre.lua:114 HOT 2
- Setup environment
- main_train does nothing HOT 1
- Multi agents or change the blind structure HOT 1
- main_train does still nothing HOT 4
- Not playing best strategy?
- Have anyone created a docker file?
- Using Lua and tricky torch makes the project quite unfriendly to newbees HOT 1
- What is to be expected from this project? HOT 2
- Deepholdem.lua Segmentation fault HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepholdem.