Comments (8)
Successfully reproduced on the adult dataset with your specified preprocessing. As verification, the changes in the last few commits in project/scaffolding have only been syntactic, right?
from sdgym.
Tried it on both the Census as well as the Adult dataset. Also tried using both the GMMTransformer and the BGMTransformer. Neither change seems to make a difference.
from sdgym.
Narrowed the problem down to something specifically with TGAN. Using the latest project_scaffolding the problem persists with both the census and adult dataset. TableGAN does not seem to have the problem. MedGAN also doesn't seem to suffer from this problem.
from sdgym.
Thanks for reporting this, we will have a look and see how can it be fixed.
from sdgym.
Good to hear. I have the old TGAN working so I don't necessarily need this TGAN. Thus, I likely won't investigate any further into this issue either.
from sdgym.
@Baukebrenninkmeijer we just released the paper about SDGym here. This explains the latest TGAN model which is in this current SDGym repo (the older version was in this repo) This paper creates a new much approach (we should really call it CTGAN to differentiate it from TGAN- I can see how that can be confusing). Also, the paper explains the reasons behind SDGym and why we created it.
from sdgym.
Haha you guys beat me to it! I was hoping your paper would be finished after my thesis.
Could you maybe elaborate on the usage of the Gumbel softmax? I've been confused by when to and when not to use adapted softmax functions to overcome the differentiation problem. In the old TGAN paper you stated that it was not necessary to use the Gumbel softmax because the number of categories is fairly small, so the probability distribution can be generated directly with softmax. However, in the new TGAN paper you decide to use it to overcome the same stated problem.
from sdgym.
from sdgym.
Related Issues (20)
- PrivBayes HOT 2
- Add functions to top level import
- Remove pomegranate
- limit_dataset_size causes sdgym to crash
- benchmark_single_table crashes with metadata dict
- Passing None as synthesizers runs all of them HOT 2
- timeout parameter causes sdgym to crash
- Can't download datasets if `.aws` config is present
- ImportError: cannot import name 'Metadata' from 'sdv' HOT 2
- (Known issue, workaround provided) Problems with importing SDGym HOT 1
- The `UniformSynthesizer` should follow the sdtypes in metadata (not the data's dtypes)
- The `IndependentSynthesizer` should follow the sdtypes in the metadata (not the data's dtypes)
- Add support for Python 3.11
- Remove anyio usage
- load_dataset fails for HOT 4
- Drop support for Python 3.7
- Switch default branch from master to main HOT 1
- Binary Classification metric fails with unknown category (`ValueError`) HOT 2
- Add ability to load and inspect individual datasets HOT 1
- Dockerfile error HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sdgym.