Comments (5)
- There is no difference between the train.src_path and generator.src_path. Actually, they are not used at the same time. The former is used to pre-train the model and the latter is used to train the generator during the gan training.
- The dis_positive_data and the dis_negative_data is generated automatically. In our experiments, it will includes 5000 sentence for training the discriminator during gan training.
- the parameter λ is not defined in the config file, just set in the model.py
- We stop the training when the model achieves no improvement on the development sets. The process for the evaluating is the same with the testing. We didn't test the PPL score for evaluation.
from nmt_gan.
Hi @ZhenYangIACAS ,
Thanks for the answer!
You said that the dis_positive_data and the dis_negative_data are generated automatically, but we need to generate it for the first iteration of the first epoch right?
Another question about the evaluation - did you use evaulate.sh for evaluation?
When did you evaluate bleu score on the dev set? every epoch?
Did you stop training when the model achieves no improvement for the tenth evaluation on the dev set or first evaluation or another number?
Thanks!
from nmt_gan.
After the preprocessing you start training with GANs, do you use early stopping there as well (with patience 10?) or do you have another way to decide when to stop training? (similar to @zoharai's last question)
from nmt_gan.
@zoharai @borgr During the GAN training, you do not need to generate the dis_positive_data and dis_negative_data even for the first epoch. You can find the logic behind in the gan_train.py. For the evaluation, we stop training when the model achieves no improvement for the tenth evaluation on the dev set, which has been clarified in our paper. And we evaluate the model at regular intervals.
from nmt_gan.
@zoharai @ZhenYangIACAS Sorry to bother both of you. I am still confused about the evaluation. I can't find it where it is used. Could help me ?
from nmt_gan.
Related Issues (20)
- Preprocessing of en2de data HOT 4
- InvalidArgumentError: Assign requires shapes of both tensors to match.
- How to get the vocab files? HOT 7
- Dropout HOT 1
- NoneType HOT 2
- file not found HOT 1
- the g_loss in function gan_output (in model.py) HOT 1
- Hi,Where is the training data? Thanks.
- Mistake in vocab.py?
- How to get the dis_negative_data in the config_discriminator_pretrain.yaml? HOT 2
- dis_saveto HOT 10
- list index out of range HOT 2
- Can't find data
- GAN training is too slow.
- dis_negative_data HOT 2
- the Adversal training is very very very slow!!!!
- About discriminator
- Nmt GANs
- Training data size for the generator and discriminator HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nmt_gan.