Comments (9)
have you ever taken a look at the generated results?
from bob.
The ppl calculation: https://huggingface.co/transformers/perplexity.html
from bob.
However, as long as the final output of the model, the d2 score, improves, we don't need to worry about the d1 score.
Please tell us why you decided that epoch 7 (Perplexity on test set is 21.037 and 7.813.) is optimal.
PPL is just one of the indicators, and there are many other metrics. Our goal is to generate good dialogue responses rather than getting the extremely low ppl. The Epochs > 15 are usually overfitted on the ppl metric and suffer a significant quality drop of the responses. In our test run, epoch 7 delivers good responses and has a competitive performance on all metrics, including the relatively good ppl (cf. baselines).
from bob.
I see. I understand now.
Thank you very much for answering my question.
from bob.
Hi, bro, I would like to ask that have you reproduced the results mentioned in paper?
from bob.
Hi, bro. I could not reproduce the results mentioned in paper.
from bob.
That's too bad, i tried to contact with the author, but I never receive reply. Do you know of any papers that use NLI and can reproduce the results?
from bob.
Uhm...
My friend also used this model, but couldn't reproduce the result. I think it is difficult to reproduce the result in this paper.
I changed the model using in my research. I don't know other models using NLI.
from bob.
Fine, thanks!
from bob.
Related Issues (19)
- File name "xlib.modeling_tf_auto" not found HOT 1
- per_input_ids=persona_input_ids?
- 采用DDP训练代码出错 HOT 1
- `per_input_ids=persona_input_ids` seems not be used when `ul_training` HOT 3
- Could you provide your model? HOT 1
- Which PPL to be used?
- 关于`ul_training`一些理解不到位的地方 HOT 3
- How to get the results of other metrics?
- Question about nliset of PersonaChat HOT 3
- 1
- How to run this project in "pytorch1.8"
- Question about C.score
- Question about CUDA out of memory HOT 1
- Question about the PersonaChat data HOT 4
- Could you provide GDR code? HOT 1
- Question about Dist.1 / 2 HOT 4
- Can't use model.generate when beam_size>1 HOT 1
- 数据不一致问题 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bob.