First of all, I would like to know how to calculate the ppl of d1 and d2. <p dir="

The ppl calculation: <a href="https://huggingface.co/transformers/perplexity.html" rel

How to calculate and evaluate the ppl of D1 and D2. about bob HOT 9 CLOSED

songhaoyu commented on July 24, 2024

How to calculate and evaluate the ppl of D1 and D2.

from bob.

Comments (9)

haoyusoong commented on July 24, 2024

have you ever taken a look at the generated results?

from bob.

haoyusoong commented on July 24, 2024

The ppl calculation: https://huggingface.co/transformers/perplexity.html

from bob.

haoyusoong commented on July 24, 2024

However, as long as the final output of the model, the d2 score, improves, we don't need to worry about the d1 score.
Please tell us why you decided that epoch 7 (Perplexity on test set is 21.037 and 7.813.) is optimal.

PPL is just one of the indicators, and there are many other metrics. Our goal is to generate good dialogue responses rather than getting the extremely low ppl. The Epochs > 15 are usually overfitted on the ppl metric and suffer a significant quality drop of the responses. In our test run, epoch 7 delivers good responses and has a competitive performance on all metrics, including the relatively good ppl (cf. baselines).

from bob.

iyo-0713 commented on July 24, 2024

I see. I understand now.
Thank you very much for answering my question.

from bob.

Wenze7 commented on July 24, 2024

Hi, bro, I would like to ask that have you reproduced the results mentioned in paper?

from bob.

iyo-0713 commented on July 24, 2024

Hi, bro. I could not reproduce the results mentioned in paper.

from bob.

Wenze7 commented on July 24, 2024

That's too bad, i tried to contact with the author, but I never receive reply. Do you know of any papers that use NLI and can reproduce the results？

from bob.

iyo-0713 commented on July 24, 2024

Uhm...
My friend also used this model, but couldn't reproduce the result. I think it is difficult to reproduce the result in this paper.
I changed the model using in my research. I don't know other models using NLI.

from bob.

Wenze7 commented on July 24, 2024

Fine, thanks!

from bob.

Recommend Projects

How to calculate and evaluate the ppl of D1 and D2. about bob HOT 9 CLOSED

Comments (9)

Related Issues (19)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs