Hi, I have some questions about retrieval evaluation of the response generation. I

Hello <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-ur

Thanks <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-u

Question about retrieval evaluation about simmc HOT 3 CLOSED

facebookresearch commented on June 11, 2024

Question about retrieval evaluation

from simmc.

Comments (3)

satwikkottur commented on June 11, 2024 1

Hello @billkunghappy

Thanks for your interest.

First Problem: Each turn in a dialog contains 100 candidates that need to be scored and ranked. At test time, you would not know the "ground truth" candidate and thus need to score each candidate independently. Further, the scoring function to use is completely up to you. Using cross_entropy_loss is one way when training the model as a conditional language model. For this choice of scoring function, you have to use the candidate as both the input and target as you have no knowledge of the "ground truth".

Second Problem: During retrieval, one does not feed the candidate sentence to "generate" it but to score its likelihood under the model. If this is what you're talking about, then you feed the actual candidate tokens (not those predicted by the model) ~~ground truth tokens~~ to obtain the probability of the next token in the candidate given the previous ones (teacher forcing).

Hope this answers your questions.

P.S.: Edited to avoid overload of the word "ground truth".

from simmc.

satwikkottur commented on June 11, 2024 1

Hello @billkunghappy ,

I edited the above comment to add more clarity. Hope this addresses your question.

from simmc.

billkunghappy commented on June 11, 2024

Thanks @satwikkottur
For the second problem, you said to feed the ground truth token to obtain the probability of the next token
But in first problem, you said At test time, you would not know the "ground truth" candidate
The question is that since we don't have the ground truth during testing, how are we able to feed the ground truth into the model and acquire the candidate's probability?

from simmc.

Recommend Projects

Question about retrieval evaluation about simmc HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs