Comments (3)
Hello @billkunghappy
Thanks for your interest.
First Problem: Each turn in a dialog contains 100 candidates that need to be scored and ranked. At test time, you would not know the "ground truth" candidate and thus need to score each candidate independently. Further, the scoring function to use is completely up to you. Using cross_entropy_loss
is one way when training the model as a conditional language model. For this choice of scoring function, you have to use the candidate as both the input and target as you have no knowledge of the "ground truth".
Second Problem: During retrieval, one does not feed the candidate sentence to "generate" it but to score its likelihood under the model. If this is what you're talking about, then you feed the actual candidate tokens (not those predicted by the model) ground truth tokens to obtain the probability of the next token in the candidate given the previous ones (teacher forcing).
Hope this answers your questions.
P.S.: Edited to avoid overload of the word "ground truth".
from simmc.
Hello @billkunghappy ,
I edited the above comment to add more clarity. Hope this addresses your question.
from simmc.
Thanks @satwikkottur
For the second problem, you said to feed the ground truth token to obtain the probability of the next token
But in first problem, you said At test time, you would not know the "ground truth" candidate
The question is that since we don't have the ground truth during testing, how are we able to feed the ground truth into the model and acquire the candidate's probability?
from simmc.
Related Issues (20)
- Incorrect evaluation script provided for MM-DST baseline HOT 1
- Incorrect Hyperparameters ? HOT 8
- Baselines results for API call prediction HOT 1
- action_evaluation expected file format HOT 4
- Baselines results HOT 1
- Bug in baseline? (missing sigmoid) HOT 1
- Possible bugs in evaluation script in SubTask #1 HOT 2
- Are we allowed to use "turn_label" fields for subtasks 1-2 ? HOT 8
- Question about Fashion attributes HOT 1
- SubTask #3 evaluation lower case issue HOT 1
- Bug in mm_dst baseline HOT 1
- Question about submission models HOT 2
- Question about test-std files HOT 9
- Question about the new evaluation method for Task 1&2 HOT 1
- KeyError caused by ~teststd_dials_retrieval_candidates_public.json HOT 1
- How to get images HOT 2
- bug in run scripts/preprocess_simmc.sh HOT 3
- Question about mm_action_prediction/scripts/train_simmc_model.sh HOT 1
- How can I get images of fashion items?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from simmc.