Comments (4)
same error - commenting to see if there's a followup
from mt-dnn.
As an initial diagnosis, it seems that the QNNLI loader starts by loading the QNLI train TSV file, which has 104744 lines including the header. We remove the header, leaving an odd number of lines which fails the assert. This is true for the dev and test set files as well, which have 5464 lines.
I'm not familiar with what data is supposed to be read in, but it seems only 1 of the pairs of lines in the QNLI data actually don't trigger the if statement here. Not sure if that's expected, but seemed weird.
I tried dropping the assert and breaking if there's no subsequent line (if idx+1 >= len(lines)
), but that fails on the test set on the assertion here. I tried adding code that simply continues the loop if block1[1] != block2[1]
.
That code is still running, but it's possible the preprocessing breaking might screw up later stages of the model and might make fine-tuning worse? @namisan please let us know if there's a stable commit that doesn't experience this problem
from mt-dnn.
Thanks for suggestions. I've fixed the issue and please refer README for more information.
from mt-dnn.
Confirmed this works - thanks @namisan!
from mt-dnn.
Related Issues (20)
- How we can use mt-dnn to do Multi-Label Classification? HOT 1
- predict.py size mismatch for scoring_list.0.weight and scoring_list.0.bias error HOT 1
- Performance using ELECTRA and ROBERTA is significantly different from BERT HOT 3
- RuntimeError with SMART HOT 4
- Problem in SMART embedding HOT 1
- Prediction: How to find the task id? HOT 3
- Unable to get the complete model MT_DNN
- Output data in different tasks at the same time
- Project dependencies may have API risk issues
- question about task split and pretrain model
- mt-dnn on Windows?
- Readme.md is updated?
- Can you provide the pretrain files of Hugging Face?
- Older version of Pytorch unavailable HOT 1
- Code for "Targeted Adversarial Training for Natural Language Understanding"
- where is run_mt_dnn.sh HOT 1
- Problems with downloading datasets and weights HOT 3
- ddp error in fintune the task-specific like rte
- Pretrained weights for transfer learning STS benmark
- ERROR 409: Public access is not permitted on this storage account HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mt-dnn.