gu-datalab / stance-detection-ke-mlm Goto Github PK

Official resource of the paper "Knowledge Enhanced Masked Language Model for Stance Detection", NAACL 2021

Home Page: https://www.aclweb.org/anthology/2021.naacl-main.376/

License: GNU General Public License v3.0

Python 100.00%

stance-detection stance-classification stance-dataset natural-language-processing naacl2021 naacl pretrained-models language-models bert twitter

stance-detection-ke-mlm's People

Contributors

Stargazers

Watchers

Forkers

anshiquanshu66 lucas1996-xgboost

stance-detection-ke-mlm's Issues

Request for Significant Token Masking code

Hey,

Congratulations for the great paper and thanks for sharing your code in Github!

I am working on another classification problem for my master thesis and want to try Significant Token Masking, as you do, in my research.

Is there any change you to share your data loading code (preprocessing, DataCollator etc.) to guide me?

BR.

Can be used for other stance detection scene such as the SemEval-2016 Task 6 Sub-task A instead of the US Political Election?

Thanks for your wonderful work. Can be used for other stance detection scene such as the SemEval-2016 Task 6 Sub-task A instead of the US Political Election? Many thanks for your reponse.

Unable to load the model

Hi, I am facing an issue when trying to load the model in Colab. I used the git clone to download all the files.

Also, the download by model name is not working for your model from my testing. Here is a screenshot when trying using the model name instead of the file path.

Questions about the a detail in the NAACL '21 paper.

Thank you for the NAACL '21 paper. It provides an interesting way to incorporate knowledge in fine-tuning BERT. The experimental result is also comprehensive and persuasive.

But I have a question about the fine-tuning process

In Eqn 5, it seems to me that both $y_i$ and $\hat{y}_i$ with dimension equal to the vocabulary size. This makes output of the model a vector rather than a scalar. Then how do you compute gradient in this setting?
If multiple words are masked in the same sentence (see below). Then how do you decide the output of the second masked token when the first masked token is already predicted (for example [MASK] -> hilarious) ? Do you use the new token (hilarious) or you still stick to the original token (happy)?

ORIGINAL: I'm so happy Biden beat Trump in the debate
MASKED: I'm so [MASK] Biden [MASK] Trump in the debate.

gu-datalab / stance-detection-ke-mlm Goto Github PK

stance-detection-ke-mlm's People

Contributors

Stargazers

Watchers

Forkers

stance-detection-ke-mlm's Issues

Request for Significant Token Masking code

Can be used for other stance detection scene such as the SemEval-2016 Task 6 Sub-task A instead of the US Political Election?

Unable to load the model

Questions about the a detail in the NAACL '21 paper.

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs