facebookresearch / dme Goto Github PK
View Code? Open in Web Editor NEWDynamic Meta-Embeddings for Improved Sentence Representations
License: Other
Dynamic Meta-Embeddings for Improved Sentence Representations
License: Other
Thanks for this great repo. Is the code for CDME going to be available?
I believe the checkpoint for DME you provide is incompatible with the model that is constructed using SNLI data for task=snli. Mainly, the vocabulary somehow mismatches and doesn't allow the checkpoint's outermost layers, for embeddings, to be loaded into the model so constructed. Of course, we could train our model using the script, but just thought I should mention this in case more people are facing the same issue.
I see that the readme links to the corresponding Arxiv paper. Please update the link to point to the HTML abstract, i.e. to https://arxiv.org/abs/1804.07983 , and not to the PDF. Some readers merely want to read the abstract only. If they want the PDF, they can easily click on the PDF link on the Arxiv HTML page.
"out = F.dropout(out, p=self.args.emb_dropout)"
It seems that this line will trigger dropout during inference.
Line 67 in a3217ee
why filtering out the words based on the combined embedding vocabulary in the preprocessing function ? instead of replacing with <unk>
?
For exapmle :
sentence
: hide new secretions from the parental units
if the word secretions
is not in the emb_vocab ,
the sentence will be converted into hide new from the parental units
,
I think correct sentence should be hide new <unk> from the parental units
Hi, thanks for sharing the code! It really helped me understanding the paper.
I have a question on the way you calculate the attention score, especially on where to apply softmax on.
From this, it seems that the softmax function(g) takes as input "word vectors" FROM TIME STEP 1 TO S, meaning that alpha distribution over the SEQUENCE, not on NUMBER OF PRETRAINED VECTORS.
Lines 183 to 203 in 97631c4
However, line 203 takes softmax over dim=2, which is NUMBER OF PRETRAINED VECTORS(self.n_emb).
I am little confused about the mismatch here. I think the formula on the paper should be revised. Am I missing something? Please help me!
The readme mentions Python 2 which is almost obsolete. Please upgrade to and formally support Python 3.7 which is the current version. Python 2 code cannot be readily integrated into modern codebases.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.