separius / awesome-sentence-embedding Goto Github PK
View Code? Open in Web Editor NEWA curated list of pretrained sentence and word embedding models
License: GNU General Public License v3.0
A curated list of pretrained sentence and word embedding models
License: GNU General Public License v3.0
ul and li tags are not rendered in the github page, we should somehow put unordered lists in table cells with pure markdown(using 2 or more spaces didn't work)
I recently found a recent sentence embedding model that isn't on this list. If you think it's interesting, it might be worthwhile to include it ๐
Thanks for creating this resource! ๐
add this when they released their codes
Where do you see that the code linked to this paper is indeed a implementation of the paper? I can't see anything of this in the LASER repo of facebookresearch.
Can you help me? I'm just searching that code.
Thanks for your great job!
make sure to add XLM-R
I just found a recent sentence embedding model that doesn't show up on this list. If you think it's interesting enough, it might make sense to include here ๐
Hi there,
Not sure if this list is still being maintained but if so, might I (shamelessly) recommend adding DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations. It is an unsupervised method for learning high-quality sentence embeddings that we recently developed. It is similar to Sentence Transformers in that it pre-trains a transformer-based language model, but because it is unsupervised, you do not need any labels!
add this when the code is ready.
I'm curious how CLIP performs when treated simply as a sentence embedding. Is it competitive?
python scholar.py -t -p "title"
)add InferLite when it has codes
Hi Separius,
As you have described how the sentence embedding work, I have some questions about applying a model into the framework. For example, for doc2vec, what is the encoder to generate contextualized embeddings, and what is the pooling method for those embeddings to build the sentence embedding? Also, for CNN encoder, what output of the encoder can be viewed as the contextualized embeddings? If the framework does not apply for these models, may I ask why ?
Thanks.
include magnitude somewhere
make sure to add ELECTRA
Great work and would you mind to add https://github.com/hanxiao/bert-as-service/ into the list?
add this when they released their code
add MT-DNN when they released their codes
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.