GithubHelp home page GithubHelp logo

clips / dutchembeddings Goto Github PK

View Code? Open in Web Editor NEW
82.0 10.0 14.0 1.6 MB

Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", presented at LREC 2016.

License: GNU General Public License v2.0

Python 100.00%
word embeddings vector dutch sonar500

dutchembeddings's People

Contributors

cmry avatar guydepauw avatar stephantul avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dutchembeddings's Issues

no acces to word embedding sets

There is no acces to the word embeddings with the links provided.
They result in the following webpage:
https://www.clips.uantwerpen.be/dutchembeddings/combined-320.tar.gz
which shows:

Forbidden
You don't have permission to access this resource.

Apache/2.4.18 (Ubuntu) OpenSSL/1.0.2g mod_wsgi/4.3.0 Python/2.7.12 mod_perl/2.0.9 Perl/v5.22.1 Server at www.clips.uantwerpen.be Port 443

I'm quite desperate, I planned on using these sets in my bachalor thesis of my study (artificial intelligence, Utrecht University), but now I just found out that I don't have acces. Is there a way to receive this acces or the datasets?

Pascal Verkade
Bachelor student AI @ Utrecht University

Vector for unknown tokens

Hello, I was wondering if the token 'unk' in the COW models corresponds to the vector trained for unknown tokens. Or is it just a word encountered in the corpus? Best regards.

Memory error. Op windows kan ik de most similar functie niet uitvoeren.

Hoi,

Ik heb een probleem met het gebruiken van de embeddings. Ik weet haast zeker dat het met mijn lokale computer te maken heeft. Ik denk dat het om de overcomitting ratio gaat. Nou weet ik niet hoe of dat ik deze kan aanpassen in windows.

Kan een van jullie mij misschien op weg helpen?

Groeten,

Giovanni

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.