GithubHelp home page GithubHelp logo

nevmenandr / word2vec-russian-novels Goto Github PK

View Code? Open in Web Editor NEW
46.0 3.0 12.0 9.51 MB

Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov ๐Ÿ“š

Home Page: https://nevmenandr.github.io/novel2vec/

Jupyter Notebook 100.00%
word2vec russian-literature word2vec-russian-novels digital-humanities

word2vec-russian-novels's Introduction

DOI

Jupyter Notebook

word2vec-russian-novels ๐Ÿ“–

Fun digital humanities project by Boris Orekhov

Inspired by this work the replacement of words of Russian most valuable novels text with closest word2vec model words.

I used a model (ruwikiruscorpora) from RusVectลrฤ“s project.

Other dependencies:

Possible applications:

  • Fun
  • Source for tests for so called "olympic" competitions in literature
  • Base for literary studies that include the principle question "why this word, not the other?"

word2vec-russian-novels's People

Contributors

nevmenandr avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

word2vec-russian-novels's Issues

Request for English version or how to edit original

Hello:

Thanks for posting this repo. I have your original Russian word2novel.py running 100% with the ruwikiruscorpora.

Unfortunately I don't speak/read Russian I was wondering if you could post an English version that runs with GoogleNews-vectors-negative300.bin.gz or similar English model?

I've hacked my way through your script by changing all Russian characters to English but it only outputs the original English files to the "books_after" folder.

Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.