GithubHelp home page GithubHelp logo

asirem16 / kor-sentence-similarity Goto Github PK

View Code? Open in Web Editor NEW

This project forked from yoongi0428/kor-sentence-similarity

0.0 1.0 0.0 3.12 MB

Sentence/Text Similararity Models for Korean

License: MIT License

Python 100.00%

kor-sentence-similarity's Introduction

Kor-Sentence-Similarity

Sentence/Text Similarity for Korean (In Simple Way)

Models

Details

  • Data

    • In data, two questions are seperated by '\t'
  • Preprocessing

    • Character Level (음소 or 음절)
    • Digits and Specials
    • For eumjeol(Syllable), use frequent 2350
  • Configuration

    • main.py : main run file
    • --epochs : # of training epochs
    • --batch : Batch Size
    • --lr : Learning rate
    • --strmaxlen : Maximum Limit of String Length
    • --charsize : Vocab Size
    • filter_num : # of Filter of one CNN Filter
    • --emb : Embedding Dimension
    • --eumjeol : Use Eumjeol(Syllable-level) if specified
    • threshold : Threshold to determine Similar or not
    • --model : Model Selection (CNN, MLP)

To Run

  • Set FC,layer and CNN layers in 'main.py'
  • run 'main.py' with arguments as you wish

kor-sentence-similarity's People

Contributors

yoongi0428 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.