GithubHelp home page GithubHelp logo

ml-ai-nlp-ir / treelstm.pytorch Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dasguptar/treelstm.pytorch

0.0 2.0 0.0 25 KB

Tree LSTM implementation in PyTorch

License: MIT License

Python 70.15% Shell 0.49% Java 29.36%

treelstm.pytorch's Introduction

Tree-Structured Long Short-Term Memory Networks

A PyTorch based implementation of Tree-LSTM from Kai Sheng Tai's paper Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks.

Requirements

  • PyTorch
  • tqdm
  • Java >= 8 (for Stanford CoreNLP utilities)
  • Python >= 2.7

Usage

First run the script ./fetch_and_preprocess.sh, which downloads:

The preprocessing script also generates dependency parses of the SICK dataset using the Stanford Neural Network Dependency Parser.

To try the Dependency Tree-LSTM from the paper to predict similarity for pairs of sentences on the SICK dataset, run python main.py to train and test the model, and have a look at config.py for command-line arguments.

The first run takes a few minutes because the GLOVE embeddings for the words in the SICK vocabulary will need to be read and stored to a cache for future runs. In later runs, only the cache is read in during later runs.

This code with --lr 0.01 --wd 0.0001 --optim adagrad --batchsize 25 gives a Pearson's coefficient of 0.8336 and a MSE of 0.3119, as opposed to a Pearson's coefficient of 0.8676 and a MSE of 0.2532 in the original paper. The difference might be because of differences in the way the word embeddings are updated.

Notes

PyTorch 0.1.12 has support for sparse tensors in both CPU and GPU modes. This means that nn.Embedding can now have sparse updates, potentially reducing memory usage. Enable this by the --sparse argument, but be warned of two things:

  • Sparse training has not been tested by me. The code works, but performance has not been benchmarked for this code.
  • Weight decay does not work with sparse gradients/parameters.

Acknowledgements

Shout-out to Kai Sheng Tai for the original LuaTorch implementation, and to the Pytorch team for the fun library.

Author

Riddhiman Dasgupta

This is my first PyTorch based implementation, and might contain bugs. Please let me know if you find any!

License

MIT

treelstm.pytorch's People

Contributors

dasguptar avatar vinhdv avatar huangshenno1 avatar

Watchers

Apurv Verma avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.