GithubHelp home page GithubHelp logo

zzoy / machinecomprehension-1 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from jaonm/machinecomprehension

0.0 1.0 0.0 83 KB

Machine comprehension for recent papers using pytorch

Python 99.04% Shell 0.96%

machinecomprehension-1's Introduction

MachineComprehension

Introduction

Pytorch implementation of machine comprehension papers for SQuAD v1.1

BIDAF (Minjoon Seo et al.,2016) https://arxiv.org/abs/1611.01603
QANet (Adams Wei Yu et al.,2018) https://arxiv.org/abs/1804.09541
Ensemble model for BIDAF and QANet

Structure

  • preproc.py: dataset preprocessing and build training features
  • config.py: determine which model to train and hype-parameters setting
  • evaluate.py: evaluate script
  • main.py: program entry
  • models/qanet.py: QANet model
  • models/bidaf.py: BIDAF model
  • models/ensemble.py QANet and BIDAF ensemble

Result

Result on dev dataset

QANet BIDAF Ensemble
F1 76.3 74.1 77.6
EM 67.5 63.3 68

Difference with papers

  • Context length is set to 300 due to limit of memory
  • Char embedding doesn't be connected with convolution layer
  • Difference hype parameters setting

Todo

  • Achieve the result in paper
  • Reduce memory cost
  • Complete R-net model
  • Do the ensemble

machinecomprehension-1's People

Contributors

jaonm avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.