GithubHelp home page GithubHelp logo

smai-project's Introduction

SMAI Project: Sentence Classification using deep learning

Directory Structure:

.
├── datasets/
│   ├── <dataset_name>/
│   └── ...
├── doc/
├── src/
│   ├── models/
│   │   ├── <model_name.py>
│   │   └── ...
│   ├── parsers/
│   │   ├── <parser_name.py>
│   │   └── ...
│   ├── train.py
│   └── test.py
└── var/ (not committed)
    ├── train/
    │   └── <model_name>/
    │       └── <dataset_name>/
    └── wordvec/
        └── <dataset_name>/

Usage:

make.py [-h] [--model MODEL] [--dataset DATASET] [--parser PARSER] [--log-level LOG_LEVEL] TASK

  • TASK: Task to perform {preprocess, train, test}
  • --model MODEL: Model to use (src/models/MODEL.py)
  • --dataset DATASET: Dataset to use (dataset/DATASET/)
  • --parser PARSER: Data parser to use (src/parsers/PARSER.py)
  • --log-level LOG_LEVEL: Logging level (10 for testing, and 40 for production. Default: 30)

Workflow

  • preprocess.py:
    • Learns word vectors, and saves them to var/wordvec/...
  • train.py:
    • Trains the model, and saves weights to var/train/...
    • Logs training reports to var/log/train/...
  • test.py:
    • Loads weights from var/train/..., and predicts the labels for test data
    • Logs training reports to var/log/test/...
    • If labels are known (validation), then scores are reported.
  • models/MODEL.py
    • class Model implements the CNN.
  • parsers/PARSER.py
    • Implements classes to lazy-load data, and for generating word vectors using the trained word vector vocab.

Adding Models:

Check Readme in src/models/

Adding Parsers:

Adding a parser for new datasets - to handle preprocessing and conversion to word vectors. Check Readme in src/parsers/

Adding Datasets:

Add datasets to the datasets folder. Parsers should know the corresponding internal directory structure.

Contributors

Team: \sigma \sqrt -1

  • Arjun P
  • Gokul B Nair
  • Soumya Vadlamannati
  • Sai Anurudh Reddy Peduri

smai-project's People

Contributors

marauderer97 avatar superty avatar dependabot[bot] avatar codelegend avatar anurudhp avatar

Watchers

 avatar  avatar Gokul B. Nair avatar  avatar

Forkers

marauderer97

smai-project's Issues

Training

Implement the train function in src/train.py.
Trains a model, and saves weights to var/train/... (exact folder depending on config)

Testing

Implement the test function in src/test.py.
Load weights from the checkpoint, and run the test.
If labels are known, report scores.

Word2Vec

Implement the word vector generator in src/preprocess.py
Learn the word vectors, and store them in var/wordvec/...

Extra: Add support for minimal user validation (random sampling etc.)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.