GithubHelp home page GithubHelp logo

question-answering-system's Introduction

Question-Answering-System

This system implements a Deep NLP pipeline which is capable of extracting the exact one word answer to any question entered in Natural Language by searching a scalable Corpora of over 30 Wikipedia Articles in the raw text format. The Question can be entered during runtime or a set of questions can be dumped in a text file. The System prints the fetched answers on the command line as well as in a well structured JSON file. The developed NLP pipeline exercises various concepts such as TF-IDF and Cosine similarity, Tokenization, Part of Speech Tagging, Lemmatizing, Dependency Parsing, Lesk Algorithm, Wordnet Interpretations, Named Entity Recognition and sentence overlapping and text matching.

Getting Started

Firstly, clone the repository to your local system. Make sure you have a well functioning Computer with a decent RAM and sufficient storage. This System is cross platform functional and will work on any known Operating System.

Prerequisites

Before you begin make sure the follwing things are installed on your computer: a. Python3 b. Data Science and scientific processing Libraries. It is not mandatory although recommended to install Anaconda for Python 3.7 as it pretty much contains all important data science libraries.
c. The libraries needed for this system to work are: sklearn, spacy, glob, nltk, sys, numpy, wordnet, string, json, pprint.

Deployment

Once the environment is set up and everthing is installed do the following: a. Navigate to the cloned directory through the command line shell. b. Execute the program named questionanswer.py by typing:

python questionanswer.py

c. The program will ask to input name of the file which contains all questions. Enter the name of that file d. The system will then execute and output a json file with the answers to the questions. Answers will also be printed on the command line interface.

Extras

The project also consists of some other .py Python files. These files exercise some of the NLP concepts used to develop the system in individuality. It is kind of fun to see how these concepts help in the development of an integrated system as a whole. Do try executing them too. Execute as:

python <program name>.py

Built With

Authors

  • Habil Damania

Acknowledgement

A big shout out to my friend and project partner Kanika Rawat for helping in developing this system.

question-answering-system's People

Contributors

kanika-rawat avatar habildamania avatar

Watchers

James Cloos avatar  avatar

Forkers

shrutin97

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.