GithubHelp home page GithubHelp logo

cqa's Introduction

Posted your question to CQA sites. Now what? Just wait? - Towards developing the next step...

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Important Keynote

You may want to read more about this project before working on the coding part.

Regarding the third and fourth tasks, we used DOC2VEC after some data analysis and comparison. There is also an aspect of context in our project. We want to return the most similar question based on the user’s asked question. We selected random examples and examined the original question and the related ones. We concluded that DOC2VEC returns better results than TFIDF contextually. You can find the data regarding to why we chose DOC2VEC instead of TFIDF under DOC2VEC.ipynb and TFIDF.ipynb files. Around line 22 we showed how DOC2VEC returns more related questions than TFIDF. For more details please check the Final_Report.

Prerequisites

Download the project by cloning the following link or download the zip version from Github.

https://github.com/utkueray/CQA.git

Please use Python3, if possible Python 3.6.X or Python 3.8.X version, assuming that it will work on Python 3.7.X but never tested it.

Also, please use version 3 of Pip while installing.

You may have to define Python and Pip version while running scripts, therefore becareful while copying commands.

Installing

After downloading the project file, you have to open a command line with Python 3.6.X or Python 3.8.X installed and navigate to the project folder.

You can install necessary python libraries with typing the following command in to your command line.

pip install -r requirements.txt

or

pip3 install -r requirements.txt

Also, you have to run the following script in command line to generate doc2vec model in the predefined path.

python generateModel.py

or

python3 generateModel.py

Additional Note (If you want to update the data dump.)

If you want to update the doc2vec model and the dataframes used in the system with latest datadump, you can download the data dump from https://archive.org/download/stackexchange/ai.stackexchange.com.7z and repace Posts.xml and Users.xml files in the root directory with the new ones. Now, you have to run both Task1 and Task2 ipynb files and they will generate necessary dataframes in predefined paths. Finally, you have to run following command again in the command line to generate the updated mode.

python generateModel.py

or

python3 generateModel.py

Running

Running the system is fairly easy after installing necessary python libraries. Enter the following command to your command line, or you can use a PyCharm which is tested and works fine.

python manage.py runserver 80

or

python3 manage.py runserver 80

or ( if you are using MAC OS)

sudo python3 manage.py runserver 80

Authors

Copyright and License

Copyright 2019-2020 Sabanci University. Code released under the MIT license - see the LICENSE.md file for details.

Acknowledgments

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.