GithubHelp home page GithubHelp logo

clumsyclover / wikiscraperbot Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 1.0 595 KB

A bot that scrapes data from wikipedia and answers questions regarding the collected data.

Python 18.88% JavaScript 55.68% HTML 17.73% CSS 7.70%

wikiscraperbot's Introduction

WikiScraperBot

This is a bot made using beautifulsoup.py, wikipedia pacakage for python, NLTK and some pacakages from sklearn.

Things you'll need to install:-

1) Node.js

2) Python

Things that can be done with this bot:-

1) Search for any topics avaliable on wikipedia.

2) Get relevant answers for any questions regarding any topics avaliable on Wikipedia.

Demo images:-

Some commands that can come in handy

1)'$$ loadfromurl-[last part of a wikipedia url]'

---- Directly scrapes data from a given url. Enter ONLY the last part of a url, for ex. 'Barack_Obama' in https://en.wikipedia.org/wiki/Barack_Obama

2) '$$ unloaddata'

---- Unloads the currently requested data (completely), query and makes the bot ask you for new query.

3) '$$ unloaddatabacktolist'

---- Unloads only the currently requested data and returns to the point where the bot asks you to select one of the suggestions.

wikiscraperbot's People

Contributors

athndev avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

sagarbhure

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.