GithubHelp home page GithubHelp logo

wazzabeee / scraping_tool Goto Github PK

View Code? Open in Web Editor NEW
3.0 3.0 0.0 755 KB

A python application providing a GUI to scrape tweets from Twitter using Tweepy library and scripts allowing cron jobs.

License: MIT License

Python 53.38% Tcl 46.62%
twitter scraping-python api tweets gui cron python tweepy tkinter

scraping_tool's Introduction

Twitter Scraping Tool

Python version

About

This program allow the user to retrieve tweets from Twitter's API using the library Tweepy via a GUI or a CRON automatization.

This project was made part of my internship at the "Human Computer Humans Interacting with Computers at University of Primorska" lab (HICUP Lab).

Setup

To use this program, you will need your API keys retrieved after creating a developer account at Twitter.

Once retrieved the keys must be filled as follows.

How to use

# Clone this repository
$ git clone https://github.com/scraping_tool

# Go into the repository
$ cd scraping_tool

# Install requirements
$ pip3 install -r requirements.txt

# Fill credentials in constants.py
# See "Setup"

# Run the app
$ python main.py

Parameter details

More details on the parameters available can be seen on Twitter's offical page.

CRON automatization

After trying out queries and requests with the GUI, you can setup automatizations.

You can define the parameters of the search via the cron_search.json file (or cron_user.json). All parameters are the same from the GUI’s tab.

When you’re ready you just have to setup the execution of the cron_search.py (or cron_user.py) script at the desired times and days.

scraping_tool's People

Contributors

wazzabeee avatar

Stargazers

 avatar Redson Fernando avatar  avatar

Watchers

Kostas Georgiou avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.