GithubHelp home page GithubHelp logo

scriptsmith / reaper Goto Github PK

View Code? Open in Web Editor NEW
357.0 29.0 70.0 7.5 MB

Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Home Page: https://reaper.social

License: GNU General Public License v3.0

Python 96.80% Inno Setup 2.30% Batchfile 0.43% Shell 0.47%
scraping facebook twitter reddit youtube pinterest tumblr api socialmedia data-mining

reaper's Introduction

Reaper

Website Github All Releases GitHub license Gitter

Reaper is a PyQt5 GUI that scrapes Facebook, Twitter, Reddit, Youtube, Pinterest, and Tumblr APIs using socialreaper

Are you a developer? Try the Python package

Features

  • Support for 6 social media platforms
  • CSV output
  • Instructions for getting API keys
  • API key management
  • Download queuing system
  • Error management
  • Disk caching for big data
  • Ability to read a list of inputs from CSV and text files
  • Ability to append to exsting data
  • Dark theme
  • UTF-8 and ASCII support

Download

To download the latest builds for your platform, check out the releases

Installers and standalone versions are available for Windows and macOS

Usage

Instructions for using Reaper are available on reaper.social

Run source

Reaper uses string formatting that was added in Python 3.6. You need to run Reaper with Python 3.6+ or download a pre-built version from the releases

Download

git clone https://github.com/ScriptSmith/reaper.git
cd reaper

Run

pip3 install -r requirements.txt
python3 reaper.py

reaper's People

Contributors

scriptsmith avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

reaper's Issues

dont have access public repository

when i tried to install the package.. i got the error..

[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.
Command "git clone -q [email protected]:ScriptSmith/socialreaper.git /home/localadmin/reaper/src/socialreaper" failed with error code 128 in None

User Report

Submitter: Test User
This is a test of the gitreports system to assign issues to user's bugs without requiring them to sign up to github

SyntaxError

I Keep getting this error, how do i fix this error?

File "reaper.py", line 57
self.window.setWindowTitle(f"Reaper {self.version}")
^
SyntaxError: invalid syntax

Error on import qdarkstyle

After following the installation instructions I have the following error:
Traceback (most recent call last):
File "reaper.py", line 24, in
import qdarkstyle
ModuleNotFoundError: No module named 'qdarkstyle'

Not a real issue - Tumblr hashtags

Hi,

My name is Joe Hatfield, and I am a graduate student at the University of Colorado Boulder. First off, thank you so much for developing this tool; it has been a lifesaver for my research. Does the tool scrape for ALL hashtags included on individual Tumblr posts? In other words, is there a column(s) for additional hashtags included in a post other than the one used to scrape? My csv seems not to include that info, but I was wondering if perhaps I was missing it.

Many thanks,

Joe

Maximum recursion depth exceeded

The program quits on launching reaper.py

C:\Users\username\AppData\Local\Programs\Python\Python36-32\python.exe D:/reaper-master/reaper.py
Traceback (most recent call last):
File "D:\reaper-master\components\globals.py", line 17, in _calc_path
return _calc_path(head)
File "D:\reaper-master\components\globals.py", line 17, in _calc_path
return _calc_path(head)
File "D:\reaper-master\components\globals.py", line 17, in _calc_path
return _calc_path(head)
[Previous line repeated 993 more times]
File "D:\reaper-master\components\globals.py", line 13, in _calc_path
head, tail = os.path.split(path)
File "C:\Users\username\AppData\Local\Programs\Python\Python36-32\lib\ntpath.py", line 206, in split
d, p = splitdrive(p)
File "C:\Users\username\AppData\Local\Programs\Python\Python36-32\lib\ntpath.py", line 141, in splitdrive
if len(p) >= 2:
RecursionError: maximum recursion depth exceeded in comparison

404

I setup 3 keys following your guide; Twitter, FB and Pinterest.

When I try to scrape Pinterest or FB I get the following popup error.

"message": "404 Not Found: The requested URL was not found on the server. If you entered the URL manually please check your spelling and try again.",
"type": "http"
I tried resetting the Pinterest API key as per your instructions, same thing.

Twitter key works fine.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.