GithubHelp home page GithubHelp logo

wyrover / apple-store-crawler Goto Github PK

View Code? Open in Web Editor NEW

This project forked from obersoy/apple-store-crawler

0.0 1.0 0.0 412 KB

A simple, scalable scrapper for apps data being listed on Apple's AppStore

C# 100.00%

apple-store-crawler's Introduction

Apple Store Apps Crawler

Simple scalable crawler for Apps data from the iTunes Apple Store.

All the data you can see about a certain app once you open its page on the browser, is the data available from this project (minus the description and text reviews).

You don't have to input any of your Apple Account credentials since this Crawler acts like a "Logged Out" user.

Setting up your environment

If you want to host your own database, SQS queues and virtual machines, you can. All you have to do is change the config files and the "Consts" class on the SharedLibrary to the values of your own preference (QueueNames, MongoDB Credentials/Address) and Amazon Web Services Keys (that you will need in order to access your queues from code).

For more detailed information, please, refer to this project's Wiki (W.I.P)

Exporting the Database

As people kept requesting me, i decided to export the database on it's current state, which you can find on this link (containing the records as JSON): https://s3.amazonaws.com/GooglePlayStore/Apple%20Store%20JSON.zip

Have in mind that downloading the database costs me money, since i pay for the outbound traffic provided by AWS when one queries the database So, consider making a donation (via paypal) to [email protected] (the value is up to you).

If you need any specific extraction, let me know so we can figure out whats the best way to do it.

About me

My name is Marcello Lins, i am a 23 y/o developer from Brazil who works with BigData and DataMining techniques at the moment.

http://about.me/marcellolins

What is this project about ?

The main idea of this project is to gather/mine data about apps of the Google Play Store and build a rich database so that developers, android fans and anyone else can use to generate statistics about the current play store situation

There are many questions we have no answer at the moment and we should be able to answer than with this database.

What do i need before i start?

  • I highly recommend you read all the pages of this wiki, which won`t take long.

  • Know C#

How about the database?

  • I have made my MongoDB database public, including a user with read/write permissions so we can all use and populate the same database.

  • If you feel like, you can make your own MongoDB Database and change the code Consts to point the output to your own MongoDB Database. No Biggie

Refer to the Pages section of this wiki for individual information about each aspect of the project.

apple-store-crawler's People

Contributors

marcellolins avatar

Watchers

wyrover avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.