GithubHelp home page GithubHelp logo

msandt3 / gthockey-stats Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 120 KB

Scraping player statistics from achahockey.org. This is intended to be uses as a chron job for the gthockey php server

Python 100.00%

gthockey-stats's Introduction

STATS SCRAPING

This repo contains code to scrape player statistics from the achahockey.org web page. This is a proof of concept and being developed for use to minimize labor for data entry.

Dependencies

Running the Code

After cloning the repo you can scrape data as follows:

$ scrapy crawl acha

Saving the Results

Scrapy supports several standards for storing scraped data. In order to store them in JSON, CSV or XML execute the respective command:

$ scrapy crawl acha -o items.json -t json
$ scrapy crawl acha -o items.csv -t csv
$ scrapy crawl acha -o items.xml -t xml

Scripting the Routines

There is now an automated script for running the scraping routines as well. This is for future use in CGI on a nearlyfreespeech web server.

$ python crawl.py

The previous will automatically scrape and store the data in a json file

Issues

For more information on how to use Scrapy please see the Scrapy Reference

Contributing

This is an open source project. Feel free to fork it and submit pull requests at will.

gthockey-stats's People

Contributors

msandt3 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.