GithubHelp home page GithubHelp logo

nmarek1269 / scrape_ufc_stats Goto Github PK

View Code? Open in Web Editor NEW

This project forked from greco1899/scrape_ufc_stats

0.0 0.0 0.0 2.69 MB

Scrape UFC stats

License: GNU General Public License v3.0

Python 17.38% Jupyter Notebook 82.62%

scrape_ufc_stats's Introduction

Scrape UFC Stats

Scrape all available UFC events data, fights stats, and fighter details and from ufcstats.com and save in CSV format.


Prerequisites

Python 3

Install requirements with pip install -r requirements.txt


Introduction

Data for all events, fights, and fighters have scraped and saved as the following data files:

ufc_events.csv
ufc_fight_details.csv
ufc_fight_results.csv
ufc_fight_stats.csv
ufc_fighter_details.csv
ufc_fighter_tott.csv

To download the CSV files without running any code, click Code > Download ZIP or clone the repo.

You can also scrape the all data for fight stats again using the notebook scrape_ufc_stats_all_historical_data.ipynb, and all data for fighter tale of the tape again using the notebook scrape_ufc_stats_fighter_tott.ipynb. Do note these will each take a few hours to complete.

Once you have the up-to-date historical data for fight stats, you can run the notebook scrape_ufc_stats_unparsed_data.ipynb or the script scrape_ufc_stats_unparsed_data.py to scrape the only latest fights and refresh the data.

The notebook scrape_ufc_stats_working_example.ipynb can be used for testing or debugging. The code here is broken down into sections which can be executed to scrape single data points, e.g. scraping stats for one fight only.

Do feel free to reach out with any comments, suggestions, or issues. ๐Ÿ˜ƒ


Data Refresh

The script scrape_ufc_stats_unparsed_data.py has been added to PythonAnywhere and will run daily to check and scrape new fights and fighters and push the refreshed data files to this repository.

This keeps the data files up to date with the latest fight and fighter stats and you can quickly download and use the CSV files above without running any code.


Next Steps

Data Refresh for Fights - Parse and refresh new fight data programmatically when there are new events

Scrape Fighter Details - Get all fighter's tale of the tape (e.g. height, weight, etc.)

Data Refresh for Fighters - Parse and refresh new fighter details programmatically when there are new fighters added to the roster

Clean Data - Clean data to be used for further analysis

Data Exploration - Plot and visualise stats

Prediction - Build a machine learning model to predict the outcome of fights


scrape_ufc_stats's People

Contributors

greco1899 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.