GithubHelp home page GithubHelp logo

heykishan / tennis_slam_pointbypoint Goto Github PK

View Code? Open in Web Editor NEW

This project forked from jeffsackmann/tennis_slam_pointbypoint

0.0 1.0 0.0 30.75 MB

Point-by-point data for Grand Slams, 2011-current

tennis_slam_pointbypoint's Introduction

Grand Slam Point-by-Point Data, 2011-18

alt text

This repo contains point-by-point data for most[1] main-draw singles Grand Slam matches since 2011. It was scraped from the four Grand Slam websites shortly after each event.

There are two files for each tournament. "-matches.csv" contain metadata for all the matches included from the tournament, and '-points.csv' contains all the available data for each point.

Unfortunately, much of the most useful data isn't available for every tournament. (For instance, there is no first/second serve indicator for many events, and rally length isn't included after the first few.) Much of the metadata isn't available for the last few years of tournaments, and some point-level data (such as winner type) isn't represented the same way throughout the whole dataset.

[Update, Feb 2017: Rally length came back with the 2016 French. Also new in 2016 was the gradual introduction of distance run stats.]

Still, there's a lot that can be done with this[2], especially since point-by-point tennis data is not readily available.

I'll try to keep this updated after each tournament, but I can't make any promises as to punctuality.

Note: This data is not available for the 2018 Australian Open or 2018 French Open. Some similar data is available for the 2018 AO, and at some point I may assemble that into a format as close to the other majors as possible.

License

Creative Commons License
Tennis databases, files, and algorithms by Jeff Sackmann / Tennis Abstract is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Based on a work at https://github.com/JeffSackmann.

In other words: Attribution is required. Non-commercial use only.


[1] In general, this data is available for matches on courts with the Hawkeye system installed. The vast majority of missing matches are first-rounders.

[2] For instance, http://heavytopspin.com/2011/09/16/win-probability-graphs-and-stats/ http://heavytopspin.com/2011/08/07/do-points-get-shorter-as-the-match-progresses/ http://heavytopspin.com/2011/06/06/fun-with-french-open-rally-length/

tennis_slam_pointbypoint's People

Contributors

jeffsackmann avatar heykishan avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.