GithubHelp home page GithubHelp logo

itrustnumbers / exploratory_analysis_on_steam_game_statistics Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 26.46 MB

An Exploratory Analysis and data visualization project on Steam Gaming Statistics Datasets on Kaggle, To uncover trends in the data and answer the research question : " To Determine the relation if any between the rating of a game and its market Success (Measured as Hours the game was played) "

Jupyter Notebook 100.00%
pandas-python data-science matplotlib seaborn eda data-visualization scipy

exploratory_analysis_on_steam_game_statistics's Introduction

This is an Exploratory Analysis on Steam Game Statistics Focusing on the Rating of Games and Hours Played on steam

Research Question or Motivation For this Project/Analysis

Ѳ To Determine the relation if any between the rating of a game and its market Success (Measured as Hours the game was played)

The Data Sets Used in this Analysis :-

-> This Data Set contains different ratings and user feedback on different games on steam which are used to calculate the Normalized Rating Score for each game in this Analysis

-> This Data Set contains different game played on steam along with the number of hours the game was played by different users this information is used to calculate Normalized Play Score for each game in this Analysis

Normalization Method Used :

To have the rating and the play time on the same scale we have to Normalize Both values to a scale of [0,2] :

Normalizing the Rating Score To a Range of [0,2] by :-

Normalizing the Playtime To a Range of [0,2] by :-

Final Visualization :

Final Visualization

Corollary :-

First, As we can see there are really next to no co-relation between the rating and the play time of a game, This Peculiar phenomena is described below:

  1. The rating system is found to be skewed along the genre of a game i.e if a game is a FPS(first person shooting) type game then you cannot compare its rating to the rating of a non FPS game

  2. The Rating is developed as soon as the game hits the market whereas the play time increases gradually over the years therefor new game with better rating has infinitesimally small chance of having a play time more than the older games

  3. Then there are the challenge of competitive games which demands rigorous play time and determination, this act skews our analysis

  4. The nature of rating and repeated gaming is highlighted in the visualization in the fact that the set of top rated games and the set of most played games is disjoint in nature

Final Summary:

To Further increase the accuracy of this analysis clustering is a good option and as of now rating and Play Hours Does not carry or Does but very little co-relation and hence rating is not a good indictor of a games market success

Requirment for Reproduction

1. Python3

2. Jupyter Notebook

3. ipykernel

4. Pandas

5. Numpy

6. Matplotlib

7. Seaborn

8. Scipy

exploratory_analysis_on_steam_game_statistics's People

Contributors

itrustnumbers avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.