GithubHelp home page GithubHelp logo

paologriffo / bike-sharing Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 2.6 MB

A Stability Index for an iterative procedure of selecting and validating a statistical model.

bic bayesian linear-regression resampling-strategies

bike-sharing's Introduction

A Stability Index for statistical models. An iterative procedure of selection and validation through the Delta BIC.

[MS Thesis in Statistical Sciences for decision - Federico II University of Naples project files.]


This folder contains:

  • Bike sharing dataset
  • Markdown Project code 'Bestfit_bikesharing.rmd'
  • Thesis Brochure 'MS_Thesis.pdf'

Project scope and approach

The aim of the thesis is to develop a generalized and automatic procedure of selection and validation of a statistical model using the BIC criterion to derive a Stabiliy Index.

The Stability Index proposed in the thesis is tested in the case of the Linear Regression. The UCI bike sharing dataset take into account the number of random accesses to the bike sharing service in Washington D.C. during the weekends in the years 2011 and 2012 and the varying weather conditions.

A best model is selected with the Bestsubset selection from a group of similar in best performing models choosen on the Delta BIC ranks.

Index derivation scheme

The Stability Index is separately computed on different partitions of the original data, considering both anomalous wheather conditions days and usuals ones, as follow:

1) Complete sample (2011-2012 records)
1.1) Complete sample (2011-2012 records) without anomalous (approx. 1%)
2) First half of the sample (2011 records)
2.1) First half of the sample (2011 records) without anomalous (approx. 1%)
3) Second half of the sample (2012 records)
3.1) Second half of the sample (2012 records) without anomalous (approx. 1%)

For these samples, the estimation, selection and validation steps of the best model are iterated B times, in turn resampling the B times with two different percentage, that is:

a) 90%
b) 80%

Finally, for the sake of clarity, a comparison between all the combinations shows the insights on the Stability Index.

bike-sharing's People

Contributors

paologriffo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.