GithubHelp home page GithubHelp logo

modeltools's People

Contributors

gaffney2010 avatar smith478 avatar

Watchers

 avatar  avatar

modeltools's Issues

Add Multivariate Linear Analysis

add multivariate linear analysis so that we can view how correlations affect the linear model this wil give
some insight into the ensemble/non-linear models. Could also add in interaction detection to non-linear models
using Friedman's H statistic, then add those interactions into the linear model to see how close we get in performance
since linear model will give intuitive interpretation

Fix Jupyter workbooks

I moved some workbooks to top-level, because I couldn't figure out how to reference this libraries otherwise. We could consolidate the two that I have working. The others may need to be updated for the changes made.

Build Variable Importance Function

Per Dan:

Build in variable importance function that uses:
built in functions with sci-kit learn
Shapley Value based importance (run-time would be 2^n (number of models to fit) where n is the number of predictors/features in the model)
Perhaps we could use correlation to make a network so that instead of testing all coalitions, we only test those with high correlation
The assumption would be that the contribution of independent variables woud be roughly additive. (this seems fair)
We would still look at all possible subsets, but for uncorrelated variables, we could just add up their contributions
If Shaply Value importance is fit on training and evaluated on holdout, then after we calculate Shapley we could just remove all variables with a negative shapley value
This would be an alternative to forward/backward regression for variable selection

Figure out a way to evaluate variable importance when using dummy variables

Audit seeding logic

My new Model class needs random seeds strung through it. As well, some of the existing logic could use a second look.

GLM, etc.

Per Dan:

Add multivariate GLM part to visualize predicted values, taking into account correlations

We should also look at using/testing polynomial (rather than just linear) regression for our continuous variables

Think about simplifications of variables.. Is there a way to automate this part. At very least, do forward/backward regression with
a linear model to see if strange/undesireable things are happening.

Implement Bayesian Optimization for Parameter Tuning

In Model selection piece, use Bayesian Optimization to do hyper-parameter tuning
Look at adding a double lift chart, this will also indicate model diversity for possible ensembling
Or for classification look at confusion matrix and see if two models are doing well on different segments

Implement a Particle Swarm Algorithm

If we pass through an already implemented library, this is a light-weight meta-heuristic. It will be a nice way to show off the flexibility of the Model class.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.