GithubHelp home page GithubHelp logo

bhaskardasari / predictingnationalhappiness Goto Github PK

View Code? Open in Web Editor NEW

This project forked from central-ldn-data-sci/predictingnationalhappiness

0.0 2.0 0.0 1 MB

Using the World Development Indicator Dataset to see which factors are most predictive of a nation's happiness

Jupyter Notebook 61.70% HTML 38.30%

predictingnationalhappiness's Introduction

predictingNationalHappiness

We'll be using the World Development Indicator Dataset (WDI) to see which factors are most predictive of a nation's happiness score:

[About the Metric] (https://en.wikipedia.org/wiki/World_Happiness_Report). Can we learn more about what might predict this score other than the factors wikipedia has listed? Let's find out.

Take the World Development Indicator Dataset (aka WDI) (which is available in both R and Python) and select some features you think would be most predictive of that happiness score - then give a go at modeling to predict national happiness.


Goal: Using the WDI dataset, figure out which factors are most strongly predictive of well being in a given country - for example, is lack of debt predictive of a country’s happiness score?

Topics: Data Munging, Joining, Feature Selection, Modeling or Visualization

  • I. Get the Well-Being Data - csv from github. Creat a dataswet with "country", and "happiness_score"
  • II. Get the Other Features on a Country Level from the R or Python library R - WDI dataset WDI::WDIsearch() to see factors - look at the documentation to see how to look up the other factors Python instructions [here] (https://pypi.python.org/pypi/wbdata)
  • III. Join the two datasets on the country name. Your output should look like… Country, happiness_score, feature_1, feature_2, etc where it’s up to you to select which features and how many to put into the dataset.

Potential Issues - joining on country name - it may not be the same string between the sets.

How you approach feature selection depends on what you want to focus on for the evening.

  • If you are less experienced, my recommendation would be to pick 2-3 features you think might be interesting and then run a multivariate linear regression on them. Focus on getting the mathematical concepts and the coding implementation (R, Python, etc) rather than making the perfect model. Alternatively, you could focus on visualization, and, although this isn't modeling directly, it will help you understand which features you may want to use if you did want to model it - download Tableau Public and pop the csv in to see what you learn.
  • If you are more experienced, it’s totally up to you how you approach the problem! Some ideas: (1) Practice a skill you want to improve at - for example, if you are usually a model-er, try doing a visualization approach, etc. If you usually use R, try python, etc. A new model you haven’t done, etc. (2) If mentoring is your interest (teaching helps you solidify learning), help out one of the n00bs.

Note we probably won’t finish this problem tonight - 2 hours likely too short, but give it a go and see what you learn. Follow up after if you liked it. Feedback is welcome.

Before you head out, take a few minutes to summarize how you approached the problem and then share with the group.For example…

  • Did you use R or Python?
  • What libraries/packages/models?
  • Did you find any interesting insights?
  • What did you find most challenging? etc.

Bring:

  • Laptops, Charging Cords, an Inquisitive Spirit

predictingnationalhappiness's People

Contributors

ojwatson avatar ericadohring avatar

Watchers

James Cloos avatar Bhaskar D avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.