GithubHelp home page GithubHelp logo

Yang Shi's Projects

big-data-amazon-review icon big-data-amazon-review

This project is analyzing amazon reviews on video games and luggage. Each dataset contains over 1.5 million rows. The ETL process was performed completely in the cloud, and the DataFrame was uploaded to an RDS instance. Basic statistical analysis was also performed on one of the datasets.

boardgame-analysis icon boardgame-analysis

BoardGameGeek is a valuable resource for boardgame information. We built a webpage to visualize the data related to board games. Using machine learning, we predict board game ratings based on 10 independent variables.

boardgame-dashboard icon boardgame-dashboard

We built a webpage to visualize the data related to board games. The page scrapes and loads the latest news, and uses SQL database to load the data, and javascript to visualize the filtered information.

boardgame_ratings icon boardgame_ratings

We explored the BoardGameGeek dataset. Using machine learning, we predict board game ratings based on 10 independent variables.

data-journalism-and-d3 icon data-journalism-and-d3

I used JavaScript to create charts, graphs, and interactive elements to help readers understand the health risks facing particular demographics.

etl_tax_school_ohio icon etl_tax_school_ohio

Education and housing are closely related to our socio-economic life. In this project, we extracted data on those fields from three different resources: Ohio Tax Information, Census API on Social (DP02), Economic(DP03), and Housing(DP04) Characteristics of Ohio (state:39) Counties (88 total), and Greatschools.org. The gathered data will then be cleaned and transformed into a PostgreSQL database.

healtchare_fraud_detector icon healtchare_fraud_detector

Healthcare provider fraud detector is a classification machine learning project to predict the potential fraudulent providers based on the healthcare claims filed.

leaflet-usgs icon leaflet-usgs

In this project, I used leaflet to visualize the United States Geological Survey (USGS) past 30 days M4.5+ earthquakes dataset. I created layers of maps with circles represent all of the earthquakes from this data set based on their tectonic plates, longitude, latitude, depth, and magnitude.

machine-learning-basics icon machine-learning-basics

This project is to study the NASA Kepler space telescope data based on a planet-hunting mission. I created several machine learning models that are capable of classifying candidate exoplanets from the raw dataset.

pitch_pyfect icon pitch_pyfect

In this project, we mined through the SpotifyⓇ music database and identified the optimal ranges of music theory and song structure—for example, key signature, beats per minute, beats per measure, tempo, etc.

plotly_bio icon plotly_bio

I built an interactive dashboard to explore the microbes that colonize human navels. The plots used here are bar, pie, and gauge with data selection from a dropdown menu.

web-scraping-missions-to-mars icon web-scraping-missions-to-mars

I scraped the NASA Mars News Site, JPL Featured Space Image website, Mars Facts webpage, and USGS Astrogeology site to obtain news titles, news content, tables, and high-resolution images. The information was stored in MongoDB and deployed to an HTML page with a flask application.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.