GithubHelp home page GithubHelp logo

datahacks2020's Introduction

DataHacks 2020

Welcome to DataHacks 2020! Out of hundreds of applicants, you’ve been selected because you display true potential for solving complex problems and exude a passion for comprehending and transforming data. Let’s begin the Hackathon!

READ BEFORE HACKING STARTS

Useful tools and websites

Rules

  • Each team consists of up to THREE people (≤ 3)
  • For beginner track, ONLY beginners (students who have not taken DSC 80 or any CSE/COGS/DSC upper-div class) can form a team

Competition format

  1. Each team pick a track to work on
  2. You have 24 hours until Sunday noon to work on your dataset
  3. Follow the prompt/README file for each track
  4. Prepare for a report with all of your findings in a reasonable length
  5. Zip your report (pdf) and code and submit as a group to Devpost (link above, come up with an appropriate team name!).
  6. Judges will read through your reports and pick the top three teams from each track
  7. Selected nine teams will go on stage and present their findings (maximum five minutes per team)
  8. Judges will announce one winner per track based on the presentations

Track information

Beginner Track

This is a beginner-friendly track! We will give you a dataset that contains San Diego Housing information from the 1970s. You may work on problems such as the relationship between housing quality and ethnic groups/genders, predictions of housing prices based on given conditions, etc. Don't worry if you don't have any knowledge in data science (including python, pandas, EDA, machine learning...). We'll have a series of workshops to help you build your project!

Find more information here.

Science Track

In this challenge, we’re interested in using Data Visualization and NLP (Natural Language Processing) to analyze chronic illnesses through accumulated survey data. The data was retrieved from the CDC website. The data is real-world data and can be messy, preprocessing may be required to extract trends and patterns in the data. The end goal is to create a report with at least 3 data visualizations and incorporate NLP to send an important message about a specific chronic illness to an audience. Also, make sure your report contains what you did (cleaning, processing, any modeling, etc) and is submitted as well to ensure good data science practices. This prompt is relatively open-ended: other data may be incorporated as deemed necessary and the message you decide to convey is up to you (however, do make sure to back it up with evidence and visuals).

Find more information here.

Business Track

Over the past decade, the transportation industry has become one of the most promising areas for careers in Data Science and/or Data Engineering. At UBER, the world’s most popular ride-share service, data scientists have access to billions of rows of data and are expected to showcase mastery over-processing, visualizing, and analyzing the company’s data. In this track, you will have the opportunity to work with real-world UBER time-series data from the San Francisco area, spanning across the first and second quarters of 2019. The time-series data will be centered on travel times for UBER trips in the overall San Francisco area.

Find more information here.

datahacks2020's People

Contributors

dybcyrus avatar annsudhart avatar datahacksds3 avatar ruizheng0521 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.