Several useful public datasets are included in this repository to practice your Data Science and Machine Learning skills. These datasets are also used in the course on "Data Science and Machine Learning using Python - A Bootcamp".
The course is available on the following platforms:
For free contents, please subscribe to our Youtube Channel.
The repository is created to ensure that the datasets remain available without any dependence on party involvement.
- 2010 Alcohol Consumption by Country
- 2011 US Agricultural Exports (modified)
- 2012 US Election Data
- 2014 World GDP
- 2016 World Happiness Index
- 2020 CoVID19 Geographic Distribution Worldwide Data click here to download the most recent one
- Bees Data
- Emergency Calls (911 Calls Data)
- Stocks (multiple csv files):
- BMO
- CIBC
- CNQ
- Encana
- RBC
- Suncor
- USO
- WTI
- bootstrapping (sample data from StartCraft game on AMP -- Actions Per Minute only)
- Australian Credit Approval
- Breast Cancer (Wisconsin)
- Breast Cancer (Yugoslavia)
- Bank Note Authentication
- Coded Data (Synthetically Created)
- Heart Disease Cleveland Data Clean
- Horse Colic
- Ionosphere
- Loan
- Mammographic Masses Data Clean
- Pima Indians Diabetes
- Sonar Returns
- Titanic Data (multiple csv files)
- BioAssay dataset (highly imbalanced data)
- Chronic Kidney Disease
- Abalone Age (or regression)
- Glass Identification
- Iris Flower Species
- Seed Quality Data
- Wheat Seeds
- Wine Quality (or regression)
- Wine Quality Merged (red & white => column "red_wine" 1/0)
- Auto Insurance Total Claims
- Big Mart Sales
- Boston Housing
- Kings County House Price
- Longley Economic
- StarCraft 1 dataset and description
- Armed Robberies in Boston (Monthly )
- Car Sales (Monthly)
- Champagne Sales (Monthly)
- Female Births in California (Daily )
- International Airline Passengers (Monthly )
- Shampoo Sales (Monthly)
- Specialty Writing Paper Sales (Monthly)
- Sunspots (Monthly)
- Temperatures in Melbourne (Daily Minimum )
- Temperatures in Melbourne (Daily Maximum )
- Temperatures in Nottingham Castle (Mean Monthly)
- Water Usage in Baltimore (Yearly )
- Historical Product Demand Dataset -- Forecasts the demand for thousands of different products
- Pollution Levels in Beijing (Hourly)
- Minutely Individual Household Electric Power Consumption
- Human Activity Recognition Using Smartphones
- Indoor Movement Prediction
- Movies and Rating data (two separate csv files)