GithubHelp home page GithubHelp logo

ganesh7625 / datasets Goto Github PK

View Code? Open in Web Editor NEW

This project forked from jbrownlee/datasets

0.0 0.0 0.0 220.53 MB

Machine learning datasets used in tutorials on MachineLearningMastery.com

Home Page: http://MachineLearningMastery.com

datasets's Introduction

Machine Learning Datasets

This repository contains a copy of machine learning datasets used in tutorials on MachineLearningMastery.com.

This repository was created to ensure that the datasets used in tutorials remain available and are not dependent upon unreliable third parties.

All regression and classification problem CSV files have no header line, no whitespace between columns, the target is the last column, and missing values are marked with a question mark character ('?').

In many cases, tutorials will link directly to the raw dataset URL, therefore dataset filenames should not be changed once added to the repository.

Datasets

This section provides a summary of the datasets in this repository.

Binary Classification Datasets

  • Breast Cancer (Wisconsin) (breast-cancer-wisconsin.csv)
  • Breast Cancer (Yugoslavia) (breast-cancer.csv)
  • Breast Cancer (Haberman's) (haberman.csv)
  • Bank Note Authentication (banknote_authentication.csv)
  • Horse Colic (horse-colic.csv)
  • Ionosphere (ionosphere.csv)
  • Pima Indians Diabetes (pima-indians-diabetes.csv)
  • Sonar Returns (sonar.csv)
  • German Credit (german.csv)
  • Credit Card Fraud (creditcard.csv.zip)
  • Adult Income (adult-all.csv)
  • Mammography (mammography.csv)
  • Oil Spill (oil-spill.csv)
  • Phoneme (phoneme.csv)

Multiclass Classification Datasets

  • Glass Identification (glass.csv)
  • Iris Flower Species (iris.csv)
  • Wheat Seeds (wheat-seeds.csv)
  • Wine (wine.csv)
  • Ecoli (ecoli.csv)
  • Thyroid Gland (new-thyroid.csv)

Regression Datasets

  • Boston Housing (housing.csv)
  • Auto Insurance Total Claims (auto-insurance.csv)
  • Auto Imports Prices (auto_imports.csv)
  • Abalone Age (abalone.csv)
  • Wine Quality Red (winequality-red.csv)
  • Wine Quality White (winequality-white.csv)

Univariate Time Series Datasets

  • Daily Minimum Temperatures in Melbourne (daily-min-temperatures.csv)
  • Daily Maximum Temperatures in Melbourne (daily-max-temperatures.csv)
  • Daily Female Births in California (daily-total-female-births.csv)
  • Monthly International Airline Passengers (monthly-airline-passengers.csv)
  • Monthly Armed Robberies in Boston (monthly-robberies.csv)
  • Monthly Sunspots (monthly-sunspots.csv)
  • Monthly Champagne Sales (monthly_champagne_sales.csv)
  • Monthly Shampoo Sales (monthly-shampoo-sales.csv)
  • Monthly Car Sales (monthly-car-sales.csv)
  • Monthly Mean Temperatures in Nottingham Castle (monthly-mean-temp.csv)
  • Monthly Specialty Writing Paper Sales (monthly-writing-paper-sales.csv)
  • Yearly Water Usage in Baltimore (yearly-water-usage.csv)

Multivariate Time Series Datasets

  • Hourly Pollution Levels in Beijing (pollution.csv)
  • Minutely Individual Household Electric Power Consumption (household_power_consumption.zip)
  • Human Activity Recognition Using Smartphones (HAR_Smartphones.zip)
  • Indoor Movement Prediction (IndoorMovement.zip)
  • Yearly Longley Economic Employment (longley.csv)

Natural Language Processing

  • Flickr 8k Photo Caption Dataset (Flickr8k_Dataset.zip, Flickr8k_text.zip)
  • Movie Review Polarity (review_polarity.tar.gz)
  • German to English Translation (deu-eng.txt)
  • The Republic, by Plato (republic.txt)

ARFF Datasets

  • Weka UCI Datasets (weka-datasets.zip)
  • Weka Numeric Datasets (weka-datasets-numeric.zip)

datasets's People

Contributors

jason2brownlee avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.