GithubHelp home page GithubHelp logo

cleaning_data_project's Introduction

Getting and Cleaning Data Project

The script run_analysis.R from this project can be executed to generate tidy data from the Samsumg dataset as described on the Getting and Cleaning Data project on Coursera.

The script will load the train and test datasets (eg. the "subjects", "labels" and "features" data files) and merge them together to produce a single data frame.

Note that in this data frame we only keep the mean and standard deviation variables from the initial features.

The script will also use appropriate names for the label values (eg. corresponding to different activities), and rename all the variables from the syntax "-mean()-X" to "MeanX" for instance (eg. removing "-" and parenthesis).

Once this tidy data frame is setup, the script performs the generation of the final "mean" dataset (written to file as "mean_set.txt" in the current working directory): this final data set contains the average value for each variable for each activity and for each subject.

Important note: this script excepts the Samsumg data do be available in the current working directory (eg. in a sub folder called "UCI HAR Dataset")

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.