GithubHelp home page GithubHelp logo

rickhenderson / smartphone-activity-study Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 237 KB

A repository containing data, a codebook, and an R analysis file which deal with cleaning a set of data from a study on human activity recognition by the UCI Centre for Machine Learning and Intelligent Systems.

R 100.00%

smartphone-activity-study's Introduction

Smartphone Activity Study

This repository contains a single, tidy dataset stored as a .txt file, a codebook, and an R analysis file which deals with cleaning up a set of data from a study on human activity recognition by the UCI Centre for Machine Learning and Intelligent Systems.

This repo has been created as a requirement for the final week of the course "Getting and Cleaning Data" offered via Coursera by the John Hopkins Bloomberg School of Public Health.

The link to the full study is http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones.

Description

In brief, the researchers observed 30 participants (called subjects for the rest of this repo) while they performed 6 different activities. The subjects were wearing a smartphone to track their movements so that data could be retrieved from the phone's sensors to try and determine which of the six activities the subject had performed, just based on the sensor reading. The data from the sensors was then split into two groups: 70% of the subjects' data was placed in a training group for training the algorithms for detecting which activity was being performed, and the other 30% was placed in a test group to test the algortithms.

In this repo, the file run_analysis.R is a set of R commands that will manipulate the data from the study to perform a number of steps including:

  • combine the test data with the training data
  • sets the variable names properly to also make the dataset easier to read
  • combine the raw data from the sensors with data regarding the activity that was performed by each subject and a subject ID to identify each subject
  • extract mean and standard deviation for the provided measurements from a large set of variables
  • insert the descriptive activity names into the dataset for readability
  • produces output as a .txt file which contains the average of each of the mean and standard deviation variables for each activity for each subject (participant in the study).

The output produced by the analysis file is contained in the file avg_by_actsub_tidy.txt.

In total, the resulting table will have 6 sets of mean and standard deviation measurements for each of the 30 subjects, as each subject performed all of the activities.

This file should be read into a data frame using df <- read.table("avg_by_actsub_tidy.txt", header=TRUE) or equivalent command as the file already contains column headings (names for the variables).

If you have any questions about this data set, please contact me at [email protected] with "Smartphone Activity Study" in the subject line.

This repo was created as a requirement of the final week of the Getting and Cleaning Data course on Coursera.

smartphone-activity-study's People

Contributors

rickhenderson avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.