Deep Learning for Human Activity Recognition

Deep learning is perhaps the nearest future of human activity recognition. While there are many existing non-deep method, we still want to unleash the full power of deep learning. This repo provides a demo of using deep learning to perform human activity recognition.

Prerequisites

Python 3.x
Numpy
Tensorflow

Dataset

There are many public datasets for human activity recognition. You can refer to this survey article Deep learning for sensor-based activity recognition: a survey to find more.

In this demo, we will use UCI HAR dataset as an example. This dataset can be found in here.

Of course, this dataset needs further preprocessing before being put into the network. I've also provided a preprocessing version of the dataset as a .npz file so you can focus on the network. We'll talk about this later. But it is also highly recommended you download the dataset so that you can experience all the process on your own.

#subject	#activity	Frequency
30	6	50 Hz

Network structure

What is the most influential deep structure? CNN it is. So we'll use CNN in our demo. Besides, since most of the signals for activity recognition is time series data, RNN is also needed. We'll cover that in another coming demo.

CNN structure

Convolution + pooling + convolution + pooling + dense + dense + dense + output

That is: 2 convolutions, 2 poolings, and 3 fully connected layers. Please check the following image.

RNN structure

Coming soon

Usage

Just run the activity_recognition.py file, that's all.

python activity_recognition.py or click the run button in you IDE.

About the inputs

That dataset contains 9 channels of the inputs: (acc_body, acc_total and acc_gyro) on x-y-z. So the input channel is 9.

Dataset providers have clipped the dataset using sliding window, so every 128 in .txt can be considered as an input. In real life, you need to first clipped the input using sliding window.

So in the end, we reformatted the inputs from 9 inputs files to 1 file, the shape of that file is [n_sample,128,9], that is, every windows has 9 channels with each channel has length 128. When feeding it to Tensorflow, it has to be reshaped to [n_sample,128,1,9] as we expect there is 128 X 1 signals for every channel.

About the result

The result can vary depending the network structure. For comparison, I have provide a result file called result_all.csv containing results of different dropout/learning_rate/training_epoch.

xjlian / deep-learning-activity-recognition Goto Github PK

deep-learning-activity-recognition's Introduction

Deep Learning for Human Activity Recognition

Prerequisites

Dataset

Network structure

CNN structure

RNN structure

Usage

About the inputs

About the result

Related projects

deep-learning-activity-recognition's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs