Solution for Project 3 (Behavioral Cloning) of Udacity Self-Driving Car Nanodegree

This solution uses a neural network with only 15 parameters.

Initial idea based on: https://github.com/xslittlegrass/CarND-Behavioral-Cloning

Video of a lap of the track and the output of convolution and max pooling layer available here:

Getting started

Model. py doesn't work directly on the data from Udacity, but instead on data that has already been scaled and augmented. This speeds up training a lot.
That being said, the first step is to place the data in the udacity directory
Run prepare_data.py. It will go over all the images, scale them to 32x16, add augmented data and save them as numpy array in the files x.data.npy and y.data.npy
Run model.py. It will train the model using the data from x.data.npy and y.data.npy
Run drive.py model.json
Repeat 4-5 until a model works correctly. This is due to issue #1

Creation of the training dataset

I've used some of the ideas outlined here: https://chatbotslife.com/using-augmentation-to-mimic-human-driving-496b569760a9#.w8yz2oycs

The dataset used for training is the one provided by Udacity with the following changes:

Left/right camera - Images from left/right camera are also used by modifiying the steering angle with 0.25 (6.25 degress). I've tried with different values and this works well. Using a constant is not exactly correct, because the correction should be dependent on the angle of the car, but this is not implemented
Flipping - Images with non-zero angle are flipped, including those from the left/right camera. This helps balance the neural network and not bias it to right or left turns. This is useful because the training data has more turns to the one side than to the other. There are a lot of zero angles in the dataset, so I don't flip them in order to balance them out.
Brightness adjusment - There is code for brightness adjusment, but is not currently used

Model architecture

The final model consists of only 4 layers:

Average Pooling with 2x2 pool size - Further shrinks the images to 16x8
Convolution with 2x2 kernel and 1x1 stride and ELU activation - Scans the 16x8 picture with kernel size of 2x2 that is moved by 1 pixel at a time and produces 7x15x1 output. It helps with finding features in the input irrespective of their position in the input
Max Pooling with 2x4 pool size - Reduces the dimensions to 3x3x1.
Dropout - Prevents overfitting. This model has only 10 connections at this stage, but still using dropout improves performance
Dense layer with 1 neuron - Used to sum up the data from the max pooling layer and output one variable - the steering angle

Training

Training is done using Adam optimizer
There is an early cutoff to prevent overfitting and to make it easier to change the model - I don't have to change the epoch count
20% of the data is used for validation, testing is performed on the simulator.

How did I reach this model:

I started playing with different models that were failing. After a lot of trial and error, I tried the NVIDIA model, documented here: https://images.nvidia.com/content/tegra/automotive/images/2016/solutions/pdf/end-to-end-dl-using-px.pdf
It worked with my data and I discovered that it has less params than my initial models
I started reducing the dimensions of the model - The NVIDIA model uses 200x66, I tried changing the input to 160x80 and modifying the convolutions accordingly. This further reduced the numbers of parameters, so I figured out that I can reduce them further
A fellow student (Mengxi Wu) posted his model on Udacity's slack with only 400 params and I got inspired to reduce it further. There is a link on top of this document
I started reducing the number of kernels in the convolution and the car was still able to drive itself.
I also noticed that the original model was modified and the max pooling modified to use 4x4 pool size.
For the final model I added the Average Pooling in front. This effectively reduces the image size to 16x8.
Also, because the images are not squares, I changed the MaxPooling to have the same aspect ratio as the input

nikidimi / carnd-behavioral-cloning Goto Github PK

carnd-behavioral-cloning's Introduction

Solution for Project 3 (Behavioral Cloning) of Udacity Self-Driving Car Nanodegree

Getting started

Creation of the training dataset

Model architecture

Training

How did I reach this model:

carnd-behavioral-cloning's People

Contributors

Stargazers

Watchers

Forkers

carnd-behavioral-cloning's Issues

Model training is not always successful

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs