Vehicle Detection and Tracking

In this project, the goal is to provide a vehicle detection and tracking pipeline to identify vehicles in a video from a front-facing camera on a car taken during highway driving.

Histogram of Oriented Gradients (HOG)

Extract HOG features from the training images

I started by reading in all the vehicle and non-vehicle images. Here are some examples of the vehicle and non-vehicle classes:

I then explored different color spaces and different HOG parameters (orientations, pixels_per_cell, and cells_per_block). I grabbed random images from each of the two classes and displayed them to get a feel how a HOG output looks like. Here is an example using the YCrCb color space and HOG parameters of orientations=9, pixels_per_cell=(16, 16) and cells_per_block=(2, 2):

Sample images of the two classes:

HOG output for vehicle sample image:

HOG output for non-vehicle sample image:

Final choice of HOG parameters

I've tried multiple combinations of parameters. So the final parameters are based on emperical results. Ideally, I would have used a grid search to test all different combinations including colour spaces. The best result obtained so far was using orientations=9, pixels_per_cell=(8, 8) and cells_per_block=(2, 2).

Train a classifier

I've trained a linear SVM classifier with default paramters. I used a random split of 20% for the test data. The classifier achieved over 99% accuracy on the test data.

Sliding Window Search

HOG Sub-sampling window search

I decided to use a HOG sub-sampling window search which is a more efficient method for doing the sliding window approach. Each window is defined by a scaling factor where a scale of 1 would result in a window that's 8 x 8 cells then the overlap of each window is in terms of the cell distance which I set to 1. Furthermore, I use different scale values to generate multiple-scaled search windows. Here is an example of the multiple-scaled search windows.

Examples of test images

Ultimately I searched on two scales using YCrCb 3-channel HOG features, which provided a nice result. Here are some example images:

Video Implementation

Test on project video

The pipeline was applied on the provided project video and the final video result was quite well.

Here's a link to my video result

Combining overlapping bounding boxes

I recorded the positions of positive detections in each frame of the video. From the positive detections I created a heatmap and then thresholded that map to identify vehicle positions. Further, I identified individual blobs in the heatmap where I assumed each blob corresponded to a vehicle. I constructed bounding boxes to cover the area of each blob detected.

Here are three frames and their corresponding heatmaps without threshold:

In order to reduce the number of false positives, the last 8 frames were stored and then thresholded on the sum of these heatmaps. As a side effect this techniqe results much more stable bounding boxes as well.

Discussion

One possible problem of the algorithm could occur in the detection of vehicles in different weather and light conditions. The used dataset is too small, so more data sources and data augmentation techniques can help to improve the classifier. Further, an ensemble of different classifiers for detection can make the pipeline more robust and improve the final accuracy. In addition to that, an exploration and combination of different color spaces and the usage of spatial binning and histogram features could be helpful but also imporove the complexity.

By increasing the complexity, the pipeline could suffer under performance issues. I believe that state-of-the art deep learning approaches like SSD and YOLO could help.

Another possible problem is the appearance of false positives which are caused by misclassification. In the current pipeline we are filtering out these false positives with heatmap thresholds. An more intelligent filter technique like tracking the bounding boxes over several frames could decrease the appearance of false positives and improve the overall algorithm.

Improvements:

Fine tune of hype parameters
Usage of histogram and spatial binning features
Ensemble of different classifiers
More training data and data augmetation techniques
More sophisticated filter techniques

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
examples		examples
images		images
input/train		input/train
output_videos		output_videos
test_images		test_images
utils		utils
.gitignore		.gitignore
README.md		README.md
VehicleDetection.ipynb		VehicleDetection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vehicle Detection and Tracking

Histogram of Oriented Gradients (HOG)

Extract HOG features from the training images

Final choice of HOG parameters

Train a classifier

Sliding Window Search

HOG Sub-sampling window search

Examples of test images

Video Implementation

Test on project video

Combining overlapping bounding boxes

Discussion

About

Releases

Packages

Languages

apgeorg/Vehicle-Detection-And-Tracking

Folders and files

Latest commit

History

Repository files navigation

Vehicle Detection and Tracking

Histogram of Oriented Gradients (HOG)

Extract HOG features from the training images

Final choice of HOG parameters

Train a classifier

Sliding Window Search

HOG Sub-sampling window search

Examples of test images

Video Implementation

Test on project video

Combining overlapping bounding boxes

Discussion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages