https://github.com/danny-1k/comm.ai-calibration-challenge

Giving a go at comma.ai's calibration challenge with deep learning techniques
https://github.com/danny-1k/comm.ai-calibration-challenge

Last synced: 4 months ago
JSON representation

Giving a go at comma.ai's calibration challenge with deep learning techniques

Host: GitHub
URL: https://github.com/danny-1k/comm.ai-calibration-challenge
Owner: danny-1k
License: mit
Created: 2022-07-28T11:04:52.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2022-07-30T15:01:11.000Z (almost 3 years ago)
Last Synced: 2024-12-29T07:36:41.337Z (5 months ago)
Language: Jupyter Notebook
Size: 718 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

## [Check out the challenge's github repo](https://github.com/commaai/calib_challenge)

-------

This is an attempt at Comma ai's Calibration challenge with, DEEP LEARNING

The approach taken was to use a 7 layer convnet trained with PyTorch to predict yaw and pitch of the camera's motion.

The dataset consisted of 5 videos ~1min long recorderd at 20fps .
Which translates to about `5 x 60 x 20 = 6000` images.

In reality though, there were a little bit less than 6000 images as some frames taken when the car was 4m/s didn't have any pitch/ yaw recorded.
In my case, I removed every row that contained a NaN as I deemed them redundant.

Unlabeled videos were provided but for sanity sake, I splited the labeled videos into 80% training and 20% testing

The testing data would help to identify overfitting and determine overall generalization.

Because of the low data count, data augmentation was applied to improve the generalization of the model

As for image processing, I converted the images to grayscale and then downsampled the image by ~ 4 times.

As an extra sanity check, I decided to check the factors in the image that influenced the models predictions. I found it hard to believe the model would get such a low error without over fitting
The code for this is in `notebooks/Interpretability.ipynb`

Pitch Saliency map
![](pitch_gradcam.png "Saliency map for the pitch predition")

Yaw Saliency map
![](yaw_gradcam.png "Saliency map for the yaw predition")

The model seems to focus more on the lanes on the left and right side.
There are also large activations for the exposed part of the car interior.

But the initial finding kinda makes sense as the parallax point would have to start from those parts of the image

In the first visualization, there is an activation in what seems to be the parallax point of the road.

This challenge was really fun not gonna lie.

I started out with EDA checking the data and theninitially thought that applying deep learning techniques would be unfeasible because of the lack of data and the scale of the problem. Though the results were promising, I'm a bit sceptical on the generalization of the model as I can't validate on the unlabeled data.

I'm Really grateful to the Comma AI team for hosting this challenge.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/danny-1k/comm.ai-calibration-challenge

Awesome Lists containing this project

README