Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/steph1793/Voxelnet

:mag_right: :oncoming_automobile: :articulated_lorry: :walking: :bike: 3D objects detections in LIDAR point clouds for autonomous driving
https://github.com/steph1793/Voxelnet

Last synced: 19 days ago
JSON representation

:mag_right: :oncoming_automobile: :articulated_lorry: :walking: :bike: 3D objects detections in LIDAR point clouds for autonomous driving

Host: GitHub
URL: https://github.com/steph1793/Voxelnet
Owner: steph1793
License: gpl-3.0
Created: 2019-12-25T21:02:04.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2021-09-06T03:31:36.000Z (about 3 years ago)
Last Synced: 2024-08-01T05:13:20.104Z (4 months ago)
Language: Python
Homepage:
Size: 1.19 MB
Stars: 57
Watchers: 4
Forks: 52
Open Issues: 6
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Voxelnet (tensorflow 2.0.0)
![Image of Voxelnet Architecture](https://github.com/steph1793/Voxelnet/blob/master/images/pre.png)

Implementation of [VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection](https://arxiv.org/abs/1711.06396) in tensorflow 2.0.0.

This project is based on the work of [Qiangui Huang](https://github.com/qianguih), ([project](https://github.com/qianguih/voxelnet)) and [Xiaojian Ma](https://github.com/jeasinema). Thanks to them for their tremendous job that allowed me to rebuild this architecture and understand the non explicit parts of the paper.

# Dependencies
* Python 3.6
* Tensorflow 2.0.0
* opencv
* numba

# Installation
1. Clone this repository
2. Compile the Cython module
```bash
$ python3 setup build_ext --inplace
```
3. Compile the evaluation code (Optional)
This will compile the Kitti_eval project which I decided not to run during the training (during the dump tests). But at the end, of the training, you may run this evaluation code with the model of your choice. In the training script, it is commented. You may un-comment to run the kitti_eval during the training.
```bash
$ cd kitti_eval
$ g++ -o evaluate_object_3d_offline evaluate_object_3d_offline.cpp
```
4. grant the execution permission to evaluation script
```bash
$ cd kitti_eval
$ chmod +x launch_test.sh
```
# Data preparation
1. Download the 3D KITTI detection dataset from [here](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d). Data to download include:
* Velodyne point clouds (29 GB): input data to VoxelNet
* Training labels of object data set (5 MB): input label to VoxelNet
* Camera calibration matrices of object data set (16 MB): for visualization of predictions
* Left color images of object data set (12 GB): for visualization of predictions

2. In this project, we use the cropped point cloud data for training and validation. Point clouds outside the image coordinates are removed. Update the directories in `data/crop.py` and run `data/crop.py` to generate cropped data. Note that cropped point cloud data will overwrite raw point cloud data.

2. Split the training set into training and validation set according to the protocol [here](https://xiaozhichen.github.io/files/mv3d/imagesets.tar.gz). And rearrange the folders to have the following structure:
```plain
└── DATA_DIR
├── training <-- training data
| ├── image_2
| ├── label_2
| └── velodyne
└── validation <--- evaluation data
| ├── image_2
| ├── label_2
| └── velodyne
```

# Train

Run `train.py`. You can find the meaning of each hyperparameter in the script file.
```
$ !python train.py \
--strategy="all" \
--n_epochs=160 \
--batch_size=2 \
--learning_rate=0.001 \
--small_addon_for_BCE=1e-6 \
--max_gradient_norm=5 \
--alpha_bce=1.5 \
--beta_bce=1 \
--huber_delta=3 \
--dump_vis="no" \
--data_root_dir="../DATA_DIR/T_DATA" \
--model_dir="model" \
--model_name="model6" \
--dump_test_interval=40 \
--summary_interval=10 \
--summary_val_interval=10 \
--summary_flush_interval=20 \
--ckpt_max_keep=10 \
```

# Evaluate
1. Run `predict.py`.

```
!python predict.py \
--strategy="all" \
--batch_size=2 \
--dump_vis="yes" \
--data_root_dir="../DATA_DIR/T_DATA/" \
--dataset_to_test="validation" \
--model_dir="model" \
--model_name="model6" \
--ckpt_name="" \
```

2. Then, run the kitty_eval project to compute the performances of the model.
```
./kitti_eval/evaluate_object_3d_offline [DATA_DIR]/validation/label_2 ./predictions [output file]
```

# What's new

* Tensorflow 2.0.0
* Data pipeline with tensorflow dataset api
* Eager mode (with use of autograph for speed)
* Use of a variant of the smooth-l1 loss for the regression loss (use of Huber loss)
* Use of tf.distribute for the multi gpu training (still in process, only one gpu training works for now)
* Non use of the first type of data augmentation (may lead in a decrease of performance)

# Performances
(ongoing)
I've just finished the project, and start training it. But before that, I did a lot of tests to challenge the archictecture. One of them is overfitting the model on a small training set in a few steps in order to check if i built a model able to learn anything at all, results, below.(PS : I tried to be faithful as much as I could to the paper).

![perf](https://github.com/steph1793/Voxelnet/blob/master/images/Capture3.PNG)
![perf2](https://github.com/steph1793/Voxelnet/blob/master/images/Capture4.PNG)

# Coming next

* Train of models for car detection
* Finish multi gpu interface
* Train models for Pedestrian and Cyclist detection
* Try new features that I'll communicate soon