
An open API service indexing awesome lists of open source software.

List: awesome-Automanous-3D-detection-methods

Last synced: 2 months ago
JSON representation




# Awesome-Aut-3D-Detection-Methods
## note
This repository is created to collect excellent works on 3D object detection for autonomous driving tasks. We will update the latest papers as soon as possible.
### keywords
#### inputs
__`monocular`__: monocular   __`stereo`__: stereo  __`lidar`__: point cloud  
__`image+lidar`__: image+lidar fusion  
#### datasets
experiments on datasets: __`kitti`__: KITTI   __`nuse`__: NuScenes   __`waymo`__: Waymo   __`ATG4D`__: ATG4D   __`lyft`__: lyft 
#### code
framework : __`Tensorflow`__: TensorFlow   __`PyTorch`__: PyTorch  

## 2017
- [[CVPR](] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. [[tensorflow](][[pytorch](] [__`lidar`__] :fire::star:
- [[CVPR](] Multi-View 3D Object Detection Network for Autonomous Driving. [[tensorflow](] [__`image+lidar`__] [__`kitti`__]:fire: :star:
- [[ICRA](] Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks. [[code_matlab](] [__`lidar`__] [__`kitti`__]:star:
- [[IROS](] 3D fully convolutional network for vehicle detection in point cloud. [[tensorflow](] [__`lidar`__] [__`kitti`__]:fire: :star:
## 2018
- [[CVPR](] PIXOR: Real-time 3D Object Detection from Point Clouds. [[pytorch](] [__`lidar`__] [__`kitti`__][__`ATG4D`__]
- [[CVPR](] VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection. [[tensorflow](] [__`lidar`__] [__`kitti`__]:fire::fire::fire: :star:
- [[CVPR](] PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation. [[code](] [__`image+lidar`__] [__`kitti`__]
- [[CVPR](] Frustum PointNets for 3D Object Detection from RGB-D Data. [[tensorflow](] [__`image+lidar`__] [__`kitti`__] :fire: :star:
- [[ECCV](] Deep Continuous Fusion for Multi-Sensor 3D Object Detection. [__`image+lidar`__] [__`kitti`__] [__`ATG4D`__]
- [[ECCVW](] YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud. [ __`monocular`__] [__`kitti`__]
- [[ICRA](] End-to-end Learning of Multi-sensor 3D Tracking by Detection. [__`image+lidar`__] [__`kitti`__]
- [[ICRA](] Robust Real-Time 3D Person Detection for Indoor and Outdoor Applications. [__`lidar`__] [__`kitti`__]
- [[ICRA](] A General Pipeline for 3D Detection of Vehicles.[__`lidar`__] [__`kitti`__]
- [[IROS](] Joint 3D Proposal Generation and Object Detection from View Aggregation. [__`lidar`__] [__`kitti`__]:star:
- [[IROS](] Edge and Corner Detection for Unorganized 3D Point Clouds with Application to Robotic Welding. [__`lidar`__] [__`kitti`__]
- [[SENSORS](] SECOND: Sparsely Embedded Convolutional Detection. [[pytorch](][__`lidar`__] [__`kitti`__] :fire::fire::fire::fire:
- [[arXiv](] IPOD: Intensive Point-based Object Detector for Point Cloud. [__`image+lidar`__] [__`kitti`__]
- [[arXiv](] Complex-YOLO: Real-time 3D Object Detection on Point Clouds. [[pytorch](] [__`lidar`__] [__`kitti`__] :fire:
## 2019
- [[CVPR](] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving. [[code](] [__`stereo`__][__`kitti`__]
- [[CVPR](] Stereo R-CNN based 3D Object Detection for Autonomous Driving. [[code](] [__`stereo`__][__`kitti`__]
- [[CVPR](] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud. [[pytorch](] [__`lidar`__] [__`kitti`__]:fire:
- [[CVPR](] PointPillars: Fast Encoders for Object Detection from Point Clouds. [[pytorch](] [__`lidar`__] [__`kitti`__]:fire:
- [[CVPR](] LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving.[__`lidar`__] [__`kitti`__][__`ATG4D`__]
- [[CVPRW](] Attentional PointNet for 3D-Object Detection in Point Clouds. [[pytorch](] [__`lidar`__] [__`kitti`__]
- [[ICCV](] Fast Point R-CNN. [__`lidar`__] [__`kitti`__]
- [[ICCV](] STD: Sparse-to-Dense 3D Object Detector for Point Cloud.[[pytorch](] [__`lidar`__] [__`kitti`__]
- [[ICCV](] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection.[[pytorch](] [__`monocular`__] [__`kitti`__]
- [[ICCVW](] Range Adaptation for 3D Object Detection in LiDAR. [__`lidar`__] [__`kitti`__]
- [[ICCVW](] Multi-View Reprojection Architecture for Orientation Estimation. [__`monocular`__] [__`kitti`__]
- [[NeurIPS](] Point-Voxel CNN for Efficient 3D Deep Learning. [__`lidar`__] [__`kitti`__]
- [[ICMLW](] LiDAR Sensor modeling and Data augmentation with GANs for Autonomous driving. [__`lidar`__]
- [[ICRA](] Focal Loss in 3D Object Detection. [[code](] [__`lidar`__] [__`kitti`__]
- [[ICRA](] SEG-VoxelNet for 3D Vehicle Detection from RGB and LiDAR Data. [__`lidar`__] [__`kitti`__]
- [[ICRA](] MVX-Net: Multimodal VoxelNet for 3D Object Detection. [__`lidar`__] [__`kitti`__]
- [[AAAI](] MonoGRNet: A Geometric Reasoning Network for 3D Object Localization. [__`monocular`__] [__`kitti`__]
- [[IROS](] EPN: Edge-Aware PointNet for Object Recognition from Multi-View 2.5D Point Clouds. [[tensorflow](] [__`lidar`__] [__`kitti`__]
- [[IROS](] Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection. [[pytorch](] [__`lidar+image`__] [__`kitti`__]
- [[IROS](] Improving 3D object detection for pedestrians with virtual multi-view synthesis orientation estimation. [__`lidar`__] [__`kitti`__]
- [[3DV](] IoU Loss for 2D/3D Object Detection. [__`lidar`__] [__`kitti`__]
- [[arXiv](] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud. [__`monocular`__][__`kitti`__]
- [[arXiv](] FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds. [[code](] [__`lidar`__] [__`kitti`__]
- [[CVPRW](] Complexer-YOLO: Real-Time 3D Object Detection and Tracking on Semantic Point Clouds. [[pytorch](] [__`monocular`__][__`kitti`__]:fire:
- [[CVPR](] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction. [[pytorch](] [__`monocular`__][__`kitti`__]
- [[CVPR](] GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving. [__`monocular`__][__`kitti`__]
- [[CVPR](] ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape. [__`monocular`__][__`kitti`__]
- [[CVPR](] Triangulation Learning Network: from Monocular to Stereo 3D Object Detection. [[pytorch](] [__`stereo`__][__`kitti`__]
- [[CoRR](] 3D Backbone Network for 3D Object Detection. [[code](] [__`lidar`__] [__`kitti`__]
- [[arXiv](] nuScenes: A multimodal dataset for autonomous driving. [[link](] [__`dataset`__]
- [[arXiv](] Deformable Filter Convolution for Point Cloud Reasoning.[__`lidar`__] [__`kitti`__][__`ATG4D`__]
- [[arXiv](] PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement.[__`lidar`__] [__`kitti`__][__`ATG4D`__]
## 2020

- [[TPAMI](] Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud. [[pytorch](][__`lidar`__] [__`kitti`__]
- [[AAAI](] TANet: Robust 3D Object Detection from Point Clouds with Triple Attention. [[code](] [__`lidar`__] [__`kitti`__]
- [[AAAI](] PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module. [__`lidar+image`__] [__`kitti`__]
- [[AAAI](] ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection. [[code](] [__`stereo`__] [__`kitti`__]
- [[AAAI](] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation. [__`monocular`__] [__`kitti`__]
- [[CVPR](] PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection. [[pytorch](] [__`lidar`__] [__`kitti`__] [__`waymo`__]:fire: :star: :fire: :star:
- [[CVPR](] Structure Aware Single-stage 3D Object Detection from Point Cloud. [[pytorch](] [__`lidar`__] [__`kitti`__] :fire: :star:
- [[CVPR](]3DSSD: Point-based 3D Single Stage Object Detector. [[TensorFlow](] [__`lidar`__] [__`kitti`__][__`nusc`__] :fire: :star:
- [[CVPR](]Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud. [[TensorFlow](] [__`lidar`__] [__`kitti`__] :fire: :star:
- [[CVPR](]Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection. [__`lidar`__] [__`kitti`__]
- [[CVPR](]PnPNet: End-to-End Perception and Prediction with Tracking in the Loop. [__`lidar`__]
- [[CVPR](] Train in Germany, Test in The USA: Making 3D Object Detectors Generalize.[[code](] [__`lidar`__]
- [[CVPR](] PointPainting: Sequential Fusion for 3D Object Detection. [__`lidar+image`__] [__`kitti`__] [__`nusc`__]
- [[CVPR](] DSGN: Deep Stereo Geometry Network for 3D Object Detection. [__`monocular`__] [__`kitti`__]
- [[CVPR](] Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation.[[code](] [__`stereo`__] [__`kitti`__]
- [[CVPR](] Learning Depth-Guided Convolutions for Monocular 3D Object Detection.[[code](] [__`monocular`__] [__`kitti`__]
- [[CVPR](] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships. [__`monocular`__] [__`kitti`__]
- [[CVPR](] LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention. [__`lidar_video`__] [__`nusc`__]
- [[CVPR](] Physically Realizable Adversarial Examples for LiDAR Object Detection. [__`lidar`__]
- [[CVPR](]HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection. [__`lidar`__] [__`kitti`__]
- [[CVPR](]Dops: Learning to detect 3d objects and predict their 3d shapes. [__`lidar_video`__] [__`waymo`__]
- [[CVPR](]Learning to Evaluate Perception Models Using Planner-Centric Metrics. [__`lidar`__]
- [[CVPR](]What You See is What You Get: Exploiting Visibility for 3D Object Detection. [__`lidar`__] [__`nusc`__]
- [[CVPR](]MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps. [__`lidar`__]
- [[ECCVW](] Deformable PV-RCNN: Improving 3D Object Detection with Learned Deformations.[[code](][__`lidar`__] [__`kitti`__]
- [[ECCV](] object as hotspots.[__`lidar`__] [__`kitti`__]
- [[ECCV](] EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection.[__`lidar+image`__] [__`kitti`__]
- [[ECCV](] 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection.[__`lidar+image`__] [__`kitti`__]
- [[ECCV](] Kinematic 3D Object Detection in Monocular Video.[[code](][__`monocular_video`__] [__`kitti`__]
- [[ECCV](] Rethinking Pseudo-LiDAR Representation.[[code](][__`monocular`__] [__`kitti`__]
- [[ECCV](] An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds.[__`lidar`__] [__`waymo`__]
- [[ECCV](] Pillar-based Object Detection for Autonomous Driving.[__`lidar`__] [__`waymo`__]
- [[ECCV](] Active Perception using Light Curtains for Autonomous Driving.[[code](][__`lidar`__]
- [[ECCV](] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution.[__`lidar`__]
- [[ECCV](] Improving 3D Object Detection through Progressive Population Based Augmentation.[__`lidar`__] [__`kitti`__]
- [[IROS](] MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views.[__`lidar`__] [__`nusc`__]
- [[ACMMM](] Weakly Supervised 3D Object Detection from Point Clouds.[__`lidar`__]
- [[BMVC](] RV-FuseNet: Range View based Fusion of Time-Series LiDAR Data for Joint 3D Object Detection and Motion Forecasting [__`lidar`__][__`nusc`__]
- [[Sensors]()] 3D-GIoU: 3D Generalized Intersection over Union for Object Detection in Point Cloud [__`lidar`__][__`kitti`__]
- [[arxiv](] 3D IoU-Net: IoU Guided 3D Object Detector for Point Clouds [__`lidar`__][__`kitti`__]
- [[arxiv](] Center-based 3D Object Detection and Tracking [[code](][__`lidar`__][__`nusc`__]
- [[arxiv](] Boundary-Aware Dense Feature Indicator for Single-Stage 3D Object Detection from Point Clouds [__`lidar`__][__`nusc`__]
- [[arxiv](] InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling [__`lidar`__][__`nusc`__]
- [[arxiv](] Quantifying Data Augmentation for LiDAR based 3D Object Detection [__`lidar`__][__`kitti`__]
- [[arxiv](] Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection [__`lidar`__][__`kitti`__][__`nusc`__]
- [[arxiv](] Real-time 3D object proposal generation and classification under limited processing resources [__`lidar`__][__`kitti`__]
- [[arxiv](] Safety-Aware Hardening of 3D Object Detection Neural Network Systems [__`lidar`__][__`kitti`__]
- [[arxiv](] Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection[__`stereo`__][__`kitti`__]
- [[arxiv](] SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds [[code](][__`lidar`__][__`kitti`__]
- [[arxiv](] SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds
- [[arxiv](] GhostBuster: Looking Into Shadows to Detect Ghost Objects in Autonomous Vehicle 3D Sensing [__`lidar`__][__`kitti`__]
- [[arxiv](] Cross-Modality 3D Object Detection [__`lidar`__][__`kitti`__]
- [[arxiv](] Towards Autonomous Driving: a Multi-Modal 360∘ Perception Proposal[__`lidar`__][__`kitti`__]
- [[arxiv](] RangeRCNN: Towards Fast and Accurate 3D Object Detection with Range Image Representation[__`lidar`__][__`kitti`__]
- [[arxiv](] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection[__`lidar+image`__][__`kitti`__]
- [[arxiv](]Reinforced Axial Refinement Network for Monocular 3D Object Detection[__`monocular`__][__`kitti`__]
- [[arxiv](]PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges[__`lidar`__][__`kitti`__][__`waymo`__]
- [[arxiv](]GhostBuster: Looking Into Shadows to Detect Ghost Objects in Autonomous Vehicle 3D Sensing [__`lidar`__][__`kitti`__]
- [[arxiv](]CenterNet3D:An Anchor free Object Detector for Autonomous Driving [__`lidar`__][__`kitti`__]
- [[arxiv](] Part-Aware Data Augmentation for 3D Object Detection in Point Cloud. [__`lidar`__][__`kitti`__]
- [[arxiv](] 1st Place Solution for Waymo Open Dataset Challenge - 3D Detection and Domain Adaptation. [__`lidar`__][__`waymo`__]
- [[arxiv](] RoIFusion: 3D Object Detection from LiDAR and Vision. [__`lidar+image`__][__`kitti`__]
- [[arxiv](] A Density-Aware PointRCNN for 3D Objection Detection in Point Clouds. [__`lidar`__][__`kitti`__]
- [[arxiv](] Radar-Camera Sensor Fusion for Joint Object Detection and Distance Estimation in Autonomous Vehicles
. [__`lidar+image`__][__`nusc`__]
- [[arxiv](] 3D Object Detection and Tracking Based on Streaming Data. [__`lidar`__][__`kitti`__]['det_and_tracking']
- [[arxiv](] Uncertainty-Aware Voxel based 3D Object Detection and Tracking with von-Mises Loss. [__`lidar`__][__`kitti`__]['det_and_tracking']
- [[arxiv](] Faraway-Frustum: Dealing with Lidar Sparsity for 3D Object Detection using Fusion. [__`lidar`__][__`kitti`__]
- [[arxiv](] Multi-View Adaptive Fusion Network for 3D Object Detection. [__`lidar`__][__`kitti`__]
- [[arxiv](] Stereo Frustums: A Siamese Pipeline for 3D Object Detection. [__`stereo`__][__`kitti`__]
- [[arxiv](]MLOD: Awareness of Extrinsic Perturbation in Multi-LiDAR 3D Object Detection for Autonomous Driving. [__`lidar`__]
- [[arxiv](]SF-UDA3D: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection. [__`lidar`__][__`kitti`__][__`nusc`__]
- [[arxiv](]Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels. [__`mono`__][__`kitti`__]
- [[arxiv](]SemanticVoxels: Sequential Fusion for 3D Pedestrian Detection using LiDAR Point Cloud and Semantic Segmentation. [__`lidar`__][__`kitti`__]
- [[arxiv](]CoFF: Cooperative Spatial Feature Fusion for 3D Object Detection on Autonomous Vehicles. [__`lidar`__][__`kitti`__]
- [[arxiv](]Multi-Frame to Single-Frame: Knowledge Distillation for 3D Object Detection. [__`lidar`__][__`waymo`__]
- [[arxiv](] MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion. [__`lidar`__][__`kitti`__]
- [[arxiv](] CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection. [__`radar+image`__][__`nusc`__]
- [[arxiv](] Uncertainty-Aware Voxel based 3D Object Detection and Tracking with von-Mises Loss. [__`lidar`__][__`kitti`__]
- [[arxiv](] A Neural Network for Fast and Efficient Edge Detection in 3D Point Clouds. [__`lidar`__][__`kitti`__]
- [[arxiv](] CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point Cloud. [__`lidar`__][__`kitti`__]
- [[arxiv](] It’s All Around You: Range-Guided Cylindrical Network for 3D Object Detection. [__`lidar`__][__`kitti`__]
## Suvery
- [[TPAMI](] Deep Learning for 3D Point Clouds: A Survey[__`lidar`__]
- [[arxiv](] 3D Point Cloud Processing and Learning for Autonomous Driving[__`lidar`__]
- [[arxiv](] Deep Learning for 3D Point Cloud Understanding: A Survey[__`lidar`__]
- [[arxiv](] An Overview Of 3D Object Detection[__`lidar`__]

## code base
- [[lidar_only](] second.pytorch [__`kitti`__][__`nusc`__]
- [[lidar_only](] Det3D [__`kitti`__][__`nusc`__][__`lyft`__][__`waymo`__]
- [[lidar_only](] OpenPCDet[__`kitti`__][__`nusc`__][__`waymo`__]
- [[lidar_image](] mmdetection3d[__`kitti`__][__`nusc`__][__`lyft`__][__`waymo`__]