{"id":13631642,"url":"https://github.com/lhyfst/3d-detection-papers","last_synced_at":"2025-04-17T22:31:28.490Z","repository":{"id":48172913,"uuid":"175601364","full_name":"lhyfst/3d-detection-papers","owner":"lhyfst","description":"The papers in this list are about 3d detection, especially those using point clouds.","archived":false,"fork":false,"pushed_at":"2020-06-11T05:33:04.000Z","size":57,"stargazers_count":75,"open_issues_count":0,"forks_count":10,"subscribers_count":6,"default_branch":"master","last_synced_at":"2024-08-01T22:49:16.600Z","etag":null,"topics":["3d-detection","cloud-point","paper","read-list"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lhyfst.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-03-14T10:39:02.000Z","updated_at":"2024-01-05T00:23:29.000Z","dependencies_parsed_at":"2022-09-02T09:10:54.984Z","dependency_job_id":null,"html_url":"https://github.com/lhyfst/3d-detection-papers","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lhyfst%2F3d-detection-papers","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lhyfst%2F3d-detection-papers/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lhyfst%2F3d-detection-papers/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lhyfst%2F3d-detection-papers/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lhyfst","download_url":"https://codeload.github.com/lhyfst/3d-detection-papers/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":223768616,"owners_count":17199356,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["3d-detection","cloud-point","paper","read-list"],"created_at":"2024-08-01T22:02:33.066Z","updated_at":"2024-11-08T23:31:18.021Z","avatar_url":"https://github.com/lhyfst.png","language":null,"funding_links":[],"categories":["Others"],"sub_categories":[],"readme":"# 3d detection papers\n\n\u003e Author: Li Heyuan (李贺元)\u003cbr\u003e\n\u003e Email: lhyfst@gmail.com\u003cbr\u003e\n\u003e The papers in this list are about Autonomous Vehicles 3d detection \u0026 semantic segmentation\u003cbr\u003e\n\u003e especially those using point clouds and in deep learning methods.\u003cbr\u003e\n\u003e All rights reserved\n\n\nIf you have any suggestion or want to recommend new papers, please feel free to let me know.\u003cbr\u003e\nI have read most of the papers here, and am very happy to discuss with you if you have any issues on these papers.\u003cbr\u003e\nI will keep updating this project continuously.\u003cbr\u003e\n\n---\n\nI will improve this project in a few days.\u003cbr\u003e\n\ntodo list:\n1. links  -- done\n2. recommended papers  -- done. The ones that I particularly liked are marked with :star:. \n3. authors -- done\n4. models  -- remove this item\n5. relevant code  -- done\n6. rank by year  -- done\n7. active researchers  -- done\n8. write a brief review for each mainstream method\n9. try to write some reviews for the recommended papers, if I have time.\n\n\n## Mainstream Method Series\n- Voxel\n- Multi-view\n- Pointnet\n- Other\n\n## Voxel\n- [Sliding shapes for 3d object detection in depth images](http://slidingshapes.cs.princeton.edu/paper.pdf), \nShuran Song, Jianxiong Xiao, \n[code](http://slidingshapes.cs.princeton.edu/), \n[website](http://slidingshapes.cs.princeton.edu/), \nECCV, 2014\n- [Voxnet: A 3d convolutional neural network for real-time object recognition](https://www.ri.cmu.edu/pub_files/2015/9/voxnet_maturana_scherer_iros15.pdf), \nDaniel Maturana and Sebastian Scherer,\n[code](https://github.com/dimatura/voxnet), \n[website](http://dimatura.net/research/voxnet/), \nIROS, 2015 \n- [Deep sliding shapes for amodal 3d object detection in rgb-d images](http://dss.cs.princeton.edu/paper.pdf), \nShuran Song, Jianxiong Xiao, \n[code](https://github.com/shurans/DeepSlidingShape), \n[website](http://dss.cs.princeton.edu/), \nCVPR, 2016\n- [Octnet: Learning deep 3d representations at high resolutions](https://arxiv.org/pdf/1611.05009.pdf), \nGernot Riegler, Ali Osman Ulusoy, Andreas Geiger, \n[code](https://github.com/griegler/octnet), \nCVPR, 2017  :star:\n- [3d fully convolutional network for vehicle detection in point cloud](https://arxiv.org/pdf/1611.08069.pdf), \nGernot Riegler, Ali Osman Ulusoy, Andreas Geiger, \n[unofficial code](https://github.com/yukitsuji/3D_CNN_tensorflow), \nIROS, 2017\n- [Voting for voting in online point cloud object detection](http://www.robots.ox.ac.uk/~mobile/Papers/2015RSS_wang.pdf), \nDominic Zeng Wang, Ingmar Posner,\nRobotics: Science and System, 2015\n- [Vote3deep: Fast object detection in 3d point clouds using efficient convolutional neural networks](https://arxiv.org/pdf/1609.06666),\nMartin Engelcke, Dushyant Rao, Dominic Zeng Wang, Chi Hay Tong, Ingmar Posner, \nICRA, 2017\n\n\n## Multi-view\n- [Pedestrian detection combining RGB and dense LIDAR data](https://ieeexplore.ieee.org/document/6943141), \nCristiano Premebida, João Carreira, Jorge Batista, Urbano Nunes, \nIROS, 2014\n- [3d-assisted feature synthesis for novel views of an object](http://openaccess.thecvf.com/content_iccv_2015/papers/Su_3D-Assisted_Feature_Synthesis_ICCV_2015_paper.pdf), \nHao Su, Fan Wang, Eric Yi, Leonidas Guibas, \nICCV, 2015\n- [Multiview random forest of local experts combining rgb and lidar data for pedestrian detection](https://ieeexplore.ieee.org/document/7225711), \nA. González, G. Villalonga, J. Xu, D. Vázquez, J. Amores, and A. López., \nIEEE Intelligent Vehicles Symposium, 2015\n- [Multiview convolutional neural networks for 3d shape recognition](http://vis-www.cs.umass.edu/mvcnn/docs/su15mvcnn.pdf), \nHang Su Subhransu Maji Evangelos Kalogerakis Erik Learned-Miller, \n[code](https://github.com/jongchyisu/mvcnn_pytorch)\nICCV, 2015\n- [Vehicle detection from 3d lidar using fully convolutional network](https://arxiv.org/pdf/1608.07916.pdf), \nBo Li, Tianlei Zhang, Tian Xia, \nRobotics: Science and System, 2016\n- [Volumetric and multi-view cnns for object classification on 3d data](http://openaccess.thecvf.com/content_cvpr_2016/papers/Qi_Volumetric_and_Multi-View_CVPR_2016_paper.pdf), \nCharles R. Qi, Hao Su, Matthias Niessner, Angela Dai, Mengyuan Yan, Leonidas J. Guibas, \nCVPR, 2016\n- [Fusionnet: 3d object classification using multiple data representations](https://arxiv.org/pdf/1607.05695.pdf), \nVishakh Hegde, Reza Zadeh, \nCoRR, 2016\n- [Multi-view 3d object detection network for autonomous driving](https://arxiv.org/pdf/1611.07759.pdf), \nXiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia, \n[code](https://github.com/bostondiditeam/MV3D), \nCVPR, 2017  :star:\n- [Joint 3d proposal generation and object detection from view aggregation](https://arxiv.org/pdf/1712.02294.pdf), \nJason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, Steven L. Waslander, \n[code](https://github.com/kujason/avod), \nIROS, 2018\n- [Pixor: Real-time 3d object detection from point clouds](https://arxiv.org/pdf/1902.06326), \nBin Yang, Wenjie Luo, Raquel Urtasun, \nCVPR, 2018  :star:\n- [Deep continuous fusion for multi-sensor 3d object detection](http://openaccess.thecvf.com/content_ECCV_2018/papers/Ming_Liang_Deep_Continuous_Fusion_ECCV_2018_paper.pdf), \nMing Liang, Bin Yang, Shenlong Wang, and Raquel Urtasun, \nECCV, 2018  :star:\n- [MLOD: A multi-view 3D object detection based on robust feature fusion method](https://arxiv.org/pdf/1909.04163.pdf)\nJian Deng and Krzysztof Czarnecki, \n2019\n\n## PointNet\n- [Pointnet: Deep learning on point sets for 3d classification and segmentation](https://arxiv.org/abs/1612.00593), \nCharles R. Qi*, Hao Su*, Kaichun Mo, Leonidas J. Guibas, \n[code](https://github.com/charlesq34/pointnet), \n[website](http://stanford.edu/~rqi/pointnet/), \nCVPR, 2017  :star:\n- [Pointnet++: Deep hierarchical feature learning on point sets in a metric space](https://arxiv.org/abs/1706.02413), \nCharles R. Qi, Li (Eric) Yi, Hao Su, Leonidas J. Guibas, \n[code](https://github.com/charlesq34/pointnet2), \n[website](http://stanford.edu/~rqi/pointnet2/), \nNIPS, 2017\n- [Frustum pointnets for 3d object detection from rgb-d data](https://arxiv.org/abs/1711.08488), \nCharles R. Qi, Wei Liu, Chenxia Wu, Hao Su, Leonidas J. Guibas, \n[code](https://github.com/charlesq34/frustum-pointnets), \n[website](http://stanford.edu/~rqi/frustum-pointnets/), \nCVPR, 2018  :star:\n- [PointRCNN 3D Object Proposal Generation and Detection from Point Cloud](https://arxiv.org/pdf/1812.04244.pdf), \nShaoshuai Shi, Xiaogang Wang, Hongsheng Li, \n2018  :star:\n- [Pointfusion: Deep sensor fusion for 3d bounding box estimation](https://arxiv.org/pdf/1711.10871.pdf), \nDanfei Xu, Dragomir Anguelov, Ashesh Jain, \nCVPR, 2018\n- [Voxelnet: End-to-end learning for point cloud based 3d object detection](https://arxiv.org/pdf/1711.06396.pdf), \nYin Zhou, Oncel Tuzel, \n[unofficial code](https://github.com/jeasinema/VoxelNet-tensorflow), \nCVPR, 2018  :star:\n- [Second: Sparsely embedded convolutional detection](https://pdfs.semanticscholar.org/5125/a16039cabc6320c908a4764f32596e018ad3.pdf), \nYan Yan,, Yuxing Mao, and Bo Li, \n[code](https://github.com/traveller59/second.pytorch), \nSensors, 2018\n- [Pointpillars: Encoders for object detection from point clouds](https://arxiv.org/pdf/1812.05784.pdf), \nAlex H. Lang, Sourabh Vora, Holger Caesar, Lubing Zhou, Jiong Yang, Oscar Beijbom, \n[code](https://github.com/nutonomy/second.pytorch), \n2018\n- [RoarNet: A Robust 3D Object Detection based on RegiOn Approximation Refinement](https://arxiv.org/pdf/1811.03818), \nKiwoo Shin, Youngwook Paul Kwon, Masayoshi Tomizuka, \n[code](https://github.com/Kiwoo/RoarNet), \n2018  :star:\n- [IPOD: Intensive Point-based Object Detector for Point Cloud](https://arxiv.org/pdf/1812.05276.pdf), \nZetong Yang, Yanan Sun, Shu Liu, Xiaoyong Shen, Jiaya Jia, \n2018\n- [Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection](https://arxiv.org/pdf/1903.01864.pdf), \nZhixin Wang, Kui Jia, \n2019  :star:\n- [3D Object Detection Using Scale Invariant and Feature Reweighting Networks](https://arxiv.org/pdf/1711.10871.pdf), \nXin Zhao, Zhe Liu, Ruolan Hu, Kaiqi Huang, \nAAAI, 2019\n- [STD: Sparse-to-Dense 3D Object Detector for Point Cloud](https://arxiv.org/pdf/1907.10471.pdf)Zetong Yang, Yanan Sun, Shu Liu, Xiaoyong Shen, Jiaya Jia, \nICCV, 2019  :star:\n- [Patch Refinement - Localized 3D Object Detection](https://arxiv.org/pdf/1910.04093.pdf)\nJohannes Lehner, Andreas Mitterecker, Thomas Adler, Markus Hofmarcher, Bernhard Nessler, Sepp Hochreiter, \n2019 \n\n## Other\n- [Recurrent slice networks for 3d segmentation of point clouds](https://arxiv.org/pdf/1802.04402.pdf), \nQiangui Huang, Weiyue Wang, Ulrich Neumann, \n[code](https://github.com/qianguih/RSNet), \nCVPR, 2018  :star:\n- [Pointsift: A sift-like network module for 3d point cloud semantic segmentation](https://arxiv.org/abs/1807.00652), \nMingyang Jiang, Yiran Wu, Tianqi Zhao, Zelin Zhao, Cewu Lu, \n[code](https://github.com/MVIG-SJTU/pointSIFT), \nCoRR, 2018\n- [A General Pipeline for 3D Detection of Vehicles](https://ieeexplore.ieee.org/abstract/document/8461232)\nXinxin Du ; Marcelo H. Ang ; Sertac Karaman ; Daniela Rus,\n2018\n- [Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud](https://arxiv.org/pdf/1907.03670.pdf),Shaoshuai Shi, Zhe Wang, Xiaogang Wang, Hongsheng Li, \n2019  :star:\n- [Multi-Task Multi-Sensor Fusion for 3D Object Detection](http://openaccess.thecvf.com/content_CVPR_2019/papers/Liang_Multi-Task_Multi-Sensor_Fusion_for_3D_Object_Detection_CVPR_2019_paper.pdf), \nMing Liang, Bin Yang, Yun Chen, Rui Hu, Raquel Urtasun,\n2019, :star:\n- [Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression](https://arxiv.org/pdf/1902.09630.pdf),\nHamid Rezatofighi, Nathan Tsoi1, JunYoung Gwak, Amir Sadeghian, Ian Reid, Silvio Savarese,\n2019, :star:\n- [Voxel-FPN: multi-scale voxel feature aggregationin 3D object detection from point clouds](https://arxiv.org/ftp/arxiv/papers/1907/1907.05286.pdf),\nBei Wang, Jianping An and Jiayan Cao,\n2019, :star:\n- [PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud](http://openaccess.thecvf.com/content_CVPR_2019/papers/Shi_PointRCNN_3D_Object_Proposal_Generation_and_Detection_From_Point_Cloud_CVPR_2019_paper.pdf),\nShaoshuai Shi, Xiaogang Wang, Hongsheng Li,\n2019\n- [Fast Point R-CNN](https://arxiv.org/pdf/1908.02990.pdf),\nYilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia, \n2019\n- [Three-dimensional Backbone Network for 3D Object Detection in Traffic Scenes](https://arxiv.org/pdf/1901.08373.pdf)\nXuesong Li, Jose Guivant, Ngaiming Kwok, Yongzhi Xu, Ruowei Li, Hongkun Wu,\n2019\n- [SCNet: Subdivision Coding Network for Object Detection Based on 3D Point Cloud](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8813061)\nZHIYU WANG, HAO FU, LI WANG, LIANG XIAO, AND BIN DA, \n2019\n- [PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module](https://arxiv.org/pdf/1911.06084.pdf)\nLiang Xie, Chao Xiang, Zhengxu Yu, Guodong Xu, Zheng Yang, Deng Cai1, Xiaofei He, \nAAAI, 2019\n- [HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection]\n(https://arxiv.org/pdf/2003.00186.pdf)\nMaosheng Ye, Shuangjie Xu, Tongyi Cao,\nCVPR, 2020\n- [FPConv: Learning Local Flattening for Point Convolution]\n(https://arxiv.org/pdf/2002.10701.pdf)\nYiqun Lin, Zizheng Yan, Haibin Huang, Dong Du, Ligang Liu, Shuguang Cui, Xiaoguang Han,\nCVPR, 2020\n- [Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection]\n(https://arxiv.org/pdf/2002.10701.pdf)\nLiang Du, Xiaoqing Ye, Xiao Tan, Jianfeng Feng, Zhenbo Xu, Errui Ding and Shilei Wen, \nCVPR, 2020\n- [SVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds]\n(https://arxiv.org/pdf/2006.04043.pdf)\nQingdong He, Zhengning Wang, Hao Zeng, Yi Zeng, Shuaicheng Liu, Bing Zeng, \nCVPR, 2020\n- [H3DNet: 3D Object Detection Using Hybrid Geometric Primitives]\n(https://arxiv.org/pdf/2006.05682.pdf)\nZaiwei Zhang, Bo Sun, Haitao Yang, Qixing Huang\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flhyfst%2F3d-detection-papers","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flhyfst%2F3d-detection-papers","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flhyfst%2F3d-detection-papers/lists"}