DeepLearning-pwcn

There are paper with code and note in terms of deep learning.
https://github.com/Gojay001/DeepLearning-pwcn

Last synced: 1 day ago
JSON representation

Image Classification
- Gradient-based learning applied to document recognition
- NIN - pwcn/tree/master/Classification/NIN/Code)
- Very Deep Convolutional Networks for Large-Scale Image Recognition
- GoogLeNet - foundation.org/openaccess/content_cvpr_2015/papers/Szegedy_Going_Deeper_With_2015_CVPR_paper.pdf) | CVPR(2015) | [PyTorch](https://github.com/Gojay001/DeepLearning-pwcn/tree/master/Classification/GoogLeNet/Code)
- ResNet - pwcn/tree/master/Classification/ResNet/Code)
- Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
- Deep Layer Aggregation
Object Detection
- Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
- Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
- Fast R-CNN
- Faster R-CNN - CNN: Towards Real-Time Object Detection with Region Proposal Networks](https://arxiv.org/abs/1506.01497) | NIPS(2015) | [PyTorch](https://github.com/Gojay001/faster-rcnn.pytorch)
- SSD: Single Shot MultiBox Detector
- You Only Look Once: Unified, Real-Time Object Detection
- YOLO9000: Better, Faster, Stronger
- Focal Loss for Dense Object Detection
- YOLOv3: An Incremental Improvement
- CornerNet: Detecting Objects as Paired Keypoints - vl/CornerNet)
- Objects as Points
- YOLOv4: Optimal Speed and Accuracy of Object Detection
- You Only Look One-level Feature - model/YOLOF)
- Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
- Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
- Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
Object Segmentation
- SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling
- Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs
- Pyramid Scene Parsing Network
- DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
- Fully convolutional networks for semantic segmentation
- U-Net: Convolutional Networks for Biomedical Image Segmentation
- Mask R-CNN - CNN](http://openaccess.thecvf.com/content_ICCV_2017/papers/He_Mask_R-CNN_ICCV_2017_paper.pdf) | ICCV / TPAMI(2017) | [PyTorch](https://github.com/facebookresearch/detectron2)
- Rethinking Atrous Convolution for Semantic Image Segmentation
- PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
- PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
- Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
- Dual Graph Convolutional Network for Semantic Segmentation - DGCNet)
- Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers - zvg/SETR)
- Segmenter: Transformer for Semantic Segmentation
- SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
- Fully Transformer Networks for Semantic ImageSegmentation
Object Tracking
- SORT
- DeepSORT
- Fully-Convolutional Siamese Networks for Object Tracking
- High Performance Visual Tracking with Siamese Region Proposal Network - pytorch)
- SiamRPN++
- SiamMask
- Tracktor
- GlobalTrack - term Tracking](https://arxiv.org/abs/1912.08531) | AAAI(2020) | [PyTorch](https://github.com/huanglianghua/GlobalTrack)
- SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking
- Siamese Box Adaptive Network for Visual Tracking
- Deformable Siamese Attention Networks for Visual Object Tracking - tech/research-siamattn)
- PAMCC-AOT - Assisted Multi-Camera Collaboration for Active Object Tracking](https://arxiv.org/abs/2001.05161) | AAAI(2020) | [code]
- FFT
- JRMOT - Time 3D Multi-Object Tracker and a New Large-Scale Dataset](https://arxiv.org/abs/2002.08397) | arXiv(2020) | [code]
- Tracklet - object Tracking via End-to-end Tracklet Searching and Ranking](https://arxiv.org/abs/2003.02795) | arXiv(2020) | [code]
- Real-time 3D Deep Multi-Camera Tracking
- FairMOT - Object Tracking](https://arxiv.org/abs/2004.01888) | arXiv(2020) | [PyTorch](https://github.com/Gojay001/FairMOT)
- TSDM - refiner and a Mask-generator](https://arxiv.org/abs/2005.04063) | arXiv(2020) | [PyTorch](https://github.com/Gojay001/TSDM)
- Graph Attention Tracking
- Rotation Equivariant Siamese Networks for Tracking - siamnet)
Few-Shot Segmentation
- OSLSM - Shot Learning for Semantic Segmentation](https://arxiv.org/abs/1709.03410) | BMVC(2017) | [Caffe](https://github.com/lzzcd001/OSLSM)
- co-FCN - Shot Semantic Segmentation](https://openreview.net/pdf?id=SkMjFKJwG) | ICLR(2018) | [code]
- AMP: Adaptive Masked Proxies for Few-Shot Segmentation
- SG-One - One: Similarity Guidance Network for One-Shot Semantic Segmentation](https://arxiv.org/abs/1810.09091) | arXiv(2018) / TCYB(2020) | [PyTorch](https://github.com/xiaomengyc/SG-One)
- Learning Combinatorial Embedding Networks for Deep Graph Matching - SJTU/PCA-GM)
- CANet - Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning](https://openaccess.thecvf.com/content_CVPR_2019/papers/Zhang_CANet_Class-Agnostic_Segmentation_Networks_With_Iterative_Refinement_and_Attentive_Few-Shot_CVPR_2019_paper.pdf) | CVPR(2019) | [PyTorch](https://github.com/icoz69/CaNet)
- FGN: Fully Guided Network for Few-Shot Instance Segmentation
- PGNet - Based One-Shot Semantic Segmentation](https://openaccess.thecvf.com/content_ICCV_2019/papers/Zhang_Pyramid_Graph_Networks_With_Connection_Attentions_for_Region-Based_One-Shot_Semantic_ICCV_2019_paper.pdf) | ICCV(2019) | [code]
- CRNet - Reference Networks for Few-Shot Segmentation](https://openaccess.thecvf.com/content_CVPR_2020/papers/Liu_CRNet_Cross-Reference_Networks_for_Few-Shot_Segmentation_CVPR_2020_paper.pdf) | CVPR(2020) | [code]
- On the Texture Bias for Few-Shot CNN Segmentation - segmentation)
- LTM - Shot Segmentation](https://arxiv.org/abs/1910.05886) | MMMM(2020) | [code]
- SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation
- PPNet - aware Prototype Network for Few-shot Semantic Segmentation](https://arxiv.org/abs/2007.06309) | ECCV(2020) | [PyTorch](https://github.com/Xiangyi1996/PPNet-PyTorch)
- PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation - Research-Lab/PFENet)
- Prototype Mixture Models for Few-shot Semantic Segmentation - Bob/PMMs)
- Generalized Few-Shot Semantic Segmentation
- Self-Guided and Cross-Guided Learning for Few-Shot Segmentation
- Adaptive Prototype Learning and Allocation for Few-Shot Segmentation
- Hypercorrelation Squeeze for Few-Shot Segmenation
- Learning What Not to Segment: A New Perspective on Few-Shot Segmentation
- Self-Guided and Cross-Guided Learning for Few-Shot Segmentation
- Adaptive Prototype Learning and Allocation for Few-Shot Segmentation
3D Face Reconstruction and Facial Animation
Attention or Transformer
- Attention Is All You Need
- Non-local Neural Networks - nonlocal-net)
- Image Transformer
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - research/vision_transformer)
- Swin Transformer: Hierarchical Vision Transformer using Shifted Windows - Transformer)
- ResT: An Efficient Transformer for Visual Recognition
- Dual-stream Network for Visual Recognition
- Transformer in Convolutional Neural Networks - liu/TransCNN)
- Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
Salient Object Detection
Unsupervised Learning
- Exploring Simple Siamese Representation Learning
3D Object Detection
- PV-RCNN - RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection](http://openaccess.thecvf.com/content_CVPR_2020/papers/Shi_PV-RCNN_Point-Voxel_Feature_Set_Abstraction_for_3D_Object_Detection_CVPR_2020_paper.pdf) | CVPR(2020) | [PyTorch](https://github.com/sshaoshuai/PV-RCNN)
Few-Shot Learning
- RN - Shot Learning](https://arxiv.org/abs/1711.06025) | CVPR(2018) | [PyTorch](https://github.com/Gojay001/LearningToCompare_FSL)
Generative Adversarial Network
- Generative Adversarial Networks
- BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network - group.com/projects/BeautyGAN)
Optimization
Survey
- A Survey on 3D Object Detection Methods for Autonomous Driving Applications
- FSL-Survey-2019 - Shot Learning](https://arxiv.org/abs/1904.05046) | CSUR(2019)
- Deep Learning in Video Multi-Object Tracking: A Survey
- A Survey of Transformers

Categories

Few-Shot Segmentation 22 Object Tracking 20 Object Segmentation 16 Object Detection 16 3D Face Reconstruction and Facial Animation 13 Attention or Transformer 9 Image Classification 7 Optimization 5 Salient Object Detection 5 Survey 4 Generative Adversarial Network 2 Unsupervised Learning 1 Few-Shot Learning 1 3D Object Detection 1

Sub Categories