https://github.com/Yangzhangcst/Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.
https://github.com/Yangzhangcst/Transformer-in-Computer-Vision

awesome computer-vision deep-learning detr papers transformer transformer-awesome transformer-cv vit

Last synced: about 1 year ago
JSON representation

A paper list of some recent Transformer-based CV works.

Host: GitHub
URL: https://github.com/Yangzhangcst/Transformer-in-Computer-Vision
Owner: Yangzhangcst
Created: 2021-04-14T05:14:41.000Z (about 5 years ago)
Default Branch: main
Last Pushed: 2025-05-01T01:49:05.000Z (about 1 year ago)
Last Synced: 2025-05-01T02:38:16.075Z (about 1 year ago)
Topics: awesome, computer-vision, deep-learning, detr, papers, transformer, transformer-awesome, transformer-cv, vit
Homepage:
Size: 4.72 MB
Stars: 1,257
Watchers: 41
Forks: 144
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-machine-learning-resources - **[List - in-Computer-Vision?style=social) (Table of Contents)
Awesome-Transformer-Attention - Transformer-in-Computer-Vision (GitHub)
awesome-ai-data-github-repos - Transformer in Vision: paper list of some recent Transformer-based CV works

README

          Transformer-in-Vision[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

A paper list of some recent Transformer-based CV works. If you find some ignored papers, please open issues or pull requests.

!!The latest version has been updated, and you can click on the following links to view the list of papers and the codes (if available). The old version is [20240323](list_old_20240323.md).

**Last updated: 2025/05/05

## Table of Contents

- [Survey](main/survey.md)

- Recent Papers

  - [Action](main/action.md)

  - [Active Learning](main/active-learning.md)

  - [Adversarial Attacks](main/adversarial-attacks.md)

  - [Anomaly Detection](main/anomaly-detection.md)

  - [Assessment](main/assessment.md)

  - [Augmentation](main/augmentation.md)

  - [Audio](main/audio.md)

  - [Bird's-Eye-View](main/birds-eye-view.md)

  - [Captioning](main/captioning.md)

  - [Change Detection](main/change-detection.md)

  - [Classification (Backbone)](main/classification-backbone.md)

  - [Clustering](main/clustering.md)

  - [Completion](main/completion.md)

  - [Compression](main/compression.md)

  - [Cross-view](main/cross-view.md)

  - [Crowd](main/crowd.md)

  - [Deblurring](main/deblurring.md)

  - [Depth](main/depth.md)

  - [Deepfake Detection](main/deepfake-detection.md)

  - [Dehazing](main/dehazing.md)

  - [Deraining](main/deraining.md)

  - [Denoising](main/denoising.md)

  - [Detection](main/detection.md)

  - [Diffusion](main/diffusion.md)

  - [Edge](main/edge.md)

  - [Enhancement](main/enhancement.md)

  - [Face](main/face.md)

  - [Federated Learning](main/federated-learning.md)

  - [Few-shot Learning](main/few-shot-learning.md)

  - [Fusion](main/fusion.md)

  - [Gait](main/gait.md)

  - [Gaze](main/gaze.md)

  - [Generative Model](main/generative-model.md)

  - [Graph](main/graph.md)

  - [Hand Gesture](main/hand-gesture.md)

  - [High Dynamic Range Imaging](main/high-dynamic-range-imaging.md)

  - [HOI](main/hoi.md)

  - [Hyperspectral](main/hyperspectral.md)

  - [Illumination](main/illumination.md)

  - [Incremental Learning](main/incremental-learning.md)

  - [In-painting](main/in-painting.md)

  - [Instance Segmentation](main/instance-segmentation.md)

  - [Knowledge Distillation](main/knowledge-distillation.md)

  - [Lane](main/lane.md)

  - [Layout](main/layout.md)

  - [Lighting](main/lighting.md)

  - [LLM](main/llmlvm.md)

  - [Matching](main/matching.md)

  - [Matting](main/matting.md)

  - [Medical](main/medical.md)

  - [Mesh](main/mesh.md)

  - [Metric learning](main/metric-learning.md)

  - [Motion](main/motion.md)

  - [Multi-label](main/multi-label.md)

  - [Multi-task/modal](main/multi-taskmodal.md)

  - [Multi-view Stereo](main/multi-view-stereo.md)

  - [NAS](main/nas.md)

  - [Navigation](main/navigation.md)

  - [Neural Rendering](main/neural-rendering.md)

  - [OCR](main/ocr.md)

  - [Octree](main/octree.md)

  - [Open World](main/open-world.md)

  - [Optical Flow](main/optical-flow.md)

  - [Panoptic Segmentation](main/panoptic-segmentation.md)

  - [Point Cloud](main/point-cloud.md)

  - [Pose](main/pose.md)

  - [Planning](main/planning.md)

  - [Pruning & Quantization](main/pruning--quantization.md)

  - [Recognition](main/recognition.md)

  - [Reconstruction](main/reconstruction.md)

  - [Referring](main/referring.md)

  - [Registration](main/registration.md)

  - [Re-identification](main/re-identification.md)

  - [Remote Sensing](main/remote-sensing.md)

  - [Restoration](main/restoration.md)

  - [Retrieval](main/retrieval.md)

  - [Robotic](main/robotic.md)

  - [Salient Detection](main/salient-detection.md)

  - [Scene](main/scene.md)

  - [Self-supervised Learning](main/self-supervised-learning.md)

  - [Semantic Segmentation](main/semantic-segmentation.md)

  - [Shape](main/shape.md)

  - [SLAM](main/slam.md)

  - [SNN](main/snn.md)

  - [Style Transfer](main/style-transfer.md)

  - [Super-Resolution](main/super-resolution.md)

  - [Synthesis](main/synthesis.md)

  - [Text-to-Image/Video](main/text-to-imagevideo.md)

  - [Texture](main/texture.md)

  - [Time Series](main/time-series.md)

  - [Tracking](main/tracking.md)

  - [Traffic](main/traffic.md)

  - [Transfer learning](main/transfer-learning.md)

  - [Translation](main/translation.md)

  - [Unsupervised learning](main/unsupervised-learning.md)

  - [UAV](main/uav.md)

  - [Video](main/video.md)

  - [Visual Grounding](main/visual-grounding.md)

  - [Visual Question Answering](main/visual-question-answering.md)

  - [Visual Reasoning](main/visual-reasoning.md)

  - [Visual Relationship Detection](main/visual-relationship-detection.md)

  - [Voxel](main/voxel.md)

  - [Weakly Supervised Learning](main/weakly-supervised-learning.md)

  - [Zero-Shot Learning](main/zero-shot-learning.md)

  - [Others](main/others.md)

- [Contact & Feedback](#contact--feedback)

## Contact & Feedback

If you have any suggestions about this project, feel free to contact me.

- [e-mail: yzhangcst[at]gmail.com]

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Yangzhangcst/Transformer-in-Computer-Vision

Awesome Lists containing this project

README