Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/Yangzhangcst/Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.
https://github.com/Yangzhangcst/Transformer-in-Computer-Vision

awesome computer-vision deep-learning detr papers transformer transformer-awesome transformer-cv vit

Last synced: 2 months ago
JSON representation

A paper list of some recent Transformer-based CV works.

Awesome Lists containing this project

README

        

Transformer-in-Vision[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

A paper list of some recent Transformer-based CV works. If you find some ignored papers, please open issues or pull requests.

!!The latest version has been updated, and you can click on the following links to view the list of papers and the codes (if available). The old version is [20240323](list_old_20240323.md).

**Last updated: 2024/11/12

## Table of Contents

- [Survey](main/survey.md)
- Recent Papers
- [Action](main/action.md)
- [Active Learning](main/active-learning.md)
- [Adversarial Attacks](main/adversarial-attacks.md)
- [Anomaly Detection](main/anomaly-detection.md)
- [Assessment](main/assessment.md)
- [Augmentation](main/augmentation.md)
- [Audio](main/audio.md)
- [Bird's-Eye-View](main/birds-eye-view.md)
- [Captioning](main/captioning.md)
- [Change Detection](main/change-detection.md)
- [Classification (Backbone)](main/classification-backbone.md)
- [Clustering](main/clustering.md)
- [Completion](main/completion.md)
- [Compression](main/compression.md)
- [Cross-view](main/cross-view.md)
- [Crowd](main/crowd.md)
- [Deblurring](main/deblurring.md)
- [Depth](main/depth.md)
- [Deepfake Detection](main/deepfake-detection.md)
- [Dehazing](main/dehazing.md)
- [Deraining](main/deraining.md)
- [Denoising](main/denoising.md)
- [Detection](main/detection.md)
- [Diffusion](main/diffusion.md)
- [Edge](main/edge.md)
- [Enhancement](main/enhancement.md)
- [Face](main/face.md)
- [Federated Learning](main/federated-learning.md)
- [Few-shot Learning](main/few-shot-learning.md)
- [Fusion](main/fusion.md)
- [Gait](main/gait.md)
- [Gaze](main/gaze.md)
- [Generative Model](main/generative-model.md)
- [Graph](main/graph.md)
- [Hand Gesture](main/hand-gesture.md)
- [High Dynamic Range Imaging](main/high-dynamic-range-imaging.md)
- [HOI](main/hoi.md)
- [Hyperspectral](main/hyperspectral.md)
- [Illumination](main/illumination.md)
- [Incremental Learning](main/incremental-learning.md)
- [In-painting](main/in-painting.md)
- [Instance Segmentation](main/instance-segmentation.md)
- [Knowledge Distillation](main/knowledge-distillation.md)
- [Lane](main/lane.md)
- [Layout](main/layout.md)
- [Lighting](main/lighting.md)
- [LLM](main/llmlvm.md)
- [Matching](main/matching.md)
- [Matting](main/matting.md)
- [Medical](main/medical.md)
- [Mesh](main/mesh.md)
- [Metric learning](main/metric-learning.md)
- [Motion](main/motion.md)
- [Multi-label](main/multi-label.md)
- [Multi-task/modal](main/multi-taskmodal.md)
- [Multi-view Stereo](main/multi-view-stereo.md)
- [NAS](main/nas.md)
- [Navigation](main/navigation.md)
- [Neural Rendering](main/neural-rendering.md)
- [OCR](main/ocr.md)
- [Octree](main/octree.md)
- [Open World](main/open-world.md)
- [Optical Flow](main/optical-flow.md)
- [Panoptic Segmentation](main/panoptic-segmentation.md)
- [Point Cloud](main/point-cloud.md)
- [Pose](main/pose.md)
- [Planning](main/planning.md)
- [Pruning & Quantization](main/pruning--quantization.md)
- [Recognition](main/recognition.md)
- [Reconstruction](main/reconstruction.md)
- [Referring](main/referring.md)
- [Registration](main/registration.md)
- [Re-identification](main/re-identification.md)
- [Remote Sensing](main/remote-sensing.md)
- [Restoration](main/restoration.md)
- [Retrieval](main/retrieval.md)
- [Robotic](main/robotic.md)
- [Salient Detection](main/salient-detection.md)
- [Scene](main/scene.md)
- [Self-supervised Learning](main/self-supervised-learning.md)
- [Semantic Segmentation](main/semantic-segmentation.md)
- [Shape](main/shape.md)
- [SLAM](main/slam.md)
- [SNN](main/snn.md)
- [Style Transfer](main/style-transfer.md)
- [Super-Resolution](main/super-resolution.md)
- [Synthesis](main/synthesis.md)
- [Text-to-Image/Video](main/text-to-imagevideo.md)
- [Texture](main/texture.md)
- [Time Series](main/time-series.md)
- [Tracking](main/tracking.md)
- [Traffic](main/traffic.md)
- [Transfer learning](main/transfer-learning.md)
- [Translation](main/translation.md)
- [Unsupervised learning](main/unsupervised-learning.md)
- [UAV](main/uav.md)
- [Video](main/video.md)
- [Visual Grounding](main/visual-grounding.md)
- [Visual Question Answering](main/visual-question-answering.md)
- [Visual Reasoning](main/visual-reasoning.md)
- [Visual Relationship Detection](main/visual-relationship-detection.md)
- [Voxel](main/voxel.md)
- [Weakly Supervised Learning](main/weakly-supervised-learning.md)
- [Zero-Shot Learning](main/zero-shot-learning.md)
- [Others](main/others.md)
- [Contact & Feedback](#contact--feedback)

## Contact & Feedback

If you have any suggestions about this project, feel free to contact me.

- [e-mail: yzhangcst[at]gmail.com]