https://github.com/52CV/CV-Surveys

计算机视觉相关综述。包括目标检测、跟踪........
https://github.com/52CV/CV-Surveys
Last synced: about 1 month ago
JSON representation
计算机视觉相关综述。包括目标检测、跟踪........
Host: GitHub
URL: https://github.com/52CV/CV-Surveys
Owner: 52CV
Created: 2021-01-05T02:58:20.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2024-10-30T02:17:41.000Z (7 months ago)
Last Synced: 2024-10-30T04:59:15.037Z (7 months ago)
Homepage:
Size: 907 KB
Stars: 1,877
Watchers: 38
Forks: 242
Open Issues: 0
Metadata Files:
- Readme: README.md
Awesome Lists containing this project

awesome-yolo-object-detection - 52CV/CV-Surveys - Surveys?style=social"/> : 计算机视觉相关综述。包括目标检测、跟踪........ (Summary)
awesome-yolo-object-detection - 52CV/CV-Surveys - Surveys?style=social"/> : 计算机视觉相关综述。包括目标检测、跟踪........ (Summary)
README

        


  



## 查看2025年综述文献点这里↘️[2025-CV-Surveys](https://github.com/52CV/CV-Surveys)

## 2025 年论文分类汇总戳这里

↘️[WACV-2025-Papers](https://github.com/52CV/WACV-2025-Papers)

↘️[CVPR-2025-Papers](https://github.com/52CV/CVPR-2025-Papers)

## 2024 年论文分类汇总戳这里

↘️[WACV-2024-Papers](https://github.com/52CV/WACV-2024-Papers)

↘️[CVPR-2024-Papers](https://github.com/52CV/CVPR-2024-Papers)

↘️[ECCV-2024-Papers](https://github.com/52CV/ECCV-2024-Papers)

## [2023 年论文分类汇总戳这里](#00000)

## [2022 年论文分类汇总戳这里](#0000)

## [2021 年论文分类汇总戳这里](#000)

## [2020 年论文分类汇总戳这里](#00)

# 2025-CV-Surveys

2025 年，计算机视觉相关综述。包括目标检测、跟踪........

### :green_book::green_book::green_book:在[【我爱计算机视觉】微信公众号](https://user-images.githubusercontent.com/62801906/163739684-175f0b8a-871e-4a41-b310-b549625fdcb1.png)后台回复“CV综述”，即可收到本文列出的全部论文的打包下载。至4月18日已公开 158+2 篇。

1月36篇。


2月50篇。


3月45篇。

## 目录

|:cat:|:dog:|:tiger:|:wolf:|

|------|------|------|------|

|[1.Unkown(未分)](#1)|

## Machine Learning

* [Machine Learning Applications to Diffuse Reflectance Spectroscopy in Optical Diagnosis; A Systematic Review](https://arxiv.org/abs/2503.02905)
[2025-03-06]

* 强化学习

  * [Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey](https://arxiv.org/abs/2503.09956)
[2025-03-14]

* 对比学习

  * [A Survey on Data Curation for Visual Contrastive Learning: Why Crafting Effective Positive and Negative Pairs Matters](https://arxiv.org/abs/2502.08134)
[2025-02-13]

* 类增量学习

  * [Latest Advancements Towards Catastrophic Forgetting under Data Scarcity: A Comprehensive Survey on Few-Shot Class Incremental Learning](https://arxiv.org/abs/2502.08181)
[2025-02-13]

* 对抗

  * [A Survey of Adversarial Defenses in Vision-based Systems: Categorization, Methods and Challenges](https://arxiv.org/abs/2503.00384)
[2025-03-04]

## agriculture(农业)

* [A survey of datasets for computer vision in agriculture](https://arxiv.org/abs/2502.16950)
:star:[code](https://smartfarminglab.github.io/field_dataset_survey/)
[2025-02-25]

## Biomedical(生物特征识别)

* 掌纹识别

  * [Deep Learning in Palmprint Recognition-A Comprehensive Survey](https://arxiv.org/abs/2501.01166)
[2025-01-03]

## Neural Radiance Fields

* [Neural Radiance Fields for the Real World: A Survey](https://arxiv.org/abs/2501.13104)
[2025-01-23]

## Robots(机器人)

* [Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions](https://arxiv.org/abs/2501.05750)
[2025-01-13]

## Industrial Defect Detection(工业缺陷检测)

* [Anomaly Detection for Industrial Applications, Its Challenges, Solutions, and Future Directions: A Review](https://arxiv.org/abs/2501.11310)
[2025-01-22]

* [A Survey on Industrial Anomalies Synthesis](https://arxiv.org/abs/2502.16412)
:star:[code](https://github.com/M-3LAB/awesome-anomaly-synthesis.)
[2025-02-25]

* [A Survey on Foundation-Model-Based Industrial Defect Detection](https://arxiv.org/abs/2502.19106)
[2025-02-27]

## Video

* [A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems](https://arxiv.org/abs/2502.06581)
[2025-02-11]

## Action Detection(动作检测)

* [Action Valuation in Sports: A Survey](https://arxiv.org/abs/2504.06163)
[2025-04-09]

## Autonomous Driving(自动驾驶)

* [A Survey of World Models for Autonomous Driving](https://arxiv.org/abs/2501.11260)
[2025-01-22]

* [The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey](https://arxiv.org/abs/2502.10498)
:star:[code](https://github.com/LMD0311/Awesome-World-Model)
[2025-02-18]

* [4D mmWave Radar in Adverse Environments for Autonomous Driving: A Survey](https://arxiv.org/abs/2503.24091)
[2025-04-01]

* [Systematic Literature Review on Vehicular Collaborative Perception -- A Computer Vision Perspective](https://arxiv.org/abs/2504.04631)
[2025-04-08]

* [Adversarial Examples in Environment Perception for Automated Driving (Review)](https://arxiv.org/abs/2504.08414)
[2025-04-14]

* [Collaborative Perception Datasets for Autonomous Driving: A Review](https://arxiv.org/abs/2504.12696)
:star:[code](https://github.com/frankwnb/Collaborative-Perception-Datasets-for-Autonomous-Driving)
[2025-04-18]

* 车道线检测

  * [Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review](https://arxiv.org/abs/2504.08540)
[2025-04-14]

* 分心驾驶检测

  * [A Review Paper of the Effects of Distinct Modalities and ML Techniques to Distracted Driving Detection](https://arxiv.org/abs/2501.11758)
[2025-01-22]

## Machine Learning

* [A Systematic Review of Machine Learning Methods for Multimodal EEG Data in Clinical Application](https://arxiv.org/abs/2501.08585)
[2025-01-16]

## Few/Zero-Shot Learning/DG/A(小/零样本/域泛化/域适应)

* Non-Transferable Learning(反迁移学习)

  * [Toward Robust Non-Transferable Learning: A Survey and Benchmark](https://arxiv.org/abs/2502.13593)
[2025-02-20]

## Retrieval-Augmented Generation(检索增强生成)

* [Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook](https://arxiv.org/abs/2503.18016)
:star:[code](https://github.com/zhengxuJosh/Awesome-RAG-Vision)
[2025-03-25]

## Vision-Language(视觉语言)

* [Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability](https://arxiv.org/abs/2501.01346)
[2025-01-03]

* [Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey](https://arxiv.org/abs/2501.02189)
:star:[code](https://github.com/zli12321/Awesome-VLM-Papers-And-Models.git)
[2025-01-07]

* [Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches](https://arxiv.org/abs/2501.03151)
[2025-01-07]

* [Visual Large Language Models for Generalized and Specialized Applications](https://arxiv.org/abs/2501.02765)
:star:[code](https://github.com/JackYFL/awesome-VLLMs)
[2025-01-07]

* [When Data Manipulation Meets Attack Goals: An In-depth Survey of Attacks for VLMs](https://arxiv.org/abs/2502.06390)
:star:[code](https://github.com/AobtDai/VLM_Attack_Paper_List)
[2025-02-11]

* [Survey on Vision-Language-Action Models](https://arxiv.org/abs/2502.06851)
[2025-02-12]

* [Vision-Language Models for Edge Networks: A Comprehensive Survey](https://arxiv.org/abs/2502.07855)
[2025-02-13]

* [Harnessing Vision Models for Time Series Analysis: A Survey](https://arxiv.org/abs/2502.08869)
[2025-02-14]

* [A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations](https://arxiv.org/abs/2502.14881)
:star:[code](https://github.com/XuankunRong/Awesome-LVLM-Safety)
[2025-02-24]

* [Multi-Modal Foundation Models for Computational Pathology: A Survey](https://arxiv.org/abs/2503.09091)
[2025-03-13]

* [Small Vision-Language Models: A Survey on Compact Architectures and Techniques](https://arxiv.org/abs/2503.10665)
[2025-03-17]

* [A Survey on Efficient Vision-Language Models](https://arxiv.org/abs/2504.09724)
:star:[code](https://github.com/MPSC-UMBC/Efficient-Vision-Language-Models-A-Survey)
[2025-04-15]

* LLM

  * [Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review](https://arxiv.org/abs/2503.04983)
[2025-03-10]

  * [A Review on Large Language Models for Visual Analytics](https://arxiv.org/abs/2503.15176)
[2025-03-20]

  * [Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions](https://arxiv.org/abs/2503.16585)
[2025-03-24]

  * [How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM](https://arxiv.org/abs/2504.05786)
[2025-04-09]

* MLLM

  * [Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review](https://arxiv.org/abs/2502.16586)
[2025-02-25]

  * [Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey](https://arxiv.org/abs/2503.12605)
:star:[code](https://github.com/yaotingwangofficial/Awesome-MCoT)
[2025-03-18]

  * [Aligning Multimodal LLM with Human Preference: A Survey](https://arxiv.org/abs/2503.14504)
:star:[code](https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Alignment.)
[2025-03-19]

  * [Survey of Adversarial Robustness in Multimodal Large Language Models](https://arxiv.org/abs/2503.13962)
[2025-03-19]

## GAN/Image Synthesis(图像生成)

* [Generative AI for Cel-Animation: A Survey](https://arxiv.org/abs/2501.06250)
:star:[code](https://github.com/yunlong10/Awesome-AI4Animation)
[2025-01-14]

* [Generative Physical AI in Vision: A Survey](https://arxiv.org/abs/2501.10928)
:star:[code](https://github.com/BestJunYu/Awesome-Physics-aware-Generation)
[2025-01-22]

* [Survey on AI-Generated Media Detection: From Non-MLLM to MLLM](https://arxiv.org/abs/2502.05240)
[2025-02-11]

* [A Survey on Text-Driven 360-Degree Panorama Generation](https://arxiv.org/abs/2502.14799)
:star:[code](https://littlewhitesea.github.io/Text-Driven-Pano-Gen/)
[2025-02-21]

* [Methods and Trends in Detecting Generated Images: A Comprehensive Review](https://arxiv.org/abs/2502.15176)
[2025-02-24]

* [Simulating the Real World: A Unified Survey of Multimodal Generative Models](https://arxiv.org/abs/2503.04641)
[2025-03-07]

* [Generative AI for Film Creation: A Survey of Recent Advances](https://arxiv.org/abs/2504.08296)
[2025-04-14]

* GAN 

  * [Image Inversion: A Survey from GANs to Diffusion and Beyond](https://arxiv.org/abs/2502.11974)
:star:[code](https://github.com/RyanChenYN/ImageInversion)
[2025-02-18]

  * [Generative Adversarial Networks with Limited Data: A Survey and Benchmarking](https://arxiv.org/abs/2504.05456)
[2025-04-09]

* 图像生成

  * [Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing](https://arxiv.org/abs/2502.07829)
[2025-02-13]

  * [Personalized Image Generation with Deep Generative Models: A Decade Survey](https://arxiv.org/abs/2502.13081)
:star:[code](https://github.com/csyxwei/Awesome-Personalized-Image-Generation)
[2025-02-19]

* AIGC

  * [Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC](https://arxiv.org/abs/2502.07007)
[2025-02-12]

* 图像到图像翻译

  * [Unpaired Image-to-Image Translation with Content Preserving Perspective: A Review](https://arxiv.org/abs/2502.08667)
[2025-02-14]

* 文本-图像

  * [A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models](https://arxiv.org/abs/2502.14896)
[2025-02-24]

  * [A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images](https://arxiv.org/abs/2502.21151)
[2025-03-03]

  * [A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety](https://arxiv.org/abs/2503.00020)
[2025-03-04]

  * [A Survey on Self-supervised Contrastive Learning for Multimodal Text-Image Analysis](https://arxiv.org/abs/2503.11101)
[2025-03-17]

  * [A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models](https://arxiv.org/abs/2503.13576)
[2025-03-19]

* 视频生成

  * [A Survey: Spatiotemporal Consistency in Video Generation](https://arxiv.org/abs/2502.17863)
[2025-02-26]

  * [Exploring the Evolution of Physics Cognition in Video Generation: A Survey](https://arxiv.org/abs/2503.21765)
:star:[code](https://github.com/minnie-lin/Awesome-Physics-Cognition-based-Video-Generation)
[2025-03-28]

* 4D生成

  * [Advances in 4D Generation: A Survey](https://arxiv.org/abs/2503.14501)
:star:[code](https://github.com/MiaoQiaowei/Awesome-4D)
[2025-03-19]

* 3D生成

  * [Recent Advance in 3D Object and Scene Generation: A Survey](https://arxiv.org/abs/2504.11734)
[2025-04-17]

* 视觉-音乐生成

  * [Vision-to-Music Generation: A Survey](https://arxiv.org/abs/2503.21254)
:star:[code](https://github.com/wzk1015/Awesome-Vision-to-Music-Generation.)
[2025-03-28]

## MC/KD/Pruning(模型压缩/知识蒸馏/剪枝)

* [A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion](https://arxiv.org/abs/2501.07451)
[2025-01-14]

* [Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies](https://arxiv.org/abs/2503.02891)
[2025-03-06]

* KD

  * [A Comprehensive Survey on Knowledge Distillation](https://arxiv.org/abs/2503.12067)
:star:[code](https://github.com/IPL-Sharif/KD_Survey)
[2025-03-18]

## Visual Question Answering (视觉问答)

* [Visual question answering: from early developments to recent advances -- a survey](https://arxiv.org/abs/2501.03939)
[2025-01-08]

* [The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering](https://arxiv.org/abs/2501.07109)
[2025-01-14]

## Medical Image Progress(医学图像处理)

* [In the Picture: Medical Imaging Datasets, Artifacts, and their Living Review](https://arxiv.org/abs/2501.10727)
[2025-01-22]

* [Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact](https://arxiv.org/abs/2502.08333)
[2025-02-13]

* [A Survey of LLM-based Agents in Medicine: How far are we from Baymax?](https://arxiv.org/abs/2502.11211)
[2025-02-18]

* [Denoising, segmentation and volumetric rendering of optical coherence tomography angiography (OCTA) image using deep learning techniques: a review](https://arxiv.org/abs/2502.14935)
[2025-02-24]

* [The Impact of Artificial Intelligence on Emergency Medicine: A Review of Recent Advances](https://arxiv.org/abs/2503.14546)
[2025-03-20]

* [Comprehensive Review of Reinforcement Learning for Medical Ultrasound Imaging](https://arxiv.org/abs/2503.16543)
[2025-03-24]

* [Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey](https://arxiv.org/abs/2504.11588)
[2025-04-17]

* 医学图像分割

  * [A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation](https://arxiv.org/abs/2502.06895)
[2025-02-12]

* 手术场景理解

  * [Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review](https://arxiv.org/abs/2502.14886)
[2025-02-24]

* 手术视频分割

  * [Deep learning approaches to surgical video segmentation and object detection: A Scoping Review](https://arxiv.org/abs/2502.16459)
[2025-02-25]

* 图像配准

  * [From Traditional to Deep Learning Approaches in Whole Slide Image Registration: A Methodological Review](https://arxiv.org/abs/2502.19123)
[2025-02-27]

* MRI重建

  * [A Survey of fMRI to Image Reconstruction](https://arxiv.org/abs/2502.16861)
[2025-02-25]

  * [A Comprehensive Survey on Magnetic Resonance Image Reconstruction](https://arxiv.org/abs/2503.07097)
[2025-03-11]

  * [A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli](https://arxiv.org/abs/2503.15978)
:star:[code](https://github.com/LpyNow/BrainDecodingImage)
[2025-03-21]

## OCR

* [Handwritten Text Recognition: A Survey](https://arxiv.org/abs/2502.08417)
[2025-02-13]

## UAV/Remote Sensing/Satellite Image(无人机/遥感/卫星图像)

* [Advancing Earth Observation: A Survey on AI-Powered Image Processing in Satellites](https://arxiv.org/abs/2501.12030)
[2025-01-22]

* [Plantation Monitoring Using Drone Images: A Dataset and Performance Review](https://arxiv.org/abs/2502.08233)
[2025-02-13]

* [A Survey on Remote Sensing Foundation Models: From Vision to Multimodality](https://arxiv.org/abs/2503.22081)
[2025-03-31]

* [A Decade of Deep Learning for Remote Sensing Spatiotemporal Fusion: Advances, Challenges, and Opportunities](https://arxiv.org/abs/2504.00901)
:star:[code](https://github.com/yc-cui/Deep-Learning-Spatiotemporal-Fusion-Survey)
[2025-04-02]

* [MIMRS: A Survey on Masked Image Modeling in Remote Sensing](https://arxiv.org/abs/2504.03181)
[2025-04-07]

* [A comprehensive review of remote sensing in wetland classification and mapping](https://arxiv.org/abs/2504.10842)
[2025-04-16]

* Anti-UAV

  * [Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions](https://arxiv.org/abs/2504.11967)
[2025-04-17]

* 变化检测

  * [Operational Change Detection for Geographical Information: Overview and Challenges](https://arxiv.org/abs/2503.14109)
[2025-03-19]

* 船舶分类

  * [A Survey on SAR ship classification using Deep Learning](https://arxiv.org/abs/2503.11906)
[2025-03-18]

* 火灾烟雾

   [Fire and Smoke Datasets in 20 Years: An In-depth Review](https://arxiv.org/abs/2503.14552)
[2025-03-20]

## Object Detection(目标检测)

* [YOLOv8 to YOLO11: A Comprehensive Architecture In-depth Comparative Review](https://arxiv.org/abs/2501.13400)
[2025-01-24]

* [Context in object detection: a systematic literature review](https://arxiv.org/abs/2503.23249)
[2025-04-01]

* [Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation](https://arxiv.org/abs/2504.09480)
:star:[code](https://github.com/better-chao/perceptual_abilities_evaluation)
[2025-04-15]

* [A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions](https://arxiv.org/abs/2504.11995)
[2025-04-17]

* 线路检测

  * [Deep Learning in Automated Power Line Inspection: A Review](https://arxiv.org/abs/2502.07826)
[2025-02-13]

* 小目标检测

  * [Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications](https://arxiv.org/abs/2503.20516)
[2025-03-27]

## HOI

* [3D Human Interaction Generation: A Survey](https://arxiv.org/abs/2503.13120)
[2025-03-18]

* [A Survey on Human Interaction Motion Generation](https://arxiv.org/abs/2503.12763)
:star:[code](https://github.com/soraproducer/Awesome-Human-Interaction-Motion-Generation)
[2025-03-18]

## Action Recognition

* [SMART-Vision: Survey of Modern Action Recognition Techniques in Vision](https://arxiv.org/abs/2501.13066)
[2025-01-23]

## Pose(姿态估计)

* [Survey on Hand Gesture Recognition from Visual Input](https://arxiv.org/abs/2501.11992)
[2025-01-22]

## Points Cloud(点云)

* [Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey](https://arxiv.org/abs/2501.05473)
[2025-01-13]

* [Point Cloud Based Scene Segmentation: A Survey](https://arxiv.org/abs/2503.12595)
[2025-03-18]

## 3D Visual

* 三维重建

  * [Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison](https://arxiv.org/abs/2502.20154)
[2025-02-28]

  * [Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey](https://arxiv.org/abs/2503.14537)
[2025-03-20]

  * [A Survey on Event-driven 3D Reconstruction: Development under Different Categories](https://arxiv.org/abs/2503.19753)
[2025-03-26]

  * [Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review](https://arxiv.org/abs/2504.11349)
:star:[code](https://github.com/Bean-Young/AI4Med)
[2025-04-16]

* 深度估计

  * [A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision](https://arxiv.org/abs/2501.05147)
[2025-01-10]

  * [Survey on Monocular Metric Depth Estimation](https://arxiv.org/abs/2501.11841)
[2025-01-22]

## Face

* [A Survey on Facial Image Privacy Preservation in Cloud-Based Services](https://arxiv.org/abs/2501.08665)
[2025-01-16]

* [Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities](https://arxiv.org/abs/2502.06803)
[2025-02-12]

* [Face Deepfakes - A Comprehensive Review](https://arxiv.org/abs/2502.09812)
[2025-02-17]

* 情绪分析

  * [Enhanced Sentiment Analysis of Iranian Restaurant Reviews Utilizing Sentiment Intensity Analyzer & Fuzzy Logic](https://arxiv.org/abs/2503.12141)
[2025-03-18]

## Image Segmentation(图像分割)

* [A Comparative Review of the Histogram-based Image Segmentation Methods](https://arxiv.org/abs/2502.18550)
[2025-02-27]

* [SAM2 for Image and Video Segmentation: A Comprehensive Survey](https://arxiv.org/abs/2503.12781)
[2025-03-18]

## Image Retrieval(图像检索)

* [A Comprehensive Survey on Composed Image Retrieval](https://arxiv.org/abs/2502.18495)
[2025-02-27]

* [Composed Multi-modal Retrieval: A Survey of Approaches and Applications](https://arxiv.org/abs/2503.01334)
[2025-03-04]

## Image Classification

* [Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective](https://arxiv.org/abs/2501.03305)
[2025-01-08]基于深度学习的植物叶片病害检测与分类

## Image Super-Resolution

* [State-of-the-Art Transformer Models for Image Super-Resolution: Techniques, Challenges, and Applications](https://arxiv.org/abs/2501.07855)
[2025-01-15]

## Image Progress(图像/视频处理)

* 图像增强

  * [Underwater Image Enhancement using Generative Adversarial Networks: A Survey](https://arxiv.org/abs/2501.06273)
[2025-01-14]

  * [A Comprehensive Survey on Image Signal Processing Approaches for Low-Illumination Image Enhancement](https://arxiv.org/abs/2502.05995)
[2025-02-11]

* 图像质量评估/增强  

  * [Fundus Image Quality Assessment and Enhancement: a Systematic Review](https://arxiv.org/abs/2501.11520)
[2025-01-22]

  * [A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook](https://arxiv.org/abs/2502.08540)
[2025-02-13]

* 去反射

  * [Survey on Single-Image Reflection Removal using Deep Learning Techniques](https://arxiv.org/abs/2502.08836)
[2025-02-14]

## Unknown(未分)

* [Visualizing Uncertainty in Image Guided Surgery a Review](https://arxiv.org/abs/2501.06280)
[2025-01-14]

* [A Preliminary Survey of Semantic Descriptive Model for Images](https://arxiv.org/abs/2501.08352)
[2025-01-16]

* [New Fashion Products Performance Forecasting: A Survey on Evolutions, Models and Emerging Trends](https://arxiv.org/abs/2501.10324)
[2025-01-20]

* [Explainable artificial intelligence (XAI): from inherent explainability to large language models](https://arxiv.org/abs/2501.09967)
[2025-01-20]

* [Explainability for Vision Foundation Models: A Survey](https://arxiv.org/abs/2501.12203)
[2025-01-22]

* [Advanced technology in railway track monitoring using the GPR Technique: A Review](https://arxiv.org/abs/2501.11132)
[2025-01-22]

* [Reproducibility review of "Why Not Other Classes": Towards Class-Contrastive Back-Propagation Explanations](https://arxiv.org/abs/2501.11096)
[2025-01-22]

* [Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation](https://arxiv.org/abs/2502.05151)
[2025-02-10]

* [Diffusion Models for Computational Neuroimaging: A Survey](https://arxiv.org/abs/2502.06552)
:star:[code](https://github.com/JoeZhao527/dm4neuro)
[2025-02-11]

* [Safety at Scale: A Comprehensive Survey of Large Model Safety](https://arxiv.org/abs/2502.05206)
[2025-02-11]

* [Event Vision Sensor: A Review](https://arxiv.org/abs/2502.06116)
[2025-02-11]

* [A Survey on Mamba Architecture for Vision Applications](https://arxiv.org/abs/2502.07161)
[2025-02-12]

* [A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision](https://arxiv.org/abs/2502.10444)
:star:[code](https://github.com/52CV/CV-Surveys/)
[2025-02-18]

* [Event-based Solutions for Human-centered Applications: A Comprehensive Review](https://arxiv.org/abs/2502.18490)
:star:[code](https://github.com/nmirabeth/event_human)
[2025-02-27]

* [A Survey on Ordinal Regression: Applications, Advances and Prospects](https://arxiv.org/abs/2503.00952)
[2025-03-04]

* [Lossy Neural Compression for Geospatial Analytics: A Review](https://arxiv.org/abs/2503.01505)
[2025-03-04]

* [A Review on Geometry and Surface Inspection in 3D Concrete Printing](https://arxiv.org/abs/2503.07472)
[2025-03-11]

* [A Systematic Review of ECG Arrhythmia Classification: Adherence to Standards, Fair Evaluation, and Embedded Feasibility](https://arxiv.org/abs/2503.07276)
[2025-03-11]

* [A Survey on Wi-Fi Sensing Generalizability: Taxonomy, Techniques, Datasets, and Future Research Prospects](https://arxiv.org/abs/2503.08008)
[2025-03-12]

* [Challenges and Trends in Egocentric Vision: A Survey](https://arxiv.org/abs/2503.15275)
[2025-03-20]

* [A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions](https://arxiv.org/abs/2503.16546)
[2025-03-24]

* [Hybrid Multi-Stage Learning Framework for Edge Detection: A Survey](https://arxiv.org/abs/2503.21827)
[2025-03-31]

* [Towards Mobile Sensing with Event Cameras on High-mobility Resource-constrained Devices: A Survey](https://arxiv.org/abs/2503.22943)
[2025-04-01]

* [Foundation Models For Seismic Data Processing: An Extensive Review](https://arxiv.org/abs/2503.24166)
[2025-04-01]

* [A Survey of Pathology Foundation Model: Progress and Future Directions](https://arxiv.org/abs/2504.04045)
:star:[code](https://github.com/BearCleverProud/AwesomeWSI)
[2025-04-08]

* [Attention in Diffusion Model: A Survey](https://arxiv.org/abs/2504.03738)
[2025-04-08]

* [Loss Functions in Deep Learning: A Comprehensive Review](https://arxiv.org/abs/2504.04242)
[2025-04-08]

* [Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review](https://arxiv.org/abs/2504.08588)
[2025-04-14]

* [Computer-Aided Layout Generation for Building Design: A Review](https://arxiv.org/abs/2504.09694)
:star:[code](https://github.com/jcliu0428/awesome-building-layout-generation)
[2025-04-15]

* [Digital Twin Generation from Visual Data: A Survey](https://arxiv.org/abs/2504.13159)
:star:[code](https://github.com/ndrwmlnk/awesome-digital-twins)
[2025-04-18]



## 2023 年论文分类汇总戳这里

↘️[CVPR-2023-Papers](https://github.com/52CV/CVPR-2023-Papers)

↘️[WACV-2023-Papers](https://github.com/52CV/WACV-2023-Papers)

↘️[ICCV-2023-Papers](https://github.com/52CV/ICCV-2023-Papers)

↘️[2023-CV-Surveys](https://github.com/52CV/CV-Surveys/blob/main/2023-CV-Surveys.md)



## 2022 年论文分类汇总戳这里

↘️[CVPR-2022-Papers](https://github.com/52CV/CVPR-2022-Papers/blob/main/README.md)

↘️[WACV-2022-Papers](https://github.com/52CV/WACV-2022-Papers)

↘️[ECCV-2022-Papers](https://github.com/52CV/ECCV-2022-Papers/blob/main/README.md)



## 2021 年论文分类汇总戳这里

↘️[ICCV-2021-Papers](https://github.com/52CV/ICCV-2021-Papers)

↘️[CVPR-2021-Papers](https://github.com/52CV/CVPR-2021-Papers)



## 2020 年论文分类汇总戳这里

↘️[CVPR-2020-Papers](https://github.com/52CV/CVPR-2020-Papers) 

↘️[ECCV-2020-Papers](https://github.com/52CV/ECCV-2020-Papers)

## 扫码CV君微信（注明：CV）入微信交流群：

![image](https://user-images.githubusercontent.com/62801906/112356924-051e6700-8d0a-11eb-96dd-5c9890832fbf.png)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/52CV/CV-Surveys

Awesome Lists containing this project

README