https://github.com/52CV/CV-Surveys
计算机视觉相关综述。包括目标检测、跟踪........
https://github.com/52CV/CV-Surveys
Last synced: 4 days ago
JSON representation
计算机视觉相关综述。包括目标检测、跟踪........
- Host: GitHub
- URL: https://github.com/52CV/CV-Surveys
- Owner: 52CV
- Created: 2021-01-05T02:58:20.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2024-10-30T02:17:41.000Z (6 months ago)
- Last Synced: 2024-10-30T04:59:15.037Z (6 months ago)
- Homepage:
- Size: 907 KB
- Stars: 1,877
- Watchers: 38
- Forks: 242
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-yolo-object-detection - 52CV/CV-Surveys - Surveys?style=social"/> : 计算机视觉相关综述。包括目标检测、跟踪........ (Summary)
- awesome-yolo-object-detection - 52CV/CV-Surveys - Surveys?style=social"/> : 计算机视觉相关综述。包括目标检测、跟踪........ (Summary)
README
![]()
## 查看2025年综述文献点这里↘️[2025-CV-Surveys](https://github.com/52CV/CV-Surveys)
## 2025 年论文分类汇总戳这里
↘️[WACV-2025-Papers](https://github.com/52CV/WACV-2025-Papers)
↘️[CVPR-2025-Papers](https://github.com/52CV/CVPR-2025-Papers)## 2024 年论文分类汇总戳这里
↘️[WACV-2024-Papers](https://github.com/52CV/WACV-2024-Papers)
↘️[CVPR-2024-Papers](https://github.com/52CV/CVPR-2024-Papers)
↘️[ECCV-2024-Papers](https://github.com/52CV/ECCV-2024-Papers)## [2023 年论文分类汇总戳这里](#00000)
## [2022 年论文分类汇总戳这里](#0000)
## [2021 年论文分类汇总戳这里](#000)
## [2020 年论文分类汇总戳这里](#00)# 2025-CV-Surveys
2025 年,计算机视觉相关综述。包括目标检测、跟踪........
### :green_book::green_book::green_book:在[【我爱计算机视觉】微信公众号](https://user-images.githubusercontent.com/62801906/163739684-175f0b8a-871e-4a41-b310-b549625fdcb1.png)后台回复“CV综述”,即可收到本文列出的全部论文的打包下载。至4月18日已公开 158+2 篇。
1月36篇。
2月50篇。
3月45篇。## 目录
|:cat:|:dog:|:tiger:|:wolf:|
|------|------|------|------|
|[1.Unkown(未分)](#1)|## Machine Learning
* [Machine Learning Applications to Diffuse Reflectance Spectroscopy in Optical Diagnosis; A Systematic Review](https://arxiv.org/abs/2503.02905)
[2025-03-06]
* 强化学习
* [Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey](https://arxiv.org/abs/2503.09956)
[2025-03-14]
* 对比学习
* [A Survey on Data Curation for Visual Contrastive Learning: Why Crafting Effective Positive and Negative Pairs Matters](https://arxiv.org/abs/2502.08134)
[2025-02-13]
* 类增量学习
* [Latest Advancements Towards Catastrophic Forgetting under Data Scarcity: A Comprehensive Survey on Few-Shot Class Incremental Learning](https://arxiv.org/abs/2502.08181)
[2025-02-13]
* 对抗
* [A Survey of Adversarial Defenses in Vision-based Systems: Categorization, Methods and Challenges](https://arxiv.org/abs/2503.00384)
[2025-03-04]## agriculture(农业)
* [A survey of datasets for computer vision in agriculture](https://arxiv.org/abs/2502.16950)
:star:[code](https://smartfarminglab.github.io/field_dataset_survey/)
[2025-02-25]## Biomedical(生物特征识别)
* 掌纹识别
* [Deep Learning in Palmprint Recognition-A Comprehensive Survey](https://arxiv.org/abs/2501.01166)
[2025-01-03]## Neural Radiance Fields
* [Neural Radiance Fields for the Real World: A Survey](https://arxiv.org/abs/2501.13104)
[2025-01-23]## Robots(机器人)
* [Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions](https://arxiv.org/abs/2501.05750)
[2025-01-13]## Industrial Defect Detection(工业缺陷检测)
* [Anomaly Detection for Industrial Applications, Its Challenges, Solutions, and Future Directions: A Review](https://arxiv.org/abs/2501.11310)
[2025-01-22]
* [A Survey on Industrial Anomalies Synthesis](https://arxiv.org/abs/2502.16412)
:star:[code](https://github.com/M-3LAB/awesome-anomaly-synthesis.)
[2025-02-25]
* [A Survey on Foundation-Model-Based Industrial Defect Detection](https://arxiv.org/abs/2502.19106)
[2025-02-27]## Video
* [A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems](https://arxiv.org/abs/2502.06581)
[2025-02-11]## Action Detection(动作检测)
* [Action Valuation in Sports: A Survey](https://arxiv.org/abs/2504.06163)
[2025-04-09]## Autonomous Driving(自动驾驶)
* [A Survey of World Models for Autonomous Driving](https://arxiv.org/abs/2501.11260)
[2025-01-22]
* [The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey](https://arxiv.org/abs/2502.10498)
:star:[code](https://github.com/LMD0311/Awesome-World-Model)
[2025-02-18]
* [4D mmWave Radar in Adverse Environments for Autonomous Driving: A Survey](https://arxiv.org/abs/2503.24091)
[2025-04-01]
* [Systematic Literature Review on Vehicular Collaborative Perception -- A Computer Vision Perspective](https://arxiv.org/abs/2504.04631)
[2025-04-08]
* [Adversarial Examples in Environment Perception for Automated Driving (Review)](https://arxiv.org/abs/2504.08414)
[2025-04-14]
* [Collaborative Perception Datasets for Autonomous Driving: A Review](https://arxiv.org/abs/2504.12696)
:star:[code](https://github.com/frankwnb/Collaborative-Perception-Datasets-for-Autonomous-Driving)
[2025-04-18]
* 车道线检测
* [Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review](https://arxiv.org/abs/2504.08540)
[2025-04-14]
* 分心驾驶检测
* [A Review Paper of the Effects of Distinct Modalities and ML Techniques to Distracted Driving Detection](https://arxiv.org/abs/2501.11758)
[2025-01-22]## Machine Learning
* [A Systematic Review of Machine Learning Methods for Multimodal EEG Data in Clinical Application](https://arxiv.org/abs/2501.08585)
[2025-01-16]## Few/Zero-Shot Learning/DG/A(小/零样本/域泛化/域适应)
* Non-Transferable Learning(反迁移学习)
* [Toward Robust Non-Transferable Learning: A Survey and Benchmark](https://arxiv.org/abs/2502.13593)
[2025-02-20]## Retrieval-Augmented Generation(检索增强生成)
* [Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook](https://arxiv.org/abs/2503.18016)
:star:[code](https://github.com/zhengxuJosh/Awesome-RAG-Vision)
[2025-03-25]## Vision-Language(视觉语言)
* [Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability](https://arxiv.org/abs/2501.01346)
[2025-01-03]
* [Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey](https://arxiv.org/abs/2501.02189)
:star:[code](https://github.com/zli12321/Awesome-VLM-Papers-And-Models.git)
[2025-01-07]
* [Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches](https://arxiv.org/abs/2501.03151)
[2025-01-07]
* [Visual Large Language Models for Generalized and Specialized Applications](https://arxiv.org/abs/2501.02765)
:star:[code](https://github.com/JackYFL/awesome-VLLMs)
[2025-01-07]
* [When Data Manipulation Meets Attack Goals: An In-depth Survey of Attacks for VLMs](https://arxiv.org/abs/2502.06390)
:star:[code](https://github.com/AobtDai/VLM_Attack_Paper_List)
[2025-02-11]
* [Survey on Vision-Language-Action Models](https://arxiv.org/abs/2502.06851)
[2025-02-12]
* [Vision-Language Models for Edge Networks: A Comprehensive Survey](https://arxiv.org/abs/2502.07855)
[2025-02-13]
* [Harnessing Vision Models for Time Series Analysis: A Survey](https://arxiv.org/abs/2502.08869)
[2025-02-14]
* [A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations](https://arxiv.org/abs/2502.14881)
:star:[code](https://github.com/XuankunRong/Awesome-LVLM-Safety)
[2025-02-24]
* [Multi-Modal Foundation Models for Computational Pathology: A Survey](https://arxiv.org/abs/2503.09091)
[2025-03-13]
* [Small Vision-Language Models: A Survey on Compact Architectures and Techniques](https://arxiv.org/abs/2503.10665)
[2025-03-17]
* [A Survey on Efficient Vision-Language Models](https://arxiv.org/abs/2504.09724)
:star:[code](https://github.com/MPSC-UMBC/Efficient-Vision-Language-Models-A-Survey)
[2025-04-15]
* LLM
* [Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review](https://arxiv.org/abs/2503.04983)
[2025-03-10]
* [A Review on Large Language Models for Visual Analytics](https://arxiv.org/abs/2503.15176)
[2025-03-20]
* [Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions](https://arxiv.org/abs/2503.16585)
[2025-03-24]
* [How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM](https://arxiv.org/abs/2504.05786)
[2025-04-09]
* MLLM
* [Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review](https://arxiv.org/abs/2502.16586)
[2025-02-25]
* [Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey](https://arxiv.org/abs/2503.12605)
:star:[code](https://github.com/yaotingwangofficial/Awesome-MCoT)
[2025-03-18]
* [Aligning Multimodal LLM with Human Preference: A Survey](https://arxiv.org/abs/2503.14504)
:star:[code](https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Alignment.)
[2025-03-19]
* [Survey of Adversarial Robustness in Multimodal Large Language Models](https://arxiv.org/abs/2503.13962)
[2025-03-19]## GAN/Image Synthesis(图像生成)
* [Generative AI for Cel-Animation: A Survey](https://arxiv.org/abs/2501.06250)
:star:[code](https://github.com/yunlong10/Awesome-AI4Animation)
[2025-01-14]
* [Generative Physical AI in Vision: A Survey](https://arxiv.org/abs/2501.10928)
:star:[code](https://github.com/BestJunYu/Awesome-Physics-aware-Generation)
[2025-01-22]
* [Survey on AI-Generated Media Detection: From Non-MLLM to MLLM](https://arxiv.org/abs/2502.05240)
[2025-02-11]
* [A Survey on Text-Driven 360-Degree Panorama Generation](https://arxiv.org/abs/2502.14799)
:star:[code](https://littlewhitesea.github.io/Text-Driven-Pano-Gen/)
[2025-02-21]
* [Methods and Trends in Detecting Generated Images: A Comprehensive Review](https://arxiv.org/abs/2502.15176)
[2025-02-24]
* [Simulating the Real World: A Unified Survey of Multimodal Generative Models](https://arxiv.org/abs/2503.04641)
[2025-03-07]
* [Generative AI for Film Creation: A Survey of Recent Advances](https://arxiv.org/abs/2504.08296)
[2025-04-14]
* GAN
* [Image Inversion: A Survey from GANs to Diffusion and Beyond](https://arxiv.org/abs/2502.11974)
:star:[code](https://github.com/RyanChenYN/ImageInversion)
[2025-02-18]
* [Generative Adversarial Networks with Limited Data: A Survey and Benchmarking](https://arxiv.org/abs/2504.05456)
[2025-04-09]
* 图像生成
* [Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing](https://arxiv.org/abs/2502.07829)
[2025-02-13]
* [Personalized Image Generation with Deep Generative Models: A Decade Survey](https://arxiv.org/abs/2502.13081)
:star:[code](https://github.com/csyxwei/Awesome-Personalized-Image-Generation)
[2025-02-19]
* AIGC
* [Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC](https://arxiv.org/abs/2502.07007)
[2025-02-12]
* 图像到图像翻译
* [Unpaired Image-to-Image Translation with Content Preserving Perspective: A Review](https://arxiv.org/abs/2502.08667)
[2025-02-14]
* 文本-图像
* [A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models](https://arxiv.org/abs/2502.14896)
[2025-02-24]
* [A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images](https://arxiv.org/abs/2502.21151)
[2025-03-03]
* [A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety](https://arxiv.org/abs/2503.00020)
[2025-03-04]
* [A Survey on Self-supervised Contrastive Learning for Multimodal Text-Image Analysis](https://arxiv.org/abs/2503.11101)
[2025-03-17]
* [A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models](https://arxiv.org/abs/2503.13576)
[2025-03-19]
* 视频生成
* [A Survey: Spatiotemporal Consistency in Video Generation](https://arxiv.org/abs/2502.17863)
[2025-02-26]
* [Exploring the Evolution of Physics Cognition in Video Generation: A Survey](https://arxiv.org/abs/2503.21765)
:star:[code](https://github.com/minnie-lin/Awesome-Physics-Cognition-based-Video-Generation)
[2025-03-28]
* 4D生成
* [Advances in 4D Generation: A Survey](https://arxiv.org/abs/2503.14501)
:star:[code](https://github.com/MiaoQiaowei/Awesome-4D)
[2025-03-19]
* 3D生成
* [Recent Advance in 3D Object and Scene Generation: A Survey](https://arxiv.org/abs/2504.11734)
[2025-04-17]
* 视觉-音乐生成
* [Vision-to-Music Generation: A Survey](https://arxiv.org/abs/2503.21254)
:star:[code](https://github.com/wzk1015/Awesome-Vision-to-Music-Generation.)
[2025-03-28]## MC/KD/Pruning(模型压缩/知识蒸馏/剪枝)
* [A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion](https://arxiv.org/abs/2501.07451)
[2025-01-14]
* [Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies](https://arxiv.org/abs/2503.02891)
[2025-03-06]
* KD
* [A Comprehensive Survey on Knowledge Distillation](https://arxiv.org/abs/2503.12067)
:star:[code](https://github.com/IPL-Sharif/KD_Survey)
[2025-03-18]## Visual Question Answering (视觉问答)
* [Visual question answering: from early developments to recent advances -- a survey](https://arxiv.org/abs/2501.03939)
[2025-01-08]
* [The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering](https://arxiv.org/abs/2501.07109)
[2025-01-14]## Medical Image Progress(医学图像处理)
* [In the Picture: Medical Imaging Datasets, Artifacts, and their Living Review](https://arxiv.org/abs/2501.10727)
[2025-01-22]
* [Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact](https://arxiv.org/abs/2502.08333)
[2025-02-13]
* [A Survey of LLM-based Agents in Medicine: How far are we from Baymax?](https://arxiv.org/abs/2502.11211)
[2025-02-18]
* [Denoising, segmentation and volumetric rendering of optical coherence tomography angiography (OCTA) image using deep learning techniques: a review](https://arxiv.org/abs/2502.14935)
[2025-02-24]
* [The Impact of Artificial Intelligence on Emergency Medicine: A Review of Recent Advances](https://arxiv.org/abs/2503.14546)
[2025-03-20]
* [Comprehensive Review of Reinforcement Learning for Medical Ultrasound Imaging](https://arxiv.org/abs/2503.16543)
[2025-03-24]
* [Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey](https://arxiv.org/abs/2504.11588)
[2025-04-17]
* 医学图像分割
* [A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation](https://arxiv.org/abs/2502.06895)
[2025-02-12]
* 手术场景理解
* [Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review](https://arxiv.org/abs/2502.14886)
[2025-02-24]
* 手术视频分割
* [Deep learning approaches to surgical video segmentation and object detection: A Scoping Review](https://arxiv.org/abs/2502.16459)
[2025-02-25]
* 图像配准
* [From Traditional to Deep Learning Approaches in Whole Slide Image Registration: A Methodological Review](https://arxiv.org/abs/2502.19123)
[2025-02-27]
* MRI重建
* [A Survey of fMRI to Image Reconstruction](https://arxiv.org/abs/2502.16861)
[2025-02-25]
* [A Comprehensive Survey on Magnetic Resonance Image Reconstruction](https://arxiv.org/abs/2503.07097)
[2025-03-11]
* [A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli](https://arxiv.org/abs/2503.15978)
:star:[code](https://github.com/LpyNow/BrainDecodingImage)
[2025-03-21]## OCR
* [Handwritten Text Recognition: A Survey](https://arxiv.org/abs/2502.08417)
[2025-02-13]## UAV/Remote Sensing/Satellite Image(无人机/遥感/卫星图像)
* [Advancing Earth Observation: A Survey on AI-Powered Image Processing in Satellites](https://arxiv.org/abs/2501.12030)
[2025-01-22]
* [Plantation Monitoring Using Drone Images: A Dataset and Performance Review](https://arxiv.org/abs/2502.08233)
[2025-02-13]
* [A Survey on Remote Sensing Foundation Models: From Vision to Multimodality](https://arxiv.org/abs/2503.22081)
[2025-03-31]
* [A Decade of Deep Learning for Remote Sensing Spatiotemporal Fusion: Advances, Challenges, and Opportunities](https://arxiv.org/abs/2504.00901)
:star:[code](https://github.com/yc-cui/Deep-Learning-Spatiotemporal-Fusion-Survey)
[2025-04-02]
* [MIMRS: A Survey on Masked Image Modeling in Remote Sensing](https://arxiv.org/abs/2504.03181)
[2025-04-07]
* [A comprehensive review of remote sensing in wetland classification and mapping](https://arxiv.org/abs/2504.10842)
[2025-04-16]
* Anti-UAV
* [Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions](https://arxiv.org/abs/2504.11967)
[2025-04-17]
* 变化检测
* [Operational Change Detection for Geographical Information: Overview and Challenges](https://arxiv.org/abs/2503.14109)
[2025-03-19]
* 船舶分类
* [A Survey on SAR ship classification using Deep Learning](https://arxiv.org/abs/2503.11906)
[2025-03-18]
* 火灾烟雾
[Fire and Smoke Datasets in 20 Years: An In-depth Review](https://arxiv.org/abs/2503.14552)
[2025-03-20]## Object Detection(目标检测)
* [YOLOv8 to YOLO11: A Comprehensive Architecture In-depth Comparative Review](https://arxiv.org/abs/2501.13400)
[2025-01-24]
* [Context in object detection: a systematic literature review](https://arxiv.org/abs/2503.23249)
[2025-04-01]
* [Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation](https://arxiv.org/abs/2504.09480)
:star:[code](https://github.com/better-chao/perceptual_abilities_evaluation)
[2025-04-15]
* [A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions](https://arxiv.org/abs/2504.11995)
[2025-04-17]
* 线路检测
* [Deep Learning in Automated Power Line Inspection: A Review](https://arxiv.org/abs/2502.07826)
[2025-02-13]
* 小目标检测
* [Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications](https://arxiv.org/abs/2503.20516)
[2025-03-27]## HOI
* [3D Human Interaction Generation: A Survey](https://arxiv.org/abs/2503.13120)
[2025-03-18]
* [A Survey on Human Interaction Motion Generation](https://arxiv.org/abs/2503.12763)
:star:[code](https://github.com/soraproducer/Awesome-Human-Interaction-Motion-Generation)
[2025-03-18]## Action Recognition
* [SMART-Vision: Survey of Modern Action Recognition Techniques in Vision](https://arxiv.org/abs/2501.13066)
[2025-01-23]## Pose(姿态估计)
* [Survey on Hand Gesture Recognition from Visual Input](https://arxiv.org/abs/2501.11992)
[2025-01-22]## Points Cloud(点云)
* [Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey](https://arxiv.org/abs/2501.05473)
[2025-01-13]
* [Point Cloud Based Scene Segmentation: A Survey](https://arxiv.org/abs/2503.12595)
[2025-03-18]## 3D Visual
* 三维重建
* [Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison](https://arxiv.org/abs/2502.20154)
[2025-02-28]
* [Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey](https://arxiv.org/abs/2503.14537)
[2025-03-20]
* [A Survey on Event-driven 3D Reconstruction: Development under Different Categories](https://arxiv.org/abs/2503.19753)
[2025-03-26]
* [Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review](https://arxiv.org/abs/2504.11349)
:star:[code](https://github.com/Bean-Young/AI4Med)
[2025-04-16]
* 深度估计
* [A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision](https://arxiv.org/abs/2501.05147)
[2025-01-10]
* [Survey on Monocular Metric Depth Estimation](https://arxiv.org/abs/2501.11841)
[2025-01-22]## Face
* [A Survey on Facial Image Privacy Preservation in Cloud-Based Services](https://arxiv.org/abs/2501.08665)
[2025-01-16]
* [Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities](https://arxiv.org/abs/2502.06803)
[2025-02-12]
* [Face Deepfakes - A Comprehensive Review](https://arxiv.org/abs/2502.09812)
[2025-02-17]
* 情绪分析
* [Enhanced Sentiment Analysis of Iranian Restaurant Reviews Utilizing Sentiment Intensity Analyzer & Fuzzy Logic](https://arxiv.org/abs/2503.12141)
[2025-03-18]## Image Segmentation(图像分割)
* [A Comparative Review of the Histogram-based Image Segmentation Methods](https://arxiv.org/abs/2502.18550)
[2025-02-27]
* [SAM2 for Image and Video Segmentation: A Comprehensive Survey](https://arxiv.org/abs/2503.12781)
[2025-03-18]## Image Retrieval(图像检索)
* [A Comprehensive Survey on Composed Image Retrieval](https://arxiv.org/abs/2502.18495)
[2025-02-27]
* [Composed Multi-modal Retrieval: A Survey of Approaches and Applications](https://arxiv.org/abs/2503.01334)
[2025-03-04]## Image Classification
* [Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective](https://arxiv.org/abs/2501.03305)
[2025-01-08]基于深度学习的植物叶片病害检测与分类## Image Super-Resolution
* [State-of-the-Art Transformer Models for Image Super-Resolution: Techniques, Challenges, and Applications](https://arxiv.org/abs/2501.07855)
[2025-01-15]## Image Progress(图像/视频处理)
* 图像增强
* [Underwater Image Enhancement using Generative Adversarial Networks: A Survey](https://arxiv.org/abs/2501.06273)
[2025-01-14]
* [A Comprehensive Survey on Image Signal Processing Approaches for Low-Illumination Image Enhancement](https://arxiv.org/abs/2502.05995)
[2025-02-11]
* 图像质量评估/增强
* [Fundus Image Quality Assessment and Enhancement: a Systematic Review](https://arxiv.org/abs/2501.11520)
[2025-01-22]
* [A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook](https://arxiv.org/abs/2502.08540)
[2025-02-13]
* 去反射
* [Survey on Single-Image Reflection Removal using Deep Learning Techniques](https://arxiv.org/abs/2502.08836)
[2025-02-14]## Unknown(未分)
* [Visualizing Uncertainty in Image Guided Surgery a Review](https://arxiv.org/abs/2501.06280)
[2025-01-14]
* [A Preliminary Survey of Semantic Descriptive Model for Images](https://arxiv.org/abs/2501.08352)
[2025-01-16]
* [New Fashion Products Performance Forecasting: A Survey on Evolutions, Models and Emerging Trends](https://arxiv.org/abs/2501.10324)
[2025-01-20]
* [Explainable artificial intelligence (XAI): from inherent explainability to large language models](https://arxiv.org/abs/2501.09967)
[2025-01-20]
* [Explainability for Vision Foundation Models: A Survey](https://arxiv.org/abs/2501.12203)
[2025-01-22]
* [Advanced technology in railway track monitoring using the GPR Technique: A Review](https://arxiv.org/abs/2501.11132)
[2025-01-22]
* [Reproducibility review of "Why Not Other Classes": Towards Class-Contrastive Back-Propagation Explanations](https://arxiv.org/abs/2501.11096)
[2025-01-22]
* [Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation](https://arxiv.org/abs/2502.05151)
[2025-02-10]
* [Diffusion Models for Computational Neuroimaging: A Survey](https://arxiv.org/abs/2502.06552)
:star:[code](https://github.com/JoeZhao527/dm4neuro)
[2025-02-11]
* [Safety at Scale: A Comprehensive Survey of Large Model Safety](https://arxiv.org/abs/2502.05206)
[2025-02-11]
* [Event Vision Sensor: A Review](https://arxiv.org/abs/2502.06116)
[2025-02-11]
* [A Survey on Mamba Architecture for Vision Applications](https://arxiv.org/abs/2502.07161)
[2025-02-12]
* [A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision](https://arxiv.org/abs/2502.10444)
:star:[code](https://github.com/52CV/CV-Surveys/)
[2025-02-18]
* [Event-based Solutions for Human-centered Applications: A Comprehensive Review](https://arxiv.org/abs/2502.18490)
:star:[code](https://github.com/nmirabeth/event_human)
[2025-02-27]
* [A Survey on Ordinal Regression: Applications, Advances and Prospects](https://arxiv.org/abs/2503.00952)
[2025-03-04]
* [Lossy Neural Compression for Geospatial Analytics: A Review](https://arxiv.org/abs/2503.01505)
[2025-03-04]
* [A Review on Geometry and Surface Inspection in 3D Concrete Printing](https://arxiv.org/abs/2503.07472)
[2025-03-11]
* [A Systematic Review of ECG Arrhythmia Classification: Adherence to Standards, Fair Evaluation, and Embedded Feasibility](https://arxiv.org/abs/2503.07276)
[2025-03-11]
* [A Survey on Wi-Fi Sensing Generalizability: Taxonomy, Techniques, Datasets, and Future Research Prospects](https://arxiv.org/abs/2503.08008)
[2025-03-12]
* [Challenges and Trends in Egocentric Vision: A Survey](https://arxiv.org/abs/2503.15275)
[2025-03-20]
* [A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions](https://arxiv.org/abs/2503.16546)
[2025-03-24]
* [Hybrid Multi-Stage Learning Framework for Edge Detection: A Survey](https://arxiv.org/abs/2503.21827)
[2025-03-31]
* [Towards Mobile Sensing with Event Cameras on High-mobility Resource-constrained Devices: A Survey](https://arxiv.org/abs/2503.22943)
[2025-04-01]
* [Foundation Models For Seismic Data Processing: An Extensive Review](https://arxiv.org/abs/2503.24166)
[2025-04-01]
* [A Survey of Pathology Foundation Model: Progress and Future Directions](https://arxiv.org/abs/2504.04045)
:star:[code](https://github.com/BearCleverProud/AwesomeWSI)
[2025-04-08]
* [Attention in Diffusion Model: A Survey](https://arxiv.org/abs/2504.03738)
[2025-04-08]
* [Loss Functions in Deep Learning: A Comprehensive Review](https://arxiv.org/abs/2504.04242)
[2025-04-08]
* [Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review](https://arxiv.org/abs/2504.08588)
[2025-04-14]
* [Computer-Aided Layout Generation for Building Design: A Review](https://arxiv.org/abs/2504.09694)
:star:[code](https://github.com/jcliu0428/awesome-building-layout-generation)
[2025-04-15]
* [Digital Twin Generation from Visual Data: A Survey](https://arxiv.org/abs/2504.13159)
:star:[code](https://github.com/ndrwmlnk/awesome-digital-twins)
[2025-04-18]## 2023 年论文分类汇总戳这里
↘️[CVPR-2023-Papers](https://github.com/52CV/CVPR-2023-Papers)
↘️[WACV-2023-Papers](https://github.com/52CV/WACV-2023-Papers)
↘️[ICCV-2023-Papers](https://github.com/52CV/ICCV-2023-Papers)
↘️[2023-CV-Surveys](https://github.com/52CV/CV-Surveys/blob/main/2023-CV-Surveys.md)## 2022 年论文分类汇总戳这里
↘️[CVPR-2022-Papers](https://github.com/52CV/CVPR-2022-Papers/blob/main/README.md)
↘️[WACV-2022-Papers](https://github.com/52CV/WACV-2022-Papers)
↘️[ECCV-2022-Papers](https://github.com/52CV/ECCV-2022-Papers/blob/main/README.md)## 2021 年论文分类汇总戳这里
↘️[ICCV-2021-Papers](https://github.com/52CV/ICCV-2021-Papers)
↘️[CVPR-2021-Papers](https://github.com/52CV/CVPR-2021-Papers)## 2020 年论文分类汇总戳这里
↘️[CVPR-2020-Papers](https://github.com/52CV/CVPR-2020-Papers)
↘️[ECCV-2020-Papers](https://github.com/52CV/ECCV-2020-Papers)## 扫码CV君微信(注明:CV)入微信交流群:
