{"id":13440693,"url":"https://github.com/amusi/ICCV2023-Papers-with-Code","last_synced_at":"2025-03-20T10:32:04.418Z","repository":{"id":43103596,"uuid":"388636965","full_name":"amusi/ICCV2023-Papers-with-Code","owner":"amusi","description":"ICCV 2023 论文和开源项目合集","archived":false,"fork":false,"pushed_at":"2023-10-01T01:06:42.000Z","size":104,"stargazers_count":2495,"open_issues_count":41,"forks_count":250,"subscribers_count":39,"default_branch":"main","last_synced_at":"2024-10-15T10:05:15.330Z","etag":null,"topics":["artificial-intelligence","computer-vision","iccv","iccv2021","iccv2023","object-detection","semantic-segmentation","transformer"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/amusi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2021-07-23T00:47:16.000Z","updated_at":"2024-10-11T08:39:04.000Z","dependencies_parsed_at":"2024-01-16T02:45:27.649Z","dependency_job_id":"222d7143-fd18-4946-a1ff-99515dec6792","html_url":"https://github.com/amusi/ICCV2023-Papers-with-Code","commit_stats":null,"previous_names":["amusi/iccv2023-papers-with-code"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2FICCV2023-Papers-with-Code","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2FICCV2023-Papers-with-Code/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2FICCV2023-Papers-with-Code/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2FICCV2023-Papers-with-Code/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/amusi","download_url":"https://codeload.github.com/amusi/ICCV2023-Papers-with-Code/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":221563089,"owners_count":16843910,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["artificial-intelligence","computer-vision","iccv","iccv2021","iccv2023","object-detection","semantic-segmentation","transformer"],"created_at":"2024-07-31T03:01:25.228Z","updated_at":"2024-10-28T00:30:27.891Z","avatar_url":"https://github.com/amusi.png","language":null,"funding_links":[],"categories":["Others","📚 3D Generation"],"sub_categories":[],"readme":"# ICCV2023-Papers-with-Code\n\n[ICCV 2023](http://iccv2023.thecvf.com/) 论文和开源项目合集(papers with code)！\n\n2160 papers accepted！\n\nICCV 2023 收录论文IDs：https://t.co/A0mCH8gbOi\n\n\u003e 注1：欢迎各位大佬提交issue，分享ICCV 2023论文和开源项目！\n\u003e\n\u003e 注2：关于往年CV顶会论文以及其他优质CV论文和大盘点，详见： https://github.com/amusi/daily-paper-computer-vision\n\u003e\n\u003e [ICCV 2021](ICCV2021-Papers-with-Code.md)\n\n如果你想了解最新最优质的的CV论文、开源项目和学习资料，欢迎扫码加入【[CVer学术交流群](https://t.zsxq.com/10OGjThDw)】！互相学习，一起进步~\n\n![](https://github.com/amusi/CVPR2023-Papers-with-Code/raw/master/CVer%E5%AD%A6%E6%9C%AF%E4%BA%A4%E6%B5%81%E7%BE%A4.png)\n\n# 【ICCV 2023 论文开源目录】\n\n- [Backbone](#Backbone)\n- [CLIP](#CLIP)\n- [MAE](#MAE)\n- [GAN](#GAN)\n- [GNN](#GNN)\n- [MLP](#MLP)\n- [NAS](#NAS)\n- [OCR](#OCR)\n- [NeRF](#NeRF)\n- [DETR](#DETR)\n- [Prompt](#Prompt)\n- [Diffusion Models(扩散模型)](#Diffusion)\n- [Prompt](#Prompt)\n- [Avatars](#Avatars)\n- [ReID(重识别)](#ReID)\n- [长尾分布(Long-Tail)](#Long-Tail)\n- [Vision Transformer](#Vision-Transformer)\n- [视觉和语言(Vision-Language)](#VL)\n- [自监督学习(Self-supervised Learning)](#SSL)\n- [数据增强(Data Augmentation)](#DA)\n- [目标检测(Object Detection)](#Object-Detection)\n- [目标跟踪(Visual Tracking)](#VT)\n- [语义分割(Semantic Segmentation)](#Semantic-Segmentation)\n- [实例分割(Instance Segmentation)](#Instance-Segmentation)\n- [全景分割(Panoptic Segmentation)](#Panoptic-Segmentation)\n- [医学图像分类(Medical Image Classfication)](#MIC)\n- [医学图像分割(Medical Image Segmentation)](#MIS)\n- [视频目标分割(Video Object Segmentation)](#VOS)\n- [视频实例分割(Video Instance Segmentation)](#VIS)\n- [参考图像分割(Referring Image Segmentation)](#RIS)\n- [图像抠图(Image Matting)](#Matting)\n- [Low-level Vision](#LLV)\n- [超分辨率(Super-Resolution)](#SR)\n- [去噪(Denoising)](#Denoising)\n- [去模糊(Deblur)](#Deblur)\n- [3D点云(3D Point Cloud)](#3D-Point-Cloud)\n- [3D目标检测(3D Object Detection)](#3DOD)\n- [3D语义分割(3D Semantic Segmentation)](#3DSS)\n- [3D目标跟踪(3D Object Tracking)](#3D-Object-Tracking)\n- [3D语义场景补全(3D Semantic Scene Completion)](#3DSSC)\n- [3D配准(3D Registration)](#3D-Registration)\n- [3D人体姿态估计(3D Human Pose Estimation)](#3D-Human-Pose-Estimation)\n- [3D人体Mesh估计(3D Human Mesh Estimation)](#3D-Human-Pose-Estimation)\n- [医学图像(Medical Image)](#Medical-Image)\n- [图像生成(Image Generation)](#Image-Generation)\n- [视频生成(Video Generation)](#Video-Generation)\n- [图像编辑(Image Editing)](#Image-Editing)\n- [视频编辑(Video Editing)](#Video-Editing)\n- [视频理解(Video Understanding)](#Video-Understanding)\n- [人体运动生成(Human Motion Generation)](#Human-Motion-Generation)\n- [低光照图像增强(Low-light Image Enhancement)](#Low-light-Image-Enhancement)\n- [场景文本识别(Scene Text Recognition)](#STR)\n- [图像检索(Image Retrieval)](#Image-Retrieval)\n- [图像融合(Image Fusion)](#Image-Fusion)\n- [轨迹预测(Trajectory Prediction) ](#Trajectory-Prediction)\n- [人群计数(Crowd Counting)](#Crowd-Counting)\n- [Video Quality Assessment(视频质量评价)](#Video-Quality-Assessment)\n- [其它(Others)](#Others)\n\n\u003ca name=\"Avatars\"\u003e\u003c/a\u003e\n\n# Avatars \n\n**Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control**\n\nPaper: https://arxiv.org/abs/2303.17606\n\nCode: https://github.com/songrise/AvatarCraft\n\n\u003ca name=\"Backbone\"\u003e\u003c/a\u003e\n\n# Backbone\n\n**Rethinking Mobile Block for Efficient Attention-based Models**\n\n- Paper: https://arxiv.org/abs/2301.01146\n- Code: https://github.com/zhangzjn/EMO \n\n\u003ca name=\"CLIP\"\u003e\u003c/a\u003e\n\n# CLIP\n\n**PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization**\n\n- Paper: https://arxiv.org/abs/2307.15199\n- Code: [https://PromptStyler.github.io/](https://promptstyler.github.io/)\n\n**CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation**\n\n- Paper: https://arxiv.org/abs/2308.15226\n- Code: http://www.github.com/devaansh100/CLIPTrans\n\n\u003ca name=\"NeRF\"\u003e\u003c/a\u003e\n\n# NeRF\n\n**IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis**\n\n- Homepage: https://zju3dv.github.io/intrinsic_nerf/\n- Paper: https://arxiv.org/abs/2210.00647\n- Code: https://github.com/zju3dv/IntrinsicNeRF\n\n**Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control**\n\n- Paper: https://arxiv.org/abs/2303.17606\n\n- Code: https://github.com/songrise/AvatarCraft\n\n**FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis**\n\n- Homepage: https://shawn615.github.io/flipnerf/\n- Code: https://github.com/shawn615/FlipNeRF\n- Paper: https://arxiv.org/abs/2306.17723\n\n**Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields**\n\n- Homepage: https://wbhu.github.io/projects/Tri-MipRF\n\n- Paper: https://arxiv.org/abs/2307.11335\n- Code: https://github.com/wbhu/Tri-MipRF\n\n\u003ca name=\"Diffusion\"\u003e\u003c/a\u003e\n\n# Diffusion Models(扩散模型)\n\n**PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment**\n\n- Paper: https://arxiv.org/abs/2306.15667\n- Code: https://github.com/facebookresearch/PoseDiffusion\n\n**FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model**\n\n- Paper: https://arxiv.org/abs/2303.09833\n- Code: https://github.com/vvictoryuki/FreeDoM\n\n**BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion**\n\n- Paper: https://arxiv.org/abs/2307.10816\n- Code: https://github.com/Sierkinhane/BoxDiff\n\n**BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction**\n\n- Paper: https://arxiv.org/abs/2211.14304\n- Code: https://github.com/BarqueroGerman/BeLFusion\n\n**DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion**\n\n- Paper: https://arxiv.org/abs/2303.06840\n- Code: https://github.com/Zhaozixiang1228/MMIF-DDFM\n\n**DIRE for Diffusion-Generated Image Detection**\n\n- Paper: https://arxiv.org/abs/2303.09295\n- Code: https://github.com/ZhendongWang6/DIRE\n\n\u003ca name=\"Prompt\"\u003e\u003c/a\u003e\n\n# Prompt\n\n**Read-only Prompt Optimization for Vision-Language Few-shot Learning** \n\n- Paper: https://arxiv.org/abs/2308.14960\n- Code: https://github.com/mlvlab/RPO\n\n**Introducing Language Guidance in Prompt-based Continual Learning**\n\n- Paper: https://arxiv.org/abs/2308.15827\n- Code: None\n\n\u003ca name=\"VL\"\u003e\u003c/a\u003e\n\n# 视觉和语言(Vision-Language)\n\n**Read-only Prompt Optimization for Vision-Language Few-shot Learning** \n\n- Paper: https://arxiv.org/abs/2308.14960\n- Code: https://github.com/mlvlab/RPO\n\n\u003ca name=\"Object-Detection\"\u003e\u003c/a\u003e\n\n# 目标检测(Object Detection)\n\n**Femtodet: an object detection baseline for energy versus performance tradeoffs**\n\n- Paper: https://arxiv.org/abs/2301.06719\n- Code: https://github.com/yh-pengtu/FemtoDet\n\n**Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment**\n\n- Paper: https://arxiv.org/abs/2207.13085\n- Code: https://github.com/Atten4Vis/GroupDETR\n\n**Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection**\n\n- Paper: https://arxiv.org/abs/2205.09613\n- Code: https://github.com/LiewFeng/imTED\n\n**ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation**\n\n- Paper: https://arxiv.org/abs/2308.09242\n- Code: https://github.com/iSEE-Laboratory/ASAG\n\n\u003ca name=\"VT\"\u003e\u003c/a\u003e\n\n# 目标跟踪(Visual Tracking)\n\n**Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers**\n\n- Paper: https://arxiv.org/abs/2307.04129\n- Code: https://github.com/ZHU-Zhiyu/High-Rank_RGB-Event_Tracker \n\n\u003ca name=\"Semantic-Segmentation\"\u003e\u003c/a\u003e\n\n# 语义分割(Semantic Segmentation)\n\n**Segment Anything**\n\n- Homepage: https://segment-anything.com/\n- Paper: https://arxiv.org/abs/2304.02643\n- Code: https://github.com/facebookresearch/segment-anything\n\n**MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation**\n\n- Paper: https://arxiv.org/abs/2304.09913\n- Code: https://github.com/shjo-april/MARS\n\n**FreeCOS: Self-Supervised Learning from Fractals and Unlabeled Images for Curvilinear Object Segmentation**\n\n- Paper: https://arxiv.org/abs/2307.07245\n- Code: https://github.com/TY-Shi/FreeCOS\n\n**Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation**\n\n- Paper: https://arxiv.org/abs/2211.14512\n- Code: https://github.com/yyliu01\n\n**Disentangle then Parse:Night-time Semantic Segmentation with Illumination Disentanglement**\n\n- Paper: https://arxiv.org/abs/2307.09362\n- Code: https://github.com/w1oves/DTP\n\n\u003ca name=\"VOS\"\u003e\u003c/a\u003e\n\n# 视频目标分割(Video Object Segmentation)\n\n**Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus**\n\n- Paper: https://arxiv.org/abs/2207.01203 \n\n- Code: https://github.com/lxa9867/R2VOS\n\n\u003ca name=\"VIS\"\u003e\u003c/a\u003e\n\n# 视频实例分割(Video Instance Segmentation)\n\n**DVIS: Decoupled Video Instance Segmentation Framework**\n\n- Paper: https://arxiv.org/abs/2306.03413\n- Code: https://github.com/zhang-tao-whu/DVIS\n\n\u003ca name=\"MIC\"\u003e\u003c/a\u003e\n\n# 医学图像分类\n\n**BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification**\n\n- Paper: https://arxiv.org/abs/2203.01937\n\n- Code: https://github.com/cyh-0/BoMD\n\n\u003ca name=\"MIS\"\u003e\u003c/a\u003e\n\n# 医学图像分割\n\n**CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection**\n\n- Paper: https://arxiv.org/abs/2301.00785\n- Code: https://github.com/ljwztc/CLIP-Driven-Universal-Model\n\n\u003ca name=\"LLV\"\u003e\u003c/a\u003e\n\n# Low-level Vision\n\n**Self-supervised Learning to Bring Dual Reversed Rolling Shutter Images Alive**\n\n- Paper: https://arxiv.org/abs/2305.19862\n- Code: https://github.com/shangwei5/SelfDRSC \n\n\u003ca name=\"SR\"\u003e\u003c/a\u003e\n\n# 超分辨率(Super-Resolution)\n\n**Spherical Space Feature Decomposition for Guided Depth Map Super-Resolution.**\n\n- Paper: https://arxiv.org/abs/2303.08942\n- Code: https://github.com/Zhaozixiang1228/GDSR-SSDNet \n\n\u003ca name=\"3D-Point-Cloud\"\u003e\u003c/a\u003e\n\n# 3D点云(3D Point Cloud)\n\n**Robo3D: Towards Robust and Reliable 3D Perception against Corruptions**\n\n- Homepage: https://ldkong.com/Robo3D\n- Paper: https://arxiv.org/abs/2303.17597\n- Code: https://github.com/ldkong1205/Robo3D\n\n**Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models**\n\n- Paper: https://arxiv.org/abs/2304.07221\n- Code: https://github.com/zyh16143998882/ICCV23-IDPT\n\n**Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos**\n\n- Paper: https://arxiv.org/abs/2308.09247\n- Code: None\n\n\u003ca name=\"3DOD\"\u003e\u003c/a\u003e\n\n# 3D目标检测(3D Object Detection)\n\n**PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images**\n\n- Paper: https://arxiv.org/abs/2206.01256\n- Code: https://github.com/megvii-research/PETR\n\n**DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection**\n\n- Paper: https://arxiv.org/abs/2304.13031\n- Code: https://github.com/AIR-DISCOVER/DQS3D\n\n**SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection**\n\n- Paper: https://arxiv.org/abs/2304.14340\n- Code: https://github.com/yichen928/SparseFusion\n\n**StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection**\n\n- Paper: https://arxiv.org/abs/2303.11926\n- Code: https://github.com/exiawsh/StreamPETR.git\n\n**Cross Modal Transformer: Towards Fast and Robust 3D Object Detection**\n\n- Paper: https://arxiv.org/abs/2301.01283\n- Code: https://github.com/junjie18/CMT.git\n\n**MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation**\n\n- Paper: https://arxiv.org/abs/2304.09801\n- Project: https://chongjiange.github.io/metabev.html\n- Code: https://github.com/ChongjianGE/MetaBEV\n\n**Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling**\n\n- Paper: https://arxiv.org/abs/2307.07944\n- Code: https://github.com/zhuoxiao-chen/ReDB-DA-3Ddet\n\n**SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection**\n\n- Paper: https://arxiv.org/abs/2307.11477\n- Code: https://github.com/mengtan00/SA-BEV\n\n\u003ca name=\"3DSS\"\u003e\u003c/a\u003e\n\n# 3D语义分割(3D Semantic Segmentation)\n\n**Rethinking Range View Representation for LiDAR Segmentation**\n\n- Homepage: https://ldkong.com/RangeFormer\n- Paper: https://arxiv.org/abs/2303.05367\n- Code: None\n\n\u003ca name=\"3D-Object-Tracking\"\u003e\u003c/a\u003e\n\n# 3D目标跟踪(3D Object Tracking)\n\n**MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors**\n\n- Paper: https://arxiv.org/abs/2303.05071\n- Code : https://github.com/slothfulxtx/MBPTrack3D\n\n\u003ca name=\"Video-Understanding\"\u003e\u003c/a\u003e\n\n# 视频理解(Video Understanding)\n\n**Unmasked Teacher: Towards Training-Efficient Video Foundation Models**\n\n- Paper: https://arxiv.org/abs/2303.16058\n\n- Code: https://github.com/OpenGVLab/unmasked_teacher\n\n\u003ca name=\"Image-Generation\"\u003e\u003c/a\u003e\n\n# 图像生成(Image Generation)\n\n**FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model**\n\n- Paper: https://arxiv.org/abs/2303.09833\n- Code: https://github.com/vvictoryuki/FreeDoM\n\n**BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion**\n\n- Paper: https://arxiv.org/abs/2307.10816\n- Code: https://github.com/Sierkinhane/BoxDiff \n\n\u003ca name=\"Video-Generation\"\u003e\u003c/a\u003e\n\n# 视频生成(Video Generation)\n\n**Simulating Fluids in Real-World Still Images**\n\n- Homepage: https://slr-sfs.github.io/ \n- Paper: https://arxiv.org/abs/2204.11335\n- Code: https://github.com/simon3dv/SLR-SFS\n\n\u003ca name=\"Image-Editing\"\u003e\u003c/a\u003e\n\n# 图像编辑(Image Editing)\n\n**Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing**\n\n- Paper: https://arxiv.org/abs/2304.02051\n- Code: https://github.com/aimagelab/multimodal-garment-designer \n\n\u003ca name=\"Video-Editing\"\u003e\u003c/a\u003e\n\n# 视频编辑(Video Editing)\n\n**FateZero: Fusing Attentions for Zero-shot Text-based Video Editing**\n\n- Project: https://fate-zero-edit.github.io/ \n- Paper: https://arxiv.org/abs/2303.09535\n- Code: https://github.com/ChenyangQiQi/FateZero \n\n\u003ca name=\"Human-Motion-Generation\"\u003e\u003c/a\u003e\n\n# 人体运动生成(Human Motion Generation)\n\n**BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction**\n\n- Paper: https://arxiv.org/abs/2211.14304\n- Code: https://github.com/BarqueroGerman/BeLFusion \n\n\u003ca name=\"Low-light-Image-Enhancement\"\u003e\u003c/a\u003e\n\n# 低光照图像增强(Low-light Image Enhancement)\n\n**Implicit Neural Representation for Cooperative Low-light Image Enhancement**\n\n- Paper: https://arxiv.org/abs/2303.11722\n- Code: https://github.com/Ysz2022/NeRCo\n\n\u003ca name=\"STD\"\u003e\u003c/a\u003e\n\n# 场景文本检测(Scene Text Detection)\n\n\n\n\u003ca name=\"STR\"\u003e\u003c/a\u003e\n\n# 场景文本识别(Scene Text Recognition)\n\n**Self-supervised Character-to-Character Distillation for Text Recognition**\n\n- Paper: https://arxiv.org/abs/2211.00288\n- Code: https://github.com/TongkunGuan/CCD\n\n**MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition**\n\n- Paper: https://arxiv.org/abs/2305.14758\n- Code: https://github.com/simplify23/MRN\n- 中文解读：https://zhuanlan.zhihu.com/p/643948935 \n\n\u003ca name=\"Image-Retrieval\"\u003e\u003c/a\u003e\n\n# 图像检索(Image Retrieval)\n\n**Zero-Shot Composed Image Retrieval with Textual Inversion**\n\n- Paper: https://arxiv.org/abs/2303.15247\n- Code: https://github.com/miccunifi/SEARLE \n\n\u003ca name=\"Image-Fusion\"\u003e\u003c/a\u003e\n\n# 图像融合(Image Fusion)\n\n**DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion**\n\n- Paper: https://arxiv.org/abs/2303.06840\n- Code: https://github.com/Zhaozixiang1228/MMIF-DDFM\n\n\u003ca name=\"Trajectory-Prediction\"\u003e\u003c/a\u003e\n\n# 轨迹预测(Trajectory Prediction)\n\n**EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting**\n\n- Homepage: https://inhwanbae.github.io/publication/eigentrajectory/\n\n- Paper: https://arxiv.org/abs/2307.09306 \n- Code: https://github.com/InhwanBae/EigenTrajectory\n\n\u003ca name=\"Crowd-Counting\"\u003e\u003c/a\u003e\n\n# 人群计数(Crowd Counting)\n\n**Point-Query Quadtree for Crowd Counting, Localization, and More**\n\n- Paper: https://arxiv.org/abs/2308.13814\n- Code: https://github.com/cxliu0/PET\n\n\u003ca name=\"Video-Quality-Assessment\"\u003e\u003c/a\u003e\n\n# Video Quality Assessment(视频质量评价)\n\n**Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives**\n\n- Paper: https://arxiv.org/abs/2211.04894\n- Code: https://github.com/VQAssessment/DOVER\n\n\u003ca name=\"Others\"\u003e\u003c/a\u003e\n\n# 其它(Others)\n\n**MotionBERT: A Unified Perspective on Learning Human Motion Representations**\n\n- Homepage: https://motionbert.github.io/\n- Paper: https://arxiv.org/abs/2210.06551\n- Code: https://github.com/Walter0807/MotionBERT \n\n**Graph Matching with Bi-level Noisy Correspondence**\n\n- Paper: https://arxiv.org/pdf/2212.04085.pdf\n- Code: https://github.com/Lin-Yijie/Graph-Matching-Networks/tree/main/COMMON \n\n**LDL: Line Distance Functions for Panoramic Localization**\n\n- Paper: https://arxiv.org/abs/2308.13989\n- Code: https://github.com/82magnolia/panoramic-localization\n\n**Active Neural Mapping**\n\n- Homepage: https://zikeyan.github.io/active-INR/index.html\n- Paper: https://arxiv.org/abs/2308.16246\n- Code: https://zikeyan.github.io/active-INR/index.html#\n\n**Reconstructing Groups of People with Hypergraph Relational Reasoning**\n\n- Paper: https://arxiv.org/abs/2308.15844\n- Code: https://github.com/boycehbz/GroupRec","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famusi%2FICCV2023-Papers-with-Code","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Famusi%2FICCV2023-Papers-with-Code","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famusi%2FICCV2023-Papers-with-Code/lists"}