Projects in Awesome Lists by TencentARC
A curated list of projects in awesome lists by TencentARC .
https://github.com/tencentarc/gfpgan
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
deep-learning face-restoration gan gfpgan image-restoration pytorch super-resolution
Last synced: 09 Sep 2025
https://github.com/TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
deep-learning face-restoration gan gfpgan image-restoration pytorch super-resolution
Last synced: 14 Mar 2025
https://github.com/TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Last synced: 21 Jul 2025
https://github.com/tencentarc/brushnet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 14 May 2025
https://github.com/tencentarc/motionctrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Last synced: 13 Apr 2025
https://github.com/TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Last synced: 28 Mar 2025
https://github.com/TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image
Last synced: 28 Mar 2025
https://github.com/TencentARC/Pixal3D
[SIGGRAPH 2026] Pixal3D: Pixel-Aligned 3D Generation from Images
Last synced: 21 May 2026
https://github.com/tencentarc/seed-voken
SEED-Voken: A Series of Powerful Visual Tokenizers
Last synced: 15 May 2025
https://github.com/tencentarc/seed-story
SEED-Story: Multimodal Long Story Generation with Large Language Model
Last synced: 16 May 2025
https://github.com/tencentarc/instantmesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Last synced: 12 Apr 2025
https://github.com/TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
Last synced: 27 Mar 2025
https://github.com/tencentarc/masactrl
[ICCV 2023] Consistent Image Synthesis and Editing
Last synced: 12 Apr 2025
https://github.com/TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
Last synced: 27 Mar 2025
https://github.com/TencentARC/SEED-Voken
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Last synced: 22 Jul 2025
https://github.com/tencentarc/brushedit
[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
diffusion-models image-editing image-inpainting
Last synced: 25 Jun 2025
https://github.com/tencentarc/llama-pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Last synced: 05 Apr 2025
https://github.com/tencentarc/mix-of-show
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Last synced: 05 Apr 2025
https://github.com/TencentARC/Mix-of-Show
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Last synced: 27 Mar 2025
https://github.com/tencentarc/colorflow
The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"
Last synced: 25 Jun 2025
https://github.com/tencentarc/animesr
Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"
Last synced: 06 Apr 2025
https://github.com/tencentarc/vqfr
ECCV 2022, Oral, VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
face-restoration vector-quantization
Last synced: 16 Aug 2025
https://github.com/tencentarc/animegamer
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
Last synced: 10 Oct 2025
https://github.com/tencentarc/smartedit
Official code of SmartEdit [CVPR-2024 Highlight]
Last synced: 06 Apr 2025
https://github.com/TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
Last synced: 27 Mar 2025
https://github.com/tencentarc/videopainter
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
video video-dataset video-editing video-inpainting
Last synced: 25 Jun 2025
https://github.com/TencentARC/VideoPainter
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
video video-dataset video-editing video-inpainting
Last synced: 01 Apr 2025
https://github.com/tencentarc/geometrycrafter
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Last synced: 25 Jun 2025
https://github.com/tencentarc/ditctrl
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
Last synced: 25 Jun 2025
https://github.com/tencentarc/umt
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
Last synced: 21 Jul 2025
https://github.com/tencentarc/vit-lens
[CVPR 2024] ViT-Lens: Towards Omni-modal Representations
Last synced: 04 Apr 2025
https://github.com/tencentarc/mm-realsr
Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"
Last synced: 05 Apr 2025
https://github.com/tencentarc/st-llm
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
large-language-models video-language-model video-understanding
Last synced: 08 Oct 2025
https://github.com/tencentarc/mcq
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
Last synced: 05 Apr 2025
https://github.com/tencentarc/stereocrafter
A framework to convert any 2D videos to immersive stereoscopic 3D
Last synced: 25 Jun 2025
https://github.com/tencentarc/faig
NeurIPS 2021, Spotlight, Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution
Last synced: 05 Apr 2025
https://github.com/TencentARC/ArcNerf
Nerf and extensions in all
3d deep-learning graphics instant-ngp nerf neural-rendering pytorch reconstruction rendering
Last synced: 02 May 2025
https://github.com/tencentarc/arcnerf
Nerf and extensions in all
3d deep-learning graphics instant-ngp nerf neural-rendering pytorch reconstruction rendering
Last synced: 05 Apr 2025
https://github.com/tencentarc/moto
Latent Motion Token as the Bridging Language for Robot Manipulation
Last synced: 11 Oct 2025
https://github.com/tencentarc/mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
Last synced: 17 Jun 2025
https://github.com/tencentarc/blobctrl
[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Last synced: 25 Jun 2025
https://github.com/tencentarc/surfelnerf
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
Last synced: 22 Jan 2026
https://github.com/tencentarc/repsr
Codes for "RepSR: Training Efficient VGG-style Super-Resolution Networks with Structural Re-Parameterization and Batch Normalization"
Last synced: 16 Mar 2026
https://github.com/tencentarc/di-pcg
Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
Last synced: 25 Jun 2025
https://github.com/tencentarc/hosnerf
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
Last synced: 05 Apr 2025
https://github.com/tencentarc/divot
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
Last synced: 25 Jun 2025
https://github.com/tencentarc/fastrealvsr
Codes for "Mitigating Artifacts in Real-World Video Super-Resolution Models"
Last synced: 02 Feb 2026
https://github.com/tencentarc/gvt
Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".
Last synced: 05 Apr 2025
https://github.com/tencentarc/freesplatter
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
Last synced: 25 Jun 2025
https://github.com/tencentarc/video-holmes
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
Last synced: 25 Jun 2025
https://github.com/tencentarc/bebr
Official code for "Binary embedding based retrieval at Tencent"
Last synced: 03 Sep 2025
https://github.com/tencentarc/pi-tuning
Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
Last synced: 05 Apr 2025
https://github.com/tencentarc/flm
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
language-modeling vision-language-pretraining
Last synced: 05 Apr 2025
https://github.com/tencentarc/bts
BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
Last synced: 08 Feb 2026
https://github.com/tencentarc/efficient-vsr-training
Codes for "Accelerating the Training of Video Super-Resolution"
Last synced: 09 Apr 2025
https://github.com/tencentarc/sgat4pass
This is the official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation (IJCAI 2023)
Last synced: 05 Apr 2025
https://github.com/tencentarc/dtn
Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.
Last synced: 05 Apr 2025
https://github.com/tencentarc/opencompatible
OpenCompatible provides a standard compatible training benchmark, covering practical training scenarios.
Last synced: 03 Sep 2025
https://github.com/tencentarc/taca
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
Last synced: 21 Jan 2026
https://github.com/tencentarc/common_trainer
Common template for pytorch project. Easy to extent and modify for new project.
computer-vision deep-learning machine-learning pytorch
Last synced: 05 Apr 2025
https://github.com/tencentarc/transfusion
The code repo for the ACM MM paper: TransFusion: Multi-Modal Fusion for Video Tag Inference viaTranslation-based Knowledge Embedding.
Last synced: 22 Jan 2026
https://github.com/tencentarc/arcvis
Visualization of 3d and 2d components interactively.
3d numpy plotly pytorch visualization
Last synced: 22 Jul 2025