An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by TencentARC

A curated list of projects in awesome lists by TencentARC .

https://github.com/tencentarc/gfpgan

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

deep-learning face-restoration gan gfpgan image-restoration pytorch super-resolution

Last synced: 09 Sep 2025

https://github.com/TencentARC/GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

deep-learning face-restoration gan gfpgan image-restoration pytorch super-resolution

Last synced: 14 Mar 2025

https://github.com/tencentarc/photomaker

PhotoMaker [CVPR 2024]

Last synced: 14 May 2025

https://github.com/TencentARC/PhotoMaker

PhotoMaker [CVPR 2024]

Last synced: 27 Mar 2025

https://github.com/TencentARC/InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Last synced: 21 Jul 2025

https://github.com/tencentarc/t2i-adapter

T2I-Adapter

Last synced: 14 May 2025

https://github.com/TencentARC/T2I-Adapter

T2I-Adapter

Last synced: 28 Mar 2025

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 14 May 2025

https://github.com/tencentarc/motionctrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Last synced: 13 Apr 2025

https://github.com/TencentARC/MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Last synced: 28 Mar 2025

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 28 Mar 2025

https://github.com/TencentARC/Pixal3D

[SIGGRAPH 2026] Pixal3D: Pixel-Aligned 3D Generation from Images

Last synced: 21 May 2026

https://github.com/tencentarc/seed-voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Last synced: 15 May 2025

https://github.com/tencentarc/seed-story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Last synced: 16 May 2025

https://github.com/tencentarc/instantmesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Last synced: 12 Apr 2025

https://github.com/TencentARC/SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Last synced: 27 Mar 2025

https://github.com/tencentarc/masactrl

[ICCV 2023] Consistent Image Synthesis and Editing

Last synced: 12 Apr 2025

https://github.com/TencentARC/MasaCtrl

[ICCV 2023] Consistent Image Synthesis and Editing

Last synced: 27 Mar 2025

https://github.com/TencentARC/SEED-Voken

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Last synced: 22 Jul 2025

https://github.com/tencentarc/brushedit

[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"

diffusion-models image-editing image-inpainting

Last synced: 25 Jun 2025

https://github.com/tencentarc/llama-pro

[ACL 2024] Progressive LLaMA with Block Expansion.

llama llama2 llm

Last synced: 05 Apr 2025

https://github.com/tencentarc/mix-of-show

NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Last synced: 05 Apr 2025

https://github.com/TencentARC/Mix-of-Show

NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Last synced: 27 Mar 2025

https://github.com/tencentarc/colorflow

The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"

Last synced: 25 Jun 2025

https://github.com/tencentarc/animesr

Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"

Last synced: 06 Apr 2025

https://github.com/tencentarc/vqfr

ECCV 2022, Oral, VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

face-restoration vector-quantization

Last synced: 16 Aug 2025

https://github.com/tencentarc/animegamer

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Last synced: 10 Oct 2025

https://github.com/tencentarc/smartedit

Official code of SmartEdit [CVPR-2024 Highlight]

Last synced: 06 Apr 2025

https://github.com/TencentARC/SmartEdit

Official code of SmartEdit [CVPR-2024 Highlight]

Last synced: 27 Mar 2025

https://github.com/tencentarc/videopainter

Any-length Video Inpainting and Editing with Plug-and-Play Context Control

video video-dataset video-editing video-inpainting

Last synced: 25 Jun 2025

https://github.com/TencentARC/VideoPainter

Any-length Video Inpainting and Editing with Plug-and-Play Context Control

video video-dataset video-editing video-inpainting

Last synced: 01 Apr 2025

https://github.com/tencentarc/geometrycrafter

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

depth-estimation video-to-4d

Last synced: 25 Jun 2025

https://github.com/tencentarc/ditctrl

[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"

Last synced: 25 Jun 2025

https://github.com/tencentarc/umt

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Last synced: 21 Jul 2025

https://github.com/tencentarc/vit-lens

[CVPR 2024] ViT-Lens: Towards Omni-modal Representations

multimodal-learning

Last synced: 04 Apr 2025

https://github.com/tencentarc/mm-realsr

Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"

Last synced: 05 Apr 2025

https://github.com/tencentarc/st-llm

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

large-language-models video-language-model video-understanding

Last synced: 08 Oct 2025

https://github.com/tencentarc/mcq

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Last synced: 05 Apr 2025

https://github.com/tencentarc/desra

Official codes for DeSRA (ICML 2023)

Last synced: 05 Apr 2025

https://github.com/tencentarc/stereocrafter

A framework to convert any 2D videos to immersive stereoscopic 3D

Last synced: 25 Jun 2025

https://github.com/tencentarc/faig

NeurIPS 2021, Spotlight, Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution

Last synced: 05 Apr 2025

https://github.com/tencentarc/moto

Latent Motion Token as the Bridging Language for Robot Manipulation

Last synced: 11 Oct 2025

https://github.com/tencentarc/mllm-npu

mllm-npu: training multimodal large language models on Ascend NPUs

Last synced: 17 Jun 2025

https://github.com/tencentarc/blobctrl

[Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

aigc image-editing

Last synced: 25 Jun 2025

https://github.com/tencentarc/surfelnerf

SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes

Last synced: 22 Jan 2026

https://github.com/tencentarc/repsr

Codes for "RepSR: Training Efficient VGG-style Super-Resolution Networks with Structural Re-Parameterization and Batch Normalization"

Last synced: 16 Mar 2026

https://github.com/tencentarc/di-pcg

Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".

Last synced: 25 Jun 2025

https://github.com/tencentarc/hosnerf

HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video

Last synced: 05 Apr 2025

https://github.com/tencentarc/divot

Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)

Last synced: 25 Jun 2025

https://github.com/tencentarc/fastrealvsr

Codes for "Mitigating Artifacts in Real-World Video Super-Resolution Models"

Last synced: 02 Feb 2026

https://github.com/tencentarc/conmim

Official codes for ConMIM (ICLR 2023)

Last synced: 05 Apr 2025

https://github.com/tencentarc/gvt

Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".

Last synced: 05 Apr 2025

https://github.com/tencentarc/freesplatter

FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

Last synced: 25 Jun 2025

https://github.com/tencentarc/video-holmes

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Last synced: 25 Jun 2025

https://github.com/tencentarc/tvts

Turning to Video for Transcript Sorting

Last synced: 05 Apr 2025

https://github.com/tencentarc/bebr

Official code for "Binary embedding based retrieval at Tencent"

Last synced: 03 Sep 2025

https://github.com/tencentarc/mindomni

Last synced: 25 Jun 2025

https://github.com/tencentarc/visft

Last synced: 09 Mar 2026

https://github.com/tencentarc/pi-tuning

Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.

Last synced: 05 Apr 2025

https://github.com/tencentarc/flm

Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

language-modeling vision-language-pretraining

Last synced: 05 Apr 2025

https://github.com/tencentarc/bts

BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild

Last synced: 08 Feb 2026

https://github.com/tencentarc/efficient-vsr-training

Codes for "Accelerating the Training of Video Super-Resolution"

Last synced: 09 Apr 2025

https://github.com/tencentarc/sgat4pass

This is the official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation (IJCAI 2023)

Last synced: 05 Apr 2025

https://github.com/tencentarc/dtn

Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.

Last synced: 05 Apr 2025

https://github.com/tencentarc/opencompatible

OpenCompatible provides a standard compatible training benchmark, covering practical training scenarios.

Last synced: 03 Sep 2025

https://github.com/tencentarc/sfda

Last synced: 05 Apr 2025

https://github.com/tencentarc/taca

Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".

Last synced: 21 Jan 2026

https://github.com/tencentarc/fluxkits

Last synced: 25 Jun 2025

https://github.com/tencentarc/common_trainer

Common template for pytorch project. Easy to extent and modify for new project.

computer-vision deep-learning machine-learning pytorch

Last synced: 05 Apr 2025

https://github.com/tencentarc/transfusion

The code repo for the ACM MM paper: TransFusion: Multi-Modal Fusion for Video Tag Inference viaTranslation-based Knowledge Embedding.

Last synced: 22 Jan 2026

https://github.com/tencentarc/arcvis

Visualization of 3d and 2d components interactively.

3d numpy plotly pytorch visualization

Last synced: 22 Jul 2025

https://github.com/tencentarc/vtlayout

Last synced: 19 Mar 2026