Projects in Awesome Lists by dvlab-research

https://github.com/dvlab-research/MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

generation large-language-models vision-language-model

Last synced: 14 Aug 2024

https://github.com/dvlab-research/mgm

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

generation large-language-models vision-language-model

Last synced: 14 Oct 2024

https://github.com/dvlab-research/LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

fine-tuning-llm large-language-models llm long-context lora

Last synced: 27 Oct 2024

https://github.com/dvlab-research/longlora

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

fine-tuning-llm large-language-models llm long-context lora

Last synced: 14 Oct 2024

https://github.com/dvlab-research/LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

large-language-model llm multi-modal segmentation

Last synced: 04 Nov 2024

https://github.com/dvlab-research/lisa

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

large-language-model llm multi-modal segmentation

Last synced: 15 Oct 2024

https://github.com/dvlab-research/controlnext

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Last synced: 15 Oct 2024

https://github.com/dvlab-research/VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

3d-multi-object-tracking 3d-object-detection argoverse autonomous-driving lidar nuscenes waymo-open-dataset

Last synced: 28 Oct 2024

https://github.com/dvlab-research/ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Last synced: 16 Aug 2024

https://github.com/dvlab-research/3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

3d autonomous-driving segment-anything

Last synced: 27 Oct 2024

https://github.com/dvlab-research/PointGroup

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Last synced: 28 Oct 2024

https://github.com/dvlab-research/3DSSD

3DSSD: Point-based 3D Single Stage Object Detector (CVPR 2020)

3d-detection

Last synced: 28 Oct 2024

https://github.com/dvlab-research/FocalsConv

Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)

3d-object-detection autonomous-driving kitti nuscenes sparse-convolution

Last synced: 28 Oct 2024

https://github.com/dvlab-research/Stratified-Transformer

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

cvpr2022 point-cloud semantic-segmentation transformer

Last synced: 28 Oct 2024

https://github.com/dvlab-research/Video-P2P

Video-P2P: Video Editing with Cross-attention Control

generative-model image-editing stable-diffusion text-driven-editing video-editing

Last synced: 31 Oct 2024

https://github.com/dvlab-research/DSGN

DSGN: Deep Stereo Geometry Network for 3D Object Detection (CVPR 2020)

3d-detection cvpr2020 depth-estimation stereo-vision

Last synced: 28 Oct 2024

https://github.com/dvlab-research/SphereFormer

The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).

3d-object-detection 3d-semantic-segmentation cvpr2023 lidar-point-cloud nuscenes semantickitti transformer waymo

Last synced: 28 Oct 2024

https://github.com/dvlab-research/ReviewKD

Distilling Knowledge via Knowledge Review, CVPR 2021

Last synced: 03 Aug 2024

https://github.com/dvlab-research/UVTR

Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)

3d-detection multi-modality pytorch

Last synced: 28 Oct 2024

https://github.com/dvlab-research/Parametric-Contrastive-Learning

Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

class-imbalance contrastive-learning iccv2021 image-classification imagenet imbalanced-data imbalanced-learning long-tailed-recognition parametric-contrastive-learning pytorch supervised-contrastive-learning supervised-learning tpami

Last synced: 03 Aug 2024

https://github.com/dvlab-research/LargeKernel3D

LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)

3d nuscenes object-detection scannet semantic-segmentation

Last synced: 28 Oct 2024

https://github.com/dvlab-research/ECCV22-P3AFormer-Tracking-Objects-as-Pixel-wise-Distributions

The official code for our ECCV22 oral paper: tracking objects as pixel-wise distributions.

mot multiple-object-tracking p3aformer tracking transformer

Last synced: 28 Oct 2024

https://github.com/dvlab-research/Context-Aware-Consistency

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

cvpr2021 semantic-segmentation semi-supervised-learning semi-supervised-segmentation

Last synced: 09 Nov 2024

https://github.com/dvlab-research/spconv-plus

Last synced: 28 Oct 2024

https://github.com/dvlab-research/SparseTransformer

A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).

3d-point-cloud cuda sparse-transformer transformer

Last synced: 28 Oct 2024

https://github.com/dvlab-research/EfficientNeRF

The official code for "Efficient Neural Radiance Fields" in CVPR2022.

Last synced: 07 Nov 2024

https://github.com/dvlab-research/MiSLAS

Improving Calibration for Long-Tailed Recognition (CVPR2021)

confidence-calibration long-tailed-recognition

Last synced: 05 Nov 2024

https://github.com/dvlab-research/RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

diffusion-models image-variations style-transfer text-to-image

Last synced: 31 Oct 2024

https://github.com/dvlab-research/DeepVision3D

DeepVision3D is an open source toolbox for point-cloud understanding.

Last synced: 28 Oct 2024

https://github.com/dvlab-research/Imbalanced-Learning

Imbalanced learning tool for imbalanced recognition and segmentation

Last synced: 28 Oct 2024

https://github.com/dvlab-research/JigsawClustering

This is the code for CVPR 2021 oral paper: Jigsaw Clustering for Unsupervised Visual Representation Learning

Last synced: 03 Aug 2024

https://github.com/dvlab-research/AttenNorm

Attentive Normalization for Conditional Image Generation

Last synced: 03 Aug 2024

https://github.com/dvlab-research/GFS-Seg

The official implementation of Generalized Few-shot Semantic Segmentation (CVPR 2022)

Last synced: 03 Aug 2024

https://github.com/dvlab-research/SDSD

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

dynamic-video-dataset low-light-video-enhancement

Last synced: 03 Nov 2024