An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with vit

A curated list of projects in awesome lists tagged with vit .

https://github.com/open-compass/vlmevalkit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 13 May 2025

https://github.com/hila-chefer/transformer-explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

attention-matrix attention-visualization bert bert-model cvpr2021 deep-learning explainability perturbation transformer-interpretability vision-transformer visualize-classifications vit

Last synced: 15 May 2025

https://github.com/hila-chefer/Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

attention-matrix attention-visualization bert bert-model cvpr2021 deep-learning explainability perturbation transformer-interpretability vision-transformer visualize-classifications vit

Last synced: 27 Mar 2025

https://github.com/thu-ml/sageattention

Quantized Attention achieves speedup of 2-3x and 3-5x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

attention cuda efficient-attention inference-acceleration llm llm-infra mlsys quantization triton video-generate video-generation vit

Last synced: 14 May 2025

https://github.com/open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 20 Jul 2025

https://github.com/br-idl/paddlevit

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

classification computer-vision cv deep-learning detection encoder-decoder gan mlp object-detection paddlepaddle segmentation semantic-segmentation transformer vit

Last synced: 14 Apr 2025

https://github.com/yitu-opensource/T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

t2t-transformer vision-transformer vit

Last synced: 05 May 2025

https://github.com/v-iashin/video_features

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

audio-features clip feature-extraction i3d ig65m laion multi-gpu optical-flow parallel pytorch r2plus1d raft resnet s3d swin timm vggish video-features visual-features vit

Last synced: 02 Apr 2025

https://github.com/paddlepaddle/passl

PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法

beit clip convnext cvt deep-learning deit mae moco moco-v2 paddle pixpro pvt self-supervised-learning simclr swav swin-transformer vision-transformer vit xcit

Last synced: 04 Apr 2025

https://github.com/megvii-research/RevCol

Official Code of Paper "Reversible Column Networks" "RevColv2"

cnn computer-vision iclr2023 mae pytorch transformer vit

Last synced: 20 Mar 2025

https://github.com/kyegomez/navit

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

attention-mechanism clip gpt4 multimodal multimodal-deep-learning multimodal-learning multimodality vit

Last synced: 16 May 2025

https://github.com/skyworkai/moh

MoH: Multi-Head Attention as Mixture-of-Head Attention

attention dit llms mixture-of-experts moe transformer vit

Last synced: 04 Apr 2025

https://github.com/kyegomez/vit-rgts

Open source implementation of "Vision Transformers Need Registers"

attention-mechanism gpt4 vision-api vision-transformer vit

Last synced: 12 Apr 2025

https://github.com/paddlepaddle/plsc

Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

arcface cait convmae cosface data-parallel deit distributed-training face-recognition facevit hight-speed large-scale mae moco-v3 model-parallel paddle paddlepaddle partial-fc resnet swin-transformer vit

Last synced: 05 Mar 2026

https://github.com/vitjs/vit

🚀 React application framework inspired by UmiJS / 类 UmiJS 的 React 应用框架

mock-data react react-framework umi umijs vit vite vite-plugin vite-plugin-react vitjs

Last synced: 26 Oct 2025

https://github.com/kamalkraj/vision-transformer

Vision Transformer using TensorFlow 2.0

image-classification tensorflow transformer vit

Last synced: 12 Sep 2025

https://github.com/rasbt/pytorch-memory-optim

This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.

deep-learning llm memory-optimization pytorch vision vit

Last synced: 21 Aug 2025

https://github.com/hunto/image_classification_sota

Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.

cifar image-classification imagenet kd nas pruning pytorch rep transformer vit

Last synced: 03 Feb 2026

https://github.com/szq0214/SReT

Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

efficient-neural-networks efficient-transformers transformer-architecture vision-transformer vit

Last synced: 20 Mar 2025

https://github.com/shunk031/simple-aesthetics-predictor

CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.

aesthetic-predictor clip huggingface huggingface-transformers vit

Last synced: 30 Apr 2025

https://github.com/pwb97/deepfake-detection

Learning a Deep Dual-level Network for Robust DeepFake Detection

auc-loss capsulenet dctnet deep-learning deepfake-detection facexray fwa mesonet pytorch tensorflow transformer vit xception

Last synced: 13 May 2025

https://github.com/mv-lab/ViT-FGVC8

"Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8

computer-vision cub200-2011 cvpr deep-learning fgvc fine-grained-classification stanford-dogs-dataset transformers vit

Last synced: 04 Apr 2025

https://github.com/shreyxnsh/vtop

VTOP is the application version of Vellore Institute of Technology, Bhopal's website. It consists of various features like faculty database, event gallery, notices section, e-books , cgpa calculator,etc.

android college college-app faculty-dashboard firebase firebase-realtime-database java vit

Last synced: 17 Aug 2025

https://github.com/kyegomez/mc-vit

Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"

ai multi-modal multi-modal-transformers multi-modality open-source transformer transformers vit

Last synced: 17 Aug 2025

https://github.com/bryanlimy/v1t

Code for "V1T: Large-scale mouse V1 response prediction using a Vision Transformer"

deep-learning neural-response pytorch vision-transformer vit

Last synced: 17 Oct 2025

https://github.com/zer0int/clip-xai-gui

CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models

attention attention-visualization clip game gradient-ascent gui image-to-text vision-transformer vit xai

Last synced: 24 Oct 2025

https://github.com/ekaansharora/bruteforce-selenium

A simple program to bruteforce a Wi-Fi login page using selenium and python. Used to bruteforce VOLSBB wifi @ VIT.

brute-force bruteforce python python3 selenium selenium-webdriver vit wifi

Last synced: 23 Apr 2025

https://github.com/njmarko/graph-transformer-psiml

Transformer implemented with graph attention network (GAT) layers from PyTorch Geometric

attention gat gnn graph-neural-networks pytorch-geometric transformer vision-transformer vit

Last synced: 28 Oct 2025

https://github.com/chaudharydeepanshu/mini_vtop

Mini VTOP is an unofficial VIT Bhopal University VTOP website app that uses the power of WebView to create a user-friendly app.

flutter flutter-inappwebview vellore-institute-of-technology vit vit-app vit-bhopal vit-captcha vit-session-timeout webview

Last synced: 09 Apr 2025

https://github.com/kyegomez/visionllama

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

ai deep-learning multi-modal vision-models vision-transformers vit

Last synced: 23 Jul 2025

https://github.com/kyegomez/audiomamba

Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch

ai audio audio-transformers mamba ml patch transformers vit

Last synced: 07 May 2025

https://github.com/jain131102/neocolab

CSE3004_Design and Analysis of Algorithms Lab_ NEOCOLAB

daa java neocolab vit

Last synced: 02 Aug 2025

https://github.com/sayannath/vit-tf-hub-application

Build and fine-tune your Image Classifier using a Vision Transformer Model from TensorFlow Hub

fine-tuning jax keras tensorflow tf2 tfhub transformers vision-transformer vit

Last synced: 12 May 2025

https://github.com/zer0int/clip-vit-visualization

What do CLIP Vision Transformers learn? Feature Visualization can show you!

activation clip feature neuron vision-transformer visualization vit xai

Last synced: 16 Mar 2026

https://github.com/qnguyen3/vit_pytorch

A PyTorch Implementation of ViT (Vision Transformer)

deep-learning pytorch transformer vision-transformer vit

Last synced: 16 May 2025

https://github.com/kyegomez/clipq

A simple implementation of a CLIP that splits up an image into quandrants and then gets the embeddings for each quandrant

artificial-intelligence clip computer-vision gpt4 multimodal vision-transformer vit

Last synced: 07 May 2025

https://github.com/jiauzhang/nnm

Neural Network Models

diffusion gan llm transformer vae vit

Last synced: 12 May 2025

https://github.com/andreped/dss

:vibration_mode: From training of transformers to real-time development in cross-platform mobile apps!

android cnn data-visualization deep-learning dss flutter internet-of-things ios iot mobile real-time recording rnn sensor tensorboard tensorflow tf2 tflite vision-transformer vit

Last synced: 13 Apr 2025

https://github.com/liaoyanqing666/pvt_v2_video_frame_interpolation

使用PVT_v2作为编码器的视频插帧程序,A program using PVT_v2 as the encoder of video frame interpolation, VFI, pytorch

pvt pvtv2 pyramid-vision-transformer pytorch vfi video-frame-interpolation vit

Last synced: 23 Mar 2025

https://github.com/rishav394/vit-vellore-free-wifi-passwords

Just for educational purposes. Pls don't misuse. Don't want to make this repo private

vit vit-university

Last synced: 02 Sep 2025

https://github.com/JiauZhang/nnm

Neural Network Models

diffusion gan llm transformer vae vit

Last synced: 08 Apr 2025

https://github.com/rishav394/vit-proxy-chrome-extension

A zero configuration chrome extension for accessing VIT's proxy server.

chrome-extension proxy-server vit vit-university vit-vellore volsbb

Last synced: 29 Oct 2025

https://github.com/jman4162/pytorch-vision-transformers-vit

Explore fine-tuning the Vision Transformer (ViT) model for object recognition in robotics using PyTorch. This tutorial covers setup, training, and evaluation processes, achieving impressive accuracy with practical resource constraints. Ideal for learners in AI and robotics.

ai-tutorial computer-vision deep-learning engineering image-classification machine-learning object-recognition pytorch pytorch-tutorial robotics vision-transformer vision-transformer-image-classification vit

Last synced: 10 Apr 2026

https://github.com/zer0int/clip-text-image-interpretability

Get CLIP ViT text tokens about an image, visualize attention as a heatmap.

attention-visualization clip gradient-ascent heatmap image-to-text vit xai

Last synced: 07 Sep 2025

https://github.com/shreydan/visiongpt2

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.

gpt image-captioning multimodal pytorch transformers vit

Last synced: 15 May 2025

https://github.com/sanjay7178/vrevamp

A chrome extension for Enhancing VIT AP University Student portal experience in VTOP

chrome-extension chrome-plugin html js opensource vit vitap vrevamp vtop-ap

Last synced: 27 Mar 2026

https://github.com/gaganreddyin/complete-placement-resources

comprehensive repository designed to assist students for Placements_Vellore Institute of Technology (VIT)

dsa interview-preparation placement placement-preparation technical-interview-questions vit

Last synced: 01 Apr 2025

https://github.com/autodistill/autodistill-vit

ViT module for use with autodistill.

autodistill computer-vision vision-transformer vit

Last synced: 14 Apr 2025

https://github.com/ishivanshgoel/cse-1002-problem-solving-and-object-oriented-programming

This repository contains my solutions to problems given for practice in course- CSE-1002 Problem Solving and Object-Oriented Programming(Winter Semester 2019-20)

c cpp cse1002 oops solutions vit vpropel

Last synced: 08 Sep 2025

https://github.com/kyegomez/open-moonvit

This is an ultra-simple, single-file PyTorch implementation of MoonViT, the native-resolution vision encoder from Kimi-VL.

ai attention ml pytorch torch torchvision transformers vision-transformer vit vl

Last synced: 04 Jun 2026

https://github.com/rishav394/vit-scraper

Scrap the shit out of new vtop

vit vit-university vtop vtopbeta

Last synced: 19 Apr 2026

https://github.com/paradoxzw/cosattention2d

a 2D cosine attention module inspired by cosFormer: Rethinking Softmax in Attention(https://arxiv.org/abs/2202.08791)

cosformer iclr2020 pytorch transformer vit

Last synced: 24 Apr 2026

https://github.com/andreped/vit-explainer

🔥 Demonstrating Explainable AI with Vision Transformer in web app

ai explainable-ai gradio huggingface huggingface-spaces saliency-map transformers vision-transformer vit xai

Last synced: 13 Apr 2025

https://github.com/akshatvg/exc-vtop

Portal that displays a student's approved achievements with different features like approval, rejection, adding details, etc in addition to all EXC details for the clubs and chapters.

achievments faculty hacktoberfest open-source-programming osp php portal proctor students vachieveit vit vtop

Last synced: 11 Apr 2025

https://github.com/ppmpreetham/vfriend

👩🏻‍🤝‍👨🏻A Cross-platform app written in tauri for VIT Students to check which of your friends are free right now

tauri tauri-app tauri-apps vit vitchennai

Last synced: 24 Jul 2025

https://github.com/wozeparrot/tinygrad-image-models

tinygrad image models, inference, and (soon) training

cnn imagenet tinygrad vit

Last synced: 03 Apr 2025

https://github.com/qanastek/french-automatic-number-plate-recognition-system

Easy to use receipe to make a French Automatic Number Plate Recognition System (ANPR)

coco coco-dataset huggingface hugsvision object-detection vit

Last synced: 13 Feb 2026

https://github.com/tugot17/vision-transformer-presentation

Presentation on An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

vision-transformer vit

Last synced: 02 Jan 2026

https://github.com/akshatvg/devsoc-20

The official website for DEVSOC 2020 by CodeChef-VIT.

akshat-gupta akshatvg codechef-vit codechefvit developers-sprint-of-code devsoc hackathon vit website

Last synced: 11 Apr 2025

https://github.com/asherk7/neurovit

AI-powered app for MRI brain tumor classification and medical Q&A using Vision Transformers and RAG-enhanced LLMs.

aws docker faiss fastapi huggingface langchain ml openai pytorch rag transformers vit vllm

Last synced: 14 Apr 2026

https://github.com/ishan-gupt/styletransfer-pytorch

Implementation of image style transfer in PyTorch

deep-learning pytorch soft-computing vit vitap

Last synced: 21 May 2026

https://github.com/brandonroberts/tanstack-react-router-analog-nitro

An example application using TanStack React Router with AnalogJS + Nitro

react ssr tanstack vit

Last synced: 19 May 2026

https://github.com/dgcnz/repr

Code for "Relative Position and Scale Regression for Self-Supervised Pretraining"

cv ssl vit

Last synced: 19 Aug 2025

https://github.com/agora-lab-ai/omegavit

OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space modeling, and mixture of experts to achieve superior performance across various computer vision tasks.

agora agoralab ai ml open-ai ssm transformer vit

Last synced: 31 Aug 2025

https://github.com/tonywu71/vision-transformer

Tensorflow implementation of Image Classification with Vision Transformer on the MNIST dataset.

computer-vision mnist tensorflow vision-transformer vit

Last synced: 05 May 2026

https://github.com/mohammad95labbaf/brain-tumor-transferlearning

The Brain Tumor MRI Dataset from Kaggle is employed for automated brain tumor detection and classification research. Investigated methods include using pre-trained models (VGG16, ResNet50, and ViT). 🧠🔍

cnn cnn-keras deep-learning deep-neural-networks deeplearning kaggle-dataset keras keras-tensorflow neural-network neural-networks pretrained-models resnet-50 transfer-learning tumor-classification tumor-detection vgg16 vision vision-transformer vit

Last synced: 29 Apr 2026

https://github.com/akshatvg/yaas-bot

Official Website for YAAS Bot.

bot chapters clubs insta-bot team-yaas vit yaas-bot

Last synced: 30 Jun 2025

https://github.com/songquanpeng/vitgan

[WIP] Unofficial PyTorch implementation for "ViTGAN: Training GANs with Vision Transformers".

gan iclr2022 vit vitgan

Last synced: 07 May 2025

https://github.com/shivakantkurmi/dr-g-viswanathan-coding-challenge

51 days of code + additional 50 days from 46 days as an initiative for mother's and father's day #DRGViswanathan challenge

codingchallenge drgviswanathan vit vit-bhopal

Last synced: 30 Jun 2025

https://github.com/sreeeswaran/multi-modal-sentiment-analysis-with-transformers

This project leverages the power of transformer models to perform sentiment analysis on both text and images. It uses BERT for text sentiment analysis and a pre-trained vision transformer (ViT) for image sentiment analysis.

bert bert-model image-sentiment-analysis sentiment-analysis sentimental-analysis text-sentiment-analysis transformer-models transformers vision-transformer vit

Last synced: 01 Sep 2025

https://github.com/kimrass/vivit

PyTorch implementation of 'ViViT' (Arnab et al., 2021) from scratch.

timm vit vivit

Last synced: 21 Aug 2025

https://github.com/semihdervis/vit-image-classification-with-any-images

Training (fine-tuning) and evaluating Vision Transformer (ViT) models on any image dataset

computer-vision image-classification vision-transformer vision-transformer-image-classification vit

Last synced: 25 Mar 2025