Projects in Awesome Lists tagged with vit

https://lukas-blecher.github.io/LaTeX-OCR/

pix2tex: Using a ViT to convert images of equations into LaTeX code.

dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit

Last synced: 19 Nov 2025

https://github.com/lukas-blecher/latex-ocr

pix2tex: Using a ViT to convert images of equations into LaTeX code.

dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit

Last synced: 13 May 2025

https://github.com/lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit

Last synced: 26 Mar 2025

https://github.com/towhee-io/towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

computer-vision convolutional-networks embedding-vectors embeddings feature-extraction feature-vector image-processing image-retrieval llm machine-learning milvus pipeline towhee transformer unstructured-data video-processing vision-transformer vit

Last synced: 13 May 2025

https://github.com/open-compass/vlmevalkit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 13 May 2025

https://github.com/roboflow/inference

Turn any computer or edge device into a command center for your computer vision projects.

agents classification computer-vision deployment docker inference inference-api inference-server instance-segmentation jetson machine-learning object-detection onnx python tensorrt vit yolo11 yolov12 yolov5 yolov8

Last synced: 10 Apr 2026

https://github.com/hila-chefer/transformer-explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

attention-matrix attention-visualization bert bert-model cvpr2021 deep-learning explainability perturbation transformer-interpretability vision-transformer visualize-classifications vit

Last synced: 15 May 2025

https://github.com/hila-chefer/Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

attention-matrix attention-visualization bert bert-model cvpr2021 deep-learning explainability perturbation transformer-interpretability vision-transformer visualize-classifications vit

Last synced: 27 Mar 2025

https://github.com/thu-ml/sageattention

Quantized Attention achieves speedup of 2-3x and 3-5x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

attention cuda efficient-attention inference-acceleration llm llm-infra mlsys quantization triton video-generate video-generation vit

Last synced: 14 May 2025

https://github.com/Yangzhangcst/Transformer-in-Computer-Vision

A paper list of some recent Transformer-based CV works.

awesome computer-vision deep-learning detr papers transformer transformer-awesome transformer-cv vit

Last synced: 06 May 2025

https://github.com/open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 20 Jul 2025

https://github.com/br-idl/paddlevit

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

classification computer-vision cv deep-learning detection encoder-decoder gan mlp object-detection paddlepaddle segmentation semantic-segmentation transformer vit

Last synced: 14 Apr 2025

https://github.com/yitu-opensource/T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

t2t-transformer vision-transformer vit

Last synced: 05 May 2025

https://github.com/sail-sg/adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit

Last synced: 07 Jul 2025

https://github.com/sail-sg/Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit

Last synced: 05 Apr 2025

https://github.com/v-iashin/video_features

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

audio-features clip feature-extraction i3d ig65m laion multi-gpu optical-flow parallel pytorch r2plus1d raft resnet s3d swin timm vggish video-features visual-features vit

Last synced: 02 Apr 2025

https://github.com/zgcr/SimpleAICV_pytorch_training_examples

SimpleAICV:pytorch training and testing examples.

centernet darknet deeplabv3plus dino fcos kd mae pytorch regnetx repvgg resnet retinanet sam segment-anything solov2 ttfnet u2net vit yolact

Last synced: 21 Apr 2025

https://github.com/paddlepaddle/passl

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法

beit clip convnext cvt deep-learning deit mae moco moco-v2 paddle pixpro pvt self-supervised-learning simclr swav swin-transformer vision-transformer vit xcit

Last synced: 04 Apr 2025

https://github.com/megvii-research/RevCol

Official Code of Paper "Reversible Column Networks" "RevColv2"

cnn computer-vision iclr2023 mae pytorch transformer vit

Last synced: 20 Mar 2025

https://github.com/kyegomez/navit

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

attention-mechanism clip gpt4 multimodal multimodal-deep-learning multimodal-learning multimodality vit

Last synced: 16 May 2025

https://github.com/skyworkai/moh

MoH: Multi-Head Attention as Mixture-of-Head Attention

attention dit llms mixture-of-experts moe transformer vit

Last synced: 04 Apr 2025

https://github.com/qanastek/hugsvision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

bert computer-vision deep-learning deit detr huggingface image-classification image-generation machine-learning object-detection pretrained-models pythorch pytorch pytorch-transformers semantic-segmentation state-of-the-art torchvision transformers vit yolo

Last synced: 14 Jun 2025

https://github.com/qanastek/HugsVision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

bert computer-vision deep-learning deit detr huggingface image-classification image-generation machine-learning object-detection pretrained-models pythorch pytorch pytorch-transformers semantic-segmentation state-of-the-art torchvision transformers vit yolo

Last synced: 08 May 2025

https://github.com/kyegomez/vit-rgts

Open source implementation of "Vision Transformers Need Registers"

attention-mechanism gpt4 vision-api vision-transformer vit

Last synced: 12 Apr 2025

https://github.com/paddlepaddle/plsc

Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.

arcface cait convmae cosface data-parallel deit distributed-training face-recognition facevit hight-speed large-scale mae moco-v3 model-parallel paddle paddlepaddle partial-fc resnet swin-transformer vit

Last synced: 05 Mar 2026

https://github.com/vitjs/vit

🚀 React application framework inspired by UmiJS / 类 UmiJS 的 React 应用框架

mock-data react react-framework umi umijs vit vite vite-plugin vite-plugin-react vitjs

Last synced: 26 Oct 2025

https://github.com/kamalkraj/vision-transformer

Vision Transformer using TensorFlow 2.0

image-classification tensorflow transformer vit

Last synced: 12 Sep 2025

https://github.com/zubair-irshad/nerf-mae

[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

3d 3d-deep-learning 3d-detection 3d-unet differentiable-rendering feature-pyramid-network instant-ngp masked-autoencoder multi-view nerf neural-radiance-fields neural-rendering region-proposal-network representation-learning self-supervised-learning semantic-segmantation super-resoluion transformers vision-transformers vit

Last synced: 07 Apr 2025

https://github.com/rasbt/pytorch-memory-optim

This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.

deep-learning llm memory-optimization pytorch vision vit

Last synced: 21 Aug 2025

https://github.com/hunto/image_classification_sota

Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.

cifar image-classification imagenet kd nas pruning pytorch rep transformer vit

Last synced: 03 Feb 2026

https://github.com/szq0214/SReT

Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

efficient-neural-networks efficient-transformers transformer-architecture vision-transformer vit

Last synced: 20 Mar 2025

https://github.com/shunk031/simple-aesthetics-predictor

CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.

aesthetic-predictor clip huggingface huggingface-transformers vit

Last synced: 30 Apr 2025

https://github.com/pwb97/deepfake-detection

Learning a Deep Dual-level Network for Robust DeepFake Detection

auc-loss capsulenet dctnet deep-learning deepfake-detection facexray fwa mesonet pytorch tensorflow transformer vit xception

Last synced: 13 May 2025

https://github.com/mv-lab/ViT-FGVC8

"Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8

computer-vision cub200-2011 cvpr deep-learning fgvc fine-grained-classification stanford-dogs-dataset transformers vit

Last synced: 04 Apr 2025

https://github.com/stefanheng/ecg-representation-learning

Self-supervised pre-training for ECG representation with inspiration from transformers & computer vision

12-lead-ecg attention bert clustering dino ecg nlp pre-training representation-learning self-supervised-learning symbolic-representation transformer vision-transformer vit word2vec

Last synced: 13 Apr 2025

https://github.com/shreyxnsh/vtop

VTOP is the application version of Vellore Institute of Technology, Bhopal's website. It consists of various features like faculty database, event gallery, notices section, e-books , cgpa calculator,etc.

android college college-app faculty-dashboard firebase firebase-realtime-database java vit

Last synced: 17 Aug 2025

https://github.com/kyegomez/mc-vit

Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"

ai multi-modal multi-modal-transformers multi-modality open-source transformer transformers vit

Last synced: 17 Aug 2025

https://github.com/zer0int/clip-xai-gui

CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models

attention attention-visualization clip game gradient-ascent gui image-to-text vision-transformer vit xai

Last synced: 24 Oct 2025

https://github.com/bryanlimy/v1t

Code for "V1T: Large-scale mouse V1 response prediction using a Vision Transformer"

deep-learning neural-response pytorch vision-transformer vit

Last synced: 17 Oct 2025

https://github.com/ekaansharora/bruteforce-selenium

A simple program to bruteforce a Wi-Fi login page using selenium and python. Used to bruteforce VOLSBB wifi @ VIT.

brute-force bruteforce python python3 selenium selenium-webdriver vit wifi

Last synced: 23 Apr 2025

https://github.com/chaudharydeepanshu/mini_vtop

Mini VTOP is an unofficial VIT Bhopal University VTOP website app that uses the power of WebView to create a user-friendly app.

flutter flutter-inappwebview vellore-institute-of-technology vit vit-app vit-bhopal vit-captcha vit-session-timeout webview

Last synced: 09 Apr 2025

https://github.com/akshatvg/github-themed-portfolio

GitHub themed portfolio website.

akshat akshatvg gupta institute of online personal portfolio resume technology vellore vit website

Last synced: 15 Apr 2025

https://github.com/njmarko/graph-transformer-psiml

Transformer implemented with graph attention network (GAT) layers from PyTorch Geometric

attention gat gnn graph-neural-networks pytorch-geometric transformer vision-transformer vit

Last synced: 28 Oct 2025

https://github.com/kyegomez/visionllama

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

ai deep-learning multi-modal vision-models vision-transformers vit

Last synced: 23 Jul 2025

https://github.com/zer0int/clip-sae-finetune

Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.

adversarial-attacks adversarial-learning clip fine-tune fine-tuning finetune sae sparse-autoencoder vision-transformer vit

Last synced: 03 Jul 2025

https://github.com/nikhilroxtomar/vision-transformer-vit-in-tensorflow

Vision Transformer Implementation in TensorFlow

transformer transformer-architecture transformer-encoder vision-transformer vit

Last synced: 14 Apr 2025

https://github.com/kyegomez/audiomamba

Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch

ai audio audio-transformers mamba ml patch transformers vit

Last synced: 07 May 2025

https://github.com/searchformyusername/Duplicate-Images-Detection-using-Parallel-Processing

Detect Duplicate Images Blazingly Fast

parallel-computing pdc vellore vit

Last synced: 17 Sep 2025

https://github.com/1UC1F3R616/Duplicate-Images-Detection-using-Parallel-Processing

Detect Duplicate Images Blazingly Fast

parallel-computing pdc vellore vit

Last synced: 31 Mar 2025

https://github.com/jain131102/neocolab

CSE3004_Design and Analysis of Algorithms Lab_ NEOCOLAB

daa java neocolab vit

Last synced: 02 Aug 2025

https://github.com/sayannath/vit-tf-hub-application

Build and fine-tune your Image Classifier using a Vision Transformer Model from TensorFlow Hub

fine-tuning jax keras tensorflow tf2 tfhub transformers vision-transformer vit

Last synced: 12 May 2025

https://github.com/zer0int/clip-vit-visualization

What do CLIP Vision Transformers learn? Feature Visualization can show you!

activation clip feature neuron vision-transformer visualization vit xai

Last synced: 16 Mar 2026

https://github.com/qnguyen3/vit_pytorch

A PyTorch Implementation of ViT (Vision Transformer)

deep-learning pytorch transformer vision-transformer vit

Last synced: 16 May 2025

https://github.com/kyegomez/clipq

A simple implementation of a CLIP that splits up an image into quandrants and then gets the embeddings for each quandrant

artificial-intelligence clip computer-vision gpt4 multimodal vision-transformer vit

Last synced: 07 May 2025

https://github.com/andreped/dss

:vibration_mode: From training of transformers to real-time development in cross-platform mobile apps!

android cnn data-visualization deep-learning dss flutter internet-of-things ios iot mobile real-time recording rnn sensor tensorboard tensorflow tf2 tflite vision-transformer vit

Last synced: 13 Apr 2025

https://github.com/jiauzhang/nnm

Neural Network Models

diffusion gan llm transformer vae vit

Last synced: 12 May 2025

https://github.com/rishav394/vit-vellore-free-wifi-passwords

Just for educational purposes. Pls don't misuse. Don't want to make this repo private

vit vit-university

Last synced: 02 Sep 2025

https://github.com/liaoyanqing666/pvt_v2_video_frame_interpolation

使用PVT_v2作为编码器的视频插帧程序，A program using PVT_v2 as the encoder of video frame interpolation, VFI, pytorch

pvt pvtv2 pyramid-vision-transformer pytorch vfi video-frame-interpolation vit

Last synced: 23 Mar 2025

https://github.com/sayannath/image-scene-classification

Image-Scene-Classification with 30 different classes.

big-transfer bit image-classification keras mobilenetv3 state-of-the-art-models tensrflow tflite visual-transformer vit

Last synced: 12 May 2025

https://github.com/rishav394/vit-proxy-chrome-extension

A zero configuration chrome extension for accessing VIT's proxy server.

chrome-extension proxy-server vit vit-university vit-vellore volsbb

Last synced: 29 Oct 2025

https://github.com/JiauZhang/nnm

Neural Network Models

diffusion gan llm transformer vae vit

Last synced: 08 Apr 2025

https://github.com/jman4162/pytorch-vision-transformers-vit

Explore fine-tuning the Vision Transformer (ViT) model for object recognition in robotics using PyTorch. This tutorial covers setup, training, and evaluation processes, achieving impressive accuracy with practical resource constraints. Ideal for learners in AI and robotics.

ai-tutorial computer-vision deep-learning engineering image-classification machine-learning object-recognition pytorch pytorch-tutorial robotics vision-transformer vision-transformer-image-classification vit

Last synced: 10 Apr 2026

https://github.com/zer0int/clip-text-image-interpretability

Get CLIP ViT text tokens about an image, visualize attention as a heatmap.

attention-visualization clip gradient-ascent heatmap image-to-text vit xai

Last synced: 07 Sep 2025

https://github.com/shreydan/visiongpt2

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.

gpt image-captioning multimodal pytorch transformers vit

Last synced: 15 May 2025

https://github.com/sanjay7178/vrevamp

A chrome extension for Enhancing VIT AP University Student portal experience in VTOP

chrome-extension chrome-plugin html js opensource vit vitap vrevamp vtop-ap

Last synced: 27 Mar 2026

https://github.com/autodistill/autodistill-vit

ViT module for use with autodistill.

autodistill computer-vision vision-transformer vit

Last synced: 14 Apr 2025

https://github.com/gulabpatel/cnns

ann cnn cnn-explainer inceptionv3 keract mobilenetv2 multitasking resnet unet-image-segmentation vgg visiontransfoemer vit

Last synced: 26 Jun 2025

https://github.com/gaganreddyin/complete-placement-resources

comprehensive repository designed to assist students for Placements_Vellore Institute of Technology (VIT)

dsa interview-preparation placement placement-preparation technical-interview-questions vit

Last synced: 01 Apr 2025

https://github.com/ishivanshgoel/cse-1002-problem-solving-and-object-oriented-programming

This repository contains my solutions to problems given for practice in course- CSE-1002 Problem Solving and Object-Oriented Programming(Winter Semester 2019-20)

c cpp cse1002 oops solutions vit vpropel

Last synced: 08 Sep 2025

https://github.com/zer0int/clip-deepdream

Deep Dreaming with CLIP Vision Transformers

ai aiart clip deep deepdream deepdream-maker deepdreamgenerator dream explainable-ai vision-transformer vit xai

Last synced: 28 Apr 2025

https://github.com/kyegomez/open-moonvit

This is an ultra-simple, single-file PyTorch implementation of MoonViT, the native-resolution vision encoder from Kimi-VL.

ai attention ml pytorch torch torchvision transformers vision-transformer vit vl

Last synced: 04 Jun 2026

https://github.com/akshatvg/exc-vtop

Portal that displays a student's approved achievements with different features like approval, rejection, adding details, etc in addition to all EXC details for the clubs and chapters.

achievments faculty hacktoberfest open-source-programming osp php portal proctor students vachieveit vit vtop

Last synced: 11 Apr 2025

https://github.com/afondiel/self-driving-cars-perception-and-deep-learning-free-course-freecodecamp

Notes and key takeaways of the Self-Driving Cars Perception applied Deep Learning Free Course from freeCodeCamp.org

2d-object-detection 3d-data-visualization 3d-object-detection bird-eye-view camera camera-calibration compter-vision data-visualization freecodecamp kitti-dataset lidar object-tracking perception robotics self-driving-car-engineer self-driving-cars vision-transformer vit yolo

Last synced: 09 May 2026

https://github.com/ppmpreetham/vfriend

👩🏻‍🤝‍👨🏻A Cross-platform app written in tauri for VIT Students to check which of your friends are free right now

tauri tauri-app tauri-apps vit vitchennai

Last synced: 24 Jul 2025

https://github.com/andreped/vit-explainer

🔥 Demonstrating Explainable AI with Vision Transformer in web app

ai explainable-ai gradio huggingface huggingface-spaces saliency-map transformers vision-transformer vit xai

Last synced: 13 Apr 2025

https://github.com/rishav394/vit-scraper

Scrap the shit out of new vtop

vit vit-university vtop vtopbeta

Last synced: 19 Apr 2026

https://github.com/paradoxzw/cosattention2d

a 2D cosine attention module inspired by cosFormer: Rethinking Softmax in Attention(https://arxiv.org/abs/2202.08791)

cosformer iclr2020 pytorch transformer vit

Last synced: 24 Apr 2026

https://github.com/akshatvg/akshatvg.github.io

My personal portfolio website to showcase my work.

agora akshat-gupta akshatvg joshtalks legalforce nearcast online personal portfolio resume technology trademarkia vit website

Last synced: 11 Apr 2025

https://github.com/tugot17/vision-transformer-presentation

Presentation on An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

vision-transformer vit

Last synced: 02 Jan 2026

https://github.com/qanastek/french-automatic-number-plate-recognition-system

Easy to use receipe to make a French Automatic Number Plate Recognition System (ANPR)

coco coco-dataset huggingface hugsvision object-detection vit

Last synced: 13 Feb 2026

https://github.com/asherk7/neurovit

AI-powered app for MRI brain tumor classification and medical Q&A using Vision Transformers and RAG-enhanced LLMs.

aws docker faiss fastapi huggingface langchain ml openai pytorch rag transformers vit vllm

Last synced: 14 Apr 2026

https://github.com/mjahmadee/vision_transformers

Vision Transformers

cait image-classification transformer vision-transformer vision-transformers vit

Last synced: 09 Jun 2026

https://github.com/akshatvg/devsoc-20

The official website for DEVSOC 2020 by CodeChef-VIT.

akshat-gupta akshatvg codechef-vit codechefvit developers-sprint-of-code devsoc hackathon vit website

Last synced: 11 Apr 2025

https://github.com/crepopcorn/cat-vs-dog-vit-project

dog-vs-cat image-classification machine-learning python transformer vit

Last synced: 29 Apr 2026

https://github.com/wozeparrot/tinygrad-image-models

tinygrad image models, inference, and (soon) training

cnn imagenet tinygrad vit

Last synced: 03 Apr 2025

https://github.com/brandonroberts/tanstack-react-router-analog-nitro

An example application using TanStack React Router with AnalogJS + Nitro

react ssr tanstack vit

Last synced: 19 May 2026

https://github.com/dgcnz/repr

Code for "Relative Position and Scale Regression for Self-Supervised Pretraining"

cv ssl vit

Last synced: 19 Aug 2025

https://github.com/ishan-gupt/styletransfer-pytorch

Implementation of image style transfer in PyTorch

deep-learning pytorch soft-computing vit vitap

Last synced: 21 May 2026

https://github.com/afondiel/prompt-engineering-for-vision-models-deeplearningai

These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.

cnn computer-vision convnets diffusion-models fine-tuning generative-models image-processing large-vision-language-models large-vision-models meta-sam prompt-engineering video-processing vision-language-model vision-model-prompting vision-models visual-prompting vit

Last synced: 25 Aug 2025

https://github.com/agora-lab-ai/omegavit

OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space modeling, and mixture of experts to achieve superior performance across various computer vision tasks.

agora agoralab ai ml open-ai ssm transformer vit

Last synced: 31 Aug 2025

https://github.com/tonywu71/vision-transformer

Tensorflow implementation of Image Classification with Vision Transformer on the MNIST dataset.

computer-vision mnist tensorflow vision-transformer vit

Last synced: 05 May 2026

https://github.com/mohammad95labbaf/brain-tumor-transferlearning

The Brain Tumor MRI Dataset from Kaggle is employed for automated brain tumor detection and classification research. Investigated methods include using pre-trained models (VGG16, ResNet50, and ViT). 🧠🔍

cnn cnn-keras deep-learning deep-neural-networks deeplearning kaggle-dataset keras keras-tensorflow neural-network neural-networks pretrained-models resnet-50 transfer-learning tumor-classification tumor-detection vgg16 vision vision-transformer vit

Last synced: 29 Apr 2026

https://github.com/akshatvg/yaas-bot

Official Website for YAAS Bot.

bot chapters clubs insta-bot team-yaas vit yaas-bot

Last synced: 30 Jun 2025

https://github.com/shivakantkurmi/dr-g-viswanathan-coding-challenge

51 days of code + additional 50 days from 46 days as an initiative for mother's and father's day #DRGViswanathan challenge

codingchallenge drgviswanathan vit vit-bhopal

Last synced: 30 Jun 2025

https://github.com/andreped/inf1600-ai-workshop

🔥 Workshop in AI Deployment (INF-1600, UiT)

artificial-intelligence demonstration github github-actions gradio huggingface huggingface-spaces image-classification transformers vision-transformers vit web-application

Last synced: 13 Apr 2025

https://github.com/sreeeswaran/multi-modal-sentiment-analysis-with-transformers

This project leverages the power of transformer models to perform sentiment analysis on both text and images. It uses BERT for text sentiment analysis and a pre-trained vision transformer (ViT) for image sentiment analysis.

bert bert-model image-sentiment-analysis sentiment-analysis sentimental-analysis text-sentiment-analysis transformer-models transformers vision-transformer vit