Projects in Awesome Lists tagged with transformer-models
A curated list of projects in awesome lists tagged with transformer-models .
https://github.com/kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
agents ai artificial-intelligence attention-mechanism chatgpt gpt4 gpt4all huggingface langchain langchain-python machine-learning multi-modal-imaging multi-modality multimodal prompt-engineering prompt-toolkit prompting swarms transformer-models tree-of-thoughts
Last synced: 23 Oct 2025
https://github.com/opennmt/ctranslate2
Fast inference engine for Transformer models
avx avx2 cpp cuda deep-learning deep-neural-networks gemm inference intrinsics machine-translation mkl neon neural-machine-translation onednn openmp opennmt parallel-computing quantization thrust transformer-models
Last synced: 08 Oct 2025
https://github.com/OpenNMT/CTranslate2
Fast inference engine for Transformer models
avx avx2 cpp cuda deep-learning deep-neural-networks gemm inference intrinsics machine-translation mkl neon neural-machine-translation onednn openmp opennmt parallel-computing quantization thrust transformer-models
Last synced: 02 Apr 2025
https://github.com/sovrasov/flops-counter.pytorch
Flops counter for neural networks in pytorch framework
deep-neural-networks deeplearning flops-counter pytorch pytorch-cnn pytorch-utils transformer transformer-models
Last synced: 12 May 2025
https://github.com/vita-group/transgan
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
gan pytorch transformer transformer-encoder transformer-models
Last synced: 08 Apr 2025
https://github.com/VITA-Group/TransGAN
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
gan pytorch transformer transformer-encoder transformer-models
Last synced: 08 May 2025
https://github.com/vturrisi/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
barlow-twins byol contrastive-learning deepcluster dino mae masked-input-prediction moco nnclr nvidia-dali pytorch pytorch-lightning ressl self-supervised-learning simclr simsiam swav transformer-models vibcreg vicreg
Last synced: 06 Oct 2025
https://github.com/harleyszhang/llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
cuda-programming kv-cache llm llm-inference transformer-models triton-kernels vllm
Last synced: 23 Aug 2025
https://github.com/daiquocnguyen/Graph-Transformer
Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022) (Pytorch and Tensorflow)
graph-classification graph-deep-learning graph-embeddings graph-machine-learning graph-neural-networks graph-representation-learning graph-transformer node-embeddings self-attention text-classification transformer transformer-models
Last synced: 27 Mar 2025
https://github.com/philipturner/metal-flash-attention
FlashAttention (Metal Port)
artificial-intelligence attention-mechanism high-performance-computing metal software-engineering stable-diffusion transformer-models
Last synced: 28 Dec 2025
https://github.com/cuiziteng/Illumination-Adaptive-Transformer
[BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.
bmvc exposure-correction image-enhancement image-reconstruction image-restoration low-level-vision low-light-enhance low-light-image-enhancement pytorch transformer-architecture transformer-models
Last synced: 07 Apr 2025
https://github.com/HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
albert automatic-summarization bert distilbert extractive-summarization machine-learning pytorch-lightning roberta summarization summarization-dataset text-summarization transformer-models
Last synced: 29 Apr 2025
https://github.com/usefulsensors/useful-transformers
Efficient Inference of Transformer models
cpp neural-networks npu openai-whisper rockchip transformer-models
Last synced: 13 Mar 2025
https://github.com/RetroCirce/HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
audio-classification music-information-retrieval python sound-event-detection transformer-models
Last synced: 14 Jul 2025
https://github.com/audeering/w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
arousal deep-learning dominance msp-podcast onnx speech-emotion-recognition transformer-models valence wav2vec2
Last synced: 10 Jun 2025
https://github.com/dpressel/mint
MinT: Minimal Transformer Library and Tutorials
bart bert gpt gpt2 opt pytorch roberta sentence-transformers t5 transformer transformer-models transformers tutorials
Last synced: 07 Sep 2025
https://github.com/yizhongw/tk-instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
cross-task-generalization few-shot-learning instruction transformer-models zero-shot-learning
Last synced: 21 Aug 2025
https://github.com/RetroCirce/Zero_Shot_Audio_Source_Separation
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022
audio-source-separation music-information-retrieval python query-based-learning transformer-models zero-shot-learning
Last synced: 14 Jul 2025
https://github.com/csinva/imodelsX
Interpret text data using LLMs (scikit-learn compatible).
ai deep-learning explainability huggingface interpretability language-model machine-learning ml natural-language-processing natural-language-understanding neural-network pytorch scikit-learn text text-classification transformer-models xai
Last synced: 05 May 2025
https://github.com/openmachine-ai/transformer-tricks
A collection of tricks and tools to speed up transformer models
ai arxiv arxiv-papers llm llm-inference llmops machine-learning python transformer transformer-models transformer-pytorch
Last synced: 16 May 2025
https://github.com/kyegomez/algorithm-of-thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
ai-reasoning artificial-intelligence gpt4 gpt4-api gpt4all prompt-engineering swarms transformer-architecture transformer-models
Last synced: 21 Aug 2025
https://github.com/csinva/imodelsx
Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.
ai deep-learning explainability huggingface interpretability language-model machine-learning ml natural-language-processing natural-language-understanding neural-network pytorch scikit-learn text text-classification transformer-models xai
Last synced: 16 May 2025
https://github.com/sea-snell/grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
artificial-intelligence deep-learning grokking neural-network python pytorch transformer transformer-models
Last synced: 02 Oct 2025
https://github.com/voidful/tfkit
🤖📇 handling multiple nlp task in one pipeline
multi-label-classification multi-task nlp tagger tagging text-classification text-generation text-processing transformer-models transformers
Last synced: 04 Sep 2025
https://github.com/sovit-123/vision_transformers
Vision Transformers for image classification, image segmentation, and object detection.
attention computer-vision transformer-models transformers vision-transformer
Last synced: 12 Sep 2025
https://github.com/wq2012/speakerrecognitionfromscratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
attention-mechanism deep-learning librispeech lstm neural-network pytorch speaker-recognition speaker-recognition-systems transformer transformer-models
Last synced: 12 Apr 2025
https://github.com/julienkay/com.doji.transformers
A Unity package to run pretrained transformer models with Unity Sentis
ai clip machine-learning sentis tokenization tokenizer transformer-models transformers unity
Last synced: 10 Apr 2025
https://github.com/mohd-faizy/06p_sentiment-analysis-with-deep-learning-using-bert
Finetuning BERT in PyTorch for sentiment analysis.
attention-mechanism bert-embeddings bert-model bert-models encoder gpt natural-language-processing pytorch sentiment-classification transformer-architecture transformer-models transformers
Last synced: 28 Oct 2025
https://github.com/kyegomez/gpt3
An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"
artificial-intelligence attention-mechanism gpt3 transformer-architecture transformer-models
Last synced: 07 May 2025
https://github.com/rbitr/ferrite
Simple, lightweight transformers in Fortran
embedding-models embeddings sentence-embeddings sentence-transformers transformer-models transformers
Last synced: 08 Apr 2025
https://github.com/firojalam/crisis_datasets_benchmarks
Crisis Dataset for Benchmarks Experiments
bert bert-fine-tuning crisis-computing crisis-informatics disaster-response roberta social-media transformer-models tweet-text-classification
Last synced: 31 Aug 2025
https://github.com/rahul-jha98/justjoking.ai
Using a Transformer for learning the Language Model and Generate Short Jokes
gpt-2 joke jokegenerator language-model nlg nlp tensorflow2 transformer-models
Last synced: 14 May 2025
https://github.com/kyegomez/shallowff
Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"
artificial-intelligence attention attention-is-all-you-need attention-mechanism attention-mechanisms feedforward transformer transformer-encoder transformer-models transformers-models
Last synced: 28 Jun 2025
https://github.com/mohankrishnagr/infosys_text-summarization
This repository contains the implementation of a Transformer-based model for abstractive text summarization and a rule-based approach for extractive text summarization.
automatic-summarization bart deep-learning pytorch-nlp summarization-dataset text-summarization transformer-models
Last synced: 15 Oct 2025
https://github.com/kyegomez/longvit
A simplistic pytorch implementation of LongVit using my previous implementation of LongNet as a foundation.
ai artificial-intelligence attention attention-is-all-you-need attention-mechanism gpt3 gpt4 ml transformer-architecture transformer-models
Last synced: 09 Aug 2025
https://github.com/alphagov/govuk-content-metadata
GovNER: an encoder-based language model (RoBERTa) fine-tuned to perform Named Entity Recognition (NER) on GOV.UK content
cpto data-products-team gcp govuk govuk-content metadata-extraction named-entity-recognition nlp semantic-metadata transformer-models
Last synced: 08 May 2025
https://github.com/abhaskumarsinha/keras-implementation-of-transformer-architecture
This repository presents a Python-based implementation of the Transformer architecture on Keras TensorFlow library, as proposed by Vaswani et al. in their 2017 paper "Attention is all you need."
bert bert-model keras machine-translation natural-language-generation natural-language-processing natural-language-understanding nlp nlp-machine-learning nlp-parsing nmt nmt-model tensorflow tensorflow-tutorials tensorflow2 transformer transformer-architecture transformer-models translation
Last synced: 24 Oct 2025
https://github.com/szczyglis-dev/gpt3-py
[Python] "Bring-Your-Own-Key" terminal based application allowing interaction with the OpenAI's GPT-3 artificial intelligence. It provides a chat mode, code generation in Python, C++, C#, Java, Javascript, TypeScript, PHP, Assembly, SQL, Bash, Ruby, Go, Perl, R, Matlab, Q# and more.
ai api-client artificial-intelligence bot chatbot code code-generation codeanalysis command-line davinci deep-learning gpt gpt-3 machine-learning natural-language-generation natural-language-processing openai-api python3 terminal-based transformer-models
Last synced: 11 Apr 2025
https://github.com/lazerlambda/promptzl
Turn LLMs into zero-shot PyTorch classifiers!
classification few-shot huggingface large-language-model large-language-models llama llm machine-learning ml prompt prompt-engineering prompt-toolkit pytorch qwen transformer-models transformers transformers-library zero-shot
Last synced: 19 Apr 2025
https://github.com/kwokhing/ai-planet-llm-bootcamp-challenge
An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
embeddings-model fine-tuning langchain language-model llm mistral-7b ocra-mini-3b qlora retrieval-augmented-generation sentence-embeddings supervised-finetuning transformer-models
Last synced: 13 Oct 2025
https://github.com/retkowsky/many_models_image_classification
Many models image classification using Transformers models
computer-vision python transformer-models
Last synced: 02 Apr 2025
https://github.com/ksm26/embedding-models-from-architecture-to-implementation
Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.
ai-applications ai-architecture bert bert-embeddings bert-fine-tuning bert-model contrastive-loss dual-encoder embedding-models machine-learning model-training question-answer-retrieval rag-systems semantic-search sentence-embeddings transformer-models word-embeddings word2vec
Last synced: 25 Jul 2025
https://github.com/kreasof-ai/lawful-diffusion
Lawful Diffusion, ethical way to address copyright violation in text-to-image generative model.
artificial-intelligence deep-learning diffusion-models generative-ai machine-learning python pytorch stable-diffusion transformer-models
Last synced: 12 Apr 2025
https://github.com/definetlynotai/llm_class
Simple class python file that can determine toxicity, retrieve yahoo stocks and complete text for you, with super high customisation
class complex llm python stocks text-completion toxicity-classification transformer-models transformers yahoo
Last synced: 21 Jul 2025
https://github.com/sergio11/llm_finetuning_and_evaluation
The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for tasks like summarizing Spanish news articles 🇪🇸📰. It features detailed notebooks 📚 on fine-tuning and evaluating models to optimize performance for specific applications. 🔍✨
ethical-ai fine-tuning flan-t5 llm machine-learning natural-language-processing nlp parameter-efficient-tuning ppo prompt-engineering python pytorch qlora rlhf text-generation tinyllama transformer-models
Last synced: 17 Apr 2025
https://github.com/nikk0001/text-generation-by-using-gpt-2
Text Generation By Using GPT-2 Model
generative-adversarial-network generative-ai gpt gpt-2 textgeneration transformer-models transformers
Last synced: 20 Jun 2025
https://github.com/mkearney/infoquality
Information Quality
nlp nlp-machine-learning transformer-models
Last synced: 11 Jun 2025
https://github.com/aitor-alvarez/acoustic-transformer-models
Acoustic Transformer Models for Audio Classification
acoustic classification hubert pytorch-lightning transformer-models wav2vec2 wavlm
Last synced: 20 Mar 2025
https://github.com/ivanbongiorni/shakespeare-gpt
How to build a custom GPT for text generation, based on TensorFlow 2.x and Maximal. Trained on the Shakespeare corpus.
deep-learning deeplearning generative-ai gpt machine-learning machinlearning maximal natural-language-generation natural-language-processing nlg nlp python tensorflow tensorflow2 transformer transformer-architecture transformer-models
Last synced: 30 Mar 2025
https://github.com/md-emon-hasan/generative-ai-with-langchain-and-huggingface
An industry-focused guide for building, deploying, and optimizing generative AI applications, incorporating advanced techniques such as RAG, model fine-tuning, and scalable cloud/on-premise deployment strategies.
ai-content-creation ai-ethics ai-powered-applications chatbot-development deep-learning docker document-summarization generative-ai hugging-face langchain model-fine-tuning nlp nlp-machine-learning pruning-models quantization rag text-generation transformer-models transformers transformers-library
Last synced: 31 Dec 2025
https://github.com/anas-farooq8/predicting-and-generating-video-sequences
A deep learning project to predict and generate future video frames using models like ConvLSTM, PredRNN, and Transformers, leveraging the UCF101 dataset. The repository includes preprocessing, model training, video generation, an interactive UI, and evaluation metrics to compare model performance in video synthesis and temporal prediction tasks.
computer-vision conv-lstm deep-learning future-frame-predicition predrnn pytorch spatio-temporal streamlit transformer-models ucf101-dataset video-generation video-prediction
Last synced: 23 Mar 2025
https://github.com/mkearney/aimlabs
AI Message Labels: Packaging and pipelines for deep learning text classification models
ai artificial-intelligence classification deep-learning distilbert machine-learning model-training modeling natural-language-processing neural-network nlp text-classification training-pipeline transformer transformer-models
Last synced: 04 Mar 2025
https://github.com/sreeeswaran/image-captioning-transformer
This project demonstrates an image captioning model using a Transformer architecture. The model takes an image as input and generates a descriptive caption. We use the COCO dataset for training and evaluation.
coco coco-dataset image-caption-generator image-captioning model neural-networks transformer transformer-models transformers
Last synced: 16 Mar 2025
https://github.com/sreeeswaran/multi-modal-sentiment-analysis-with-transformers
This project leverages the power of transformer models to perform sentiment analysis on both text and images. It uses BERT for text sentiment analysis and a pre-trained vision transformer (ViT) for image sentiment analysis.
bert bert-model image-sentiment-analysis sentiment-analysis sentimental-analysis text-sentiment-analysis transformer-models transformers vision-transformer vit
Last synced: 01 Sep 2025
https://github.com/jelhamm/article-mathematical-modeling-of-the-short-circuit-mode-of-a-voltage-transformer
"Simulations for the paper Mathematical Modeling of the Short Circuit Mode of a Voltage Transformer"
electrical-circuits mathematical-modeling matlab matlab-programming matlab-project paper short-circuit short-circuit-analysis short-circuiting simulations transformer-models transformers
Last synced: 04 Oct 2025
https://github.com/soumilgit/ml-personal-notes
Contains short notes, with diagrams for Machine Learning, specifically focussed on the Math behind and practical perspective.
machine-learning-algorithms math mcp-server transformer-models
Last synced: 06 Sep 2025
https://github.com/arpanpramanik2003/object-detection-resnet50
This repository contains a deep learning project for CIFAR-10 image classification using the ResNet50 pre-trained model. The project includes data preprocessing, model training, evaluation, and visualization of results. Achieved high accuracy by fine-tuning the model and optimizing hyperparameters.
cifar-10 cifar10 cnn deep-learning keras machine-learning model-evaluation object-detection opencv pre-trained-model python regression-models resnet-50 streamlit tensorflow2 transformer-models
Last synced: 31 Dec 2025
https://github.com/asifdotexe/natural-langwiz
Natural LangWiz is a repository for exploring Natural Language Processing (NLP) techniques through Jupyter notebooks. It covers everything from text preprocessing and sentiment analysis to advanced transformer models. Dive in to see how we turn raw text into actionable insights with a touch of NLP wizardry!
api-integration data-preprocessing-and-cleaning data-visualization emojification grammar-checker machine-learning named-entity-recognition natural-language-preprocessing python sentiment-analysis spam-detection text-analysis text-generation text-summarization topic-modeling transformer-models translation vectorization web-scraping wordcloud
Last synced: 05 Mar 2025
https://github.com/sabasyed/latex-to-code
Python codes generation from latex expressions. Using synthetic dataset and CodeT5-base model.
fine-tuning inferencing large-language-models latex-to-python llm llm-deployment llm-models postprocessing synthetic-data t5-base t5-model transformer-models transformers
Last synced: 03 Mar 2025
https://github.com/wondermongering/linguisticperturber
Probing linguistic robustness in transformers: a quantum-inspired approach to AI interpretability
adversarial-examples ai-interpretability ai-safety computational-linguistics language-model-analysis machine-learning natural-language-processing perturbation-analysis probabilistic-models transformer-models word-embeddings
Last synced: 16 Mar 2025
https://github.com/shibam120302/image_captioning_using_transformers
Pytorch implementation of image captioning using transformer-based model.
image-captioning pytorch transformer transformer-models transformer-pytorch
Last synced: 13 Oct 2025
https://github.com/raul23/simple-transformer-tts
This project offers a deeper exploration of tttzof351's "Simple Transformer TTS" codebase, enhanced with insights from Gemini, Google AI's advanced language model.
educational language-models pytorch text-to-speech transformer-models
Last synced: 30 Jun 2025
https://github.com/shuddha2021/memorized-q-a-simulator
🤖 A toy Transformer Q&A model simulator demonstrating core concepts of large language models through memorized Q&A pairs. Educational demo with interactive web interface.
artificial-intelligence javascript learning-tool nlp nlp-machine-learning python qa-system transformer transformer-architecture transformer-models web-interface
Last synced: 26 Dec 2025
https://github.com/estnafinema0/russian-jokes-generator
Transformer Models for Humorous Text Generation. Fine-tuned on Russian jokes dataset with ALiBi, RoPE, GQA, and SwiGLU.Plus a custom Byte-level BPE tokenizer.
alibi bpe-tokenizer grouped-query-attention nlp pytorch rotary-position-embedding swiglu transformer-models
Last synced: 11 Mar 2025
https://github.com/shayne-fletcher/possum
Do "things"
huggingface-transformers llms transformer-models
Last synced: 02 Apr 2025
https://github.com/mokira3d48/i18net
Automatic machine translation built using GPT transformers and the multihead self attention mechanism.
deep-learning machine-learning machine-translation multihead-self-attention nlp transformer-models
Last synced: 04 Apr 2025
https://github.com/kunalgarglibra/personality-prediction-project
Applying different ML & Neural Network algorithms to analyze MBTI Dataset.
bert-model jupyter-notebook lstm-neural-networks python transformer-models
Last synced: 13 May 2025
https://github.com/denizetkar/transformer_models
Attention Is All You Need
machine-translation pytorch seq2seq tensorflow transformer-models
Last synced: 16 Mar 2025
https://github.com/jman4162/deep-time-series-forecasting
Comprehensive guide to time series forecasting using deep learning techniques, with practical examples and tutorials.
art chronos data-science deep-learning forecasting forecasting-models gluonts gluonts-deep-learning machine- machine-learning-algorithms neural-networks patch-tst pytorch time-series time-series-forecasting time-series-prediction transformer-models transformers
Last synced: 03 Mar 2025
https://github.com/hs094/sciatica
Sciatica is a powerful semantic search engine designed for academic literature exploration. This tool leverages cutting-edge transformer models to deliver precise and contextually relevant search results.
albert-model bert-model deep-learning information-retrieval machine-learning nlp research-tools roberta-model specter-model streamlit-webapp transformer-models
Last synced: 03 Jan 2026
https://github.com/esmail-ibraheem/dali
👨🎨 DDPM, and High-Resolution Image Synthesis with Latent Diffusion Models, papers implementation from scratch using pytorch.
diffusion-models generative-model machine-learning paper-implementations pytorch transformer-models
Last synced: 02 Mar 2025
https://github.com/loccx78vn/transformer
This is a tutorial to build the Transformer model to deal with time series forecasting task in R
deep-learning torch transformer-models
Last synced: 20 Nov 2025
https://github.com/joreag/easy_bake_ai
A GUI Based System to allow you to build your own AI on your own machine
agents ai artificial-intelligence transformer-architecture transformer-models
Last synced: 23 Nov 2025
https://github.com/vathsan08/mental-health-sentiment-analysis-using-deep-learning
# Mental Health Sentiment Analysis using Deep LearningThis project leverages deep learning to classify mental health-related sentiments from text into seven categories: Anxiety, Bipolar, Depression, Normal, Personality Disorder, Stress, and Suicidal. By utilizing advanced NLP techniques, we aim to enhance understanding and support for mental well
audio-classification eda emotion-detection l2-regularization logistic-regression machine-learning meachine-learning mental-health ngrams nltk pytorch roberta sentiment-analysis spacy speech-emotion-recognition speech-recognition transformer-models urdu-language-processing
Last synced: 10 Jun 2025
https://github.com/rawenchilada/dataassistant-thesis
A finetuned AI model that can generate GraphQL queries from natural language questions.
Last synced: 05 Apr 2025