An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with transformer-models

A curated list of projects in awesome lists tagged with transformer-models .

https://github.com/vita-group/transgan

[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang

gan pytorch transformer transformer-encoder transformer-models

Last synced: 08 Apr 2025

https://github.com/VITA-Group/TransGAN

[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang

gan pytorch transformer transformer-encoder transformer-models

Last synced: 08 May 2025

https://github.com/vturrisi/solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

barlow-twins byol contrastive-learning deepcluster dino mae masked-input-prediction moco nnclr nvidia-dali pytorch pytorch-lightning ressl self-supervised-learning simclr simsiam swav transformer-models vibcreg vicreg

Last synced: 06 Oct 2025

https://github.com/harleyszhang/llm_note

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

cuda-programming kv-cache llm llm-inference transformer-models triton-kernels vllm

Last synced: 23 Aug 2025

https://github.com/cuiziteng/Illumination-Adaptive-Transformer

[BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.

bmvc exposure-correction image-enhancement image-reconstruction image-restoration low-level-vision low-light-enhance low-light-image-enhancement pytorch transformer-architecture transformer-models

Last synced: 07 Apr 2025

https://github.com/HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

albert automatic-summarization bert distilbert extractive-summarization machine-learning pytorch-lightning roberta summarization summarization-dataset text-summarization transformer-models

Last synced: 29 Apr 2025

https://github.com/RetroCirce/HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

audio-classification music-information-retrieval python sound-event-detection transformer-models

Last synced: 14 Jul 2025

https://github.com/yizhongw/tk-instruct

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

cross-task-generalization few-shot-learning instruction transformer-models zero-shot-learning

Last synced: 21 Aug 2025

https://github.com/RetroCirce/Zero_Shot_Audio_Source_Separation

The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022

audio-source-separation music-information-retrieval python query-based-learning transformer-models zero-shot-learning

Last synced: 14 Jul 2025

https://github.com/kyegomez/algorithm-of-thoughts

My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"

ai-reasoning artificial-intelligence gpt4 gpt4-api gpt4all prompt-engineering swarms transformer-architecture transformer-models

Last synced: 21 Aug 2025

https://github.com/sea-snell/grokking

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

artificial-intelligence deep-learning grokking neural-network python pytorch transformer transformer-models

Last synced: 02 Oct 2025

https://github.com/sovit-123/vision_transformers

Vision Transformers for image classification, image segmentation, and object detection.

attention computer-vision transformer-models transformers vision-transformer

Last synced: 12 Sep 2025

https://github.com/wq2012/speakerrecognitionfromscratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

attention-mechanism deep-learning librispeech lstm neural-network pytorch speaker-recognition speaker-recognition-systems transformer transformer-models

Last synced: 12 Apr 2025

https://github.com/julienkay/com.doji.transformers

A Unity package to run pretrained transformer models with Unity Sentis

ai clip machine-learning sentis tokenization tokenizer transformer-models transformers unity

Last synced: 10 Apr 2025

https://github.com/kyegomez/gpt3

An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"

artificial-intelligence attention-mechanism gpt3 transformer-architecture transformer-models

Last synced: 07 May 2025

https://github.com/rahul-jha98/justjoking.ai

Using a Transformer for learning the Language Model and Generate Short Jokes

gpt-2 joke jokegenerator language-model nlg nlp tensorflow2 transformer-models

Last synced: 14 May 2025

https://github.com/kyegomez/shallowff

Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"

artificial-intelligence attention attention-is-all-you-need attention-mechanism attention-mechanisms feedforward transformer transformer-encoder transformer-models transformers-models

Last synced: 28 Jun 2025

https://github.com/mohankrishnagr/infosys_text-summarization

This repository contains the implementation of a Transformer-based model for abstractive text summarization and a rule-based approach for extractive text summarization.

automatic-summarization bart deep-learning pytorch-nlp summarization-dataset text-summarization transformer-models

Last synced: 15 Oct 2025

https://github.com/kyegomez/longvit

A simplistic pytorch implementation of LongVit using my previous implementation of LongNet as a foundation.

ai artificial-intelligence attention attention-is-all-you-need attention-mechanism gpt3 gpt4 ml transformer-architecture transformer-models

Last synced: 09 Aug 2025

https://github.com/alphagov/govuk-content-metadata

GovNER: an encoder-based language model (RoBERTa) fine-tuned to perform Named Entity Recognition (NER) on GOV.UK content

cpto data-products-team gcp govuk govuk-content metadata-extraction named-entity-recognition nlp semantic-metadata transformer-models

Last synced: 08 May 2025

https://github.com/abhaskumarsinha/keras-implementation-of-transformer-architecture

This repository presents a Python-based implementation of the Transformer architecture on Keras TensorFlow library, as proposed by Vaswani et al. in their 2017 paper "Attention is all you need."

bert bert-model keras machine-translation natural-language-generation natural-language-processing natural-language-understanding nlp nlp-machine-learning nlp-parsing nmt nmt-model tensorflow tensorflow-tutorials tensorflow2 transformer transformer-architecture transformer-models translation

Last synced: 24 Oct 2025

https://github.com/szczyglis-dev/gpt3-py

[Python] "Bring-Your-Own-Key" terminal based application allowing interaction with the OpenAI's GPT-3 artificial intelligence. It provides a chat mode, code generation in Python, C++, C#, Java, Javascript, TypeScript, PHP, Assembly, SQL, Bash, Ruby, Go, Perl, R, Matlab, Q# and more.

ai api-client artificial-intelligence bot chatbot code code-generation codeanalysis command-line davinci deep-learning gpt gpt-3 machine-learning natural-language-generation natural-language-processing openai-api python3 terminal-based transformer-models

Last synced: 11 Apr 2025

https://github.com/kwokhing/ai-planet-llm-bootcamp-challenge

An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain

embeddings-model fine-tuning langchain language-model llm mistral-7b ocra-mini-3b qlora retrieval-augmented-generation sentence-embeddings supervised-finetuning transformer-models

Last synced: 13 Oct 2025

https://github.com/retkowsky/many_models_image_classification

Many models image classification using Transformers models

computer-vision python transformer-models

Last synced: 02 Apr 2025

https://github.com/ksm26/embedding-models-from-architecture-to-implementation

Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.

ai-applications ai-architecture bert bert-embeddings bert-fine-tuning bert-model contrastive-loss dual-encoder embedding-models machine-learning model-training question-answer-retrieval rag-systems semantic-search sentence-embeddings transformer-models word-embeddings word2vec

Last synced: 25 Jul 2025

https://github.com/kreasof-ai/lawful-diffusion

Lawful Diffusion, ethical way to address copyright violation in text-to-image generative model.

artificial-intelligence deep-learning diffusion-models generative-ai machine-learning python pytorch stable-diffusion transformer-models

Last synced: 12 Apr 2025

https://github.com/definetlynotai/llm_class

Simple class python file that can determine toxicity, retrieve yahoo stocks and complete text for you, with super high customisation

class complex llm python stocks text-completion toxicity-classification transformer-models transformers yahoo

Last synced: 21 Jul 2025

https://github.com/sergio11/llm_finetuning_and_evaluation

The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for tasks like summarizing Spanish news articles 🇪🇸📰. It features detailed notebooks 📚 on fine-tuning and evaluating models to optimize performance for specific applications. 🔍✨

ethical-ai fine-tuning flan-t5 llm machine-learning natural-language-processing nlp parameter-efficient-tuning ppo prompt-engineering python pytorch qlora rlhf text-generation tinyllama transformer-models

Last synced: 17 Apr 2025

https://github.com/md-emon-hasan/generative-ai-with-langchain-and-huggingface

An industry-focused guide for building, deploying, and optimizing generative AI applications, incorporating advanced techniques such as RAG, model fine-tuning, and scalable cloud/on-premise deployment strategies.

ai-content-creation ai-ethics ai-powered-applications chatbot-development deep-learning docker document-summarization generative-ai hugging-face langchain model-fine-tuning nlp nlp-machine-learning pruning-models quantization rag text-generation transformer-models transformers transformers-library

Last synced: 31 Dec 2025

https://github.com/anas-farooq8/predicting-and-generating-video-sequences

A deep learning project to predict and generate future video frames using models like ConvLSTM, PredRNN, and Transformers, leveraging the UCF101 dataset. The repository includes preprocessing, model training, video generation, an interactive UI, and evaluation metrics to compare model performance in video synthesis and temporal prediction tasks.

computer-vision conv-lstm deep-learning future-frame-predicition predrnn pytorch spatio-temporal streamlit transformer-models ucf101-dataset video-generation video-prediction

Last synced: 23 Mar 2025

https://github.com/sreeeswaran/image-captioning-transformer

This project demonstrates an image captioning model using a Transformer architecture. The model takes an image as input and generates a descriptive caption. We use the COCO dataset for training and evaluation.

coco coco-dataset image-caption-generator image-captioning model neural-networks transformer transformer-models transformers

Last synced: 16 Mar 2025

https://github.com/sreeeswaran/multi-modal-sentiment-analysis-with-transformers

This project leverages the power of transformer models to perform sentiment analysis on both text and images. It uses BERT for text sentiment analysis and a pre-trained vision transformer (ViT) for image sentiment analysis.

bert bert-model image-sentiment-analysis sentiment-analysis sentimental-analysis text-sentiment-analysis transformer-models transformers vision-transformer vit

Last synced: 01 Sep 2025

https://github.com/soumilgit/ml-personal-notes

Contains short notes, with diagrams for Machine Learning, specifically focussed on the Math behind and practical perspective.

machine-learning-algorithms math mcp-server transformer-models

Last synced: 06 Sep 2025

https://github.com/arpanpramanik2003/object-detection-resnet50

This repository contains a deep learning project for CIFAR-10 image classification using the ResNet50 pre-trained model. The project includes data preprocessing, model training, evaluation, and visualization of results. Achieved high accuracy by fine-tuning the model and optimizing hyperparameters.

cifar-10 cifar10 cnn deep-learning keras machine-learning model-evaluation object-detection opencv pre-trained-model python regression-models resnet-50 streamlit tensorflow2 transformer-models

Last synced: 31 Dec 2025

https://github.com/asifdotexe/natural-langwiz

Natural LangWiz is a repository for exploring Natural Language Processing (NLP) techniques through Jupyter notebooks. It covers everything from text preprocessing and sentiment analysis to advanced transformer models. Dive in to see how we turn raw text into actionable insights with a touch of NLP wizardry!

api-integration data-preprocessing-and-cleaning data-visualization emojification grammar-checker machine-learning named-entity-recognition natural-language-preprocessing python sentiment-analysis spam-detection text-analysis text-generation text-summarization topic-modeling transformer-models translation vectorization web-scraping wordcloud

Last synced: 05 Mar 2025

https://github.com/sabasyed/latex-to-code

Python codes generation from latex expressions. Using synthetic dataset and CodeT5-base model.

fine-tuning inferencing large-language-models latex-to-python llm llm-deployment llm-models postprocessing synthetic-data t5-base t5-model transformer-models transformers

Last synced: 03 Mar 2025

https://github.com/shibam120302/image_captioning_using_transformers

Pytorch implementation of image captioning using transformer-based model.

image-captioning pytorch transformer transformer-models transformer-pytorch

Last synced: 13 Oct 2025

https://github.com/raul23/simple-transformer-tts

This project offers a deeper exploration of tttzof351's "Simple Transformer TTS" codebase, enhanced with insights from Gemini, Google AI's advanced language model.

educational language-models pytorch text-to-speech transformer-models

Last synced: 30 Jun 2025

https://github.com/shuddha2021/memorized-q-a-simulator

🤖 A toy Transformer Q&A model simulator demonstrating core concepts of large language models through memorized Q&A pairs. Educational demo with interactive web interface.

artificial-intelligence javascript learning-tool nlp nlp-machine-learning python qa-system transformer transformer-architecture transformer-models web-interface

Last synced: 26 Dec 2025

https://github.com/estnafinema0/russian-jokes-generator

Transformer Models for Humorous Text Generation. Fine-tuned on Russian jokes dataset with ALiBi, RoPE, GQA, and SwiGLU.Plus a custom Byte-level BPE tokenizer.

alibi bpe-tokenizer grouped-query-attention nlp pytorch rotary-position-embedding swiglu transformer-models

Last synced: 11 Mar 2025

https://github.com/mokira3d48/i18net

Automatic machine translation built using GPT transformers and the multihead self attention mechanism.

deep-learning machine-learning machine-translation multihead-self-attention nlp transformer-models

Last synced: 04 Apr 2025

https://github.com/kunalgarglibra/personality-prediction-project

Applying different ML & Neural Network algorithms to analyze MBTI Dataset.

bert-model jupyter-notebook lstm-neural-networks python transformer-models

Last synced: 13 May 2025

https://github.com/hs094/sciatica

Sciatica is a powerful semantic search engine designed for academic literature exploration. This tool leverages cutting-edge transformer models to deliver precise and contextually relevant search results.

albert-model bert-model deep-learning information-retrieval machine-learning nlp research-tools roberta-model specter-model streamlit-webapp transformer-models

Last synced: 03 Jan 2026

https://github.com/esmail-ibraheem/dali

👨‍🎨 DDPM, and High-Resolution Image Synthesis with Latent Diffusion Models, papers implementation from scratch using pytorch.

diffusion-models generative-model machine-learning paper-implementations pytorch transformer-models

Last synced: 02 Mar 2025

https://github.com/loccx78vn/transformer

This is a tutorial to build the Transformer model to deal with time series forecasting task in R

deep-learning torch transformer-models

Last synced: 20 Nov 2025

https://github.com/joreag/easy_bake_ai

A GUI Based System to allow you to build your own AI on your own machine

agents ai artificial-intelligence transformer-architecture transformer-models

Last synced: 23 Nov 2025

https://github.com/vathsan08/mental-health-sentiment-analysis-using-deep-learning

# Mental Health Sentiment Analysis using Deep LearningThis project leverages deep learning to classify mental health-related sentiments from text into seven categories: Anxiety, Bipolar, Depression, Normal, Personality Disorder, Stress, and Suicidal. By utilizing advanced NLP techniques, we aim to enhance understanding and support for mental well

audio-classification eda emotion-detection l2-regularization logistic-regression machine-learning meachine-learning mental-health ngrams nltk pytorch roberta sentiment-analysis spacy speech-emotion-recognition speech-recognition transformer-models urdu-language-processing

Last synced: 10 Jun 2025

https://github.com/rawenchilada/dataassistant-thesis

A finetuned AI model that can generate GraphQL queries from natural language questions.

ai graphql transformer-models

Last synced: 05 Apr 2025