An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with transformer

A curated list of projects in awesome lists tagged with transformer .

https://github.com/labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

attention deep-learning deep-learning-tutorial gan literate-programming lora machine-learning neural-networks optimizers pytorch reinforcement-learning transformer transformers

Last synced: 17 Nov 2025

https://github.com/rasbt/llms-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

ai artificial-intelligence chatgpt gpt language-model large-language-models llm python pytorch transformer

Last synced: 09 Sep 2025

https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda deepseek gpt hpu inference inferentia llama llm llm-serving llmops mlops model-serving pytorch qwen rocm tpu trainium transformer xpu

Last synced: 12 May 2025

https://github.com/ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference openai speech-recognition speech-to-text transformer whisper

Last synced: 24 Dec 2025

https://github.com/ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference openai speech-recognition speech-to-text transformer whisper

Last synced: 01 Apr 2025

https://github.com/fishaudio/fish-speech

SOTA Open Source TTS

llama transformer tts valle vits vqgan vqvae

Last synced: 13 May 2025

https://github.com/datawhalechina/leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial

Last synced: 14 May 2025

https://github.com/graykode/nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

attention bert natural-language-processing nlp paper pytorch tensorflow transformer tutorial

Last synced: 13 May 2025

https://github.com/sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

cuda deepseek deepseek-llm deepseek-r1 deepseek-r1-zero deepseek-v3 inference llama llama3 llama3-1 llava llm llm-serving moe pytorch transformer vlm

Last synced: 12 May 2025

https://github.com/blinkdl/rwkv-lm

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3 language-model linear-attention lstm pytorch rnn rwkv transformer transformers

Last synced: 09 May 2025

https://github.com/BlinkDL/RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3 language-model linear-attention lstm pytorch rnn rwkv transformer transformers

Last synced: 20 Mar 2025

https://github.com/alibaba/mnn

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md). MNN TaoAvatar Android - Local 3D Avatar Intelligence: apps/Android/Mnn3dAvatar/README.md

arm convolution deep-learning embedded-devices llm machine-learning ml mnn transformer vulkan winograd-algorithm

Last synced: 09 Sep 2025

https://github.com/cfug/dio

A powerful HTTP client for Dart and Flutter, which supports global settings, Interceptors, FormData, aborting and canceling a request, files uploading and downloading, requests timeout, custom adapters, etc.

adapter cancellable dart dio flutter http interceptor middleware network timeout transformer

Last synced: 12 May 2025

https://github.com/paddlepaddle/paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 12 May 2025

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 24 Mar 2025

https://github.com/huggingface/text-generation-inference

Large Language Model Text Generation Inference

bloom deep-learning falcon gpt inference nlp pytorch starcoder transformer

Last synced: 13 May 2025

https://github.com/paddlepaddle/paddleseg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

autonomous-driving human-segmentation image-matting image-segmentation interactive-segmentation lane-detection medical-image-segmentation panoptic-segmentation semantic-segmentation transformer video-segmentation

Last synced: 13 May 2025

https://github.com/eleutherai/lm-evaluation-harness

A framework for few-shot evaluation of language models.

evaluation-framework language-model transformer

Last synced: 09 Sep 2025

https://github.com/PaddlePaddle/PaddleSeg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

autonomous-driving human-segmentation image-matting image-segmentation interactive-segmentation lane-detection medical-image-segmentation panoptic-segmentation semantic-segmentation transformer video-segmentation

Last synced: 29 Apr 2025

https://github.com/optimalscale/lmflow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

chatgpt deep-learning instruction-following language-model pretrained-models pytorch transformer

Last synced: 14 May 2025

https://optimalscale.github.io/LMFlow/

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

chatgpt deep-learning instruction-following language-model pretrained-models pytorch transformer

Last synced: 25 Sep 2025

https://github.com/OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

chatgpt deep-learning instruction-following language-model pretrained-models pytorch transformer

Last synced: 28 Mar 2025

https://github.com/openai/jukebox

Code for the paper "Jukebox: A Generative Model for Music"

audio generative-model music paper pytorch transformer vq-vae

Last synced: 14 May 2025

https://github.com/morizeyao/gpt2-chinese

Chinese version of GPT2 training code, using BERT tokenizer.

chinese gpt-2 nlp text-generation transformer

Last synced: 14 May 2025

https://github.com/Morizeyao/GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

chinese gpt-2 nlp text-generation transformer

Last synced: 24 Mar 2025

https://github.com/jessevig/bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

bert gpt2 machine-learning natural-language-processing neural-network nlp pytorch roberta transformer transformers visualization

Last synced: 13 May 2025

https://github.com/EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

evaluation-framework language-model transformer

Last synced: 23 Mar 2025

https://github.com/codertimo/bert-pytorch

Google AI 2018 BERT pytorch implementation

bert language-model nlp pytorch transformer

Last synced: 08 Oct 2025

https://github.com/codertimo/BERT-pytorch

Google AI 2018 BERT pytorch implementation

bert language-model nlp pytorch transformer

Last synced: 02 Apr 2025

https://github.com/InternLM/MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

ai-search-engine gpt llm llms multi-agent-systems perplexity-ai search searchgpt transformer web-search

Last synced: 06 May 2025

https://github.com/internlm/mindsearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

ai-search-engine gpt llm llms multi-agent-systems perplexity-ai search searchgpt transformer web-search

Last synced: 25 Apr 2025

https://github.com/nvidia/fastertransformer

Transformer related optimization, including BERT, GPT

bert gpt pytorch transformer

Last synced: 13 May 2025

https://github.com/NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

bert gpt pytorch transformer

Last synced: 16 Mar 2025

https://github.com/zhouhaoyi/informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

deep-learning forecasting pytorch self-attention time-series transformer

Last synced: 14 May 2025

https://github.com/zhouhaoyi/Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

deep-learning forecasting pytorch self-attention time-series transformer

Last synced: 01 Apr 2025

https://github.com/timeseriesai/tsai

Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

classification cnn deep-learning fastai forecasting inceptiontime machine-learning python pytorch regression rnn rocket self-supervised sequential state-of-the-art time-series time-series-analysis time-series-classification timeseries transformer

Last synced: 14 May 2025

https://github.com/timeseriesAI/tsai

Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

classification cnn deep-learning fastai forecasting inceptiontime machine-learning python pytorch regression rnn rocket self-supervised sequential state-of-the-art time-series time-series-analysis time-series-classification timeseries transformer

Last synced: 26 Mar 2025

https://github.com/autogptq/autogptq

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

deep-learning inference large-language-models llms nlp pytorch quantization transformer transformers

Last synced: 08 Apr 2025

https://github.com/AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

deep-learning inference large-language-models llms nlp pytorch quantization transformer transformers

Last synced: 14 Mar 2025

https://github.com/OpenGenesisLab/beat-ai

🚀 Beat AI 简报: 持续分享 AI 领域的关键进展,帮你征服 AI,Just beat it! 欢迎 star 订阅.

ai ai-learning engineering llm neural-network transformer

Last synced: 10 Dec 2025

https://github.com/kyubyong/transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

attention-is-all-you-need attention-mechanism implementation transformer translation

Last synced: 14 May 2025

https://github.com/Kyubyong/transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

attention-is-all-you-need attention-mechanism implementation transformer translation

Last synced: 09 Apr 2025

https://github.com/wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper

Last synced: 13 May 2025

https://github.com/sunface/beat-ai

持续分享/翻译 AI 领域的优秀内容,帮你征服 AI,Just beat it! 欢迎 star 订阅.

ai engineering llm neural-network transformer

Last synced: 20 Jun 2025

https://github.com/ibeatai/beat-ai

<Beat AI> 又名 <零生万物> , 是一本专属于软件开发工程师的 AI 入门圣经,手把手带你上手写 AI。从神经网络到大模型,从高层设计到微观原理,从工程实现到算法,学完后,你会发现 AI 也并不是想象中那么高不可攀、无法战胜,Just beat it !

ai engineering llm neural-network transformer

Last synced: 29 Apr 2025

https://github.com/hyunwoongko/transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

attention dataset pytorch transformer

Last synced: 14 May 2025

https://github.com/bytedance/lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

accelerate bart beam-search bert cuda diverse-decoding gpt inference multilingual-nmt sampling training transformer

Last synced: 14 May 2025

https://github.com/hrnet/hrnet-semantic-segmentation

The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919

cityscapes high-resolution high-resolution-net hrnets lip pascal-context segmentation segmentation-transformer semantic-segmentation transformer

Last synced: 15 May 2025

https://github.com/HRNet/HRNet-Semantic-Segmentation

The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919

cityscapes high-resolution high-resolution-net hrnets lip pascal-context segmentation segmentation-transformer semantic-segmentation transformer

Last synced: 20 Mar 2025

https://github.com/xmuSistone/ViewpagerTransition

viewpager with parallax pages, together with vertical sliding (or click) and activity transition

parallax transformer transition viewpager

Last synced: 20 Apr 2025

https://github.com/xmusistone/viewpagertransition

viewpager with parallax pages, together with vertical sliding (or click) and activity transition

parallax transformer transition viewpager

Last synced: 15 May 2025

https://github.com/yangjianxin1/gpt2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

chichat dialogpt dialogue-model gpt-2 gpt2 nlp text-generation transformer

Last synced: 15 May 2025

https://github.com/yangjianxin1/GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

chichat dialogpt dialogue-model gpt-2 gpt2 nlp text-generation transformer

Last synced: 27 Apr 2025

https://github.com/posthtml/posthtml

PostHTML is a tool to transform HTML/XML with JS plugins

html html-parser parser posthtml transformer xml xml-parser

Last synced: 13 May 2025

https://github.com/nvlabs/segformer

Official PyTorch implementation of SegFormer

ade20k cityscapes semantic-segmentation transformer

Last synced: 15 May 2025

https://github.com/guillaume-be/rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

bart bert deep-learning electra gpt gpt-2 language-generation machine-learning ner nlp question-answering roberta rust rust-lang sentiment-analysis transformer translation

Last synced: 14 May 2025

https://github.com/datawhalechina/learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

bert nlp transformer

Last synced: 15 May 2025

https://github.com/NVlabs/SegFormer

Official PyTorch implementation of SegFormer

ade20k cityscapes semantic-segmentation transformer

Last synced: 20 Mar 2025

https://github.com/charmve/computer-vision-in-action

A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练:算法与应用》中文电子书、源码、读者交流社区(持续更新中 ...) 📘 在线电子书 https://charmve.github.io/computer-vision-in-action/ 👇项目主页

books charmve colab-notebook computer-vision computer-vision-algorithms deep-learning deep-learning-tutorial handbook in-action ipynb jupyter-notebook machine-learning neural-network notebook pytorch transformer tutorial

Last synced: 15 May 2025

https://github.com/Charmve/computer-vision-in-action

A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练:算法与应用》中文电子书、源码、读者交流社区(持续更新中 ...) 📘 在线电子书 https://charmve.github.io/computer-vision-in-action/ 👇项目主页

books charmve colab-notebook computer-vision computer-vision-algorithms deep-learning deep-learning-tutorial handbook in-action ipynb jupyter-notebook machine-learning neural-network notebook pytorch transformer tutorial

Last synced: 14 Apr 2025

https://github.com/iscyy/ultralyticspro

🔥🔥🔥 专注于YOLO11,YOLOv8、TYOLOv12、YOLOv10、RT-DETR、YOLOv7、YOLOv5改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

backbone deep-learning pytorch rt-detr transformer yolo yolo11 yoloair yolov10 yolov3 yolov4 yolov5 yolov6 yolov7 yolov8

Last synced: 13 Apr 2025

https://github.com/km1994/nlp-interview-notes

该仓库主要记录 NLP 算法工程师相关的面试题

bert deel-learning ner nlp transformer

Last synced: 23 Mar 2025

https://github.com/ddz16/TSFpaper

This repository contains a reading list of papers on Time Series Forecasting/Prediction (TSF) and Spatio-Temporal Forecasting/Prediction (STF). These papers are mainly categorized according to the type of model.

deep-learning deep-neural-networks paper-lists rnn spatial-temporal-forecasting spatio-temporal spatio-temporal-data spatio-temporal-prediction tcn time-series time-series-analysis time-series-forecasting time-series-models time-series-prediction transformer

Last synced: 01 May 2025

https://github.com/young-geng/easylm

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

chatbot deep-learning flax jax language-model large-language-models llama natural-language-processing transformer

Last synced: 15 May 2025

https://github.com/km1994/NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

bert deel-learning ner nlp transformer

Last synced: 09 Apr 2025

https://github.com/young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

chatbot deep-learning flax jax language-model large-language-models llama natural-language-processing transformer

Last synced: 14 Mar 2025

https://github.com/curiousily/getting-things-done-with-pytorch

Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BER

anomaly-detection bert computer-vision coronavirus deep-learning face-detection face-recognition lstm machine-learning nlp object-detection pytorch sentiment-analysis time-series time-series-anomaly-detection time-series-forecasting transfer-learning transformer tutorial yolo

Last synced: 15 May 2025

https://github.com/curiousily/Getting-Things-Done-with-Pytorch

Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BER

anomaly-detection bert computer-vision coronavirus deep-learning face-detection face-recognition lstm machine-learning nlp object-detection pytorch sentiment-analysis time-series time-series-anomaly-detection time-series-forecasting transfer-learning transformer tutorial yolo

Last synced: 06 Apr 2025