An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with transformers-models

A curated list of projects in awesome lists tagged with transformers-models .

https://github.com/internlm/internevo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3

Last synced: 07 Oct 2025

https://neuralcarver.github.io/michelangelo/

[NeurIPS 2023] Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

alignment-before-generation image-to-3d michelangelo shape-generation text-to-3d transformers-models

Last synced: 26 Mar 2025

https://github.com/InternLM/InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

910b deepspeed-ulysses flash-attention gemma internlm internlm2 llama3 llava llm-framework llm-training multi-modal pipeline-parallelism pytorch ring-attention sequence-parallelism tensor-parallelism transformers-models zero3

Last synced: 27 Mar 2025

https://github.com/DC-research/TEMPO

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

forecasting forecasting-models forecasting-time-series foundation-models gpt pretrained-language-model pretrained-models time-series time-series-analysis transformer transformers transformers-models

Last synced: 26 Apr 2026

https://github.com/esnya/hf-rvc

Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.

rvc torch transformers-models voice-conversion

Last synced: 02 Apr 2026

https://github.com/sinanw/llm-security-prompt-injection

This project investigates the security of large language models by performing binary classification of a set of input prompts to discover malicious prompts. Several approaches have been analyzed using classical ML algorithms, a trained LLM model, and a fine-tuned LLM model.

cybersecurity llm-prompting llm-security prompt-injection transformers-models

Last synced: 18 Jul 2025

https://github.com/The-Swarm-Corporation/Multi-Agent-Template-App

A radically simple, reliable, and high performance template to enable you to quickly get set up building multi-agent applications

agent-framework agentic agentops agents autogen crewai huggingface langchain llms models multi-agent swarms testing transformers transformers-models

Last synced: 21 Jul 2025

https://github.com/kyegomez/differentialtransformer

An open source community implementation of the model from "DIFFERENTIAL TRANSFORMER" paper by Microsoft.

ai attention ml rnns ssm transformers transformers-library transformers-models

Last synced: 07 May 2025

https://github.com/The-Swarm-Corporation/NewsAgent

NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.

agents ai llm llm-agent-swarms llm-agents models multi-agent multi-agent-collaboration news-agent swarms swarms-agents transformers transformers-models

Last synced: 21 Jul 2025

https://github.com/contrebande-labs/charred

CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell

bert canine character-aware controlnet diffusion diffusion-models fonts stable-diffusion tokenization-free transformers transformers-models typography unicode utf-16 utf-8

Last synced: 27 Jun 2025

https://github.com/nicolay-r/thor-ecac

The official fork of THoR Chain-of-Thought framework, enhanced and adapted for Emotion Cause Analysis (ECAC-2024)

chainofthought colab-notebook emotion-analysis emotion-cause-pair-extraction fine-tuning flan-t5 framework llm notebook-jupyter reasoning semeval semeval-2024 transformers-models

Last synced: 09 Mar 2026

https://github.com/the-swarm-corporation/multi-agent-template-app

A radically simple, reliable, and high performance template to enable you to quickly get set up building multi-agent applications

agent-framework agentic agentops agents autogen crewai huggingface langchain llms models multi-agent swarms testing transformers transformers-models

Last synced: 27 Jul 2025

https://github.com/kyegomez/shallowff

Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"

artificial-intelligence attention attention-is-all-you-need attention-mechanism attention-mechanisms feedforward transformer transformer-encoder transformer-models transformers-models

Last synced: 28 Jun 2025

https://github.com/Esmail-ibraheem/Transformer-pytorch

Language to Language Transformer model from scartch using pure Pytorch where I used my transformer model for translation task. from the paper "Attention is all you Need" 2017 using pytorch.

llm machine-translation paper-implementations pytorch transformers-models

Last synced: 05 May 2025

https://github.com/the-swarm-corporation/newsagent

NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.

agents ai llm llm-agent-swarms llm-agents models multi-agent multi-agent-collaboration news-agent swarms swarms-agents transformers transformers-models

Last synced: 27 Jul 2025

https://github.com/bramvanroy/mai-simplification-nl-2023

Sentence-Level Text Simplification for Dutch

dutch nlp text-simplification transformers-models

Last synced: 12 Apr 2025

https://github.com/techn0man1ac/toxiccommentclassification

This project aims to develop a model capable of identifying and classifying different levels of toxicity in comments, using the power of BERT(Bidirectional Encoder Representations from Transformers) for text analysis.

analysis bert-model classifying data-science docker machine-learning python streamlit text-classification transformers-models

Last synced: 18 Aug 2025

https://github.com/gmihaila/fintech_patents

Contains work done on the fintech patents classification project. The goal of this project is to build a model that can detect if a patent is fintech or not based on it's text content. If a patent is fintech then we want to know which kind of fintech patent it is form our defined fintech categories.

fintech fintech-patents-classification patents transformers-models

Last synced: 16 Oct 2025

https://github.com/nicolay-r/Reasoning-for-Sentiment-Analysis-Framework

The official code for CoT / ZSL reasoning framework 🧠, utilized in paper: "Large Language Models in Targeted Sentiment Analysis in Russian"

chainofthought cot engine framework generative-model gpt gpt-35 gpt-4 large-language-models llm openai-api sentiment-analysis target-sentiment-classification transformers-models

Last synced: 19 Aug 2025

https://github.com/xcollab/huggingface

This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.

bert bert-model gpt gpt-models huggingface huggingface-transformers llm llms models pretrained-language-model pretrained-models python transformer transformers-models

Last synced: 10 Apr 2025

https://github.com/docsallover/flask-chatbot

This is a simple chatbot application made using Flask and Microsoft DialoGPT. The application allows users to chat with the chatbot using natural language and receive human-like responses.

chatbot dialogpt dialogpt-medium flask machine-learning microsoft python python3 pytorch transformers-models

Last synced: 30 Apr 2026

https://github.com/nicolay-r/reasoning-for-sentiment-analysis-framework

The official code for CoT / ZSL reasoning framework 🧠, utilized in paper: "Large Language Models in Targeted Sentiment Analysis in Russian"

chainofthought cot engine framework generative-model gpt gpt-35 gpt-4 large-language-models llm openai-api sentiment-analysis target-sentiment-classification transformers-models

Last synced: 19 Aug 2025

https://github.com/anonym0uswork1221/jaraconverse-transformersbased

This JaraConverse model is a cutting-edge Transformer-based supervised Language Model (LLM) specifically designed for generating Python code snippets.

ai code-generator conversational-ai deep-neural-networks keras keras-nlp large-language-model llm machine-learning optimized python scratch tensoflow transformers transformers-models

Last synced: 27 Feb 2026

https://github.com/hrolive/deep-learning-week

This 5 day online course was co-organised by LRZ and NVIDIA Deep Learning Institute (DLI), combined lectures about Fundamentals of Deep Learning for Single and for Multi-GPUs, Building Transformer-Based Natural Language Processing Applications and Deep Learning on LRZ systems.

bert deep-learning gpu-acceleration high-performance-computing large-language-models llm machine-learning natural-language-processing nvidia python transformers transformers-models

Last synced: 10 Apr 2025

https://github.com/artzaragozagithub/nlp--p6_sentiment_analysis_and_summarization_of_stock_news

Natural Language Processing AI-model driven sentiment analysis system that will automatically process and analyze news articles to gauge market sentiment, and summarizing the news at a weekly level to enhance the accuracy of their stock price predictions and optimize investment strategies.

classifier-training confusion-matrix decisiontreeclassifier eda glove-embeddings gridsearchcv keyedvectors llama mistral-7b myplot nlp-keywords-extraction numpy-library pandas-library prompt-engineering sentiment-analysis sklearn-library text-processing text-summarization transformers-models word2vec

Last synced: 02 May 2026

https://github.com/gyakobo/local-language-model

This project is meant to generate a Local Language Model based on textual input.

ai anaconda bigram-model language-model machine-learning python3 pytorch tensorflow transformers-models virtual-environment

Last synced: 19 Apr 2026

https://github.com/codewithdark-git/titans-transformer

This repository contains an experimental implementation of the Titans Transformer architecture for sequence modeling tasks. The code is a personal exploration and may include errors or inefficiencies as I am currently in the learning stage. It is inspired by the ideas presented in the original

deep-learning deep-neural-networks inference llm ml neural-networks new nn paper python research-paper test titans transformer transformers-models

Last synced: 11 Apr 2026

https://github.com/ArtZaragozaGitHub/NLP--P6_Sentiment_Analysis_and_Summarization_of_Stock_News

Natural Language Processing AI-model driven sentiment analysis system that will automatically process and analyze news articles to gauge market sentiment, and summarizing the news at a weekly level to enhance the accuracy of their stock price predictions and optimize investment strategies.

classifier-training confusion-matrix decisiontreeclassifier eda glove-embeddings gridsearchcv keyedvectors llama mistral-7b myplot nlp-keywords-extraction numpy-library pandas-library prompt-engineering sentiment-analysis sklearn-library text-processing text-summarization transformers-models word2vec

Last synced: 27 Oct 2025

https://github.com/blacksujit/deep-learning-specialization-repo

This repo contains the neural networks learning's with tensorflow with all the high level deep learning concepts i am learning with project implementation

deep deep-layers deep-learning deep-neural-networks embeddings-word2vec llvm network-embeddings neural-network transformers-layers transformers-models vision-language-model

Last synced: 26 Apr 2026

https://github.com/shane-reaume/llm-finetuning-sentiment-analysis

A beginner-friendly project for fine-tuning, testing, and deploying language models for sentiment analysis with a strong emphasis on quality assurance and testing methodologies.

distilbert functional-testing huggingface imdb-dataset llm-evaluation memory-test metrics-gathering performance-testing python3 sentiment-analysis testing trainings transformers-models unit-testing weights-and-biases wsl-ubuntu

Last synced: 14 Feb 2026

https://github.com/miozilla/transformer

transformer :robot::dna::hugs: : HuggingFace # Transformer # AWS Sagemaker Studio Lab

aws bert huggingface-transformers model multiple-sequence pipeline pytorch sagemaker-studio-lab tokenizer transformers-models

Last synced: 20 Apr 2026

https://github.com/musty-ess/masked-language-model-using-bert

This project implements a Masked Language Model using BERT, a transformer-based model developed by Google, to predict masked words in text sequences.

ai artificial-intelligence bert bert-model language-model masked-language-models masked-word-prediction natural-language-processing nlp python tensorflow transformers transformers-models visualization

Last synced: 14 May 2026

https://github.com/laniw/censoredcrawledconversation

Text-To-Text Textbots to Demonstrate Output Differences Between Models Trained on Filtered/Unfiltered Datasets for HSS4 - The Modern Context: Select Figures and Topics

c4 flan-t5 google-colab-notebook python t5 t5-small transformers-models

Last synced: 27 Apr 2026

https://github.com/abhay-kanwasi/sentiment-dashboard

Comprehensive solution for sentiment analysis, combining a FastAPI backend with a React frontend. The application allows users to upload CSV files containing product reviews, analyze the sentiment using a pre-trained DistilBERT model, and visualize the results through an interactive dashboard

fastapi nlp sentimentanalyzer transformers-models

Last synced: 27 Apr 2026

https://github.com/morka17/ml-projects

Basic lessons and projects for building an LLM and AI assistance

gpt-2 language-model llm-training python pytorch pytorch-rnn pytorch-tutorial transformers-models

Last synced: 02 May 2026

https://github.com/felix221123/promptcart--fastapi-architecture

This project folder includes the architectural design of how PromptCart utilises product recommendation using vector database(semantic search),AI powered personalised chatbot for questions & answers using different APIs and LLMs

fastapi google-search-api openai transformers-models vector-database

Last synced: 07 May 2026

https://github.com/kriss024/transformers

Python program to translate Polish text into English using a transformer model from Hugging Face.

huggingface huggingface-transformers python3 transformers transformers-models

Last synced: 18 May 2026

https://github.com/codealchemyml/translaite

A translator that translates text of any language to English.

generative-ai huggingface-transformers transformers-models

Last synced: 28 Jul 2025

https://github.com/Blacksujit/Deep-Learning-Specialization-Repo

This repo contains the neural networks learning's with tensorflow with all the high level deep learning concepts i am learning with project implementation

deep deep-layers deep-learning deep-neural-networks embeddings-word2vec llvm network-embeddings neural-network transformers-layers transformers-models vision-language-model

Last synced: 08 May 2025

https://github.com/prakashy003/mindscope-ai

An NLP-powered system for detecting and classifying mental health conditions from text using machine learning and transformer models

fine-tuning llms mental-health nlp prompt-engineering transformers-models

Last synced: 06 May 2026

https://github.com/aitor-alvarez/podcasts-speech

Podcast extraction and summarization

nlp-summarization summarization transformers-models

Last synced: 20 Mar 2025

https://github.com/jgurakuqi/ai-powered-web-library

The goal here is to develop a simple mock web app book library for demonstrating a possible use of AI for aiding users

ai-powered-recommendations bert-fine-tuning bert-model express-js machine-learning mysql node-js pytorch transformers-models website xampp-server zero-shot-classification

Last synced: 12 Apr 2026

https://github.com/aniket2021448/fakenewspredictionapp

The web app uses logistic regression on a dataset of 20,000 news articles, achieving 96% accuracy. It employs NLTK for text preprocessing and TF-IDF for feature extraction.

huggingface ml news nltk-python numpy pandas-python streamlit-webapp transformers-models

Last synced: 23 Feb 2025

https://github.com/kazooki123/spaceai-models

SpaceAI models for text-to-text generation, image generation, image classification and voice cloning. Uses: Huggingface models, PyTorch and Python.

deep-learning huggingface machine-learning neural-networks python pytorch transformers-models

Last synced: 09 May 2026

https://github.com/jennynzhuang/llm_scaling_laws

Presentation on Scaling Laws for Neural Language Models​

machine-learning neural-language-model scaling-laws transformers-models

Last synced: 26 Mar 2025

https://github.com/codewithdark-git/transformers

The Transformers repository provides a comprehensive implementation of the Transformer architecture, a groundbreaking model that has revolutionized both Natural Language Processing (NLP) and Computer Vision tasks. Introduced in the seminal paper "Attention is All You Need" by Vaswani et al.

deep-learning machine-learning-algorithms nlp nlp-machine-learning nn python self-attention transformer transformer-architecture transformers-models vision vision-transformer

Last synced: 19 Apr 2026

https://github.com/abhi227070/image-to-text-gradio

A web-based application that generates descriptive captions for uploaded images using Hugging Face’s "Salesforce/blip-image-captioning-large" model. Built with Gradio and deployed on Hugging Face Spaces, the app provides a simple interface for transforming images into meaningful text descriptions.

deep deep-learning deeplearning-ai gradio-interface gradio-python-llm python transformers transformers-library transformers-models

Last synced: 09 Oct 2025