An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with language-models

A curated list of projects in awesome lists tagged with language-models .

https://github.com/argosopentech/argos-translate

Open-source offline translation library written in Python

language-models linux machine-translation nlp open-source python transformers translation

Last synced: 14 May 2025

https://github.com/facebookresearch/large_concept_model

Large Concept Models: Language modeling in a sentence representation space

language-models nlp pytorch seq2seq sequence-to-sequence

Last synced: 14 May 2025

https://github.com/jalammar/ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

explorables language-models natural-language-processing nlp pytorch visualization

Last synced: 10 Apr 2025

https://github.com/deepset-ai/farm

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 11 Apr 2025

https://github.com/deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 03 Apr 2025

https://github.com/curiousily/get-things-done-with-prompt-engineering-and-langchain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python

Last synced: 08 Apr 2025

https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python

Last synced: 12 Mar 2025

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 16 May 2025

https://github.com/princeton-nlp/lm-bff

[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723

few-shot-learning language-models lm-bff

Last synced: 09 Oct 2025

https://github.com/princeton-nlp/LM-BFF

[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723

few-shot-learning language-models lm-bff

Last synced: 09 Apr 2025

https://github.com/hazyresearch/hyena-dna

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

foundation-models genomics language-models

Last synced: 13 Apr 2025

https://github.com/cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

analysis deep-learning language-model language-models machine-learning nlp transformers

Last synced: 17 Jul 2025

https://github.com/cedrickchee/chatgpt-universe

ChatGPT Universe is fleeting notes on ChatGPT, GPT, and large language models (LLMs)

chatgpt generative-model gpt language-models resource-list

Last synced: 09 Apr 2025

https://github.com/neurocult/agency

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database

Last synced: 14 Jan 2026

https://github.com/petals-infra/chat.petals.dev

💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client

api bloom chatbot distributed-systems gpt guanaco language-models large-language-models llama llama2 transformer volunteer-computing

Last synced: 05 Apr 2025

https://github.com/extreme-bert/extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.

bert deep-learning language-model language-models machine-learning natural-language-processing nlp python pytorch transformer

Last synced: 09 May 2025

https://github.com/agencyenterprise/promptinject

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

adversarial-attacks agi agi-alignment ai-alignment ai-safety chain-of-thought gpt-3 language-models large-language-models machine-learning ml-safety prompt-engineering

Last synced: 05 Apr 2025

https://github.com/agencyenterprise/PromptInject

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

adversarial-attacks agi agi-alignment ai-alignment ai-safety chain-of-thought gpt-3 language-models large-language-models machine-learning ml-safety prompt-engineering

Last synced: 28 Mar 2025

https://github.com/neulab/knn-transformers

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

huggingface knn knn-lm knn-mt knn-transformers knnlm knnmt language language-models machine models nearest nearest-neighbor neighbor neuro-symbolic pytorch retomaton transformers translation

Last synced: 03 Apr 2025

https://github.com/bhattbhavesh91/voice-assistant-whisper-chatgpt

This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.

chatgpt chatgpt-api google-assistant gpt-3 gradio huggingface language-model language-models openapi virtual-assistant voice-assistant whisper

Last synced: 09 Apr 2025

https://github.com/epfl-dlab/aiflows

🤖🌊 aiFlows: The building blocks of your collaborative AI

agent agents ai ai-framework ai-frameworks chatgpt copilot gpt language-model language-models llm llms open-source oss python

Last synced: 17 Mar 2026

https://github.com/sea-snell/jaxseq

Train very large language models in Jax.

deep-learning flax gpt2 gpt3 huggingface jax language-models opt

Last synced: 07 May 2025

https://github.com/tomekkorbak/pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

ai-alignment ai-safety decision-transformers gpt language-models pretraining reinforcement-learning rlhf

Last synced: 07 May 2025

https://github.com/nayjest/lm-proxy

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton

Last synced: 18 Jun 2026

https://github.com/zjunlp/DART

[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

dart few-shot-learning iclr iclr2022 language-models pre-trained-language-models prompt prompt-learning prompt-tuning pytorch

Last synced: 21 Jun 2025

https://github.com/flairnlp/transformer-ranker

Efficiently find the best-suited language model (LM) for your NLP task

language-models transferability transferability-estimation

Last synced: 04 Apr 2025

https://github.com/Nayjest/lm-proxy

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton

Last synced: 09 Jun 2026

https://github.com/zjunlp/dart

Code for the ICLR2022 paper "Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners"

dart few-shot-learning iclr iclr2022 language-models pre-trained-language-models prompt prompt-learning prompt-tuning pytorch

Last synced: 13 Jun 2025

https://github.com/dmitryryumin/emnlp-2023-papers

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

bert computational-linguistics emnlp emnlp2023 gpt language-models llms machine-learning machine-translation multilingual-nlp named-entity-recognition natural-language-processing ner nlp nlp-applications sentiment-analysis syntax-and-semantics text-mining transformers word-embeddings

Last synced: 12 Apr 2025

https://github.com/quanta-quest/quanta-quest

AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.

agent ai anthropic bert chatgpt claude edge-computing gpt huggingface knowledgebase language-models llm nextjs nlp personal-ass rag semantic-vector-search transformers universal-search workflow

Last synced: 07 Apr 2025

https://github.com/loodos/turkish-language-models

Transformer based Turkish language models

language-models natural-language-processing nlp turkish

Last synced: 26 Feb 2026

https://github.com/Loodos/turkish-language-models

Transformer based Turkish language models

language-models natural-language-processing nlp turkish

Last synced: 03 May 2025

https://github.com/pbloem/language-models

Keras implementations of three language models: character-level RNN, word-level RNN and Sentence VAE (Bowman, Vilnis et al 2016).

bowman keras language-models rnn-language-model vae

Last synced: 10 Apr 2025

https://github.com/christian-doucette/tolkein_text

Neural Network Language Model that generates text based off Lord of the Rings. Built with Pytorch.

language-models lord-of-the-rings machine-learning nlp pytorch

Last synced: 26 Sep 2025

https://github.com/bhattbhavesh91/diffusion-chatgpt

This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.

chatgpt chatgpt-api google-assistant gpt-3 gradio gradio-interface language-model language-models openai stable-diffusion stable-diffusion-diffusers stable-diffusion-v2 whisper

Last synced: 17 Apr 2025

https://github.com/kdunee/intentguard

A Python library for verifying code properties using natural language assertions.

ai-testing code-quality code-verification language-models llm natural-language pytest test-automation testing unittest

Last synced: 13 Dec 2025

https://github.com/alexandra-chron/ntua-slp-wassa-iest2018

Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.

deep-learning deep-neural-networks emotion-analysis language-models lstm python pytorch sentiment-analysis transfer-learning twitter

Last synced: 06 Apr 2025

https://github.com/alan-turing-institute/robots-in-disguise

Information and materials for the Turing's "robots-in-disguise" reading group on fundamental AI research.

deep-learning diffusion-models foundation-model hut23 language-models large-language-models machine-learning nlp transformers

Last synced: 21 Aug 2025

https://github.com/cmungall/semantic-llama

A knowledge extraction tool that uses a large language model to extract semantic information from text

ai knowledge-extraction language-models linkml oaklib obofoundry

Last synced: 05 May 2025

https://github.com/yueyuel/reliablelm4code

Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; LLM4Code

code-generation code-intelligence language-models llm4code lm4se reliability software-

Last synced: 31 Jan 2026

https://github.com/lucidrains/nim-tokenizer

Implementation of a simple BPE tokenizer, but in Nim

artificial-intelligence deep-learning language-models nim tokenizer

Last synced: 09 Apr 2025

https://github.com/ai4sd/number-token-loss

PyPI package for number token loss.

language-models llm llm-training reasoning

Last synced: 08 Apr 2026

https://github.com/jonsafari/lt1

Course on Language Technologies and NLP

course graduate-course language-models language-technology neural-networks

Last synced: 11 Oct 2025

https://github.com/haozhg/lmd

Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

bert deep-learning language-models multilingual-bert natural-language-processing nlp pretrained-models python pytorch roberta transformers xlm-roberta

Last synced: 17 Jan 2026

https://github.com/nicolay-r/rusentrel-leaderboard

This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)

attention attention-mechanism benchmark bert-model bilstm chatgpt classifiers cnn language-models leaderboard low-resource-nlp neural-networks relation-extraction sentiment-analysis

Last synced: 11 Aug 2025

https://github.com/butlerlabs/docai

DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications

computer-vision information-extraction information-retrieval language-models machine-learning machine-learning-library natural-language-processing nlp nlp-library ocr ocr-python pretrained-models python

Last synced: 01 Mar 2026

https://github.com/vgherard/kgrams

k-grams, Language Models, and All That

language-models n-grams natural-language-processing

Last synced: 30 Apr 2025

https://github.com/colthreepv/llm-context

A CLI tool that helps you generate context files for Large Language Models (LLMs).

ai cli context-builder language-models llm nlp prompt-engineering

Last synced: 16 Feb 2026

https://github.com/linhaowei1/molretrieval

This repo implements many methods to retrieve molecules that are similar to a target molecule from a large molecule corpus.

ai4science biology computational-biology language-models molecule rag retreival retrieval-augmented-generation

Last synced: 23 Oct 2025

https://github.com/uziellujan/dl-textgen-textclass

Deep learning project on practical implementation of text generation and text classification pipelines with PyTorch and Hugging Face using RNNs, LSTMs, GRUs, and Transformers.

deep-learning gru huggingface language-models lstm nlp-machine-learning pytorch rnn text-classification text-generation transformers

Last synced: 08 Oct 2025

https://github.com/slava-vishnyakov/rag_engine

Python package for implementing Retrieval-Augmented Generation (RAG) using OpenAI's embeddings and a SQLite database with vector search capabilities

ai chatbot embeddings information-retrieval language-models machine-learning natural-language-processing nlp openai python rag retrieval-augmented-generation semantic-search sqlite vector-search

Last synced: 27 Dec 2025

https://github.com/abdouaziz/autocomplet

N-grams to build an autocomplet

angular flask frontend language-models projet

Last synced: 13 May 2026

https://github.com/medoidai/givebackgpt

An early version of a system that credits creators based on the similarity of their content to an LLM response. Giving back to creators is the only way for fair, sustainable AI economies that lead to true growth.

ai-ethics bootstrap chatbot css embeddings generative-ai html intellectual-property javascript language-models open-source responsive-web-design sustainable-ai web-search

Last synced: 05 May 2025

https://github.com/centre-for-humanities-computing/danish-ner-bias

Investigating bias in Danish language models in Named Entity Recognition (NER). Code from the paper titled "Detecting intersectionality in NER models: A data-driven approach."

language-models named-entity-recognition nlp

Last synced: 13 Jul 2025

https://github.com/torrinworx/bitorch

A plan for building a distributed system to run AI models BitTorrent style with a secure compensation mechanism.

distributed-systems language-models pytorch

Last synced: 17 May 2026

https://github.com/joel-beck/readnext

Hybrid Recommender System for Computer Science Papers | Master's Thesis Project 2023

citation-analysis hybrid-recommender-system language-models python recommender-system

Last synced: 29 Jun 2026

https://github.com/quanta-quest/quanta-quest-app

AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.

agent ai bert claude edge-computing gpt huggingface knowledgebase language-models nextjs nlp rag transformers wails workflow

Last synced: 04 Feb 2026

https://github.com/spongeengine/lmsharp

A unified .NET client library for running LLMs (Large Language Models) locally. LocalAI.NET provides a single, consistent API for interacting with popular local LLM providers like KoboldCpp, Ollama, LM Studio, and Text Generation WebUI.

ai ai-client csharp dotnet koboldcpp language-models llm llm-client lm-studio local-llm offline-ai ollama openai-compatible-api self-hosted-ai text-generation-webui

Last synced: 07 Sep 2025

https://github.com/leomsgit/personal-lib---ai-ml-nlp-cv

Collection of Notes, Guides, and Examples for Artificial Intelligence, Machine Learning, Natural Language Processing and Computer Vision

ai attention-mechanism deep-learning huggingface huggingface-transformers language-models machine-learning nlp python tokenization transformer transformers

Last synced: 04 Feb 2026

https://github.com/lukexyz/language-models

:earth_africa::book::speech_balloon: Sentiment analysis and text generation using BERT and ULMFiT (2018)

bert language-models transformer ulm-fit

Last synced: 07 Apr 2025

https://github.com/temilaj/nlp-coronavirus-wiki-twitter-perplexity

Natural language processing project to visualize word choice patterns from coronavirus (and related) articles, and compute the average perplexity scores of language models generated from these articles when used with tweets about the subject matter

coronavirus covid-19 language-models n-grams natural-language-processing nlp perplexity-scores

Last synced: 02 Sep 2025

https://github.com/tomekkorbak/kl-gpt3

A modular library for evaluating KL between a Huggingface Transformers models and GPT3

gpt3 language-models

Last synced: 04 Apr 2025

https://github.com/infinitode/duplipy

DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.

ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting

Last synced: 28 Jun 2026