An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with language-models

A curated list of projects in awesome lists tagged with language-models .

https://github.com/argosopentech/argos-translate

Open-source offline translation library written in Python

language-models linux machine-translation nlp open-source python transformers translation

Last synced: 14 May 2025

https://github.com/facebookresearch/large_concept_model

Large Concept Models: Language modeling in a sentence representation space

language-models nlp pytorch seq2seq sequence-to-sequence

Last synced: 14 May 2025

https://github.com/jalammar/ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

explorables language-models natural-language-processing nlp pytorch visualization

Last synced: 10 Apr 2025

https://github.com/deepset-ai/farm

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 11 Apr 2025

https://github.com/deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 03 Apr 2025

https://github.com/atfortes/llm-reasoning-papers

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

awesome chain-of-thought chatgpt cot gpt gpt-4 in-context-learning language-models mllm multimodal papers prompt prompt-engineering question-answering reasoning vllm

Last synced: 06 Feb 2025

https://github.com/curiousily/get-things-done-with-prompt-engineering-and-langchain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python

Last synced: 08 Apr 2025

https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python

Last synced: 12 Mar 2025

https://github.com/declare-lab/tango

A family of diffusion models for text-to-audio generation.

audio-generation diffusion diffusion-models language-models large-language-models text-to-audio

Last synced: 16 May 2025

https://github.com/princeton-nlp/lm-bff

[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723

few-shot-learning language-models lm-bff

Last synced: 04 Apr 2025

https://github.com/princeton-nlp/LM-BFF

[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723

few-shot-learning language-models lm-bff

Last synced: 09 Apr 2025

https://github.com/hazyresearch/hyena-dna

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

foundation-models genomics language-models

Last synced: 13 Apr 2025

https://github.com/cedrickchee/chatgpt-universe

ChatGPT Universe is fleeting notes on ChatGPT, GPT, and large language models (LLMs)

chatgpt generative-model gpt language-models resource-list

Last synced: 09 Apr 2025

https://github.com/neurocult/agency

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database

Last synced: 09 Apr 2025

https://github.com/cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

analysis deep-learning language-model language-models machine-learning nlp transformers

Last synced: 24 Nov 2024

https://github.com/petals-infra/chat.petals.dev

💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client

api bloom chatbot distributed-systems gpt guanaco language-models large-language-models llama llama2 transformer volunteer-computing

Last synced: 05 Apr 2025

https://github.com/extreme-bert/extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.

bert deep-learning language-model language-models machine-learning natural-language-processing nlp python pytorch transformer

Last synced: 09 May 2025

https://github.com/agencyenterprise/PromptInject

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

adversarial-attacks agi agi-alignment ai-alignment ai-safety chain-of-thought gpt-3 language-models large-language-models machine-learning ml-safety prompt-engineering

Last synced: 28 Mar 2025

https://github.com/agencyenterprise/promptinject

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

adversarial-attacks agi agi-alignment ai-alignment ai-safety chain-of-thought gpt-3 language-models large-language-models machine-learning ml-safety prompt-engineering

Last synced: 05 Apr 2025

https://github.com/neulab/knn-transformers

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

huggingface knn knn-lm knn-mt knn-transformers knnlm knnmt language language-models machine models nearest nearest-neighbor neighbor neuro-symbolic pytorch retomaton transformers translation

Last synced: 03 Apr 2025

https://github.com/bhattbhavesh91/voice-assistant-whisper-chatgpt

This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.

chatgpt chatgpt-api google-assistant gpt-3 gradio huggingface language-model language-models openapi virtual-assistant voice-assistant whisper

Last synced: 09 Apr 2025

https://github.com/epfl-dlab/aiflows

🤖🌊 aiFlows: The building blocks of your collaborative AI

agent agents ai ai-framework ai-frameworks chatgpt copilot gpt language-model language-models llm llms open-source oss python

Last synced: 06 Apr 2025

https://github.com/sea-snell/jaxseq

Train very large language models in Jax.

deep-learning flax gpt2 gpt3 huggingface jax language-models opt

Last synced: 07 May 2025

https://github.com/tomekkorbak/pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

ai-alignment ai-safety decision-transformers gpt language-models pretraining reinforcement-learning rlhf

Last synced: 07 May 2025

https://github.com/flairnlp/transformer-ranker

Efficiently find the best-suited language model (LM) for your NLP task

language-models transferability transferability-estimation

Last synced: 04 Apr 2025

https://github.com/dmitryryumin/emnlp-2023-papers

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

bert computational-linguistics emnlp emnlp2023 gpt language-models llms machine-learning machine-translation multilingual-nlp named-entity-recognition natural-language-processing ner nlp nlp-applications sentiment-analysis syntax-and-semantics text-mining transformers word-embeddings

Last synced: 12 Apr 2025

https://github.com/quanta-quest/quanta-quest

AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.

agent ai anthropic bert chatgpt claude edge-computing gpt huggingface knowledgebase language-models llm nextjs nlp personal-ass rag semantic-vector-search transformers universal-search workflow

Last synced: 07 Apr 2025

https://github.com/Loodos/turkish-language-models

Transformer based Turkish language models

language-models natural-language-processing nlp turkish

Last synced: 03 May 2025

https://github.com/pbloem/language-models

Keras implementations of three language models: character-level RNN, word-level RNN and Sentence VAE (Bowman, Vilnis et al 2016).

bowman keras language-models rnn-language-model vae

Last synced: 10 Apr 2025

https://github.com/christian-doucette/tolkein_text

Neural Network Language Model that generates text based off Lord of the Rings. Built with Pytorch.

language-models lord-of-the-rings machine-learning nlp pytorch

Last synced: 17 Jan 2025

https://github.com/bhattbhavesh91/diffusion-chatgpt

This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.

chatgpt chatgpt-api google-assistant gpt-3 gradio gradio-interface language-model language-models openai stable-diffusion stable-diffusion-diffusers stable-diffusion-v2 whisper

Last synced: 17 Apr 2025

https://github.com/alan-turing-institute/robots-in-disguise

Information and materials for the Turing's "robots-in-disguise" reading group on fundamental AI research.

deep-learning diffusion-models foundation-model hut23 language-models large-language-models machine-learning nlp transformers

Last synced: 19 Dec 2024

https://github.com/alexandra-chron/ntua-slp-wassa-iest2018

Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.

deep-learning deep-neural-networks emotion-analysis language-models lstm python pytorch sentiment-analysis transfer-learning twitter

Last synced: 06 Apr 2025

https://github.com/cmungall/semantic-llama

A knowledge extraction tool that uses a large language model to extract semantic information from text

ai knowledge-extraction language-models linkml oaklib obofoundry

Last synced: 05 May 2025

https://github.com/yueyuel/reliablelm4code

Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; LLM4Code

code-generation code-intelligence language-models llm4code lm4se reliability software-

Last synced: 27 Feb 2025

https://github.com/lucidrains/nim-tokenizer

Implementation of a simple BPE tokenizer, but in Nim

artificial-intelligence deep-learning language-models nim tokenizer

Last synced: 09 Apr 2025

https://github.com/jonsafari/lt1

Course on Language Technologies and NLP

course graduate-course language-models language-technology neural-networks

Last synced: 18 Feb 2025

https://github.com/nicolay-r/rusentrel-leaderboard

This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)

attention attention-mechanism benchmark bert-model bilstm chatgpt classifiers cnn language-models leaderboard low-resource-nlp neural-networks relation-extraction sentiment-analysis

Last synced: 19 Dec 2024

https://github.com/vgherard/kgrams

k-grams, Language Models, and All That

language-models n-grams natural-language-processing

Last synced: 30 Apr 2025

https://github.com/kdunee/intentguard

A Python library for verifying code properties using natural language assertions.

ai-testing code-quality code-verification language-models llm natural-language pytest test-automation testing unittest

Last synced: 24 Mar 2025

https://github.com/linhaowei1/molretrieval

This repo implements many methods to retrieve molecules that are similar to a target molecule from a large molecule corpus.

ai4science biology computational-biology language-models molecule rag retreival retrieval-augmented-generation

Last synced: 08 Feb 2025

https://github.com/abdouaziz/autocomplet

N-grams to build an autocomplet

angular flask frontend language-models projet

Last synced: 24 Mar 2025

https://github.com/spongeengine/llmsharp

Unified .NET client for interacting with popular local LLM providers like KoboldCpp, Ollama, LM Studio, and Oobabooga.

ai ai-client csharp dotnet koboldcpp language-models llm llm-client lm-studio local-llm offline-ai ollama openai-compatible-api self-hosted-ai text-generation-webui

Last synced: 24 Jan 2025

https://github.com/quanta-quest/quanta-quest-app

AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.

agent ai bert claude edge-computing gpt huggingface knowledgebase language-models nextjs nlp rag transformers wails workflow

Last synced: 06 May 2025

https://github.com/medoidai/givebackgpt

An early version of a system that credits creators based on the similarity of their content to an LLM response. Giving back to creators is the only way for fair, sustainable AI economies that lead to true growth.

ai-ethics bootstrap chatbot css embeddings generative-ai html intellectual-property javascript language-models open-source responsive-web-design sustainable-ai web-search

Last synced: 05 May 2025

https://github.com/torrinworx/bitorch

A plan for building a distributed system to run AI models BitTorrent style with a secure compensation mechanism.

distributed-systems language-models pytorch

Last synced: 30 Mar 2025

https://github.com/joel-beck/readnext

Hybrid Recommender System for Computer Science Papers | Master's Thesis Project 2023

citation-analysis hybrid-recommender-system language-models python recommender-system

Last synced: 09 Apr 2025

https://github.com/centre-for-humanities-computing/danish-ner-bias

Investigating bias in Danish language models in Named Entity Recognition (NER). Code from the paper titled "Detecting intersectionality in NER models: A data-driven approach."

language-models named-entity-recognition nlp

Last synced: 22 Feb 2025

https://github.com/tomekkorbak/kl-gpt3

A modular library for evaluating KL between a Huggingface Transformers models and GPT3

gpt3 language-models

Last synced: 04 Apr 2025

https://github.com/lukexyz/language-models

:earth_africa::book::speech_balloon: Sentiment analysis and text generation using BERT and ULMFiT (2018)

bert language-models transformer ulm-fit

Last synced: 07 Apr 2025

https://github.com/yash-kavaiya/30-days-llm-mastery-course

30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep into Large Language Models (LLMs). From foundational concepts to advanced techniques, learn to build, train, and deploy state-of-the-art language models.

attention-mechanism fine-tuning language-models llm model-deployment nlp pytorch transformers

Last synced: 21 Apr 2025

https://github.com/divanvisagie/ratatoskr-prototype

Experiments with ChatGPT, Notion and telegram

ai chatgpt language-models llm

Last synced: 31 Mar 2025

https://github.com/temilaj/nlp-coronavirus-wiki-twitter-perplexity

Natural language processing project to visualize word choice patterns from coronavirus (and related) articles, and compute the average perplexity scores of language models generated from these articles when used with tweets about the subject matter

coronavirus covid-19 language-models n-grams natural-language-processing nlp perplexity-scores

Last synced: 01 Mar 2025

https://github.com/spongeengine/lmsharp

A unified .NET client library for running LLMs (Large Language Models) locally. LocalAI.NET provides a single, consistent API for interacting with popular local LLM providers like KoboldCpp, Ollama, LM Studio, and Text Generation WebUI.

ai ai-client csharp dotnet koboldcpp language-models llm llm-client lm-studio local-llm offline-ai ollama openai-compatible-api self-hosted-ai text-generation-webui

Last synced: 02 Jan 2025

https://github.com/SpongeEngine/LMSharp

A unified .NET client library for running LLMs (Large Language Models) locally. LocalAI.NET provides a single, consistent API for interacting with popular local LLM providers like KoboldCpp, Ollama, LM Studio, and Text Generation WebUI.

ai ai-client csharp dotnet koboldcpp language-models llm llm-client lm-studio local-llm offline-ai ollama openai-compatible-api self-hosted-ai text-generation-webui

Last synced: 12 Jan 2025

https://github.com/infinitode/duplipy

DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.

ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting

Last synced: 21 Feb 2025

https://github.com/oelin/implicit-language-models

Elucidation of implicit language models from common data compression algorithms.

beam-search data-compression data-science generative-models language-models machine-learning

Last synced: 12 Mar 2025

https://github.com/raul23/simple-transformer-tts

This project offers a deeper exploration of tttzof351's "Simple Transformer TTS" codebase, enhanced with insights from Gemini, Google AI's advanced language model.

educational language-models pytorch text-to-speech transformer-models

Last synced: 03 Mar 2025

https://github.com/SpongeEngine/OobaboogaSharp

C# client for interacting with Oobabooga's text-generation-webui through its OpenAI-compatible API endpoints.

ai ai-client csharp dotnet language-models llm llm-client local-llm local-llm-integration local-llms offline-ai oobabooga openai-compatible-api self-hosted-ai text-generation-webui

Last synced: 12 Jan 2025

https://github.com/sharp119/deepseek_report

🧠 Research repository exploring DeepSeek AI's model evolution and architectures (2023-2025). Analyzes language, code, math, and vision models using HuggingFace collections. 📚 A personal learning journey into understanding these advanced AI systems.

code-generation deepseek-ai deepseek-models language-models math-models model-evolution model-research moe-architecture technical-analysis vision-language

Last synced: 01 Mar 2025

https://github.com/bjornmelin/nlp-engineering-hub

📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤

cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers

Last synced: 20 Mar 2025

https://github.com/terilios/automated_data_scientist

Automated Data Scientist: An intelligent, adaptive data analysis tool that leverages AI-driven automation to dynamically plan, execute, and refine data science workflows. Automatically handles data preparation, analysis planning, code generation, and result interpretation using advanced language models.

adaptive-analytics ai-driven-analytics ai-powered-data-tools api-integration automated-data-science automation data-insights data-preparation data-science-workflow data-visualization dynamic-analysis-planning exploratory-data-analysis intelligent-data-processing language-models machine-learning ml-ops openai-gpt python scalable-data-analysis

Last synced: 27 Feb 2025

https://github.com/colthreepv/llm-context

A CLI tool that helps you generate context files for Large Language Models (LLMs).

ai cli context-builder language-models llm nlp prompt-engineering

Last synced: 17 Jan 2025

https://github.com/tural00a1568/llm-chat-indexer

The LLM Chat Indexer is a clever tool designed to transform chaotic chat files into organized, searchable insights—ideal for anyone overwhelmed by digital conversations.

aibot gpt4 huggingface-embeddings langchain language-models large-language-models llm-generated-text open-source openai openai-api python self service tiktoken

Last synced: 28 Mar 2025