Projects in Awesome Lists tagged with language-models
A curated list of projects in awesome lists tagged with language-models .
https://github.com/bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
bloom chatbot deep-learning distributed-systems falcon gpt guanaco language-models large-language-models llama machine-learning mixtral neural-networks nlp pipeline-parallelism pretrained-models pytorch tensor-parallelism transformer volunteer-computing
Last synced: 13 May 2025
https://github.com/argosopentech/argos-translate
Open-source offline translation library written in Python
language-models linux machine-translation nlp open-source python transformers translation
Last synced: 14 May 2025
https://github.com/facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
language-models nlp pytorch seq2seq sequence-to-sequence
Last synced: 14 May 2025
https://github.com/jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
explorables language-models natural-language-processing nlp pytorch visualization
Last synced: 10 Apr 2025
https://github.com/deepset-ai/farm
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch
Last synced: 11 Apr 2025
https://github.com/deepset-ai/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch
Last synced: 03 Apr 2025
https://github.com/curiousily/get-things-done-with-prompt-engineering-and-langchain
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.
artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python
Last synced: 08 Apr 2025
https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.
artificial-intelligence chatgpt deep-learning gpt-4 gpt4 langchain language-models large-language-models llama2 openai prompt-engineering python
Last synced: 12 Mar 2025
https://github.com/declare-lab/tango
A family of diffusion models for text-to-audio generation.
audio-generation diffusion diffusion-models language-models large-language-models text-to-audio
Last synced: 16 May 2025
https://github.com/zjunlp/prompt4reasoningpapers
[ACL 2023] Reasoning with Language Model Prompting: A Survey
arithmetic-reasoning artificial-intelligence awsome-list chain-of-thought chatgpt commonsense-reasoning datasets gpt-3 language-models large-language-models llm logical-reasoning natural-language-processing nlp paper-list prompt prompt-engineering reasoning survey symbolic-reasoning
Last synced: 01 Feb 2026
https://github.com/kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
c-transformers chatgpt cpu cpu-inference deep-learning document-qa faiss langchain language-models large-language-models llama llama-2 llm machine-learning natural-language-processing nlp open-source-llm python sentence-transformers transformers
Last synced: 09 May 2025
https://github.com/kennethleungty/llama-2-open-source-llm-cpu-inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
c-transformers chatgpt cpu cpu-inference deep-learning document-qa faiss langchain language-models large-language-models llama llama-2 llm machine-learning natural-language-processing nlp open-source-llm python sentence-transformers transformers
Last synced: 16 May 2025
https://github.com/zjunlp/Prompt4ReasoningPapers
[ACL 2023] Reasoning with Language Model Prompting: A Survey
arithmetic-reasoning artificial-intelligence awsome-list chain-of-thought chatgpt commonsense-reasoning datasets gpt-3 language-models large-language-models llm logical-reasoning natural-language-processing nlp paper-list prompt prompt-engineering reasoning survey symbolic-reasoning
Last synced: 12 Mar 2025
https://github.com/princeton-nlp/lm-bff
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
few-shot-learning language-models lm-bff
Last synced: 09 Oct 2025
https://github.com/princeton-nlp/LM-BFF
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
few-shot-learning language-models lm-bff
Last synced: 09 Apr 2025
https://github.com/hazyresearch/hyena-dna
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
foundation-models genomics language-models
Last synced: 13 Apr 2025
https://github.com/VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
bert bert-embeddings deep-learning fairseq language-models named-entity-recognition natural-language-inference ner nli part-of-speech-tagging phobert pos-tagging python3 rdrsegmenter roberta transformers transformers-library vietnamese vietnamese-nlp vncorenlp
Last synced: 03 Apr 2025
https://github.com/webis-de/small-text
Active Learning for Text Classification in Python
active-learning deep-learning language-models looking-for-contributors machine-learning natural-language-processing nlp python pytorch small-language-models text-classification transformers
Last synced: 14 May 2025
https://github.com/monarch-initiative/ontogpt
LLM-based ontological extraction tools, including SPIRES
ai chat-gpt data-modeling gpt-3 information-extraction language-models large-language-models linkml llm monarchinitiative named-entity-recognition ner nlp oaklib obofoundry relation-extraction
Last synced: 08 Apr 2025
https://github.com/EricFillion/happy-transformer
Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
ai artificial-intelligence bert deep-learning language-models machine-learning natural-language-processing nlp python question-answering roberta text-classification transformers
Last synced: 30 Aug 2025
https://github.com/bigscience-workshop/xmtf
Crosslingual Generalization through Multitask Finetuning
bloom bloomz instruction-tuning language-models large-language-models mt0 multilingual-nlp multitask-learning t5 zero-shot-learning
Last synced: 09 Apr 2025
https://github.com/cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
analysis deep-learning language-model language-models machine-learning nlp transformers
Last synced: 17 Jul 2025
https://github.com/cedrickchee/chatgpt-universe
ChatGPT Universe is fleeting notes on ChatGPT, GPT, and large language models (LLMs)
chatgpt generative-model gpt language-models resource-list
Last synced: 09 Apr 2025
https://github.com/neurocult/agency
🕵️♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.
agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database
Last synced: 14 Jan 2026
https://github.com/huggingface/datablations
Scaling Data-Constrained Language Models
gpt high-performance-computing language-models large-language-models llms scaling-laws
Last synced: 14 Oct 2025
https://github.com/petals-infra/chat.petals.dev
💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
api bloom chatbot distributed-systems gpt guanaco language-models large-language-models llama llama2 transformer volunteer-computing
Last synced: 05 Apr 2025
https://github.com/extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
bert deep-learning language-model language-models machine-learning natural-language-processing nlp python pytorch transformer
Last synced: 09 May 2025
https://github.com/agencyenterprise/promptinject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022
adversarial-attacks agi agi-alignment ai-alignment ai-safety chain-of-thought gpt-3 language-models large-language-models machine-learning ml-safety prompt-engineering
Last synced: 05 Apr 2025
https://github.com/agencyenterprise/PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022
adversarial-attacks agi agi-alignment ai-alignment ai-safety chain-of-thought gpt-3 language-models large-language-models machine-learning ml-safety prompt-engineering
Last synced: 28 Mar 2025
https://github.com/neulab/knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT
huggingface knn knn-lm knn-mt knn-transformers knnlm knnmt language language-models machine models nearest nearest-neighbor neighbor neuro-symbolic pytorch retomaton transformers translation
Last synced: 03 Apr 2025
https://github.com/hooshvare/parsbert
🤗 ParsBERT: Transformer-based Model for Persian Language Understanding
bert downstream-tasks language-models named-entity-recognition ner nlp nlu parsbert persian-bert persian-language persianber sentiment-analysis text-classification transformer
Last synced: 29 Jun 2026
https://github.com/bhattbhavesh91/voice-assistant-whisper-chatgpt
This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.
chatgpt chatgpt-api google-assistant gpt-3 gradio huggingface language-model language-models openapi virtual-assistant voice-assistant whisper
Last synced: 09 Apr 2025
https://github.com/picovoice/picollm
On-device LLM Inference Powered by X-Bit Quantization
compression efficient-inference gemma generative-ai language-model language-models large-language-model llama llama2 llama3 llm llm-inference llms mistral mixtral model-compression natural-language-processing quantization self-hosted
Last synced: 23 Oct 2025
https://github.com/epfl-dlab/aiflows
🤖🌊 aiFlows: The building blocks of your collaborative AI
agent agents ai ai-framework ai-frameworks chatgpt copilot gpt language-model language-models llm llms open-source oss python
Last synced: 17 Mar 2026
https://github.com/sea-snell/jaxseq
Train very large language models in Jax.
deep-learning flax gpt2 gpt3 huggingface jax language-models opt
Last synced: 07 May 2025
https://github.com/tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
ai-alignment ai-safety decision-transformers gpt language-models pretraining reinforcement-learning rlhf
Last synced: 07 May 2025
https://github.com/ksm26/langchain-chat-with-your-data
Explore LangChain and build powerful chatbots that interact with your own data. Gain insights into document loading, splitting, retrieval, question answering, and more.
chat-with-your-dat chatbot-development contextual-chatbots conversational-agents conversational-ai deep-learning document-loading document-splitting embeddings information-retrieval langchain language-models llms machine-learning natural-language-processing python question-answering sentiment-analysis vector-stores
Last synced: 24 Feb 2026
https://github.com/sign-language-translator/sign-language-translator
Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artificial Intelligence.
artificial-intelligence attention-mechanism computer-vision deep-learning encoder-decoder language-models machine-learning motion-transfer nlp pose-estimation python pytorch rule-based-nlp sign-language sign-language-recognition sign-language-translation sign-to-text text-to-sign transformers translation
Last synced: 06 Apr 2025
https://github.com/nayjest/lm-proxy
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.
ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton
Last synced: 18 Jun 2026
https://github.com/zjunlp/DART
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
dart few-shot-learning iclr iclr2022 language-models pre-trained-language-models prompt prompt-learning prompt-tuning pytorch
Last synced: 21 Jun 2025
https://github.com/flairnlp/transformer-ranker
Efficiently find the best-suited language model (LM) for your NLP task
language-models transferability transferability-estimation
Last synced: 04 Apr 2025
https://github.com/Nayjest/lm-proxy
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.
ai anthropic api-proxy fastapi google-ai language-models llm llm-api llm-gateway llm-inference llm-proxy openai openai-api proxy proxy-server pyton
Last synced: 09 Jun 2026
https://github.com/zjunlp/dart
Code for the ICLR2022 paper "Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners"
dart few-shot-learning iclr iclr2022 language-models pre-trained-language-models prompt prompt-learning prompt-tuning pytorch
Last synced: 13 Jun 2025
https://github.com/dmitryryumin/emnlp-2023-papers
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!
bert computational-linguistics emnlp emnlp2023 gpt language-models llms machine-learning machine-translation multilingual-nlp named-entity-recognition natural-language-processing ner nlp nlp-applications sentiment-analysis syntax-and-semantics text-mining transformers word-embeddings
Last synced: 12 Apr 2025
https://github.com/quanta-quest/quanta-quest
AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.
agent ai anthropic bert chatgpt claude edge-computing gpt huggingface knowledgebase language-models llm nextjs nlp personal-ass rag semantic-vector-search transformers universal-search workflow
Last synced: 07 Apr 2025
https://github.com/bilel-bj/ROSGPT_Vision
Commanding robots using only Language Models' prompts
chatgpt language-models language-models-are-next large-language-models llm prompt-engineering prompting-robotic-modalities robotic-design-patterns robotic-vision robotics ros2 visual-language-models
Last synced: 24 Mar 2025
https://github.com/disi-unibo-nlp/nlg-metricverse
[COLING22] An End-to-End Library for Evaluating Natural Language Generation
language-models metrics natural-language-generation natural-language-processing nlg-evaluation python pytorch visualization
Last synced: 07 Apr 2026
https://github.com/loodos/turkish-language-models
Transformer based Turkish language models
language-models natural-language-processing nlp turkish
Last synced: 26 Feb 2026
https://github.com/Loodos/turkish-language-models
Transformer based Turkish language models
language-models natural-language-processing nlp turkish
Last synced: 03 May 2025
https://github.com/naver/disco
A Toolkit for Distributional Control of Generative Models
ai alignment distributional-policy-gradients fine-tuning generative-models human-preferences language-models machine-learning monte-carlo-sampling
Last synced: 14 Apr 2025
https://github.com/retarfi/language-pretraining
Pre-training Language Models for Japanese
bert electra implementation japanese language-model language-models natural-language-processing nlp pytorch transformer transformers
Last synced: 13 Apr 2025
https://github.com/anyks/alm
Smart Language Model
alm arpa cpp language-models tokenization tokenizer vocab-pruning
Last synced: 28 Apr 2025
https://github.com/prismworks-ai/prism-mcp-rs
Enterprise-grade Rust implementation of Anthropic's MCP protocol
agentic agents ai ai-agents anthropic api assistant claude cursor enterprise framework integration language-models llm mcp model-context-protocol plugin-system production rust sdk
Last synced: 08 Apr 2026
https://github.com/pbloem/language-models
Keras implementations of three language models: character-level RNN, word-level RNN and Sentence VAE (Bowman, Vilnis et al 2016).
bowman keras language-models rnn-language-model vae
Last synced: 10 Apr 2025
https://github.com/christian-doucette/tolkein_text
Neural Network Language Model that generates text based off Lord of the Rings. Built with Pytorch.
language-models lord-of-the-rings machine-learning nlp pytorch
Last synced: 26 Sep 2025
https://github.com/bhattbhavesh91/diffusion-chatgpt
This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.
chatgpt chatgpt-api google-assistant gpt-3 gradio gradio-interface language-model language-models openai stable-diffusion stable-diffusion-diffusers stable-diffusion-v2 whisper
Last synced: 17 Apr 2025
https://github.com/csinva/tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
ai artificial-intelligence classification controllability decision-tree huggingface interpretability language-model language-models llm llm-inference machine-learning prompt-engineering prompting scikit-learn
Last synced: 29 Apr 2025
https://github.com/clarifai/clarifai-python
Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!
artifical-intelligence clarifai clarifai-python computer-vision deep-learning image-classification language-models llm machine-learning natural-language-processing object-detection pretrained-models python rag retrieval-augmented-generation text-classification text-generation text-summarization visual-search
Last synced: 11 Mar 2026
https://github.com/kdunee/intentguard
A Python library for verifying code properties using natural language assertions.
ai-testing code-quality code-verification language-models llm natural-language pytest test-automation testing unittest
Last synced: 13 Dec 2025
https://github.com/adrianbzg/llama-multimodal-vqa
Multimodal Instruction Tuning for Llama 3
chatbot chatgpt gpt-4 huggingface instruction-tuning language-models llama llama2 llama3 multimodal multimodal-instruction-tuning visual-language-learning visual-question-answering vqa
Last synced: 25 Oct 2025
https://github.com/alexandra-chron/ntua-slp-wassa-iest2018
Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.
deep-learning deep-neural-networks emotion-analysis language-models lstm python pytorch sentiment-analysis transfer-learning twitter
Last synced: 06 Apr 2025
https://github.com/alan-turing-institute/robots-in-disguise
Information and materials for the Turing's "robots-in-disguise" reading group on fundamental AI research.
deep-learning diffusion-models foundation-model hut23 language-models large-language-models machine-learning nlp transformers
Last synced: 21 Aug 2025
https://github.com/cmungall/semantic-llama
A knowledge extraction tool that uses a large language model to extract semantic information from text
ai knowledge-extraction language-models linkml oaklib obofoundry
Last synced: 05 May 2025
https://github.com/yueyuel/reliablelm4code
Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; LLM4Code
code-generation code-intelligence language-models llm4code lm4se reliability software-
Last synced: 31 Jan 2026
https://github.com/lucidrains/nim-tokenizer
Implementation of a simple BPE tokenizer, but in Nim
artificial-intelligence deep-learning language-models nim tokenizer
Last synced: 09 Apr 2025
https://github.com/ancatmara/data-science-nlp
NLP Section of the Data Science course, NRU HSE
classification clustering data-analysis data-science dimensionality-reduction embeddings fnn language-models morphological-analysis natural-language-processing nlp python regex russian-nlp syntactic-parsing topic-modelling tutorials
Last synced: 11 Jul 2025
https://github.com/clarifai/clarifai-nodejs
Experience the power of Clarifai’s AI platform with the nodejs SDK. 🌟 Star to support our work!
artificial-intelligence clarifai clarifai-api clarifai-javascript computer-vision deep-learning image-classification language-models llm machine-learning natural-language-processing nextjs nodejs object-detection pretrained-models rag text-classification text-generation text-summarization visual-search
Last synced: 13 Apr 2025
https://github.com/ai4sd/number-token-loss
PyPI package for number token loss.
language-models llm llm-training reasoning
Last synced: 08 Apr 2026
https://github.com/jonsafari/lt1
Course on Language Technologies and NLP
course graduate-course language-models language-technology neural-networks
Last synced: 11 Oct 2025
https://github.com/psychbruce/fmat
😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.
ai artificial-intelligence bert bert-model bert-models contextualized-representation fill-in-the-blank fill-mask huggingface language-model language-models large-language-models masked-language-models natural-language-processing natural-language-understanding nlp pretrained-models transformer transformers
Last synced: 22 Oct 2025
https://github.com/ksm26/prompt-engineering-with-llama-2
The course provides guidance on best practices for prompting and building applications with the powerful open commercial license models of Llama 2.
advanced-prompting-models ai-applications chain-of-thought-prompting code-llama few-shot-prompting industry-standards language-models llama-2 llama-guard llm-interaction meta-llama-2-chat natural-language-processing open-commercial-license prompt-engineering prompt-engineering-models responsible-ai responsible-ai-techniques safe-ai
Last synced: 10 Sep 2025
https://github.com/qewertyy/sdwaifurobot
An AI Related Telegram Utility Bot, can be deployed on vercel
aiart bot diffusion-models image-generation language-models latent-diffusion llms reverse-image-search stable-diffusion
Last synced: 31 Aug 2025
https://github.com/haozhg/lmd
Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models
bert deep-learning language-models multilingual-bert natural-language-processing nlp pretrained-models python pytorch roberta transformers xlm-roberta
Last synced: 17 Jan 2026
https://github.com/nicolay-r/rusentrel-leaderboard
This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)
attention attention-mechanism benchmark bert-model bilstm chatgpt classifiers cnn language-models leaderboard low-resource-nlp neural-networks relation-extraction sentiment-analysis
Last synced: 11 Aug 2025
https://github.com/dillondaudert/pssp_lstm
Recurrent neural network implementations for protein secondary structure prediction and language models
amino-acid-sequence deep-learning deep-neural-networks jupyter-notebook language-models lstm paper prediction pretrained-models protein python3 recurrent-neural-networks rnn secondary structure structure-prediction tensorflow unsupervised-learning
Last synced: 16 Jun 2025
https://github.com/butlerlabs/docai
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
computer-vision information-extraction information-retrieval language-models machine-learning machine-learning-library natural-language-processing nlp nlp-library ocr ocr-python pretrained-models python
Last synced: 01 Mar 2026
https://github.com/vgherard/kgrams
k-grams, Language Models, and All That
language-models n-grams natural-language-processing
Last synced: 30 Apr 2025
https://github.com/pro-genai/auto-trendy-keywords
Real-time AI-driven Trending keyword generation for SEO
ai artificial-intelligence arxiv gen-ai genai generative-ai generativeai language-models large-language-models llm llms prompt-engineering python research research-paper research-project seo seo-friendly seo-optimization seo-tools
Last synced: 13 Jun 2025
https://github.com/colthreepv/llm-context
A CLI tool that helps you generate context files for Large Language Models (LLMs).
ai cli context-builder language-models llm nlp prompt-engineering
Last synced: 16 Feb 2026
https://github.com/skblaz/attviz
Dissecting Transformers via attention visualization
attention-is-all-you-need attention-mechanism interactive language-model language-models machine-learning node-js nodejs python visualization visualizations
Last synced: 18 Aug 2025
https://github.com/spongeengine/koboldsharp
C# client for KoboldCpp.
ai ai-client csharp dotnet kobold-cpp koboldai koboldcpp language-models llm llm-client local-llm local-llm-integration local-llms offline-ai openai-compatible-api self-hosted-ai text-generation-webui
Last synced: 24 Apr 2025
https://github.com/neurodivergent-dev/zscore
A sophisticated web application for text analysis and Shannon Entropy calculation.
ai-assisted-development cognitive-ui cursor-ai developer-philosophy eslint information-theory language-models lexical-analysis netlify react recursive-prompting semantic-analysis shannon-entropy software-engineering tailwindcss textual-entropy thought-quantification typescript vite zscore
Last synced: 28 Feb 2026
https://github.com/mlane/llm-engineering-cheatsheet
Timeless principles and best practices for working with language models — tooling-agnostic, future-proof, and clear.
ai-best-practices ai-cheatsheet ai-patterns ai-reference anthropic chatgpt context-management few-shot-learning generative-ai langchain language-models llm llm-engineering openai prompt-design prompt-engineering python python3 system-prompts zero-shot
Last synced: 30 Oct 2025
https://github.com/linhaowei1/molretrieval
This repo implements many methods to retrieve molecules that are similar to a target molecule from a large molecule corpus.
ai4science biology computational-biology language-models molecule rag retreival retrieval-augmented-generation
Last synced: 23 Oct 2025
https://github.com/uziellujan/dl-textgen-textclass
Deep learning project on practical implementation of text generation and text classification pipelines with PyTorch and Hugging Face using RNNs, LSTMs, GRUs, and Transformers.
deep-learning gru huggingface language-models lstm nlp-machine-learning pytorch rnn text-classification text-generation transformers
Last synced: 08 Oct 2025
https://github.com/spongeengine/lmstudiosharp
C# client for LM Studio.
ai ai-client csharp dotnet language-models llm llm-client lm-studio lmstudio local-llm local-llm-integration local-llms offline-ai openai-compatible-api self-hosted-ai
Last synced: 24 Feb 2025
https://github.com/slava-vishnyakov/rag_engine
Python package for implementing Retrieval-Augmented Generation (RAG) using OpenAI's embeddings and a SQLite database with vector search capabilities
ai chatbot embeddings information-retrieval language-models machine-learning natural-language-processing nlp openai python rag retrieval-augmented-generation semantic-search sqlite vector-search
Last synced: 27 Dec 2025
https://github.com/vaasudevans/natural-language-processing-assignments
UNB Fall-2018 NLP Assignments 💬
baseline bigrams hidden-markov-model information-retrieval-based-chatbot language-models nlp python27 sentiment-analysis unb unigram
Last synced: 31 Mar 2025
https://github.com/abdouaziz/autocomplet
N-grams to build an autocomplet
angular flask frontend language-models projet
Last synced: 13 May 2026
https://github.com/medoidai/givebackgpt
An early version of a system that credits creators based on the similarity of their content to an LLM response. Giving back to creators is the only way for fair, sustainable AI economies that lead to true growth.
ai-ethics bootstrap chatbot css embeddings generative-ai html intellectual-property javascript language-models open-source responsive-web-design sustainable-ai web-search
Last synced: 05 May 2025
https://github.com/centre-for-humanities-computing/danish-ner-bias
Investigating bias in Danish language models in Named Entity Recognition (NER). Code from the paper titled "Detecting intersectionality in NER models: A data-driven approach."
language-models named-entity-recognition nlp
Last synced: 13 Jul 2025
https://github.com/torrinworx/bitorch
A plan for building a distributed system to run AI models BitTorrent style with a secure compensation mechanism.
distributed-systems language-models pytorch
Last synced: 17 May 2026
https://github.com/joel-beck/readnext
Hybrid Recommender System for Computer Science Papers | Master's Thesis Project 2023
citation-analysis hybrid-recommender-system language-models python recommender-system
Last synced: 29 Jun 2026
https://github.com/quanta-quest/quanta-quest-app
AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.
agent ai bert claude edge-computing gpt huggingface knowledgebase language-models nextjs nlp rag transformers wails workflow
Last synced: 04 Feb 2026
https://github.com/spongeengine/lmsharp
A unified .NET client library for running LLMs (Large Language Models) locally. LocalAI.NET provides a single, consistent API for interacting with popular local LLM providers like KoboldCpp, Ollama, LM Studio, and Text Generation WebUI.
ai ai-client csharp dotnet koboldcpp language-models llm llm-client lm-studio local-llm offline-ai ollama openai-compatible-api self-hosted-ai text-generation-webui
Last synced: 07 Sep 2025
https://github.com/leomsgit/personal-lib---ai-ml-nlp-cv
Collection of Notes, Guides, and Examples for Artificial Intelligence, Machine Learning, Natural Language Processing and Computer Vision
ai attention-mechanism deep-learning huggingface huggingface-transformers language-models machine-learning nlp python tokenization transformer transformers
Last synced: 04 Feb 2026
https://github.com/lukexyz/language-models
:earth_africa::book::speech_balloon: Sentiment analysis and text generation using BERT and ULMFiT (2018)
bert language-models transformer ulm-fit
Last synced: 07 Apr 2025
https://github.com/temilaj/nlp-coronavirus-wiki-twitter-perplexity
Natural language processing project to visualize word choice patterns from coronavirus (and related) articles, and compute the average perplexity scores of language models generated from these articles when used with tweets about the subject matter
coronavirus covid-19 language-models n-grams natural-language-processing nlp perplexity-scores
Last synced: 02 Sep 2025
https://github.com/tomekkorbak/kl-gpt3
A modular library for evaluating KL between a Huggingface Transformers models and GPT3
Last synced: 04 Apr 2025
https://github.com/infinitode/duplipy
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting
Last synced: 28 Jun 2026