An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with language-modeling

A curated list of projects in awesome lists tagged with language-modeling .

https://github.com/quark0/darts

Differentiable architecture search for convolutional and recurrent networks

automl convolutional-networks deep-learning image-classification language-modeling neural-architecture-search pytorch recurrent-networks

Last synced: 15 May 2025

https://github.com/egoalpha/prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

chain-of-thought chatbot chatgpt chatgpt-api cot in-context-learning language-modeling language-understanding large-language-model llm pre-training prompt prompt-based-learning prompt-design prompt-engineering prompt-learning prompt-toolkit prompt-tuning

Last synced: 14 May 2025

https://github.com/EgoAlpha/prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

chain-of-thought chatbot chatgpt chatgpt-api cot in-context-learning language-modeling language-understanding large-language-model llm pre-training prompt prompt-based-learning prompt-design prompt-engineering prompt-learning prompt-toolkit prompt-tuning

Last synced: 12 Mar 2025

https://github.com/uber-research/pplm

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

deep-learning language-modeling machine-learning natural-language-generation natural-language-processing nlp

Last synced: 16 May 2025

https://github.com/uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

deep-learning language-modeling machine-learning natural-language-generation natural-language-processing nlp

Last synced: 14 May 2025

https://github.com/Separius/BERT-keras

Keras implementation of BERT with pre-trained weights

keras language-modeling nlp pretrained-models tensorflow theano transfer-learning transformer

Last synced: 02 Apr 2025

https://github.com/dmitryryumin/interspeech-2023-24-papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission

Last synced: 24 Jan 2026

https://github.com/songlab-cal/tape

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.

benchmark dataset deep-learning language-modeling protein-sequences protein-structure pytorch semi-supervised-learning

Last synced: 09 Apr 2025

https://github.com/google-deepmind/long-form-factuality

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

benchmark dataset evaluation factuality language language-modeling large-language-models metrics

Last synced: 17 Jun 2025

https://github.com/yxuansu/simctg

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

contrastive-learning decode deeplearning language-modeling languagemodel nlp textgeneration

Last synced: 14 Jan 2026

https://github.com/dmitryryumin/icassp-2023-24-papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

asr denoising domain-adaptation face-recognition generative-models icassp icassp2023 icassp2024 image-generation keyword-spotting language-modeling multimodal-learning music-generation self-supervised-learning semantic-segmentation signal-processing signal-restoration speech-recognition spoken-language-understanding vad

Last synced: 08 Apr 2025

https://github.com/DmitryRyumin/ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

asr denoising domain-adaptation face-recognition generative-models icassp icassp2023 icassp2024 image-generation keyword-spotting language-modeling multimodal-learning music-generation self-supervised-learning semantic-segmentation signal-processing signal-restoration speech-recognition spoken-language-understanding vad

Last synced: 14 Jul 2025

https://github.com/shmsw25/FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

emnlp2023 evaluation factuality language language-modeling

Last synced: 16 Oct 2025

https://github.com/muditbhargava66/PyxLSTM

Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.

language-modeling lstm sequence-modeling xlstm

Last synced: 07 Aug 2025

https://github.com/l0sg/relational-rnn-pytorch

An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.

deep-learning deepmind language-model language-modeling pytorch recurrent-neural-networks self-attention transformer word-language-model

Last synced: 10 Mar 2026

https://github.com/L0SG/relational-rnn-pytorch

An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.

deep-learning deepmind language-model language-modeling pytorch recurrent-neural-networks self-attention transformer word-language-model

Last synced: 19 Jul 2025

https://github.com/tonybeltramelli/deep-lyrics

Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network

deep-learning language-modeling natural-language-processing recurrent-neural-networks tensorflow

Last synced: 26 Jun 2025

https://github.com/sunnydreamrain/indrnn_pytorch

Independently Recurrent Neural Networks (IndRNN) implemented in pytorch.

action indrnn language-modeling rnn skeleton

Last synced: 27 Jun 2025

https://github.com/suriyadeepan/rnn-from-scratch

Use tensorflow's tf.scan to build vanilla, GRU and LSTM RNNs

language-modeling recurrent-neural-networks rnn tensorflow

Last synced: 24 Jan 2026

https://github.com/songlab-cal/tape-neurips2019

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)

benchmark dataset deep-learning language-modeling protein-sequences protein-structure semi-supervised-learning

Last synced: 07 May 2025

https://github.com/lucidrains/gated-state-spaces-pytorch

Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch

artificial-intelligence deep-learning language-modeling state-spaces

Last synced: 20 Aug 2025

https://github.com/euclaise/slimtrainer

Full finetuning of large language models without large memory requirements

language-modeling llama transformers

Last synced: 22 Aug 2025

https://github.com/freon4dsl/freon4dsl

Web Native language Workbench with Projectional Web Editor

domain-specific-language dsl language-modeling projectional-editor svelte typescript

Last synced: 12 Jan 2026

https://github.com/shaoxiongji/fed-att

Attentive Federated Learning for Private NLM

attention-mechanism deep-learning federated-learning language-modeling pytorch

Last synced: 22 Apr 2025

https://github.com/microsoft/codemixed-text-generator

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

code-mixing code-switching data-generation language-modeling linguistics natural-language-processing python3 synthetic-data-generation

Last synced: 29 May 2026

https://github.com/idsia/recurrent-fwp

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)

fast-weight-programmers fast-weights language-modeling pytorch recurrent-neural-networks reinforcement-learning torchbeast transformers

Last synced: 27 Oct 2025

https://github.com/google-research/mozolm

MozoLM: A language model (LM) serving library

assistive-technology augmentative-communication language-modeling

Last synced: 24 Apr 2025

https://github.com/mit-han-lab/neurips-micronet

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

efficient-model knowledge-distillation language-modeling natural-language-processing pruning quantization

Last synced: 07 Jul 2025

https://github.com/BESSER-PEARL/BESSER

A Python-based low-modeling low-code platform by BESSER

ai language-modeling metamodel modeling python uml

Last synced: 26 Oct 2025

https://github.com/tencentarc/flm

Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

language-modeling vision-language-pretraining

Last synced: 05 Apr 2025

https://github.com/andstor/verified-smart-contracts

:page_facing_up: Verified Ethereum Smart Contract dataset

dataset ethereum etherscan huggingface language-modeling smart-contracts text-generation

Last synced: 04 May 2025

https://github.com/cynthia/kosentences

Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)

corpus datasets korean language-modeling nlp

Last synced: 26 Jul 2025

https://github.com/suryatejreddy/memeify

Code and Dataset for Memeify: A Large-scale Meme Generation System

deep-learning language-modeling memes tensorflow

Last synced: 12 Sep 2025

https://github.com/clovaai/group-transformer

Official code for Group-Transformer (Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model, COLING-2020).

character-level-language-model language-modeling lightweight pytorch

Last synced: 06 Oct 2025

https://github.com/gidim/babler

Data Collection System For NLP/Speech Recognition

blogs data-collection forums language-modeling machine-learning nlp scraping

Last synced: 16 May 2025

https://github.com/apple/ml-interspeech2022-phi_rtn

Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" by Van Gysel et al.

language-modeling machine-learning speech-recognition virtual-assistants

Last synced: 02 Aug 2025

https://github.com/arrrrrmin/albert-guide

Understanding "A Lite BERT". An Transformer approach for learning self-supervised Language Models.

albert-guide albert-models guide language-modeling nlp pretrain pretraining

Last synced: 05 Oct 2025

https://github.com/sabaurgup/language_detector

With this program, you can find the frequencies of defined biagrams and trigrams in a text entered, and as a result, you can see whether the language of this text is German or English.

c computational-linguistics english-language filter frequencies german-language language-detection language-modeling

Last synced: 29 Oct 2025

https://github.com/arianhosseini/memarchs-in-rnnlm

attempt at implementing "Memory Architectures in Recurrent Neural Network Language Models" as a part of the ICLR 2018 reproducibility challenge

deep-learning language-modeling rnn-language-model self-attentive-rnn stack-rnn

Last synced: 09 Oct 2025

https://github.com/andstor/verified-smart-contracts-audit

:bug: Verified smart contract dataset with vulnerability labeling

dataset ethereum etherscan huggingface language-modeling security smart-contracts text-generation

Last synced: 19 Sep 2025

https://github.com/madjakul/halvesting

Harvests open research papers from HAL (Hyper Articles en Ligne).

dataset-generation language-modeling natural-language-processing

Last synced: 14 Feb 2026

https://github.com/bitcoinchatgpt/deserializesignature-vulnerability-algorithm

Learn about the DeserializeSignature vulnerability in Bitcoin's ECDSA signature algorithm and its potential impact on the security of Bitcoin transactions. Discover how the vulnerability can be exploited and what steps are being taken to mitigate the risk. Stay informed on the latest developments in Bitcoin security.

ai bitcoin bitcoin-wallet chatgpt colab-notebook language-modeling openai pytorch

Last synced: 30 Jan 2026

https://github.com/aquatiko/language-model-shakespere-generator

Character level LSTM based language generator based on Shakespere corpus.

language-modeling lstm natural-language-processing shakespeare

Last synced: 14 Mar 2025

https://github.com/pharo-ai/ngram

N-gram functionality for Pharo

language-modeling natural-language-processing ngrams nlp pharo

Last synced: 11 May 2025

https://github.com/dellison/wikitext.jl

Julia interface to the WikiText dataset.

corpus dataset julia language-modeling natural-language-processing nlp

Last synced: 23 Mar 2025

https://github.com/bitcoinchatgpt/gauss-jacobi-method-algorithm

To use a pre-trained Bitcoin ChatGPT AI model to learn this method, you would first need to provide the model with a clear and concise description of the algorithm, including its purpose, prerequisites, and the mathematical principles behind it. How To Get PrivateKey of Bitcoin Wallet Address.

ai bitcoin bitcoin-wallet chatgpt colab-notebook language-modeling openai pytorch

Last synced: 02 Mar 2026

https://github.com/md-emon-hasan/translatica

Spanish literary translation system, fine-tuned using PEFT with LoRA for high accuracy and minimal compute cost. Built with an intuitive, premium-grade web interface with FastAPI, Bootstrap andcustom animations

ai-app artificial-intelligence bilingual-translation data-science datascience dockerized-app english-to-spanish fine-tuning fine-tuning-llm github-actions huggingface language-model language-modeling machine-learning multilingual-nlp nlp-pipeline nlp-projects opus-mt pytorch tokenizer

Last synced: 13 Apr 2026

https://github.com/chen0040/java-plsa

Package provides the java implementation of probabilistic latent semantic analysis (pLSA)

classification-algorithm language-modeling natural-language-processing statistical-methods topic-modeling

Last synced: 03 Apr 2025

https://github.com/coding-enthusiast9857/gemini_llm_application

It is an innovative repository housing a sophisticated Large Language Model (LLM) project, showcasing the intersection of advanced natural language processing and cutting-edge artificial intelligence. This repository serves as a comprehensive platform for the development, experimentation, and application of state-of-the-art language models.

ai dl gemini gemini-pro generative-models image-description language-modeling llm llm-model ml nlp open-api python question-answering-system streamlit text-generation

Last synced: 07 Feb 2026

https://github.com/bjam24/agh-natural-language-processing

This respository contains projects made for the NLP course at the AGH UST in 2024 / 2025. They received maximum grade 5.0.

agh elasticsearch language-modeling language-modelling levenshtein llm ner neural-search nlp prompt-enginering question-answering rag regex spacy text-classificaiton text-classification

Last synced: 16 Apr 2026

https://github.com/kalifou/ri_tme1

Information retrieval - assignments for course at UPMC - Paris 6

bm25 evaluation-metrics hits-algorithm information-retrieval language-model language-modeling pagerank-algorithm python

Last synced: 28 Apr 2026

https://github.com/bitcoinchatgpt/jacobian-curve-vulnerability-algorithm

Discover the implications of the Jacobian Curve vulnerability in elliptic curve cryptography, particularly its impact on the Elliptic Curve Digital Signature Algorithm (ECDSA). This article explores how attackers can exploit this flaw to generate fraudulent transactions, create fake signatures, and compromise the integrity of blockchain systems.

ai bitcoin bitcoin-wallet chatgpt chatgpt-api colab-notebook language-modeling openai pytorch

Last synced: 19 Jan 2026

https://github.com/fcakyon/gpt2-shakespeare

A tutorial on GPT2 language model training with texts from Shakespeare

colab colab-notebook datasets gpt2 language-modeling nlp notebook python pytorch shakespeare text-generation transformers turorial

Last synced: 11 May 2026

https://github.com/daskol/lsp-lm

Language Model as a Language Server

bertology deep-learning language-modeling lsp neovim-plugin nlp

Last synced: 19 Jul 2025

https://github.com/dlpbc/gru-language-modeling

Language modeling using nairaland featured links as dataset

gru language-modeling nairaland rnn theano

Last synced: 12 May 2026

https://github.com/dobladov/kibana-fieldformats-locale

Kibana plugin to provide the language or location given a ISO_3166-1 code

kibana landing language-modeling locale plugin

Last synced: 10 Sep 2025

https://github.com/mydarapy/gpt-1-from-scratch

Rewriting and pretraining GPT-1 from scratch. Implementing Multihead Attention (MHA) in pyTorch from the original paper Improving Language Understanding by Generative Pre-Training (https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)

deep-learning language-modeling llms machine-learning natural-language-processing pretraining

Last synced: 10 Jun 2025

https://github.com/abdulvahapmutlu/mhc-slm

Official PyTorch implementation of mHC-SSM: manifold-constrained multi-stream residual mixing for State Space Language Models, extended with stream-specialized adapters for improved WikiText-2 perplexity.

adapters deep-learning hyper-connections language-modeling machine-learning mhc python pytorch sinkhorn ssm state-space-models

Last synced: 28 May 2026

https://github.com/stathwang/language-model

Generating Movie Reviews in Korean with Trigram Language Model

language-modeling markov-model n-grams natural-language-processing python

Last synced: 02 Apr 2026

https://github.com/lochbrunner/chop-specs

Chop Language Specifications

language-modeling llvm-ir specification

Last synced: 06 Mar 2026

https://github.com/dimits-ts/text_analytics

Language Modelling (text generation, spell correction) and Sentiment Analysis / POS Tagging with MLP, RNN, CNN and BERT models and LLM prompting

baseline-model bert cnn fasttext-embeddings language-modeling llm mlp-classifier pos-tagging prompting pytorch rnn sentiment-analysis sklearn spell-correction tensorflow text-generation transformers transformers-library

Last synced: 09 Apr 2026

https://github.com/nachiket273/aora-light

Agreement-Aware Orthogonal Routing Attention (AORA-Light): lightweight dual-branch transformer attention for robustness to non-consensus signals.

attention-mechanism deep-learning experimental-ml language-modeling neural-network pytorch research robust-ml scientific-machine-learning transformer

Last synced: 30 May 2026

https://github.com/ahmadrazacdx/seq-modeling-from-scratch

From Scratch RNN, LSTM, GRU, and Seq2Seq architectures for language modeling and educational purposes.

attention-mechanism deep-learning educational from-scratch gru language-modeling lstm rnn seq2seq

Last synced: 31 May 2026

https://github.com/bitcoinchatgpt/fuzzing-vulnerability-algorithm

Learn about the Fuzzing vulnerability in Bitcoin's ECDSA signature algorithm and its potential impact on the security of Bitcoin transactions. Discover how the vulnerability can be exploited and what steps are being taken to mitigate the risk. Stay informed on the latest developments in Bitcoin security.

ai bitcoin bitcoin-wallet chatgpt colab-notebook language-modeling openai pytorch

Last synced: 08 Mar 2026

https://github.com/milankinen/las-docker

Dockerized SeCo Language Analysis Services

docker dockerfile language-modeling nlp rest-api seco

Last synced: 14 Apr 2026