Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with language-model

A curated list of projects in awesome lists tagged with language-model .

https://github.com/huggingface/pytorch-openai-transformer-lm

🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI

language-model neural-networks openai pytorch transformer

Last synced: 21 Dec 2024

https://github.com/OpenMotionLab/MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

3d-generation chatgpt gpt language-model motion motion-generation motiongpt multi-modal text-driven text-to-motion

Last synced: 24 Oct 2024

https://github.com/rwkv/rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

deep-learning ggml language-model llm machine-learning quantization rwkv

Last synced: 19 Dec 2024

https://github.com/RWKV/rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

deep-learning ggml language-model llm machine-learning quantization rwkv

Last synced: 08 Nov 2024

https://github.com/ymcui/chinese-electra

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

bert chinese chinese-electra electra language-model nlp pre-trained-model pytorch tensorflow

Last synced: 15 Dec 2024

https://github.com/ymcui/Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

bert chinese chinese-electra electra language-model nlp pre-trained-model pytorch tensorflow

Last synced: 07 Nov 2024

https://github.com/bupticybee/chineseaidungeonchatgpt

中文版的ai地牢,直接使用的openai的ChatGPT api作为讲故事的模型。

ai aidungeon chatgpt gpt-3 language language-model

Last synced: 16 Dec 2024

https://github.com/bupticybee/ChineseAiDungeonChatGPT

中文版的ai地牢,直接使用的openai的ChatGPT api作为讲故事的模型。

ai aidungeon chatgpt gpt-3 language language-model

Last synced: 03 Nov 2024

https://github.com/nvlabs/prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

image-captioning language-model multi-modal-learning multi-task-learning vision-and-language vision-language-model vqa

Last synced: 15 Dec 2024

https://github.com/hyperonym/basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

generative gpt huggingface language-model llama llm model natural-language-processing nlp openai-api python text-generation transformers

Last synced: 27 Sep 2024

https://github.com/xlang-ai/osworld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

agent artificial-intelligence benchmark cli code-generation gui language-model large-action-model llm multimodal natural-language-processing reinforcement-learning rpa vlm

Last synced: 19 Dec 2024

https://github.com/xlang-ai/OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

agent artificial-intelligence benchmark cli code-generation gui language-model large-action-model llm multimodal natural-language-processing reinforcement-learning rpa vlm

Last synced: 09 Nov 2024

https://github.com/JonasGeiping/cramming

Cramming the training of a (BERT-type) language model into limited compute.

english-language language-model machine-learning

Last synced: 11 Nov 2024

https://github.com/SKTBrain/KoBERT

Korean BERT pre-trained cased (KoBERT)

bert korean-nlp language-model nlp pytorch transformers

Last synced: 09 Nov 2024

https://github.com/charent/chatlm-mini-chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

chatbot language-model t5-model text-generation

Last synced: 15 Dec 2024

https://github.com/charent/ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

chatbot language-model t5-model text-generation

Last synced: 28 Oct 2024

https://github.com/rahulschand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 20 Dec 2024

https://github.com/RahulSChand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ggml gpu huggingface language-model llama llama2 llamacpp llm pytorch quantization

Last synced: 08 Nov 2024

https://github.com/nvidia-merlin/transformers4rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet

Last synced: 18 Dec 2024

https://github.com/NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet

Last synced: 05 Nov 2024

https://github.com/mihail911/nlp-library

curated collection of papers for the nlp practitioner 📖👩‍🔬

deep-learning dialogue language-model machine-learning neural-machine-translation neural-network nlp nlp-datasets

Last synced: 04 Nov 2024

https://github.com/modal-labs/quillman

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

ai language-model python serverless speech-recognition speech-to-text

Last synced: 27 Oct 2024

https://github.com/microsoft/tora

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

autonomous-agents language-model llm mathematical-reasoning tool-learning

Last synced: 16 Dec 2024

https://github.com/microsoft/ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

autonomous-agents language-model llm mathematical-reasoning tool-learning

Last synced: 05 Nov 2024

https://github.com/horseee/llm-pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

baichuan bloom chatglm compression language-model llama llama-2 llama3 llm neurips-2023 pruning pruning-algorithms vicuna

Last synced: 20 Dec 2024

https://github.com/stanford-oval/WikiChat

WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.

chatbot emnlp2023 factuality language-model natural-language-processing nlp

Last synced: 05 Nov 2024

https://github.com/stanford-oval/wikichat

WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.

chatbot emnlp2023 factuality language-model natural-language-processing nlp

Last synced: 16 Dec 2024

https://github.com/liyuanlucasliu/lm-lstm-crf

Empower Sequence Labeling with Task-Aware Language Model

crf language-model ner pytorch sequence-labeling

Last synced: 18 Dec 2024

https://github.com/githubharald/ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

beam-search best-path ctc ctc-loss handwriting-recognition language-model loss opencl prefix-search python recurrent-neural-networks speech-recognition token-passing

Last synced: 20 Dec 2024

https://github.com/litanlitudan/skyagi

SkyAGI: Emerging human-behavior simulation capability in LLM

ai-agent aigc langchain language-model llm

Last synced: 21 Dec 2024

https://github.com/salesforce/xgen

Salesforce open-source LLMs with 8k sequence length.

language-model large-language-models llm nlp

Last synced: 20 Dec 2024

https://github.com/salesforce/xGen

Salesforce open-source LLMs with 8k sequence length.

language-model large-language-models llm nlp

Last synced: 03 Nov 2024

https://github.com/lonepatient/albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

albert bert language-model mask ngram nlp pytorch

Last synced: 21 Dec 2024

https://github.com/lonePatient/albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

albert bert language-model mask ngram nlp pytorch

Last synced: 02 Nov 2024

https://github.com/google-research/prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

flax jax language-model machine-learning nlp prompt-tuning

Last synced: 18 Dec 2024

https://github.com/codekansas/keras-language-modeling

:book: Some language modeling tools for Keras

answer-vector keras-language-modeling language-model

Last synced: 02 Nov 2024

https://github.com/michaelthwan/searchGPT

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

ai chatgpt grounded-api grounded-bot language-model llm machine-learning nlp nlp-machine-learning openai python retrieval retrieval-model

Last synced: 11 Nov 2024

https://github.com/ymcui/macbert

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

bert language-model macbert nlp pytorch tensorflow transformers

Last synced: 21 Dec 2024

https://github.com/ymcui/MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

bert language-model macbert nlp pytorch tensorflow transformers

Last synced: 16 Nov 2024

https://github.com/michiyasunaga/qagnn

[NAACL 2021] QAGNN: Question Answering using Language Models and Knowledge Graphs 🤖

biomedical-applications commonsense-reasoning graph-neural-networks knowledge-graph language-model question-answering

Last synced: 20 Dec 2024

https://github.com/salesforce/progen

Official release of the ProGen models

generative-model language-model protein

Last synced: 20 Dec 2024

https://github.com/michaelthwan/searchgpt

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

ai chatgpt grounded-api grounded-bot language-model llm machine-learning nlp nlp-machine-learning openai python retrieval retrieval-model

Last synced: 09 Nov 2024

https://github.com/louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

ai fine-tuning gpt gpt-4 language-model large-language-models llama llm llms rag retrieval-augmented-generation

Last synced: 20 Dec 2024

https://github.com/ucinlp/autoprompt

AutoPrompt: Automatic Prompt Construction for Masked Language Models.

evaluation language-model nlp

Last synced: 19 Nov 2024

https://ucinlp.github.io/autoprompt/

AutoPrompt: Automatic Prompt Construction for Masked Language Models.

evaluation language-model nlp

Last synced: 19 Nov 2024

https://github.com/kreeben/resin

Vector space index based search engine that's available as a HTTP service or as an embedded library.

information-retrieval language-model machine-learning nlu nlu-engine resin search search-algorithms search-engine vector-space vector-space-model

Last synced: 27 Oct 2024

https://github.com/githubharald/ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

ctc decoder handwritten-text-recognition language-model recurrent-neural-networks speech-recognition text-recognition

Last synced: 20 Dec 2024

https://github.com/voidful/textrl

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

chatgpt controlled-nlg gpt-2 gpt-3 language-model nlg nlp pytorch reinforcement-learning rlhf

Last synced: 09 Nov 2024

https://github.com/voidful/TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

chatgpt controlled-nlg gpt-2 gpt-3 language-model nlg nlp pytorch reinforcement-learning rlhf

Last synced: 31 Oct 2024

https://github.com/ncbi-nlp/bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).

bert bert-model language-model mimic-iii natural-language-processing pubmed pubmed-abstracts

Last synced: 12 Nov 2024

https://github.com/OpenLemur/Lemur

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

code-generation language-model machine-learning natural-language-processing nlp text-reasoning

Last synced: 14 Nov 2024

https://github.com/Shark-NLP/OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

in-context-learning language-model nlp

Last synced: 18 Nov 2024

https://github.com/SKT-AI/KoGPT2

Korean GPT-2 pretrained cased (KoGPT2)

korean-nlp language-model

Last synced: 04 Nov 2024

https://github.com/oughtinc/ice

Interactive Composition Explorer: a debugger for compositional language model programs

debugging gpt-3 language-model python

Last synced: 17 Nov 2024

https://github.com/charent/phi2-mini-chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

language-model phi-2 retrieval-augmented-generation text-generation

Last synced: 16 Dec 2024

https://github.com/AkshitIreddy/Interactive-LLM-Powered-NPCs

Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-player characters (NPCs) in any game! 🎮🤖🚀

ai artificial-intelligence autonomous-agents computer-vision langchain language-model llm-agent python video-game

Last synced: 14 Nov 2024

https://github.com/salesforce/dialogstudio

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

conversational-ai dataset dialog instruction-tuning language-model natural-language-generation natural-language-understanding open-domain-dialog open-source question-answering

Last synced: 21 Dec 2024

https://github.com/salesforce/DialogStudio

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

conversational-ai dataset dialog instruction-tuning language-model natural-language-generation natural-language-understanding open-domain-dialog open-source question-answering

Last synced: 16 Nov 2024

https://github.com/zhenbench/z-bench

Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.

benchmark chinese language-model

Last synced: 25 Oct 2024

https://github.com/choosewhatulike/trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

agent character language-model large-language-models llm natural-language-processing roleplay sft

Last synced: 15 Dec 2024

https://github.com/zetavg/llama-lora-tuner

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.

ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft

Last synced: 22 Dec 2024

https://github.com/gooofy/zamia-speech

Open tools and data for cloudless automatic speech recognition

asr cmu-sphinx kaldi language-model lexicon sequitur speech-corpora speech-recognition voxforge

Last synced: 16 Dec 2024

https://github.com/mv-lab/InstructIR

[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR

computer-vision deblurring deep-learning dehazing denoising image-enhancement image-restoration inverse-problems language-model low-light-image-enhancement multi-task multimodal neural-network photography prompt pytorch super-resolution

Last synced: 03 Nov 2024

https://github.com/zetavg/LLaMA-LoRA-Tuner

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.

ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft

Last synced: 10 Nov 2024

https://github.com/charent/Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

language-model phi-2 retrieval-augmented-generation text-generation

Last synced: 09 Nov 2024

https://github.com/instadeepai/nucleotide-transformer

🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

deep-learning dna genomics language-model nucleotide transformer

Last synced: 01 Nov 2024

https://github.com/langchain-ai/langsmith-sdk

LangSmith Client SDK Implementations

evaluation language-model observability

Last synced: 17 Dec 2024

https://github.com/modal-labs/devlooper

A program synthesis agent that autonomously fixes its output by running tests!

agent gpt-4 language-model llm modal python

Last synced: 25 Nov 2024

https://github.com/michiyasunaga/linkbert

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

biomedical-applications graph-machine-learning knowledge language-model pretrained-models pretraining question-answering transformer

Last synced: 16 Dec 2024

https://github.com/CambioML/pykoi-rlhf-finetuned-transformers

pykoi: Active learning in one unified interface

ai chatbot feedback language-model llm machine-learning rlhf

Last synced: 05 Nov 2024