Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/SKTBrain/KoBERT

Korean BERT pre-trained cased (KoBERT)

bert korean-nlp language-model nlp pytorch transformers

Last synced: 02 Jul 2024

https://github.com/PR-Pilot-AI/pr-pilot

A text-to-task automation platform that enables GitHub developers to trigger AI-driven development tasks in their repositories from anywhere.

ai bot chatgpt collaboration generative-ai gpt-4 language-model llm openai

Last synced: 02 Jul 2024

https://github.com/OpenShapeLab/ShapeGPT

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model

3d-generation caption-generation chatgpt gpt language-model multi-modal shape unified

Last synced: 01 Jul 2024

https://github.com/ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

adversarial-attacks awesome-list diffusion-models jailbreak language-model llm nlp privacy safety security vlm

Last synced: 01 Jul 2024

https://github.com/GAIR-NLP/MathPile

Generative AI for Math: MathPile

corpus language-model large-language-models math pre-training

Last synced: 27 Jun 2024

https://github.com/pooya-mohammadi/persian-spell-checker-kenlm

A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.

bash kenlm language-model nlp persian python spellcheck spellchecker symspell

Last synced: 27 Jun 2024

https://github.com/Lukas-Justen/Law-OMNI-BERT-Project

Directly applying advancements in transfer learning from BERT results in poor accuracy in domain-specific areas like law because of a word distribution shift from general domain corpora to domain-specific corpora. In our project, we will demonstrate how the pre-trained language model BERT can be adapted to additional domains, such as contract law or court judgments.

bert bert-model contracts language-model law legal-texts statistical-linguistics

Last synced: 26 Jun 2024

https://github.com/InternLM/InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

chatgpt foundation gpt gpt-4 instruction-tuning language-model large-language-model large-vision-language-model llm mllm multi-modality multimodal supervised-finetuning vision-language-model vision-transformer visual-language-learning

Last synced: 25 Jun 2024

https://github.com/Shark-NLP/OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

in-context-learning language-model nlp

Last synced: 25 Jun 2024

https://github.com/aiwaves-cn/agents

An Open-source Framework for Autonomous Language Agents

autonomous-agents language-model llm

Last synced: 24 Jun 2024

https://github.com/txsun1997/CoLAKE

COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding

deep-learning knowledge-embedding knowledge-graph language-model natural-language-processing

Last synced: 23 Jun 2024

https://github.com/michiyasunaga/qagnn

[NAACL 2021] QAGNN: Question Answering using Language Models and Knowledge Graphs 🤖

biomedical-applications commonsense-reasoning graph-neural-networks knowledge-graph language-model question-answering

Last synced: 23 Jun 2024

https://github.com/OpenLemur/Lemur

Lemur: Open Foundation Models for Language Agents

code-generation language-model machine-learning natural-language-processing nlp text-reasoning

Last synced: 23 Jun 2024

https://github.com/NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet

Last synced: 22 Jun 2024

https://optimalscale.github.io/LMFlow/

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

chatgpt deep-learning instruction-following language-model pretrained-models pytorch transformer

Last synced: 21 Jun 2024

https://github.com/hiyouga/ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers

Last synced: 20 Jun 2024

https://github.com/oxpig/AbLang

AbLang: A language model for antibodies

antibodies language-model protein-sequences semantic

Last synced: 19 Jun 2024

https://github.com/CraftJarvis/JARVIS-1

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

agent language-model minecraft

Last synced: 17 Jun 2024

https://github.com/CraftJarvis/MC-Planner

Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

language-model minecraft

Last synced: 17 Jun 2024

https://github.com/kampersanda/tongrams-rs

Rust library providing fast language model queries in compressed space

compression elias-fano language-model ngrams nlp trie

Last synced: 17 Jun 2024

https://github.com/yumeng5/LOTClass

[EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach

language-model text-classification weakly-supervised-learning

Last synced: 16 Jun 2024

https://github.com/snap-stanford/GreaseLM

[ICLR 2022 spotlight]GreaseLM: Graph REASoning Enhanced Language Models for Question Answering

biomedical-ques commonsense-reasoning graph-neural-networks knowledge-graph language-model question-answering

Last synced: 16 Jun 2024

https://github.com/michaelthwan/searchGPT

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

ai chatgpt grounded-api grounded-bot language-model llm machine-learning nlp nlp-machine-learning openai python retrieval retrieval-model

Last synced: 16 Jun 2024

https://github.com/princeton-nlp/SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

benchmark language-model software-engineering

Last synced: 16 Jun 2024

https://github.com/zhenbench/z-bench

Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.

benchmark chinese language-model

Last synced: 16 Jun 2024

https://github.com/FranxYao/Language-Model-Pretraining-for-Text-Generation

LM pretraining for generation, reading list, resources, conference mappings.

bert bert-model gpt language-generation language-model pretrained-models text-generation

Last synced: 15 Jun 2024

https://github.com/claravania/subword-lstm-lm

LSTM Language Model with Subword Units Input Representations

language-model

Last synced: 15 Jun 2024

https://github.com/openmotionlab/motiongpt

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

3d-generation chatgpt gpt language-model motion motion-generation motiongpt multi-modal text-driven text-to-motion

Last synced: 14 Jun 2024

https://github.com/cvi-szu/linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

bert chatbot chatgpt chinese chinese-nlp gpt-3 language-model llama nlp zero-shot-learning

Last synced: 14 Jun 2024

https://github.com/kevin-free/chatgpt-prompt-engineering-for-developers

吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版

chatgpt deep-learning language-model openai prompt-engineering

Last synced: 14 Jun 2024

https://github.com/coreylowman/llama-dfdx

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

cuda deep-learning inference language-model llama neural-network rust rust-lang

Last synced: 14 Jun 2024

https://github.com/microsoft/lmops

General technology for enabling AI capabilities w/ LLMs and MLLMs

agi gpt language-model llm lm lmops nlp pretraining prompt promptist x-prompt

Last synced: 14 Jun 2024

https://github.com/wuwenjie1992/StarryDivineSky

精选了5K+项目,包括机器学习、深度学习、NLP、GNN、推荐系统、生物医药、机器视觉、前后端开发等内容。Selected more than 5000 projects, including machine learning, deep learning, NLP, GNN, recommendation system, biomedicine, machine vision, etc. Let more excellent projects be discovered by people. Continue to update! Welcome to star!

awesome awesome-list biomedicine cv data-science deep-learning hacker language-model large-language-models machine-learning nlp web

Last synced: 13 Jun 2024

https://github.com/salesforce/DialogStudio

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

conversational-ai dataset dialog instruction-tuning language-model natural-language-generation natural-language-understanding open-domain-dialog open-source question-answering

Last synced: 13 Jun 2024

https://github.com/ymcui/Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

bert chinese chinese-electra electra language-model nlp pre-trained-model pytorch tensorflow

Last synced: 13 Jun 2024

https://github.com/CyberZHG/keras-xlnet

Implementation of XLNet that can load pretrained checkpoints

glue keras language-model nlp xlnet

Last synced: 13 Jun 2024

https://github.com/extreme-bert/extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.

bert deep-learning language-model language-models machine-learning natural-language-processing nlp python pytorch transformer

Last synced: 13 Jun 2024

https://github.com/ymcui/MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

bert language-model macbert nlp pytorch tensorflow transformers

Last synced: 13 Jun 2024

https://github.com/cambridgeltl/sapbert

[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.

acl2021 bert bionlp contrastive-learning language-model lexical-semantics machine-learning metric-learning naacl2021 nlp representation-learning

Last synced: 11 Jun 2024

https://github.com/lyuchenyang/Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

deep-learning language-model machine-learning multi-modal-learning natural-language-processing neural-networks

Last synced: 11 Jun 2024

https://github.com/THUDM/CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

cross-modality language-model multi-modal pretrained-models visual-language-models

Last synced: 11 Jun 2024

https://github.com/microsoft/generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

ai azure chatgpt dall-e generative-ai generativeai gpt language-model llms openai prompt-engineering semantic-search transformers

Last synced: 11 Jun 2024

https://github.com/modal-labs/quillman

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

ai language-model python serverless speech-recognition speech-to-text

Last synced: 10 Jun 2024

https://github.com/pd3f/pd3f

🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based

extract-text language-model machine-learning ocr parsr pd3f pdf pdf-to-text pipeline python text-extraction

Last synced: 09 Jun 2024

https://github.com/louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

ai fine-tuning gpt gpt-4 language-model large-language-models llama llm llms rag retrieval-augmented-generation

Last synced: 08 Jun 2024

https://github.com/microsoft/ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

autonomous-agents language-model llm mathematical-reasoning tool-learning

Last synced: 08 Jun 2024

https://github.com/Proteusiq/luga

Blazing fast language detection using fastText model

detection language language-model languages machine-learning

Last synced: 07 Jun 2024

https://github.com/allenai/ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

language-model machine-learning reinforcement-learning text-based-game text-based-game-framework

Last synced: 07 Jun 2024

https://github.com/intelligentnode/IntelliNode

Access the latest AI models like ChatGPT, LLaMA, Diffusion, Gemini Hugging face, and beyond through a unified prompt layer and performance evaluation

anthropic chatbot chatgpt claude dall-e embeddings gemini google-ai gpt-4 hugging-face image-generation language-model mistralai nodejs openai prompt-engineering semantic-search speech-synthesis vectors

Last synced: 07 Jun 2024

https://github.com/mihail911/nlp-library

curated collection of papers for the nlp practitioner 📖👩‍🔬

deep-learning dialogue language-model machine-learning neural-machine-translation neural-network nlp nlp-datasets

Last synced: 07 Jun 2024

https://github.com/SKT-AI/KoGPT2

Korean GPT-2 pretrained cased (KoGPT2)

korean-nlp language-model

Last synced: 07 Jun 2024

https://github.com/xu-song/bert-as-language-model

BERT as language model, fork from https://github.com/google-research/bert

bert language-model tensorflow

Last synced: 06 Jun 2024

https://github.com/cedrickchee/awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

attention-mechanism awesome awesome-list bert chatgpt gpt-2 gpt-3 gpt-4 language-model llama natural-language-processing neural-networks nlp pre-trained-language-models transfer-learning transformer xlnet

Last synced: 06 Jun 2024

https://github.com/Microsoft/AzureML-BERT

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning

Last synced: 06 Jun 2024

https://github.com/lonePatient/albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

albert bert language-model mask ngram nlp pytorch

Last synced: 06 Jun 2024

https://github.com/brightmart/xlnet_zh

中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large

bert language-model pre-train roberta xlnet

Last synced: 06 Jun 2024

https://github.com/SteveKGYang/MentalLLaMA

This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.

chatgpt gpt4 interpretability language-model large-language-models llama2 mental-health natural-language-processing natural-language-understanding social-media

Last synced: 04 Jun 2024

https://github.com/yizhongw/self-instruct

Aligning pretrained language models with instruction data generated by themselves.

general-purpose-model instruction-tuning language-model

Last synced: 02 Jun 2024

https://github.com/young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

chatbot deep-learning flax jax language-model large-language-models llama natural-language-processing transformer

Last synced: 02 Jun 2024

https://github.com/yanyongyu/operagents

Dynamic, highly customizable language agents framework

agent crewai gpt langgraph language-model sop

Last synced: 02 Jun 2024

https://github.com/instadeepai/nucleotide-transformer

🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

deep-learning dna genomics language-model nucleotide transformer

Last synced: 02 Jun 2024

https://github.com/mlc-ai/web-llm

Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.

chatgpt deep-learning language-model llm tvm webgpu webml

Last synced: 01 Jun 2024

https://github.com/zahidkhawaja/rusty

AI-powered CLI tool to help you remember bash commands.

gpt-3 hacktoberfest language-model machine-learning openai rust rust-lang

Last synced: 31 May 2024

https://github.com/salesforce/CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

code-generation code-intelligence code-understanding language-model large-language-models

Last synced: 31 May 2024

https://github.com/yaodongC/awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

awsome-lists datasets gpt-3 gpt-4 instruction-following instruction-tuning language-model llama

Last synced: 30 May 2024

https://github.com/maraoz/gpt-scrolls

A collaborative collection of open-source safe GPT-3 prompts that work well

generator gpt-3 language-model openai safety transformer

Last synced: 30 May 2024

https://github.com/THUDM/CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

cogvlm language-model multi-modal pretrained-models

Last synced: 28 May 2024

https://github.com/mewmix/llama-index-flask-demo

A Flask Server Demo Application showing off some llama-index LLM prompt magic, including file upload and parsing :)

ai chat-gpt flask language-model llama-index llm open-ai prompt-engineering python server

Last synced: 27 May 2024

https://github.com/Bradley-Butcher/Conformers

Unofficial implementation of Conformal Language Modeling by Quach et al

alignment conformal-prediction language-model transformers

Last synced: 25 May 2024

https://github.com/nathanlesage/local-chat

LocalChat is a ChatGPT-like chat that runs on your computer

chat chatbot chatgpt electron huggingface language-model llama llamacpp llm local privacy transformers

Last synced: 25 May 2024

https://github.com/stochasticai/xturing

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization

Last synced: 25 May 2024

https://github.com/hyperonym/basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

generative gpt huggingface language-model llama llm model natural-language-processing nlp openai-api python text-generation transformers

Last synced: 25 May 2024

https://github.com/tatsu-lab/stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

deep-learning instruction-following language-model

Last synced: 25 May 2024