Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-latest-LLM
最新LLMの一覧を作成します
https://github.com/stardust-coder/awesome-latest-LLM
Last synced: 6 days ago
JSON representation
-
English-centric
- Mixtral-8x7B - 8x7B-Instruct-v0.1) | 8x7B | apache-2.0 |||MoE, [offloading](https://github.com/dvmazur/mixtral-offloading)|
- Grok-1
- LongNet(Microsoft) - | apache-2.0 | [MAGNETO](https://arxiv.org/pdf/2210.06423.pdf)| input 1B token| |
- gigaGPT(Cerebras) - 2.0 | | |
- Mamba - spaces/mamba-2.8b) | 2.8B | apache-2.0 | based on state space model| |
- QWen(Alibaba) - 72B) | 72B | [license](https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT)| 3T tokens | | beats Llama2 |
- Self-RAG - 2.0 | 13B | | | critic model |
- TinyLlama - 1.1B-intermediate-step-1431k-3T) | apache-2.0 | 1.1B | based on Llama, 3T token | | |
- Xwin-LM - LM/Xwin-LM-70B-V0.1) | 70B | Llama2 |based on Llama2| also codes and math|
- Amber - 2.0 | Llama|| totally open|
- Phi-1.5(Microsoft) - 1_5) | 1.3B| MSRA-license||textbooks| -->
- BTX(Meta)
- Reka Flash
- BTX(Meta)
- Gemma(Google)
- Miqu - 1-70b/tree/main) | 70B | none ||| leaked from Mistral |
- Aya(Cohere) - 101) | 13B | apache-2.0 | || multilingual |
- Mixtral-8x22B(Mistral) - community/Mixtral-8x22B-v0.1) | 8x22B | apache-2.0 | || MoE |
- Llama2(Meta) - llama) | 70B | Llama2 | 2T tokens| chat-hf seems the best|
- Command-R(Cohere) - command-r-v01) | 35B | non commercial | || RAG capability |
- Phi-3(Microsoft) - 3-medium-128k-instruct) | 3.8B, 13B | MIT | Phi-3 datasets | - | |
-
Japanese-centric
- awesome-japanese-llm - llm.github.io/evaluation/about.ja.html)
- Swallow(東工大) - llm) | 70B | | Llama2-70Bベース |
- ELYZA-japanese-Llama-2-13b - japanese-Llama-2-13b) | 13B | | Llama-2-13b-chatベース |
- StableLM(StabilityAI) - stablelm-base-beta-70b) | 70B | | Llama2-70Bベース |
- LLM-jp - jp) | 13B | DPO追加あり |
- KARAKURI 70B - ai/karakuri-lm-70b-v0.1) | 70B | cc-by-sa-4.0 | Llama2-70Bベース | | [note](https://note.com/ngc_shj/n/n46ced665b378?sub_rt=share_h)|
- LLama3ELYZA-JP-8B - 3-ELYZA-JP-8B) | 8B | Llama3 | Llama3 | | 70B not open |
- KARAKURI LM 8x7B - ai/karakuri-lm-8x7b-chat-v0.1) | 8x7B | Apache-2.0 | | | MoE |
-
Model
- Apollo - 7B) | ~7B | | | | | | multilingual |
- Meditron(EPFL) - llm/meditron-70B) | 70B | Llama2 | Llama2 | GAP-Replay(48.1B) | [dataset](img/meditron-testdata.png),[score](img/meditron-eval2.png) | |
- BioMedGPT(Luo et al.)
- PMC-LLaMa
- Med-Flamingo
- LLaVa-Med(Microsoft) - med-7b-delta) | 13B | - | LLaVa| medical dataset | VAQ-RAD, SLAKE, PathVQA |multi-modal|
- Awesome-Healthcare-Foundation-Models
- AMIE(Google) - | - | based on PaLM 2 | | | EHR|
- Med-PaLM2(Google) - | PaLM2 | | |
- Med-PaLM M(Google) - | PaLM2 | | |multi-modal|
- Med-PaLM(Google) - | PaLM | | | |
- UltraMedical(TsinghuaC3I) - | Llama3 | | | |
- MedLLMsPracticalGuide
- 医療分野に特化したLLM紹介
- HF - 2.0 | Llama3 | 100,000+ data, [ORPO](https://huggingface.co/blog/mlabonne/orpo-llama-3) | | |
- Health-LLM(Rutgersなど)
- JMedLoRA(UTokyo) - CVM-utokyohospital/llama2-jmedlora-3000) | 70B | none | none | QLoRA | IgakuQA | Japanese, insufficient quality |
- Almanac(Stanford) - davinci-003 | | | RAG |
- AdaptLLM(Microsoft Research) - LLM) | 7B, 13B | | reading comprehensive corpora | | | | ICLR2024 |
- BioMistral - | | | | |
- AMIE(Google) - | - | based on PaLM 2 | | | EHR|
- Hippocrates
- AdaptLLM(Microsoft Research) - LLM-13B) | 7B, 13B | | reading comprehensive corpora | | | | ICLR2024 |
- Med-Gemini(Google) - | Gemini | | |multimodal|
- BiMediX - commercial | 8x7B | mixtral8x7B | | | MoE |
- Meditron(EPFL) - | 8B | - | Llama3 | | MedQA, MedMCQA, PubmedQA | SOTA |
- Health-LLM(Rutgersなど)
- BioMistral - | | | | |
- Almanac(Stanford) - davinci-003 | | | RAG |
-
Evaluation
-
Dataset
-
Only Text
-
- JMMLU - translated version of MMLU
- IgakuQA(Japanese National Medical License Exam)
- He et al.(2023)
- Apollo Corpus JP
- JMedData4LLM
- J-ResearchCorpus
- MIMIC-ECG-IV - caption dataset
-
Image + Text / Multimodal
-
-
Uncategorized
-
Uncategorized
-
Programming Languages
Sub Categories
Keywords
llm
4
large-language-models
3
multimodal
2
natural-language-processing
2
llm-japanese
1
large-language-model
1
language-models
1
language-model
1
japanese-llm
1
japanese-language-model
1
japanese-language
1
japanese
1
generative-models
1
generative-model
1
generative-ai
1
foundation-models
1
pretrained-models
1
mistral
1
flash-attention
1
chinese
1
translation
1
transformer
1
speech-processing
1
pretrained-language-model
1
machine-learning
1
computer-vision
1
moe
1
question-answering
1
ptb-xl
1
mimic-iv-ecg
1
ekg
1
ecg-qa
1
ecg
1
survey
1
medical-large-language-models
1
clinical-ai
1
ai-in-medicine
1
transfer-learning
1
muti-task
1
gpt-3
1
few-shot-learning
1
public-health
1
fake-news-detection
1
fake-news
1
fact-checking
1
explainable-ml
1
explainable-ai
1
emnlp2020
1
open-source
1
medical
1