awesome-latest-LLM

最新LLMの一覧を作成します
https://github.com/stardust-coder/awesome-latest-LLM

Last synced: 3 days ago
JSON representation

English-centric
- Grok-1
- LongNet(Microsoft) - | apache-2.0 | [MAGNETO](https://arxiv.org/pdf/2210.06423.pdf)| input 1B token| |
- gigaGPT(Cerebras) - 2.0 | | |
- Mamba - spaces/mamba-2.8b) | 2.8B | apache-2.0 | based on state space model| |
- QWen(Alibaba) - 72B) | 72B | [license](https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT)| 3T tokens | | beats Llama2 |
- Self-RAG - 2.0 | 13B | | | critic model |
- TinyLlama - 1.1B-intermediate-step-1431k-3T) | apache-2.0 | 1.1B | based on Llama, 3T token | | |
- Xwin-LM - LM/Xwin-LM-70B-V0.1) | 70B | Llama2 |based on Llama2| also codes and math|
- Amber - 2.0 | Llama|| totally open|
- Phi-1.5(Microsoft) - 1_5) | 1.3B| MSRA-license||textbooks| -->
- BTX(Meta)
- Reka Flash
- BTX(Meta)
- Gemma(Google)
- Miqu - 1-70b/tree/main) | 70B | none ||| leaked from Mistral |
- Aya(Cohere) - 101) | 13B | apache-2.0 | || multilingual |
- Mixtral-8x7B - 8x7B-Instruct-v0.1) | 8x7B | apache-2.0 |||MoE, [offloading](https://github.com/dvmazur/mixtral-offloading)|
- Mixtral-8x22B(Mistral) - community/Mixtral-8x22B-v0.1) | 8x22B | apache-2.0 | || MoE |
- Command-R(Cohere) - command-r-v01) | 35B | non commercial | || RAG capability |
- Llama4 (Meta) - llama/llama-4-67f0c30d9fe03840bc9d0164)|17B|llama4|30T token||10M token|
- Phi-3(Microsoft) - 3-medium-128k-instruct) | 3.8B, 13B | MIT | Phi-3 datasets | - | |
- DeepSeek-V3 - ai/DeepSeek-V3) | 671B | [link](https://github.com/deepseek-ai/DeepSeek-V3/blob/main/LICENSE-MODEL) | 14.8T | sft, RL | MoE |
- Phi-4 (Microsoft) - 4) | 14B | msrla | | | small, sft, dpo |
- Minimax-01 - Text-01) | [Minimax](https://github.com/MiniMax-AI/MiniMax-01?tab=License-1-ov-file) | 456(45.9)B | 1M token context length | | MoE, 4M token window |
- Miqu - 1-70b/tree/main) | 70B | none ||| leaked from Mistral |
- DeepSeek-R1 - ai/DeepSeek-R1)| 671B | MIT | | |
- Llama 3(Meta) - llama/Meta-Llama-3-70B-Instruct) | 70B | [META LLAMA3](https://llama.meta.com/llama3/license/) | || [extended to 120B](https://huggingface.co/mlabonne/Meta-Llama-3-120B-Instruct) |
- Awesome-LLM
Japanese-centric
- awesome-japanese-llm - llm.github.io/evaluation/about.ja.html)
- Swallow(東工大) - llm) | 70B | | Llama2-70Bベース |
- ELYZA-japanese-Llama-2-13b - japanese-Llama-2-13b) | 13B | | Llama-2-13b-chatベース |
- StableLM(StabilityAI) - stablelm-base-beta-70b) | 70B | | Llama2-70Bベース |
- LLM-jp - jp) | 13B | DPO追加あり | -->
- KARAKURI 70B - ai/karakuri-lm-70b-v0.1) | 70B | cc-by-sa-4.0 | Llama2-70Bベース | | [note](https://note.com/ngc_shj/n/n46ced665b378?sub_rt=share_h)|
- LLama3ELYZA-JP-8B - 3-ELYZA-JP-8B) | 8B | Llama3 | Llama3 | | 70B not open |
- KARAKURI LM 8x7B - ai/karakuri-lm-8x7b-chat-v0.1) | 8x7B | Apache-2.0 | | | MoE |
- PlaMo 2 (PFN) - 2-8b) | 8B | [plamo](https://tech.preferred.jp/ja/blog/plamo-community-license/) ||| Samba |
Model
- Apollo - 7B) | ~7B | | | | | | multilingual |
- Meditron(EPFL) - llm/meditron-70B) | 70B | Llama2 | Llama2 | GAP-Replay(48.1B) | [dataset](img/meditron-testdata.png),[score](img/meditron-eval2.png) | |
- BioMedGPT(Luo et al.)
- PMC-LLaMa
- Med-Flamingo
- LLaVa-Med(Microsoft) - med-7b-delta) | 13B | - | LLaVa| medical dataset | VAQ-RAD, SLAKE, PathVQA |multi-modal|
- Awesome-Healthcare-Foundation-Models
- AMIE(Google) - | - | based on PaLM 2 | | | EHR|
- Med-PaLM2(Google) - | PaLM2 | | |
- Med-PaLM M(Google) - | PaLM2 | | |multi-modal|
- Med-PaLM(Google) - | PaLM | | | |
- UltraMedical(TsinghuaC3I) - | Llama3 | | | |
- MedLLMsPracticalGuide
- 医療分野に特化したLLM紹介
- HF - 2.0 | Llama3 | 100,000+ data, [ORPO](https://huggingface.co/blog/mlabonne/orpo-llama-3) | | |
- Health-LLM(Rutgersなど)
- JMedLoRA(UTokyo) - CVM-utokyohospital/llama2-jmedlora-3000) | 70B | none | none | QLoRA | IgakuQA | Japanese, insufficient quality |
- Almanac(Stanford) - davinci-003 | | | RAG |
- AdaptLLM(Microsoft Research) - LLM) | 7B, 13B | | reading comprehensive corpora | | | | ICLR2024 |
- BioMistral - | | | | |
- AMIE(Google) - | - | based on PaLM 2 | | | EHR|
- Hippocrates
- AdaptLLM(Microsoft Research) - LLM-13B) | 7B, 13B | | reading comprehensive corpora | | | | ICLR2024 |
- Med-Gemini(Google) - | Gemini | | |multimodal|
- BiMediX - commercial | 8x7B | mixtral8x7B | | | MoE |
- Meditron(EPFL) - | 8B | - | Llama3 | | MedQA, MedMCQA, PubmedQA | SOTA |
- BioMistral - | | | | |
- Huatuo-o1 - o1-72B) | 72B | apache-2.0 |
- Almanac(Stanford) - davinci-003 | | | RAG |
- Health-LLM(Rutgersなど)
- Meditron(EPFL) - | 8B | - | Llama3 | | MedQA, MedMCQA, PubmedQA | SOTA |
- OpenMeditron - 70B) | 7~70B | |||MedQA etc. |
- Awesome-Medical-Large-Language-Models
- Awesome-Medical-LLM
Evaluation benchmarks
Evaluation
- Japanese Medical Language Model Evaluation Harness
- 医療ドメイン特化LLMの性能はどうやって評価する？
Dataset
- Only Text
  - MedQA (USMLE)
  - PubHealth
  - MMLU
  - K-Q&A
  - MedMCQA
  - PubMedQA
  - MedDistractQA
  - EquityMedQA - ended Q&A for equity and bias mitigation.
  - HeadQA
  - LongHealth
  - Medical Eval Sphere
  - MedCalcBench - Bench-v1.0)
  - PMC Patients
  - MedQA-Calc
  - MedS-Bench
  - MedQuAD
  - TJH Dataset
  - **OnDeviceMedNotes**
  - synthetic-medical-conversations-deepseek-v3
  - **FreedomIntelligence**
  - Medical O1 Reasoning
  - Medical O1 Verifiable Problem
  - Disease Database
- - JMMLU - translated version of MMLU
  - IgakuQA（Japanese National Medical License Exam）
  - He et al.(2023)
  - Apollo Corpus JP
  - JMedData4LLM
  - J-ResearchCorpus
  - MIMIC-ECG-IV - caption dataset
- Image + Text
  - He et al.(2023)
  - VQA-RAD
  - MedICaT
  - MedVTE
  - MedEval
  - Clinical NLP 2023
  - MedTrinity
  - MedAlign(Stanford)
  - ECG-QA
  - MedLLMsPracticalGuide
  - Medical datasets for LLMs (collection)
  - MIMIC-ECG-IV - caption dataset
  - OmniMedVQA
Uncategorized
- Uncategorized
Small language models (SLM)
- PLaMo 2 2B - 2.0| | | pruning, tested on HumanEval+ |
- Sarashina2.2 - tasks=3.75 |
- Phi-4 mini
- SmolLM(huggingface) - 6723884218bcda64b34d7db9)| 135M~1.7B| apache-2.0 | |
- OLMo-2 - 2-0425-1B-Instruct) | 1B | | | |

Programming Languages

Python 29 TypeScript 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

awesome-latest-LLM

English-centric

Japanese-centric

Model

Evaluation benchmarks

Evaluation

Dataset

Only Text

Image + Text

Uncategorized

Uncategorized

Small language models (SLM)