Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
https://github.com/dvlab-research/LongLoRA
fine-tuning-llm large-language-models llm long-context lora
Last synced: 3 months ago
JSON representation
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
- Host: GitHub
- URL: https://github.com/dvlab-research/LongLoRA
- Owner: dvlab-research
- License: apache-2.0
- Created: 2023-09-21T13:10:38.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-14T12:48:54.000Z (5 months ago)
- Last Synced: 2024-10-15T09:21:55.744Z (3 months ago)
- Topics: fine-tuning-llm, large-language-models, llm, long-context, lora
- Language: Python
- Homepage: http://arxiv.org/abs/2309.12307
- Size: 11.1 MB
- Stars: 2,614
- Watchers: 12
- Forks: 270
- Open Issues: 50
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- Awesome-Reasoning-Foundation-Models - [code
- StarryDivineSky - dvlab-research/LongLoRA - Attention兼容,并且在推理过程中不需要。我们发布了所有模型,包括 7B 到 70B 的模型,上下文长度从 8k 到 100k。我们建立了一个长上下文指令跟踪数据集 LongAlpaca-12k。我们发布了相应的 LongAlpaca-7B、LongAlpaca-13B 和 LongAlpaca-70B 型号。 (A01_文本生成_文本对话 / 大语言对话模型及数据)