https://github.com/gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
https://github.com/gururise/AlpacaDataCleaned
Last synced: about 1 year ago
JSON representation
Alpaca dataset from Stanford, cleaned and curated
- Host: GitHub
- URL: https://github.com/gururise/AlpacaDataCleaned
- Owner: gururise
- License: apache-2.0
- Created: 2023-03-21T16:30:07.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2023-04-14T17:57:27.000Z (about 3 years ago)
- Last Synced: 2024-10-29T17:51:56.574Z (over 1 year ago)
- Language: Python
- Size: 77.8 MB
- Stars: 1,511
- Watchers: 27
- Forks: 151
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-instruction-dataset - (gururise/Cleaned Alpaca)|52K|EN|MT|SI
- awesome-instruction-datasets - AlpacaDataCleaned - cleaned](https://huggingface.co/datasets/yahma/alpaca-cleaned) | yahma | 52k | EN | MT | SI | general instruct | text-davinci-003 | [download](https://huggingface.co/datasets/QingyiSi/Alpaca-CoT/tree/main/alpaca) | (Statistics)
- awesome-ai-coding-agent-tools - Cleaned Alpaca Dataset - Cleaned Alpaca instruction dataset for fine-tuning LLMs. (Learning Resources / Datasets)
- awesome-chatgpt-dataset - Alpaca Data Cleaned - | (Dataset Detail)
- awesome-prompt-engineering - AlpacaDataCleaned
- StarryDivineSky - gururise/AlpacaDataCleaned
- Awesome-Machine-Generated-Text - [repo - cleaned) | (Detection / Datasets)