Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ssbuild/aigc_data

share data, prompt data , pretraining data
https://github.com/ssbuild/aigc_data

aigc-data data instruct llm open open-data pretraining prompt

Last synced: 3 months ago
JSON representation

share data, prompt data , pretraining data

Awesome Lists containing this project

README

        

## aigc_data space 数据共享空间

## data for LLMs

If you like the project, please show your support by leaving a star ⭐.

| No. | project | description | secret |
|-----|:----------------------------------------------------------------------------------------------:|:-----------:|:------:|
| 1 | [悟道 - 200G](https://data.baai.ac.cn/details/WuDaoCorporaText) | | 不需要秘钥 |
| 2 | [Pile英文数据 - 1.3T的](https://pile.eleuther.ai/) | 需要强力清洗 | 不需要秘钥 |
| 3 | [Tigerbot 中文开源预训练集 - 55G](https://huggingface.co/datasets/TigerResearch/pretrain_zh/tree/main) | | 不需要秘钥 |

...

github 不经常更新, 更多数据参见 [数据分享](http://124.70.99.221:8080)

释放6个邀请码
```text
68812a3a4d1c48e39626aeb47a3f4052KhKC0CLbNQv6KIAgm4i4d0Zj4XiGssLWrcA7TlvjwBg8vydB22S6XEbUwDEOfuFkHrQAilImXCQC5tgMU0TJ9eI9tdP2F3Ni
7c07076a2ded4fa6aa16bc484b7192a3ZajgaQrTvbSjDegTCyKVg6iKd1hwwLM6onwxew386vCyUh8Ey1E9CsKQdkIv5vFLL6LTRX8bsV7lA9TZ4csbHDKecyVcllk5
c8a340f435be43ac8e72f265764c987fnRXB9E7hiarIPOrEy3aC5lDzfjArBmjvQP7L3EfBvBnQj7fCpDn1wLKP8dq96sLBw6X5U7Hazkv4MUQ8w9BNqWfyEs5T9WTH
2df70a0d62d542d8b8919fd63603dc33OMpvf0kOPSkOtRzl0jdv0NO3x5MGccGItIc3WRCLqWA7kWIOTUnkzqFfr3so8AtgpyI2UYlDbNp7H6nUtBNTcr4IwN2gGVe6
5b461015581c4b46aa979060b899ea94eBumlAmhsA6ZfFMGuXax6L8tjFHGANOnVNTnuOvHuTUXF2HbkNc7jfJXWUMzcAwP8GWBPz2cqlzDL0N5L0Z6Vg2p3Jll1S3M
67f806e9bd95467889acacc75ff1e3aaDLnjPYcI54tS1YACGQB3t1v5Qcfaua1PPCZQjCRID29XZaFHCTLCtSVXL9jQplzERxg0MBAJsdwESiwGZ6jAGTYAnV04FMXg
```



=======

## 欢迎加入
QQ group 185144988