An open API service indexing awesome lists of open source software.

https://github.com/saucam/llmdataforge

A framework for generating synthetic datasets tailored for large language models (LLMs)
https://github.com/saucam/llmdataforge

Last synced: 3 months ago
JSON representation

A framework for generating synthetic datasets tailored for large language models (LLMs)

Awesome Lists containing this project

README

          

![](docs/assets/llmdataforge.png)

# LLMDataForge
LLMDataForge is a framework for generating synthetic datasets tailored for large language models (LLMs).
The framework is optimized for:
- Ease of getting started
- Running locally with minimum hardware requirements
- High-Quality Output
- Minimum Dependencies
- Reasonably Fast Generation
- Full Data Ownership