https://github.com/saucam/llmdataforge
A framework for generating synthetic datasets tailored for large language models (LLMs)
https://github.com/saucam/llmdataforge
Last synced: 3 months ago
JSON representation
A framework for generating synthetic datasets tailored for large language models (LLMs)
- Host: GitHub
- URL: https://github.com/saucam/llmdataforge
- Owner: saucam
- License: mit
- Created: 2024-05-31T02:36:45.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-05-31T14:23:12.000Z (over 1 year ago)
- Last Synced: 2025-03-17T10:11:47.250Z (9 months ago)
- Homepage:
- Size: 2.46 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome_ai_agents - Llmdataforge - A framework for generating synthetic datasets tailored for large language models (LLMs) (Building / Datasets)
README

# LLMDataForge
LLMDataForge is a framework for generating synthetic datasets tailored for large language models (LLMs).
The framework is optimized for:
- Ease of getting started
- Running locally with minimum hardware requirements
- High-Quality Output
- Minimum Dependencies
- Reasonably Fast Generation
- Full Data Ownership