https://github.com/iamdyeus/synthetica
codebase and pipeline to generate high-quality synthetic datasets using Llama 3.1 and an innovative constitutional ruleset framework.
https://github.com/iamdyeus/synthetica
dataset-generation indianisms llama3-1 llm
Last synced: 7 months ago
JSON representation
codebase and pipeline to generate high-quality synthetic datasets using Llama 3.1 and an innovative constitutional ruleset framework.
- Host: GitHub
- URL: https://github.com/iamdyeus/synthetica
- Owner: iamDyeus
- Created: 2024-11-17T15:09:33.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2025-01-10T21:21:47.000Z (9 months ago)
- Last Synced: 2025-01-10T22:31:26.383Z (9 months ago)
- Topics: dataset-generation, indianisms, llama3-1, llm
- Language: Python
- Homepage:
- Size: 591 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Synthetic Dataset Creation
pipeline to generate high-quality "Indianisms" synthetic datasets using Llama 3.1:8B and an innovative constitutionalAI like ruleset framework.## Workflow
