An open API service indexing awesome lists of open source software.

https://github.com/iamdyeus/synthetica

codebase and pipeline to generate high-quality synthetic datasets using Llama 3.1 and an innovative constitutional ruleset framework.
https://github.com/iamdyeus/synthetica

dataset-generation indianisms llama3-1 llm

Last synced: 7 months ago
JSON representation

codebase and pipeline to generate high-quality synthetic datasets using Llama 3.1 and an innovative constitutional ruleset framework.

Awesome Lists containing this project

README

          

# Synthetic Dataset Creation
pipeline to generate high-quality "Indianisms" synthetic datasets using Llama 3.1:8B and an innovative constitutionalAI like ruleset framework.

## Workflow
![Pipeline Workflow](./pipeline_diagram.png)