Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/kolhesamiksha/nemo_curator

This repository contains a sample text data-preparation code using Nemo Curator for pre-training or synthetic data generation
https://github.com/kolhesamiksha/nemo_curator

curator data-preprocessing-pipelines finetuning-llms generative-ai nemo nvidia synthetic-dataset-generation

Last synced: about 1 month ago
JSON representation

This repository contains a sample text data-preparation code using Nemo Curator for pre-training or synthetic data generation

Awesome Lists containing this project

README

        

# Nemo_Curator
This repository contains a sample text data-preparation code using Nemo Curator for pre-training or synthetic data generation

![image](https://github.com/user-attachments/assets/b28fe3ee-fe06-4c51-a2ef-d6cc702cc4ae)