An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-generator

A curated list of projects in awesome lists tagged with data-generator .

https://github.com/bchavez/bogus

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

bogus c-sharp csharp data data-access-layer data-generator database dotnet fake faker generator poco seed test-data

Last synced: 03 Nov 2025

https://github.com/easy-mock/easy-mock

A persistent service that generates mock data quickly and provids visualization view.

data-generator easy-mock javascript mock swagger vue

Last synced: 14 May 2025

https://github.com/bchavez/Bogus

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

bogus c-sharp csharp data data-access-layer data-generator database dotnet fake faker generator poco seed test-data

Last synced: 16 Mar 2025

https://github.com/boo1ean/casual

Fake data generator for javascript

data-generator faker generator javascript

Last synced: 13 May 2025

https://github.com/hitsz-ids/synthetic-data-generator

SDG is a specialized framework designed to generate high-quality structured tabular data.

agent data-generator deep-learning gan generative-ai llm machine-learning privacy synthetic-data tabular-data

Last synced: 13 May 2025

https://github.com/instancio/instancio

A library that creates fully populated objects for your unit tests.

data-generator java junit junit-jupiter random random-generation test-automation test-data-generator testing unit-testing

Last synced: 11 Jan 2026

https://github.com/afshinea/keras-data-generator

Template for data generator in Keras

data-generator keras

Last synced: 09 Apr 2025

https://github.com/drewhamilton/poko

A Kotlin compiler plugin that generates equals, hashCode, and toString for plain old Kotlin objects in public APIs.

data-api-generator data-class data-generator extra-care ir kotlin-compiler-plugin

Last synced: 06 Apr 2025

https://github.com/finos/datahelix

The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation

data-engineering data-generation data-generator java test-data-generator

Last synced: 27 Feb 2025

https://github.com/aleksandarskrbic/khaos

Kafka data generator and load testing tool - generate fake messages, simulate producers and consumers, test broker failures, and run chaos engineering scenarios

apache-kafka chaos-engineering data-engineering data-generator devops devops-tools fake-data fault-injection kafka load-testing python stream-processing stress-testing

Last synced: 13 Jan 2026

https://github.com/rom-rb/rom-factory

Data generator with support for persistence backends

data-generator rom-rb sql

Last synced: 08 Apr 2025

https://github.com/Tynamix/ObjectFiller.NET

The .NET ObjectFiller fills the properties of your .NET objects with random data

c-sharp data-generator test-automation unittest

Last synced: 18 Apr 2025

https://github.com/nathanchapman/mayonnaise.js

🎺 Fake data generator for JavaScript, courtesy of Patrick Star

casual data-generator faker generator javascript lorem-ipsum mocking placeholder placeholder-text spongebob

Last synced: 11 Apr 2025

https://github.com/smartcat-labs/ranger

Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database

contextual-data data-generation data-generator test-data

Last synced: 12 Aug 2025

https://github.com/edyan/neuralyzer

Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)

anonymisation anonymization anonymize data-generation data-generator data-privacy database dgpr private-life rgpd

Last synced: 04 Apr 2025

https://github.com/doktormike/dammmdatagen

Marketing Mix Modeling Data Generator

benchmark data data-generator marketing-mix-modeling

Last synced: 29 Jul 2025

https://github.com/DoktorMike/dammmdatagen

Marketing Mix Modeling Data Generator

benchmark data data-generator marketing-mix-modeling

Last synced: 06 May 2025

https://github.com/cliffano/datagen

Multi-process test data files generator

cli data-generator nodejs

Last synced: 22 Sep 2025

https://github.com/navchandar/python-random-name-generator

Python data provider module that returns random people names, addresses, state names, country names as output. Useful for unit testing and automation.

data-generator python-random random randomdatagenerator sampledataset samples test-data test-data-generator testing

Last synced: 21 Aug 2025

https://github.com/xushiyan/kafka-connect-datagen

A Kafka Connect source connector that generates data for tests

data-generator etl etl-pipeline integration-test java kafka kafka-connect performance-test

Last synced: 22 Mar 2025

https://github.com/travvy88/documentgenerator_doge

Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.

ai bounding-boxes data-generator dataset document document-generation document-generator ocr synthetic-data synthetic-dataset-generation

Last synced: 04 Oct 2025

https://github.com/Travvy88/DocumentGenerator_DoGe

Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.

ai bounding-boxes data-generator dataset document document-generation document-generator ocr synthetic-data synthetic-dataset-generation

Last synced: 25 Jul 2025

https://github.com/jibsen/lzdatagen

LZ data generator

c compression data-generator

Last synced: 13 May 2025

https://github.com/yuja201/here-is-dummy

A desktop app that analyzes your database schema and generates high-quality dummy data automatically.

ai-generator automation data-generator data-tools database-schema database-testing desktop-app dummy-data electron faker mysql nodejs open-source postgresql react sql-generator typescript

Last synced: 09 Jun 2026

https://github.com/cisco-open/test-telemetry-generator

OpenTelemetry data generator to simplify testing of your platform

data-generator opentelemetry testing-library

Last synced: 11 Sep 2025

https://github.com/synthesized-io/tdk-demo

This is a collection of TDK demo projects that use different databases and options

data-generation data-generator db2 mysql oracle postgresql synthetic-data synthetic-dataset-generation test-data-generator vault

Last synced: 24 Jun 2025

https://github.com/crocs-muni/cryptostreams

Tool for generation of data from cryptoprimitives (block and stream ciphers, hash functions). Cryptoprimitives are round-reduced and the data can be configured for multiple testing scenarios.

block-ciphers cryptography data-generator hash-functions stream-ciphers

Last synced: 31 Jan 2026

https://github.com/famouswolf/randomdata

TYPO3 extensions to generate new random data or replace existing data with random data

anonymization data-generator extension faker random random-generator typo3 typo3-extension

Last synced: 30 Jun 2025

https://github.com/lycantropos/hypothesis_geometry

`hypothesis` strategies for geometries

data-generator geometry hypothesis quickcheck testing

Last synced: 30 Apr 2025

https://github.com/marcovisibelli/synthetico

general-purpose synthetic data generators to enable data science experiments

ai data-generator learning machine syntetic

Last synced: 11 May 2025

https://github.com/chenanton/ai-rubiks-cube-solver

A program which generates a sequence of cube rotations learned from a deep neural network, solving a scrambled Rubik's Cube.

data-generator keras machine-learning matplotlib-pyplot neural-network python rubiks-cube rubiks-cube-solver tensorflow2

Last synced: 10 Oct 2025

https://github.com/marcosrivasr/json-generator

Tool for generate random JSON data to test your apps

api-sample data-generator json typescript

Last synced: 20 Jul 2025

https://github.com/imshubhamsingh/test-data-generator

A simple fake Data Generator web app to be used by developers | As of now API is functional

api data-generator expressjs nodejs vuejs2

Last synced: 04 Mar 2026

https://github.com/shemmjunior/bandia

Tanzania fake random data generator

data-generator faker-generator mock-data tanzania

Last synced: 10 Jun 2026

https://github.com/ntdls/datarandomizer-sql-clr

Easily generate random human readable data using SQL Server’s SQL CLR.

data-generator sql sqlclr

Last synced: 14 Apr 2025

https://github.com/hrolive/unreal-engine-for-remote-visualization-and-machine-learning

In-depth training to using Unreal Engine as a data generator and integrat it in a simple ML workflow, in one of the leading supercomputing centres.

data-generator hpc machine-learning synthetic-data synthetic-dataset-generation unreal-engine webrtc

Last synced: 27 Jun 2025

https://github.com/ruivieira/timeseries-mock

A flexible data simulator for Kafka and OpenShift using state-space models

data-generator kafka openshift random simulator state-space-model streaming

Last synced: 21 Jul 2025

https://github.com/samir-araujo/faker-es6

A heavily inspired lib to generate massive amounts of realistic fake data. This lib was inspired by what I would like to see in Marak/faker.js plus what I thought could be a good exercise

data data-generator es6 fake fake-content faker faker-es6 generator javascript jest mock mocking typescript

Last synced: 25 Dec 2025

https://github.com/ableneo/liferay-db-setup-core

OSGi bundle to generate Liferay Portal data (permissions, roles, sites, pages etc.). Takes schemed XML declaration as an input and creates database entries accordingly.

data-generator db-migration declarative liferay liferay-7 liferay-73 liferay-dxp liferay-portal liferay-portlet liferay7 liferay71 liferay73 xml-configuration xml-schema

Last synced: 30 Apr 2025

https://github.com/gabrielcrackpro/fake-identity-generator

Web application that provides a fake identity with all the needed info

data-generator identity-generator webapp

Last synced: 03 Jan 2026

https://github.com/kg-construct/krown

KROWN 👑: A Benchmark for RDF Graph Materialization

benchmark data-generator execution-framework materialization rdf rml

Last synced: 12 Jan 2026

https://github.com/EDS-APHP-legacy/pySyntheticDatasetGenerator

Generate relational fictive dataset from a simple yaml description

data-generator database faker generator synthetic-data

Last synced: 02 May 2025

https://github.com/edde746/random-ip-generator

Python package to randomly generate IP by country

data-generator ip-address python

Last synced: 14 Mar 2025

https://github.com/vsfedorenko/kotidgy

Kotidgy aka "Kotlin Text Indexed Data Generator" is an index-based text data generator written in Kotlin

bash data-generator kotlin library text-generator

Last synced: 27 Mar 2025

https://github.com/olafwrieden/telemetry-data-generator

A repository for simulating device telemetry with an Azure Function and feeding it into an Azure Event Hub.

azure data-generator iot ot telemetry

Last synced: 01 May 2026

https://github.com/randomgamingdev/mc_block_color_mapper

Python scripts & libraries for generating and mapping the average colors for each of the Minecraft blocks

average average-calculator cli data data-generator documented-api extract extract-data extractor fast minecraft python3 simple small texture texture-pack textures

Last synced: 22 May 2026

https://github.com/tomzx/handwriting-recognition-data-generator

A data generator of images to train a program to do handwriting recognition.

data-generator glyphs handwriting-recognition

Last synced: 23 Feb 2025

https://github.com/r3dhulk/lets-fake-it

combination of tools using python faker module

data-generator fake fake-data faker python python-3 python-script python3

Last synced: 25 Oct 2025

https://github.com/kaos599/apollo-synthetic-data-generator

Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.

data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui

Last synced: 22 Jul 2025

https://github.com/sayjava/graphql-sample

Zero Coding, ⚡ Rapid GraphQL Sample Data Generator and API

data-generator faker graphql prototype sample-data

Last synced: 28 Apr 2026

https://github.com/abuzar-alvi/generate-dummy-data-in-mongo-db

This back-end web project can help beginners to improve their web development skills.

backend css data-generator dummy-data ejs javascript mangodb nodejs

Last synced: 07 May 2026

https://github.com/andredobbss/fakedatamaker

Fake Data Maker é uma aplicação Blazor WebAssembly para geração de dados falsos, rápida e altamente customizável. Suporta múltiplos idiomas e setores econômicos, com exportação para .csv, .xlsx e .sql. Ideal para cenários de testes, prototipação e simulações de dados.

blazor blazor-webassembly bogus closedxml csharp csvhelper data-generator dotnet fake-data faker mock-data mocking mudblazor multilanguage sqlserver ui-generator wasm

Last synced: 04 Mar 2026

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/cobluestars/dataherd-raika

"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."

big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience

Last synced: 02 Jan 2026

https://github.com/natlee/face-wall-generator-tool

This is a tool for generating face wall to verify some AI models.

data-generator face-recognition python3

Last synced: 15 Mar 2025

https://github.com/victor-antoniassi/day-1_sales_data_generator

Generate daily sales data for the Chinook database. Perfect for building data engineering portfolios with realistic, continuously updating transactional data. Supports D-1 batch processing patterns used in production.

batch-processing chinook-database data-engineering data-generator neon neondb postgresql sample-database synthetic-data

Last synced: 16 May 2026

https://github.com/gaizkiaadeline/rock-paper-scissor-image-classification

This image classification project focuses on classifying images of rock, paper, and scissors gestures using machine learning techniques. The model achieved an impressive validation accuracy of 98.86%, indicating its effectiveness in accurately classifying hand gestures.

callback data-generator image-augmentation image-classification sequential-models

Last synced: 02 Aug 2025

https://github.com/sim98b/tabulardatageneration

Synthetic Data Generation: Tabular & Medical Imaging. A comprehensive project focused on generating synthetic data for tabular datasets and medical imaging.

ai-research artificial-intelligence brain-tumor breast-cancer data-generator data-science dataset-augmentation deep-learning gan generative-model github-projects machine-learning medical-imaging open-source pytorch synthetic-data tabular-data

Last synced: 18 May 2026

https://github.com/kevindeyne/vardogr

Vardøgr is a CLI that can push production-like data to test environments securely and at scale

cli data-generation data-generator database mariadb mysql postgresql scrambled-data

Last synced: 12 Apr 2026

https://github.com/jesufemi-o/fake-coy-api

Dummy api to explore dlt rest api. has authentication, pagination and filtering enabled

data-engineering data-generator dlthub fastapi

Last synced: 17 May 2026

https://github.com/arunkmr08/figma-data-generator-plugin

Populate selected text layers with realistic names, specs, and attributes across FMCG, healthcare, farming, and clothing—without leaving Figma. Keep fonts and styles intact, choose a category, and generate in one click.

company company-details-generator data-generator figma plugin

Last synced: 15 May 2026

https://github.com/nicolasbizzozzero/datagenerator

Randomly generate various commonly used data

data data-generation data-generator data-science

Last synced: 18 Oct 2025

https://github.com/tijmenwierenga/bogusfixturesbundle

A Symfony Bundle for the tijmen-wierenga/bogus library

data-generator dummy-data fixtures symfony-bundle

Last synced: 29 Apr 2026

https://github.com/vesko-vujovic/dummy-data-rust

Data generation writen in rust. This generator will generate users, transaction, payment providers and user adresses.

data-generation data-generator rust

Last synced: 29 Apr 2026

https://github.com/nitor-infotech-oss/bulk-data-creation-accelerator

A component for test data generation using command line arguments that can be dynamically generated using custom excel file, default excel file or command line arguments. Extremely flexible and easy to use with limitless application use.

data-generator faker python3

Last synced: 27 Mar 2025

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 13 Oct 2025