Projects in Awesome Lists tagged with data-generator
A curated list of projects in awesome lists tagged with data-generator .
https://github.com/bchavez/bogus
:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.
bogus c-sharp csharp data data-access-layer data-generator database dotnet fake faker generator poco seed test-data
Last synced: 03 Nov 2025
https://github.com/easy-mock/easy-mock
A persistent service that generates mock data quickly and provids visualization view.
data-generator easy-mock javascript mock swagger vue
Last synced: 14 May 2025
https://github.com/bchavez/Bogus
:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.
bogus c-sharp csharp data data-access-layer data-generator database dotnet fake faker generator poco seed test-data
Last synced: 16 Mar 2025
https://github.com/boo1ean/casual
Fake data generator for javascript
data-generator faker generator javascript
Last synced: 13 May 2025
https://github.com/hitsz-ids/synthetic-data-generator
SDG is a specialized framework designed to generate high-quality structured tabular data.
agent data-generator deep-learning gan generative-ai llm machine-learning privacy synthetic-data tabular-data
Last synced: 13 May 2025
https://github.com/benkeen/generatedata
A powerful, feature-rich, random test data generator.
data data-generation data-generator data-generators human-data json random random-generation randomization rest-api test-data test-data-generator testing
Last synced: 13 May 2025
https://github.com/robotwin-Platform/robotwin
RoboTwin 2.0 Offical Repo
benchmark data-generator embodied-ai robotics
Last synced: 07 May 2026
https://github.com/elixirs/faker
Faker is a pure Elixir library for generating fake data.
data data-generator database developer-tools dummy elixir fake-content faker generator hacktoberfest phoenix qa seed seeding test testing testing-tools
Last synced: 12 May 2025
https://github.com/instancio/instancio
A library that creates fully populated objects for your unit tests.
data-generator java junit junit-jupiter random random-generation test-automation test-data-generator testing unit-testing
Last synced: 11 Jan 2026
https://github.com/nomemory/mockneat
MockNeat - the modern faker lib.
arbitrary-data big-data csv data-generation data-generator fake-data faker faker-generator faker-library java java-8 lorem-ipsum mocking random-generation random-number-generators randomization randomizer sample-data sample-data-generator sql-insert
Last synced: 13 Apr 2025
https://github.com/afshinea/keras-data-generator
Template for data generator in Keras
Last synced: 09 Apr 2025
https://github.com/drewhamilton/poko
A Kotlin compiler plugin that generates equals, hashCode, and toString for plain old Kotlin objects in public APIs.
data-api-generator data-class data-generator extra-care ir kotlin-compiler-plugin
Last synced: 06 Apr 2025
https://github.com/lucapette/fakedata
CLI utility for fake data generation
cli-utilities data-generator fake-data fakedata test-data test-data-generator testing-tools
Last synced: 22 Jul 2025
https://github.com/finos/datahelix
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
data-engineering data-generation data-generator java test-data-generator
Last synced: 27 Feb 2025
https://github.com/tinybirdco/mockingbird
Mockingbird is a mock streaming data generator
data-generation data-generator fakerjs generator http kafka streaming-data tinybird typescript
Last synced: 16 May 2025
https://github.com/aleksandarskrbic/khaos
Kafka data generator and load testing tool - generate fake messages, simulate producers and consumers, test broker failures, and run chaos engineering scenarios
apache-kafka chaos-engineering data-engineering data-generator devops devops-tools fake-data fault-injection kafka load-testing python stream-processing stress-testing
Last synced: 13 Jan 2026
https://github.com/pierluigiferrari/data_generator_object_detection_2d
A data generator for 2D object detection
data-augmentation data-generator image-transformations object-detection
Last synced: 13 May 2025
https://github.com/rom-rb/rom-factory
Data generator with support for persistence backends
Last synced: 08 Apr 2025
https://github.com/Tynamix/ObjectFiller.NET
The .NET ObjectFiller fills the properties of your .NET objects with random data
c-sharp data-generator test-automation unittest
Last synced: 18 Apr 2025
https://github.com/nathanchapman/mayonnaise.js
🎺 Fake data generator for JavaScript, courtesy of Patrick Star
casual data-generator faker generator javascript lorem-ipsum mocking placeholder placeholder-text spongebob
Last synced: 11 Apr 2025
https://github.com/dmey/synthia
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
augmentation climate copula data-augmentation data-generation data-generator data-modelling data-science dependency-analysis dependency-modeling finance fpca functional-data machine-learning oversampling principal-component-analysis statistics synthetic-data weather xarray
Last synced: 01 Feb 2026
https://github.com/smartcat-labs/ranger
Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database
contextual-data data-generation data-generator test-data
Last synced: 12 Aug 2025
https://github.com/edyan/neuralyzer
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
anonymisation anonymization anonymize data-generation data-generator data-privacy database dgpr private-life rgpd
Last synced: 04 Apr 2025
https://github.com/doktormike/dammmdatagen
Marketing Mix Modeling Data Generator
benchmark data data-generator marketing-mix-modeling
Last synced: 29 Jul 2025
https://github.com/DoktorMike/dammmdatagen
Marketing Mix Modeling Data Generator
benchmark data data-generator marketing-mix-modeling
Last synced: 06 May 2025
https://github.com/bhdicaire/datalossprevention
Data Loss Prevention (DLP) Sample Data Files
data-exfiltration data-generator data-loss-prevention data-structures dlp fake fake-data faker generator mock-data mock-data-generator test-data
Last synced: 27 Jan 2026
https://github.com/cliffano/datagen
Multi-process test data files generator
Last synced: 22 Sep 2025
https://github.com/yuriyivon/databasebenchmark
A universal database query benchmark tool
benchmark benchmarking data-generation data-generator database-benchmarking
Last synced: 15 Apr 2025
https://github.com/navchandar/python-random-name-generator
Python data provider module that returns random people names, addresses, state names, country names as output. Useful for unit testing and automation.
data-generator python-random random randomdatagenerator sampledataset samples test-data test-data-generator testing
Last synced: 21 Aug 2025
https://github.com/matheusfelipeog/fordev
Gere e valide dados randômicos com fordev 🎲
4devs 4devs-api 4devs-module api data-generator data-manipulation data-validation fake-data fake-data-generator fordev fourthdev python random-data scrapping
Last synced: 29 Jun 2025
https://github.com/huda-lab/synner
Generating Realistic Synthetic Data
angularjs chi d3 data-generator gui hci research research-paper research-project sketches user-experience user-interface visualization
Last synced: 02 May 2025
https://github.com/xushiyan/kafka-connect-datagen
A Kafka Connect source connector that generates data for tests
data-generator etl etl-pipeline integration-test java kafka kafka-connect performance-test
Last synced: 22 Mar 2025
https://github.com/travvy88/documentgenerator_doge
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
ai bounding-boxes data-generator dataset document document-generation document-generator ocr synthetic-data synthetic-dataset-generation
Last synced: 04 Oct 2025
https://github.com/Travvy88/DocumentGenerator_DoGe
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
ai bounding-boxes data-generator dataset document document-generation document-generator ocr synthetic-data synthetic-dataset-generation
Last synced: 25 Jul 2025
https://github.com/yuja201/here-is-dummy
A desktop app that analyzes your database schema and generates high-quality dummy data automatically.
ai-generator automation data-generator data-tools database-schema database-testing desktop-app dummy-data electron faker mysql nodejs open-source postgresql react sql-generator typescript
Last synced: 09 Jun 2026
https://github.com/cisco-open/test-telemetry-generator
OpenTelemetry data generator to simplify testing of your platform
data-generator opentelemetry testing-library
Last synced: 11 Sep 2025
https://github.com/synthesized-io/tdk-demo
This is a collection of TDK demo projects that use different databases and options
data-generation data-generator db2 mysql oracle postgresql synthetic-data synthetic-dataset-generation test-data-generator vault
Last synced: 24 Jun 2025
https://github.com/tarantool/sdvg
Synthetic Data Values Generator
csv-generator data data-generation data-generator generation generator http-generator parquet-generator random-data random-data-generation synthetic-data synthetic-data-generation synthetic-dataset-generation test-data test-data-generator
Last synced: 12 Jan 2026
https://github.com/crocs-muni/cryptostreams
Tool for generation of data from cryptoprimitives (block and stream ciphers, hash functions). Cryptoprimitives are round-reduced and the data can be configured for multiple testing scenarios.
block-ciphers cryptography data-generator hash-functions stream-ciphers
Last synced: 31 Jan 2026
https://github.com/famouswolf/randomdata
TYPO3 extensions to generate new random data or replace existing data with random data
anonymization data-generator extension faker random random-generator typo3 typo3-extension
Last synced: 30 Jun 2025
https://github.com/ma7555/kerasgen
A Keras/Tensorflow compatible image data generator for TripletLoss
data-generation data-generator data-generators data-science keras keras-tensorflow tensorflow triplet triplet-loss triplet-neural-network
Last synced: 11 Mar 2025
https://github.com/lycantropos/hypothesis_geometry
`hypothesis` strategies for geometries
data-generator geometry hypothesis quickcheck testing
Last synced: 30 Apr 2025
https://github.com/marcovisibelli/synthetico
general-purpose synthetic data generators to enable data science experiments
ai data-generator learning machine syntetic
Last synced: 11 May 2025
https://github.com/benedekrozemberczki/hullcoverconditionedunitdiskgraph
A generator for unit disk graphs conditioned on concave hull cover.
data data-generator data-science data-visualization deep-learning fun funny graph graph-clustering graph-embedding graph-visualization hull-cover joke machine-learning network-visualization networkx node-embedding non-planar-graph synthetic unit-disk-graph
Last synced: 06 Jul 2025
https://github.com/chenanton/ai-rubiks-cube-solver
A program which generates a sequence of cube rotations learned from a deep neural network, solving a scrambled Rubik's Cube.
data-generator keras machine-learning matplotlib-pyplot neural-network python rubiks-cube rubiks-cube-solver tensorflow2
Last synced: 10 Oct 2025
https://github.com/thatsinewave/discord-identity
Discord profile identity generator that enables users to generate up to 100 quadrillion unique Discord profiles
data-generator discord discord-bot discord-data discord-info discord-profile discordbot fun github-pages github-pages-site good-first-contribution good-first-issue good-first-issues good-first-pr-first-contribution good-first-project javascript one-page-website profile-generator profilegenerator thatsinewave
Last synced: 02 Feb 2026
https://github.com/marcosrivasr/json-generator
Tool for generate random JSON data to test your apps
api-sample data-generator json typescript
Last synced: 20 Jul 2025
https://github.com/imshubhamsingh/test-data-generator
A simple fake Data Generator web app to be used by developers | As of now API is functional
api data-generator expressjs nodejs vuejs2
Last synced: 04 Mar 2026
https://github.com/ashtonav/sydney-personal-details-generator
App that can create a large amount of randomly generated personal details, including: address, full name, phone number, date of birth, gender, and email.
address-generator csv data-generator dummy-data dummy-data-generator fake-data fake-data-generator identity-generator phone-number-generator random-data random-data-generation random-data-generator sample-data sample-data-generator sql sydney test-data test-data-generator
Last synced: 02 Oct 2025
https://github.com/shemmjunior/bandia
Tanzania fake random data generator
data-generator faker-generator mock-data tanzania
Last synced: 10 Jun 2026
https://github.com/ntdls/datarandomizer-sql-clr
Easily generate random human readable data using SQL Server’s SQL CLR.
Last synced: 14 Apr 2025
https://github.com/hrolive/unreal-engine-for-remote-visualization-and-machine-learning
In-depth training to using Unreal Engine as a data generator and integrat it in a simple ML workflow, in one of the leading supercomputing centres.
data-generator hpc machine-learning synthetic-data synthetic-dataset-generation unreal-engine webrtc
Last synced: 27 Jun 2025
https://github.com/kevingimbel/fakedata_generator
🦀 Rust library to generate fake data
data-generator fake-data fakedata generator library random rust test-data-generator testing-tool
Last synced: 22 Apr 2025
https://github.com/ruivieira/timeseries-mock
A flexible data simulator for Kafka and OpenShift using state-space models
data-generator kafka openshift random simulator state-space-model streaming
Last synced: 21 Jul 2025
https://github.com/samir-araujo/faker-es6
A heavily inspired lib to generate massive amounts of realistic fake data. This lib was inspired by what I would like to see in Marak/faker.js plus what I thought could be a good exercise
data data-generator es6 fake fake-content faker faker-es6 generator javascript jest mock mocking typescript
Last synced: 25 Dec 2025
https://github.com/datahappy1/dummy_file_generator
Dummy csv, flat text or json files generator written in Python 3.7
csv data-generation data-generator dummy-csv dummy-json dummy-text file-generator flat-file json python python3 txt-files
Last synced: 12 Apr 2025
https://github.com/ableneo/liferay-db-setup-core
OSGi bundle to generate Liferay Portal data (permissions, roles, sites, pages etc.). Takes schemed XML declaration as an input and creates database entries accordingly.
data-generator db-migration declarative liferay liferay-7 liferay-73 liferay-dxp liferay-portal liferay-portlet liferay7 liferay71 liferay73 xml-configuration xml-schema
Last synced: 30 Apr 2025
https://github.com/gabrielcrackpro/fake-identity-generator
Web application that provides a fake identity with all the needed info
data-generator identity-generator webapp
Last synced: 03 Jan 2026
https://github.com/kg-construct/krown
KROWN 👑: A Benchmark for RDF Graph Materialization
benchmark data-generator execution-framework materialization rdf rml
Last synced: 12 Jan 2026
https://github.com/EDS-APHP-legacy/pySyntheticDatasetGenerator
Generate relational fictive dataset from a simple yaml description
data-generator database faker generator synthetic-data
Last synced: 02 May 2025
https://github.com/edde746/random-ip-generator
Python package to randomly generate IP by country
data-generator ip-address python
Last synced: 14 Mar 2025
https://github.com/fpt-thaituan/transfer-learning-use-inception-v3-for-image-classification
Transfer Learning uses Inception v3 to classify human and horse images with 99.39% accuracy.
classification-image cnn data-generator deep-learning human-and-horse inceptionv3 tensorflow transfer-learning
Last synced: 02 Aug 2025
https://github.com/vsfedorenko/kotidgy
Kotidgy aka "Kotlin Text Indexed Data Generator" is an index-based text data generator written in Kotlin
bash data-generator kotlin library text-generator
Last synced: 27 Mar 2025
https://github.com/olafwrieden/telemetry-data-generator
A repository for simulating device telemetry with an Azure Function and feeding it into an Azure Event Hub.
azure data-generator iot ot telemetry
Last synced: 01 May 2026
https://github.com/psfried/dgen
Generate evil test data
csv data data-generation data-generator language testing-tools
Last synced: 18 Mar 2025
https://github.com/av/dtg
Data generation tool
data data-engineering data-generator dtg generator node nodejs npm-module npm-package random-generation utility
Last synced: 10 Apr 2026
https://github.com/randomgamingdev/mc_block_color_mapper
Python scripts & libraries for generating and mapping the average colors for each of the Minecraft blocks
average average-calculator cli data data-generator documented-api extract extract-data extractor fast minecraft python3 simple small texture texture-pack textures
Last synced: 22 May 2026
https://github.com/programandoconro/chess-random-moves
Generate random positions and random moves by piece in Chess
agent-based-simulation app chess chess-move chess-position data-generator game javascript random random-generation random-positions-chess randomization reactjs
Last synced: 16 Apr 2025
https://github.com/wklee610/datapush
MySQL Data Generator
automatic data data-generator database dataset db dynamic generator mysql test testing-tools
Last synced: 25 Jan 2026
https://github.com/tomzx/handwriting-recognition-data-generator
A data generator of images to train a program to do handwriting recognition.
data-generator glyphs handwriting-recognition
Last synced: 23 Feb 2025
https://github.com/jongirard/unique_names_generator
A Unique Names Generator built in Elixir
data data-generator elixir elixir-lang fake-data name-generator phoenix seed
Last synced: 21 Oct 2025
https://github.com/r3dhulk/lets-fake-it
combination of tools using python faker module
data-generator fake fake-data faker python python-3 python-script python3
Last synced: 25 Oct 2025
https://github.com/kaos599/apollo-synthetic-data-generator
Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.
data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui
Last synced: 22 Jul 2025
https://github.com/sayjava/graphql-sample
Zero Coding, ⚡ Rapid GraphQL Sample Data Generator and API
data-generator faker graphql prototype sample-data
Last synced: 28 Apr 2026
https://github.com/abuzar-alvi/generate-dummy-data-in-mongo-db
This back-end web project can help beginners to improve their web development skills.
backend css data-generator dummy-data ejs javascript mangodb nodejs
Last synced: 07 May 2026
https://github.com/kevcui/fakergen
:tophat: Generate JSON/YAML/XML... mock data with a structured template
data-generator devops devops-tools fake-generator faker faker-generator faker-providers json json-data json-generator json-schema json-template mock mock-data mock-json testing testing-tool testing-tools xml yaml
Last synced: 05 May 2026
https://github.com/Lightning-Chart/xydata
A data generator library.
data-generator javascript npm-package random-generation typescript
Last synced: 12 May 2025
https://github.com/andredobbss/fakedatamaker
Fake Data Maker é uma aplicação Blazor WebAssembly para geração de dados falsos, rápida e altamente customizável. Suporta múltiplos idiomas e setores econômicos, com exportação para .csv, .xlsx e .sql. Ideal para cenários de testes, prototipação e simulações de dados.
blazor blazor-webassembly bogus closedxml csharp csvhelper data-generator dotnet fake-data faker mock-data mocking mudblazor multilanguage sqlserver ui-generator wasm
Last synced: 04 Mar 2026
https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python
This Python project is made by me, Python project for improving python skills.
card data data-generator employee python
Last synced: 03 Feb 2026
https://github.com/devathul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 09 May 2026
https://github.com/jimbuck/stuffd
Like a turkey. :turkey:
cli data-generator hacktoberfest node-module random typescript
Last synced: 18 Feb 2026
https://github.com/cobluestars/dataherd-raika
"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."
big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience
Last synced: 02 Jan 2026
https://github.com/natlee/face-wall-generator-tool
This is a tool for generating face wall to verify some AI models.
data-generator face-recognition python3
Last synced: 15 Mar 2025
https://github.com/victor-antoniassi/day-1_sales_data_generator
Generate daily sales data for the Chinook database. Perfect for building data engineering portfolios with realistic, continuously updating transactional data. Supports D-1 batch processing patterns used in production.
batch-processing chinook-database data-engineering data-generator neon neondb postgresql sample-database synthetic-data
Last synced: 16 May 2026
https://github.com/abhiramborige/yolodatagenerator
Data generation with labelling made easy with numpy, cv2 and Albumentations
data-generator labeling object-detection synthetic-data-generation synthetic-images yolo-data-preprocessing yolo-dataset yolov5
Last synced: 13 Jun 2025
https://github.com/gaizkiaadeline/rock-paper-scissor-image-classification
This image classification project focuses on classifying images of rock, paper, and scissors gestures using machine learning techniques. The model achieved an impressive validation accuracy of 98.86%, indicating its effectiveness in accurately classifying hand gestures.
callback data-generator image-augmentation image-classification sequential-models
Last synced: 02 Aug 2025
https://github.com/sim98b/tabulardatageneration
Synthetic Data Generation: Tabular & Medical Imaging. A comprehensive project focused on generating synthetic data for tabular datasets and medical imaging.
ai-research artificial-intelligence brain-tumor breast-cancer data-generator data-science dataset-augmentation deep-learning gan generative-model github-projects machine-learning medical-imaging open-source pytorch synthetic-data tabular-data
Last synced: 18 May 2026
https://github.com/mihirsoni/elk-nginx-log-shipper
Log shipper
data-generator elk log-generator opendistro
Last synced: 09 May 2026
https://github.com/kevindeyne/vardogr
Vardøgr is a CLI that can push production-like data to test environments securely and at scale
cli data-generation data-generator database mariadb mysql postgresql scrambled-data
Last synced: 12 Apr 2026
https://github.com/jesufemi-o/fake-coy-api
Dummy api to explore dlt rest api. has authentication, pagination and filtering enabled
data-engineering data-generator dlthub fastapi
Last synced: 17 May 2026
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/esotericenderman/scp-secret-laboratory-translations-generator
A piece of code to generate updated SCP:SL translations.
config config-generator configuration-generator configuration-management data-generation data-generator generation scp scp-foundation scp-secret-laboratory scp-sl scpsl translation translation-files translation-management translations
Last synced: 09 Jul 2025
https://github.com/arunkmr08/figma-data-generator-plugin
Populate selected text layers with realistic names, specs, and attributes across FMCG, healthcare, farming, and clothing—without leaving Figma. Keep fonts and styles intact, choose a category, and generate in one click.
company company-details-generator data-generator figma plugin
Last synced: 15 May 2026
https://github.com/nicolasbizzozzero/datagenerator
Randomly generate various commonly used data
data data-generation data-generator data-science
Last synced: 18 Oct 2025
https://github.com/tijmenwierenga/bogusfixturesbundle
A Symfony Bundle for the tijmen-wierenga/bogus library
data-generator dummy-data fixtures symfony-bundle
Last synced: 29 Apr 2026
https://github.com/vesko-vujovic/dummy-data-rust
Data generation writen in rust. This generator will generate users, transaction, payment providers and user adresses.
data-generation data-generator rust
Last synced: 29 Apr 2026
https://github.com/nitor-infotech-oss/bulk-data-creation-accelerator
A component for test data generation using command line arguments that can be dynamically generated using custom excel file, default excel file or command line arguments. Extremely flexible and easy to use with limitless application use.
Last synced: 27 Mar 2025
https://github.com/petitatelier/data-generators
A collection of data generators, to play with in visualization experiments
data-generator data-visualization
Last synced: 13 Oct 2025