Projects in Awesome Lists tagged with parquet-generator
A curated list of projects in awesome lists tagged with parquet-generator .
https://github.com/tarantool/sdvg
Synthetic Data Values Generator
csv-generator data data-generation data-generator generation generator http-generator parquet-generator random-data random-data-generation synthetic-data synthetic-data-generation synthetic-dataset-generation test-data test-data-generator
Last synced: 12 Jan 2026
https://github.com/the-data-dilemma/parquettohuggingface
ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.
audio-dataset audio-processing automatic-speech-recognition data-analysis data-science dataset healthcare-application huggingface huggingface-datasets pandas parquet parquet-generator python3 speech-data speech-recognition speech-to-text speech-translation
Last synced: 21 Aug 2025
https://github.com/dgtlss/parqbridge
ParqBridge focuses on zero PHP dependency bloat while still producing spec-compliant Parquet files by delegating the final write step to a tiny, embedded Python script using PyArrow (or any custom CLI you prefer). You keep full Laravel DX for configuration and Storage; we bridge your data to Parquet.
laravel laravel-framework laravel-package parquet parquet-files parquet-generator parquet-schema php php8 powerbi python
Last synced: 03 Oct 2025
https://github.com/domvwt/parquet-inspector
A command line tool for inspecting parquet files with PyArrow.
cli parquet parquet-cli parquet-files parquet-generator parquet-tools parquet-viewer
Last synced: 19 Sep 2025
https://github.com/hwywl/business-tools
在开发中积攒下来的业务工具类,方便快速编写业务。
html parquet parquet-generator parquet-tools
Last synced: 25 Feb 2025
https://github.com/pr0mila/parquettohuggingface
ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.
audio-dataset huggingface huggingface-datasets pandas parquet parquet-generator python3 speech-data
Last synced: 15 Apr 2025
https://github.com/munz0908/parqbridge
🌉 Export Laravel database tables to Apache Parquet files effortlessly, using minimal dependencies and a simple artisan command for quick data handling.
laravel laravel-package parquet parquet-files parquet-generator parquet-schema php powerbi python
Last synced: 03 Sep 2025
https://github.com/syedabareehaali/github-repo-metadata-analytics
Jupyter Notebook analyzing GitHub repository metadata using Python, Parquet, Pandas, and DuckDB
analytics github hacktoberfest hacktoberfest-accepted hacktoberfest2025 pandas-python parquet-generator python
Last synced: 06 Nov 2025