An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with parquet-generator

A curated list of projects in awesome lists tagged with parquet-generator .

https://github.com/the-data-dilemma/parquettohuggingface

ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.

audio-dataset audio-processing automatic-speech-recognition data-analysis data-science dataset healthcare-application huggingface huggingface-datasets pandas parquet parquet-generator python3 speech-data speech-recognition speech-to-text speech-translation

Last synced: 21 Aug 2025

https://github.com/dgtlss/parqbridge

ParqBridge focuses on zero PHP dependency bloat while still producing spec-compliant Parquet files by delegating the final write step to a tiny, embedded Python script using PyArrow (or any custom CLI you prefer). You keep full Laravel DX for configuration and Storage; we bridge your data to Parquet.

laravel laravel-framework laravel-package parquet parquet-files parquet-generator parquet-schema php php8 powerbi python

Last synced: 03 Oct 2025

https://github.com/domvwt/parquet-inspector

A command line tool for inspecting parquet files with PyArrow.

cli parquet parquet-cli parquet-files parquet-generator parquet-tools parquet-viewer

Last synced: 19 Sep 2025

https://github.com/hwywl/business-tools

在开发中积攒下来的业务工具类,方便快速编写业务。

html parquet parquet-generator parquet-tools

Last synced: 25 Feb 2025

https://github.com/pr0mila/parquettohuggingface

ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.

audio-dataset huggingface huggingface-datasets pandas parquet parquet-generator python3 speech-data

Last synced: 15 Apr 2025

https://github.com/munz0908/parqbridge

🌉 Export Laravel database tables to Apache Parquet files effortlessly, using minimal dependencies and a simple artisan command for quick data handling.

laravel laravel-package parquet parquet-files parquet-generator parquet-schema php powerbi python

Last synced: 03 Sep 2025

https://github.com/syedabareehaali/github-repo-metadata-analytics

Jupyter Notebook analyzing GitHub repository metadata using Python, Parquet, Pandas, and DuckDB

analytics github hacktoberfest hacktoberfest-accepted hacktoberfest2025 pandas-python parquet-generator python

Last synced: 06 Nov 2025