An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with prepare-data

A curated list of projects in awesome lists tagged with prepare-data .

https://github.com/winvector/vtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

categorical-variables machine-learning-algorithms nested-models prepare-data r

Last synced: 15 May 2025

https://github.com/WinVector/vtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

categorical-variables machine-learning-algorithms nested-models prepare-data r

Last synced: 15 Mar 2025

https://github.com/hi-primus/bumblebee

🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)

bumblebee cudf dask dask-cudf data-cleaning data-preparation data-profiling datasets gpu gui optimus prepare-data python

Last synced: 02 May 2025

https://github.com/neuro-ml/reskit

A library for creating and curating reproducible pipelines for scientific and industrial machine learning

data-preparation grid-search pipeline prepare-data python reproducible-experiments reproducible-research scikit-learn

Last synced: 19 Jul 2025

https://github.com/jesussantana/ibm-data-analysis-with-python-da0101en

This course will take you from the basics of Python to exploring many different types of data.

anova correlation data-analysis model-evaluation numpy pandas prepare-data python regression-models statistics

Last synced: 17 Jul 2025