An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with dataprep

A curated list of projects in awesome lists tagged with dataprep .

https://github.com/sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

apis apiwrapper cleaning connector data-exploration data-science datacleaning dataconnector dataprep datapreparation eda exploratory-data-analysis webconnector

Last synced: 14 May 2025

https://github.com/aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

ai dataprep etl information-retrieval llm ml nlp opensearch search semantic-search

Last synced: 10 Mar 2026

https://github.com/sfu-db/apiconnectors

A curated list of example code to collect data from Web APIs using DataPrep.Connector.

configfile connector datacollection dataconnector dataprep example webapis webdata

Last synced: 05 Jul 2025

https://github.com/ms8909/dptron

mltrons dptron: Dirty Data in, Clean Data Out!

data dataprep datapreparation datascience datascience-machinelearning

Last synced: 10 Apr 2025

https://github.com/realkinetic/gcp-dataflow-gcf-trigger

Trigger a Dataflow job when a file is uploaded to Cloud Storage using a Cloud Function

dataprep gcp gcp-cloud-functions gcp-dataflow gcp-storage python

Last synced: 07 Aug 2025

https://github.com/twsl/china-pm2.5

Time series regression with LSTMs predicting PM2.5 concentration in China

dataprep jupyter-notebook lstm mlflow optuna python tf-keras

Last synced: 08 Oct 2025

https://github.com/sukanyabag/gcp-ai-notebooks

This repository contains all practice notebooks with which I performed hands-on labs in Google Cloud Training Program's "Cloud ML-AI Track"

bigquery cloudml-samples data-science dataprep tensorflow-tutorials

Last synced: 08 Feb 2026

https://github.com/realkinetic/gcp-dataprep-gcf-trigger

Trigger a Dataprep job when a file is uploaded to Cloud Storage using a Cloud Function

dataprep gcp gcp-cloud-functions gcp-storage python

Last synced: 29 Oct 2025

https://github.com/sanjana-bongale/eda_and_experiment_tracking_in_mlops_using_dataprep_and_neptune.ai

Titanic dataset analysis using DataPrep for data cleaning and Neptune.ai for experiment tracking. It includes exploratory data analysis (EDA), feature engineering, and model evaluation for predictive insights.

dataprep eda experiment-tracking jupyter-notebook mlops neptune-ai python

Last synced: 15 May 2025

https://github.com/kmohamedalie/autoeda-with-python

Creating quick visualizations and summary statistics using python

autoviz dataprep dtale melbourne-housing palmer-penguin sweetviz ydata-profiling

Last synced: 22 Feb 2025

https://github.com/adadalshabab/automated-data-analysis-using-python-libraries

Automated Libraries like : DataPrep, AutoViz, SweetViz, Klib, Dtale, Pandas Profiling are used here to help succeed in data analysis endeavors. Happy automating!

autoviz dataprep dtale-library klib pygwalker sweetviz

Last synced: 29 Dec 2025

https://github.com/ngupta23/data_prep_helper

A helper package for preparing and combining data from a variety of sources

data data-science dataprep datapreparation dataprocessing helpers python

Last synced: 03 Apr 2025

https://github.com/miozilla/dataprep-alteryx

dataprep-alteryx :eight_spoked_asterisk: : Political & Election # DataPrep # Alteryx # Trifacta # Wrangle # Recipe

alteryx-designer data-analytics data-cleansing data-wrangling dataprep recipe trifacta

Last synced: 29 Aug 2025

https://github.com/shivani0126/resturant_rating_analysis

Restaurant ratings Analysis is a project where real consumers from 2012, including additional information about each restaurant and their cuisines, and each consumer and their preferences are visualised through Power BI dashboard.

dashboard data-visualization dataanalysis datamodeling dataprep dax-functions powerbi

Last synced: 27 Jan 2026

https://github.com/erik-ingwersen-ey/dev-datatools

Helper functions, to transform Pandas Dataframes.

dataprep datatools etl pandas python

Last synced: 23 Feb 2025

https://github.com/sweta-kaundilya/adventureworks-cycles-powerbi-project

This project was completed to simulate real-world tasks that data professionals encounter every day on the job.

dashboarddesign data-visualization datamodeling dataprep dax exploratory-data-analysis powerbi powerquery

Last synced: 08 Mar 2026