Projects in Awesome Lists tagged with dataprep
A curated list of projects in awesome lists tagged with dataprep .
https://github.com/sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
apis apiwrapper cleaning connector data-exploration data-science datacleaning dataconnector dataprep datapreparation eda exploratory-data-analysis webconnector
Last synced: 14 May 2025
https://github.com/aryn-ai/sycamore
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
ai dataprep etl information-retrieval llm ml nlp opensearch search semantic-search
Last synced: 10 Mar 2026
https://github.com/sfu-db/apiconnectors
A curated list of example code to collect data from Web APIs using DataPrep.Connector.
configfile connector datacollection dataconnector dataprep example webapis webdata
Last synced: 05 Jul 2025
https://github.com/albertovpd/automated_etl_google_cloud-social_dashboard
A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315
bigquery-table cloud-functions cloud-scheduler cloud-storage dashboard data-studio dataprep etl etl-jobs etl-pipeline gdelt google-cloud google-cloud-platform google-trends python sql twitter-api
Last synced: 07 Apr 2025
https://github.com/victorcouste/google-cloudfunctions-dataprep
Google Cloud Functions examples for Google Cloud Dataprep
api api-rest bigquery cloud-functions cloudfunctions-dataprep dataprep dataprep-job google-bigquery google-cloud-dataprep google-sheet trifacta
Last synced: 20 Jul 2025
https://github.com/ms8909/dptron
mltrons dptron: Dirty Data in, Clean Data Out!
data dataprep datapreparation datascience datascience-machinelearning
Last synced: 10 Apr 2025
https://github.com/realkinetic/gcp-dataflow-gcf-trigger
Trigger a Dataflow job when a file is uploaded to Cloud Storage using a Cloud Function
dataprep gcp gcp-cloud-functions gcp-dataflow gcp-storage python
Last synced: 07 Aug 2025
https://github.com/twsl/china-pm2.5
Time series regression with LSTMs predicting PM2.5 concentration in China
dataprep jupyter-notebook lstm mlflow optuna python tf-keras
Last synced: 08 Oct 2025
https://github.com/sukanyabag/gcp-ai-notebooks
This repository contains all practice notebooks with which I performed hands-on labs in Google Cloud Training Program's "Cloud ML-AI Track"
bigquery cloudml-samples data-science dataprep tensorflow-tutorials
Last synced: 08 Feb 2026
https://github.com/realkinetic/gcp-dataprep-gcf-trigger
Trigger a Dataprep job when a file is uploaded to Cloud Storage using a Cloud Function
dataprep gcp gcp-cloud-functions gcp-storage python
Last synced: 29 Oct 2025
https://github.com/sanjana-bongale/eda_and_experiment_tracking_in_mlops_using_dataprep_and_neptune.ai
Titanic dataset analysis using DataPrep for data cleaning and Neptune.ai for experiment tracking. It includes exploratory data analysis (EDA), feature engineering, and model evaluation for predictive insights.
dataprep eda experiment-tracking jupyter-notebook mlops neptune-ai python
Last synced: 15 May 2025
https://github.com/kmohamedalie/autoeda-with-python
Creating quick visualizations and summary statistics using python
autoviz dataprep dtale melbourne-housing palmer-penguin sweetviz ydata-profiling
Last synced: 22 Feb 2025
https://github.com/adadalshabab/automated-data-analysis-using-python-libraries
Automated Libraries like : DataPrep, AutoViz, SweetViz, Klib, Dtale, Pandas Profiling are used here to help succeed in data analysis endeavors. Happy automating!
autoviz dataprep dtale-library klib pygwalker sweetviz
Last synced: 29 Dec 2025
https://github.com/ngupta23/data_prep_helper
A helper package for preparing and combining data from a variety of sources
data data-science dataprep datapreparation dataprocessing helpers python
Last synced: 03 Apr 2025
https://github.com/miozilla/dataprep-alteryx
dataprep-alteryx :eight_spoked_asterisk: : Political & Election # DataPrep # Alteryx # Trifacta # Wrangle # Recipe
alteryx-designer data-analytics data-cleansing data-wrangling dataprep recipe trifacta
Last synced: 29 Aug 2025
https://github.com/shivani0126/resturant_rating_analysis
Restaurant ratings Analysis is a project where real consumers from 2012, including additional information about each restaurant and their cuisines, and each consumer and their preferences are visualised through Power BI dashboard.
dashboard data-visualization dataanalysis datamodeling dataprep dax-functions powerbi
Last synced: 27 Jan 2026
https://github.com/sweta-kaundilya/adventureworks-cycles-powerbi-project
This project was completed to simulate real-world tasks that data professionals encounter every day on the job.
dashboarddesign data-visualization datamodeling dataprep dax exploratory-data-analysis powerbi powerquery
Last synced: 08 Mar 2026
https://github.com/data-integrations/xml-directives
Collection of XML directives
cask-marketplace cdap cdap-plugin dataprep directory udd xml
Last synced: 25 Oct 2025