Projects in Awesome Lists tagged with data-exploration
A curated list of projects in awesome lists tagged with data-exploration .
https://github.com/kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
data-analysis data-exploration dataframe matplotlib pandas plotly tableau tableau-alternative visualization
Last synced: 09 Sep 2025
https://github.com/data-centric-ai-community/fg-data-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
big-data-analytics data-analysis data-exploration data-profiling data-quality data-science deep-learning eda exploration exploratory-data-analysis hacktoberfest html-report jupyter jupyter-notebook machine-learning pandas pandas-dataframe pandas-profiling python statistics
Last synced: 08 May 2026
https://github.com/Data-Centric-AI-Community/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
big-data-analytics data-analysis data-exploration data-profiling data-quality data-science deep-learning eda exploration exploratory-data-analysis hacktoberfest html-report jupyter jupyter-notebook machine-learning pandas pandas-dataframe pandas-profiling python statistics
Last synced: 09 Mar 2026
https://github.com/Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
data-analysis data-exploration dataframe matplotlib pandas plotly tableau tableau-alternative visualization
Last synced: 26 Mar 2025
https://github.com/ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
big-data-analytics data-analysis data-exploration data-profiling data-quality data-science deep-learning eda exploration exploratory-data-analysis hacktoberfest html-report jupyter jupyter-notebook machine-learning pandas pandas-dataframe pandas-profiling python statistics
Last synced: 16 Jan 2026
https://github.com/kanaries/rath
Next generation of automated data exploratory analysis and visualization platform.
augmented-analytics automated-data-analysis automated-visualization autovis causal-discovery causal-inference causality data-analysis data-exploration data-visualization datamining eda k6s kanaries machine-learning tableau tableau-alternative visualization
Last synced: 14 May 2025
https://github.com/Kanaries/Rath
Next generation of automated data exploratory analysis and visualization platform.
augmented-analytics automated-data-analysis automated-visualization autovis causal-discovery causal-inference causality data-analysis data-exploration data-visualization datamining eda k6s kanaries machine-learning tableau tableau-alternative visualization
Last synced: 04 Apr 2025
https://github.com/fbdesignpro/sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
data-analysis data-exploration data-profiling data-science data-visualization eda exploration exploratory-data-analysis machine-learning pandas pandas-dataframe python statistics
Last synced: 14 May 2025
https://github.com/sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
apis apiwrapper cleaning connector data-exploration data-science datacleaning dataconnector dataprep datapreparation eda exploratory-data-analysis webconnector
Last synced: 14 May 2025
https://github.com/hi-primus/optimus
:truck: Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
big-data-cleaning bigdata cudf dask dask-cudf data-analysis data-cleaner data-cleaning data-cleansing data-exploration data-extraction data-preparation data-profiling data-science data-transformation data-wrangling machine-learning pyspark spark
Last synced: 14 May 2025
https://github.com/opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
alerting bigdata data-catalog data-discovery data-engineering data-exploration data-governance data-lineage data-observability data-pipelines data-platform data-profiling data-quality data-science datacatalog lineage metadata metadata-management observability oss
Last synced: 02 Apr 2026
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 06 Jan 2026
https://github.com/comet-ml/kangas
๐ฆ Explore multimedia datasets at scale
data-analysis data-exploration dataframe datagrid machine-learning
Last synced: 14 May 2025
https://github.com/abhayspawar/featexp
Feature exploration for supervised learning
data-exploration data-science feature-engineering machine-learning visualization
Last synced: 14 Jan 2026
https://github.com/keen/explorer
Data Explorer by Keen - point-and-click interface for analyzing and visualizing event data.
analysis analytics analytics-api charts data-exploration data-visualization dataviz keen-io native-analytics web-analytics
Last synced: 15 May 2025
https://github.com/polyaxon/datatile
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
dask data-exploration data-profiling data-quality data-quality-checks data-science data-visualization dataframes dataops explainable-ai matplotlib mlops pandas pandas-summary plotly pytorch spark statistics tensorflow tracking
Last synced: 17 Aug 2025
https://github.com/polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
dask data-exploration data-profiling data-quality data-quality-checks data-science data-visualization dataframes dataops explainable-ai matplotlib mlops pandas pandas-summary plotly pytorch spark statistics tensorflow tracking
Last synced: 12 Dec 2025
https://github.com/boxuancui/DataExplorer
Automate Data Exploration and Treatment
cran data-analysis data-exploration data-science eda r r-package rstats visualization
Last synced: 30 Jul 2025
https://github.com/marmotdata/marmot
Marmot helps teams discover, understand, and leverage their data with powerful search and lineage visualisation tools. It's designed to make data accessible for everyone.
bigdata data-catalog data-collaboration data-discovery data-exploration data-governance data-lineage data-observability datacatalog datadiscovery dataengineering lineage mcp mcp-server metadata
Last synced: 09 Apr 2026
https://github.com/infuseai/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 10 Apr 2025
https://github.com/puchaczov/musoq
SQL Syntax without any database
ai-assisted-queries cross-platform csharp csv data-analysis-sql data-exploration data-processing dotnet dotnet-core dotnetcore file-system plugin-architecture query-language sql text-processing
Last synced: 13 Mar 2026
https://github.com/InfuseAI/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 18 Apr 2025
https://github.com/desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data
Last synced: 22 Nov 2025
https://github.com/Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data
Last synced: 03 Apr 2025
https://github.com/panel-extensions/panel-graphic-walker
A project providing a Graphic Walker Pane for use with HoloViz Panel.
business-intelligence data data-analysis data-app data-exploration data-mining data-visualization eda holoviz-panel low-code notebook pivot-table python tableau tableau-alternative vega vega-lite visualization
Last synced: 20 Oct 2025
https://github.com/tkrabel/edaviz
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
altair data-analysis data-exploration data-sciene data-visualization eda edaviz exploratory-data interactive jupyter-notebook matplotlib pandas plotly project-jupyter pyhon qgrid seaborn
Last synced: 13 Jun 2025
https://github.com/rolkra/explore
R package that makes basic data exploration radically simple (interactive data exploration, reproducible data science)
data-exploration data-visualisation decision-trees eda r rmarkdown shiny tidy
Last synced: 08 Apr 2025
https://github.com/grafana-toolbox/grafana-wtf
Grep through all Grafana entities in the spirit of git-wtf.
data-exploration grafana grafana-api grafana-client grafana-dashboard grafana-datasource grafana-search grafana-toolbox grafana-utils kotori-daq search search-and-replace search-in-text search-replace sysadmin-tool toolbox
Last synced: 16 May 2025
https://github.com/tvdboom/atom
Automated Tool for Optimized Modelling
automl dagshub data-exploration data-pipeline data-science interactive-visualizations machine-learning mlflow model-predictions modelling python scikit-learn shap visualization
Last synced: 06 Apr 2025
https://github.com/virajbhutada/bi-projects-collection
Discover a curated collection of dynamic Power BI dashboards covering financial analytics, HR metrics, streaming service trends, real estate dynamics, and more. Meticulously designed for comprehensive data exploration, this repository continues to expand with new and impactful visualizations.
analytical-insights data-analytics data-exploration data-visualization dynamic-dashboards healthcare-analysis hr-management powerbi trends-visualization visual-reporting
Last synced: 01 Mar 2026
https://github.com/facultyai/lens
Summarise and explore Pandas DataFrames
dask data-exploration data-science data-visualisation dataframe pandas
Last synced: 14 Apr 2025
https://github.com/observedobserver/pivot-chart
light and fast implementation of web pivot table / pivot chart components.
business-intelligence chart cube data-exploration data-visualization eda excel olap pivot-chart pivot-table pivot-tables react tableau typescript visualization
Last synced: 16 Mar 2025
https://github.com/federicomarini/genetonic
Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail
bioconductor bioconductor-package data-exploration data-visualization functional-enrichment-analysis gene-expression gui pathway-analysis r reproducible-research rna-seq-analysis rna-seq-data shiny transcriptome transcriptomics user-friendly
Last synced: 13 Apr 2025
https://github.com/renumics/sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
data-analysis data-cleaning data-curation data-exploration data-science data-visualization deep-learning eda exploratory-data-analysis machine-learning python visualization
Last synced: 16 Mar 2025
https://github.com/DistrictDataLabs/cultivar
Multidimensional data explorer and visualization tool.
data-analysis data-exploration data-management visualization
Last synced: 15 Apr 2025
https://github.com/yaph/ipython-notebooks
A collection of Jupyter notebooks exploring different datasets.
data-analysis data-exploration data-visualization jupyter-notebook mapping matplotlib pandas python python-tutorials tutorials
Last synced: 08 Sep 2025
https://github.com/districtdatalabs/cultivar
Multidimensional data explorer and visualization tool.
data-analysis data-exploration data-management visualization
Last synced: 01 Feb 2026
https://github.com/evoluteur/kaggle-look-alike
Kaggle Data Explorer UI look-alike built in React.
data data-analysis data-engineering data-exploration data-mining data-platform data-science datascience exploratory-data-analysis explorer front-end frontend kaggle react spa
Last synced: 09 Apr 2025
https://github.com/debiai/debiai
Bias detection and contextual evaluation tool for your AI projects
ai bias contextual-evaluation data-agnostic data-analysis data-exploration machine-learning model-evaluation plotlyjs python visualization vuejs
Last synced: 31 Jul 2025
https://github.com/afraniomelo/kydlib
Routines for exploratory data analysis.
autocorrelation correlation-coefficient data-analysis data-exploration data-science data-visualization eda exploratory-data-analysis gaussian machine-learning noise nonlinear plotting python scatter-plot statistics time-series time-series-analysis visualization
Last synced: 09 Apr 2025
https://github.com/denisotree/tuitab
A fast, keyboard-driven terminal explorer for tabular data. Open CSV, JSON, Parquet, Excel and SQLite files directly in your terminal โ filter, sort, pivot, compute new columns, and visualise distributions without leaving the shell
cli csv data-analysis data-exploration dataframe duckdb excel parquet polars ratatui rust spreadsheet sqlite tabular-data terminal tui vim
Last synced: 15 Jun 2026
https://github.com/DECODEproject/bcnnow
Light, personalized, interactive dashboards for urban data exploration.
data-exploration data-visualization urban-dashboards
Last synced: 02 Apr 2025
https://github.com/nextkore/smartmuv
An EVM-compatible Solidity Smart Contract Storage/Slot Analyzer and Data Extractor.
blockchain-explorer code-analysis data-exploration data-extraction ethereum ethereum-blockchain explorer migrate scanner smart-contracts solidity static-analysis storage storage-analysis tracker upgrade
Last synced: 17 Jul 2025
https://github.com/8080labs/bamboolib_binder_template
bamboolib - template for creating your own binder notebook
binder-jupyter-notebook data-exploration data-science data-transformation data-visualisation data-visualization data-viz docker
Last synced: 12 Apr 2025
https://github.com/darenasc/aeda
Build a data catalog by running a single line of code
data-catalog data-exploration database eda metadata metadata-extraction
Last synced: 05 Mar 2026
https://github.com/NextKore/SmartMuv
An EVM-compatible Solidity Smart Contract Storage/Slot Analyzer and Data Extractor.
blockchain-explorer code-analysis data-exploration data-extraction ethereum ethereum-blockchain explorer migrate scanner smart-contracts solidity static-analysis storage storage-analysis tracker upgrade
Last synced: 12 Aug 2025
https://github.com/ndleah/health-analysis
This case study is contained within the Serious SQL course by Danny Ma
data-analysis data-exploration data-with-danny database serious-sql sql
Last synced: 02 Mar 2025
https://github.com/virajbhutada/us-healthcare-analysis-powerbi
Unlock insights into the U.S. healthcare landscape from 2019 to 2020. Our PowerBI-driven analysis delves into hospital performance, patient outcomes, and payer-provider dynamics. Dive into detailed reports and visualizations for informed decision-making, empowering healthcare stakeholders, and shaping the industry's future.
data-analytics data-exploration data-modeling data-visualization datascience dax-expression decision-making healthcare-analysis healthcare-datasets insights interactive-visualizations microsoftpowerbi power-query powerbi powerbi-dashboards powerbi-desktop strategic-planning
Last synced: 04 Mar 2026
https://github.com/open-edge-platform/annflux
A research tool for exploring and annotating large datasets with Active Learning
active-learning annotation classification clustering data-exploration large-scale machine-learning multilabel-clustering
Last synced: 08 Feb 2026
https://github.com/vidhi1290/deep-learning-for-eeg-emotion-classification
This repository contains a Python code script for performing emotion classification using EEG (Electroencephalogram) data. Emotion classification from EEG signals is an important application in neuroscience and human-computer interaction. The code leverages deep learning techniques to analyze EEG data and predict emotional states.
coorelation data-exploration data-preprocessing data-science data-visualization deep-learning deep-learning-algorithms eeg-emotion-recognition egg-signals emotion-distribution emotion-prediction feature-analysis heatmap human-emotions machine-learning machine-learning-algorithms pie-chart spectral-analysis time-series-visualization
Last synced: 10 Apr 2025
https://github.com/aiguofer/sql_connectors
A simple wrapper for SQL connections using SQLAlchemy and Pandas read_sql to standardize SQL workflow with multiple data sources.
data-analysis data-analytics data-exploration data-science pandas relational-databases sql sqlalchemy standardized-api
Last synced: 13 Oct 2025
https://github.com/mpolinowski/python-scikitlearn-cheatsheet
SciKit Learn Machine Learning Cheat Sheet
cheatsheet data-exploration feature-engineering python scikitlearn-machine-learning sklearn
Last synced: 06 May 2026
https://github.com/ndleah/dvd-rental-marketing-analytics
๐ฅ Email marketing campaign analysis
data-analysis data-exploration data-modelling data-structures data-wrangling database sql
Last synced: 19 Jul 2025
https://github.com/darenasc/auto-fes
Automated exploration of files in a folder structure to extract metadata and potential usage of information.
data-exploration data-profiling data-science eda plain-text python
Last synced: 16 Mar 2025
https://github.com/copyleftdev/x12-edi-tools
A comprehensive set of tools for working with X12 EDI files
data-exploration dental x12 zuub
Last synced: 01 Jul 2025
https://github.com/ideas-lab-nus/eplusr-paper
Data and code for Jia and Chong (2020): Hongyuan Jia and Adrian Chong (2020). eplusr: A framework for integrating building energy simulation and data-driven analytics. (Accepted in Energy and Buildings).
bayesian-calibration data-analysis data-driven-analytics data-exploration energyplus eplusr multi-objective-optimization parametric-analysis r
Last synced: 21 Feb 2026
https://github.com/virajbhutada/US-healthcare-analysis-powerBI
Unlock insights into the U.S. healthcare landscape from 2019 to 2020. Our PowerBI-driven analysis delves into hospital performance, patient outcomes, and payer-provider dynamics. Dive into detailed reports and visualizations for informed decision-making, empowering healthcare stakeholders, and shaping the industry's future.
data-analytics data-exploration data-modeling data-visualization datascience dax-expression decision-making healthcare-analysis healthcare-datasets insights interactive-visualizations microsoftpowerbi power-query powerbi powerbi-dashboards powerbi-desktop strategic-planning
Last synced: 09 Oct 2025
https://github.com/eikevons/pandas-paddles
Access the parent Pandas data frame in loc[], iloc[], assign(), and others Pandas helpers
data-analysis data-exploration data-science pandas pandas-dataframe pandas-library pandas-loc
Last synced: 16 Jun 2025
https://github.com/Nelson-Gon/mde
mde: Missing Data Explorer
data-analysis data-cleaning data-exploration data-science datacleaner datacleaning exploratory-data-analysis missing missing-data missing-value-treatment missing-values missingness omit r r-package r-stats recode replace rstats statistics
Last synced: 30 Jul 2025
https://github.com/adityashrm21/exploratory_data_analysis
A collection of exploratory data analysis techniques and resources
data data-analysis data-exploration data-science data-visualization dataset datasets eda exploratory-data-analysis insights kaggle
Last synced: 29 Apr 2025
https://github.com/nelson-gon/mde
mde: Missing Data Explorer
data-analysis data-cleaning data-exploration data-science datacleaner datacleaning exploratory-data-analysis missing missing-data missing-value-treatment missing-values missingness omit r r-package r-stats recode replace rstats statistics
Last synced: 24 Jul 2025
https://github.com/jgphilpott/polyplot
A data exploration application inspired by Ola Rosling's Trendalyzer software.
d3js data-exploration data-science ola-rosling threejs trendalyzer
Last synced: 11 Jul 2025
https://github.com/juzershakir/creating-customer-segments
Applied Unsupervised Learning techniques on product spending data collected for customers of a wholesale distributor to identify customer segments hidden in the data.
classification customer-segments data-exploration data-visualization feature-selection gaussian-mixture-models k-means-clustering machine-learning-nanodegree machine-learning-tutorials matplotlib numpy pandas pca python scikit-learn seaborn silhouette udacity unsupervised-learning unsupervised-machine-learning
Last synced: 22 Oct 2025
https://github.com/simonblanke/search-data-explorer
Visualize search-data from your gradient-free-optimization run.
dashboard data-exploration data-science matplotlib pandas plotly python statistics streamlit tabular-data visualization
Last synced: 12 Jun 2025
https://github.com/coding-chemist/datalens
A smart dashboard that provides automated insights and visualizations from your data. With just a few clicks, explore trends, statistics, and data quality to make informed decisions effortlessly.
data-cleaning data-exploration datalens matplotlib nltk numpy pandas streamlit
Last synced: 29 Jan 2026
https://github.com/mndrake/cliffnotes
visual summary of an R dataframe
data-exploration data-visualization r
Last synced: 23 Jan 2026
https://github.com/phillipdupuis/mbta-api-playground
Learn about the MBTA V3 API by building queries and exploring the results
data-exploration django django-rest-framework mbta-api pandas pandas-dataframe pandas-profiling python
Last synced: 22 Jan 2026
https://github.com/asifdotexe/covidporfolioproject
This is a SQL + Tableau Project on real world Covid 19 Dataset from the start of recorded case to 2nd March 2022 i.e My birthday XD
dashboard data-analysis data-exploration data-visualization sql sql-server tableau
Last synced: 08 Jun 2026
https://github.com/mpolinowski/hotel-booking-dataset
Python Pandas Dataset Exploration with Hotel Demand Data.
data-exploration hotel-booking pandas python
Last synced: 20 Apr 2026
https://github.com/muneeb706/data-exploration-system
User interface for data exploration
adminlte chartjs css cypress data-exploration django html javascript python tabulator
Last synced: 17 Apr 2026
https://github.com/acook/enumerable_deep_search
Recursively search enumerable objects
data-exploration data-mining nested-objects
Last synced: 05 Jul 2025
https://github.com/cnag-biomedical-informatics/pheno-ranker
Pheno-Ranker is a tool designed for performing semantic similarity analysis on phenotypic data structured in JSON format, such as Beacon v2 Models or Phenopackets v2.
beacon-v2 bff csv data-exploration json phenopackets-v2 pxf semantic-similarity semantic-similarity-measures
Last synced: 29 Apr 2026
https://github.com/cnag-biomedical-informatics/pheno-ranker-ui
The web ui (R-Shiny application) for Pheno-Ranker, a tool designed for performing semantic similarity analysis on phenotypic data structured in JSON format, such as Beacon v2 Models or Phenopackets v2
beacon-v2 clinical-data data-exploration json phenopackets-v2 r semantic-similarity semantic-similarity-measures shiny
Last synced: 13 Oct 2025
https://github.com/cbhihe/visualcity
Multivariate analysis and statistical modeling (with dimensional reduction) of NYC urban life pathologies
bias-detection cluster-analysis correspondence-analysis data-exploration data-science data-visualization dimensionality-reduction geolocation-data google-maps-api independence-tests knn-classification multiple-correspondence-analysis mva principal-component-analysis r
Last synced: 10 Jul 2025
https://github.com/gjbex/python-dashboards
Repository that contains material for training sessions on creating dashboards using Python.
dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization
Last synced: 13 Jul 2025
https://github.com/revogati/ecommerce_consumer_behaviour
This is a Full Data Analytics project From data cleaning, preparation, exploration, Interpretation of insights up to Presentation of findings and recommendations..
data-analysis data-exploration ecommerce jupyter-notebook python sql tableau-public visualization
Last synced: 16 Apr 2026
https://github.com/nelson-gon/nelson-gon.github.io
Biologically Plausible Programming
bioinformatics blog blogdown computational-biology data-analysis data-exploration ghost ghostwriter-theme github github-pages hugo-site hugo-theme programming python3 r side-project
Last synced: 07 Feb 2026
https://github.com/lefteris-souflas/sas-programming-and-machine-learning
Applied SAS techniques for data analysis and machine learning in a milestone project. Base SAS Programming and SAS Viya tools were utilized for preprocessing, customer profiling, sales analysis, promotions, supplier evaluation, and customer segmentation. Results were visualized comprehensively.
customer-profiling data-analytics data-exploration market-basket-analysis pre-processing recency-frequency-monetary sas-machine-learning sas-oda sas-programming sas-studio sas-visual-analytics sas-viya
Last synced: 05 Mar 2026
https://github.com/macdon112/layoff-analysis
SQL data cleaning & analysis of global layoffs
data-analysis data-cleaning data-exploration sql
Last synced: 21 Feb 2026
https://github.com/hcvazquez/data-exploration-in-spark-with-pyspark-sql
Data exploration in spark with pyspark sql
apache-spark data-exploration data-processing pyspark python spark spark-sql
Last synced: 17 May 2026
https://github.com/ahmadrazacdx/sales-data-analysis
A comprehensive data analysis project focusing on sales data exploration, cleaning, and statistical analysis. This includes key insights into sales trends, customer behavior, product categories, delivery times, and seasonality effects. Advanced statistical tests and visualizations are added to identify relationships and make data driven decisions.
data-cleaning data-exploration exploratory-data-analysis feature-engineering hypothesis-testing time-series-analysis time-series-forecasting
Last synced: 31 May 2026
https://github.com/stepandel/pinecone-explorer
Pinecone Explorer for MacOS
data-exploration electron pinecone pineconedb tanstack typescript
Last synced: 27 Jan 2026
https://github.com/daodavid/titanic-exploration-data-science-project
Titanic -applying T-test -exercises-DataCleaning,DataEplorations,Hypotesis,basic text procesing
data-cleaning data-exploration linear-regression t-independent-test titanic
Last synced: 06 Oct 2025
https://github.com/sophy8281/sms-spam-detection
Spam messages detection model
classification data-exploration data-preprocessing data-visualization oversampling-technique undersampling-technique
Last synced: 07 Sep 2025
https://github.com/as16082023/covid-19--data-exploration-
Project exploring COVID-19 data using SQL
covid data-exploration mysql sql
Last synced: 10 Apr 2025
https://github.com/anushadatta/airbnb-in-seattle
๐จ Understanding the Airbnb rental landscape in Seattle using data science.
airbnb data-analysis data-exploration data-visualization datascience sentiment-analysis
Last synced: 13 Jun 2025
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/vidhi1290/hr_employee_prediction
"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.
data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp
Last synced: 13 Apr 2026
https://github.com/mpolinowski/python-dataset-exploration
Python Data Exploration
data-exploration matplotlib-pyplot pandas plotly python seaborn
Last synced: 09 May 2026
https://github.com/lijesh010/netflix_dataset_exploratory_data_analysis_python_project
This repository contains an Exploratory Data Analysis (EDA) Python project on the Netflix dataset. The purpose of this project is to gain insights and better understand the characteristics of the content available on Netflix, including movies and TV shows.
data-analysis data-exploration data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 20 May 2026
https://github.com/hit07/data_science
Data [ Exploration, Cleaning, Manipulation, Visualisation ]
data-analysis data-cleaning data-exploration data-manipulation data-visualization eda jupyter-notebook matplotlib numpy pandas-dataframe scipy
Last synced: 27 Mar 2025
https://github.com/mokeddembillel/student-performance-prediction
Using Machine learning to predict a student final grade
data-analysis data-exploration feature-extraction feature-importance feature-selection linear-regression machine-learning power-bi principal-component-analysis regression spyder student-performance-prediction svm-regressor
Last synced: 15 Mar 2025
https://github.com/shogunbanik18/budgetify
End-to-End Budget Analysis enables effective budgeting through detailed analysis and strategic planning
analysis data data-engineering data-exploration databricks databricks-notebooks etl etl-process python3
Last synced: 09 Jun 2026
https://github.com/willie-conway/global-superstore-data-modeling-analysis
A comprehensive data modeling and analysis project for the ๐Global Super Store, focusing on database design ๐๏ธ, sales data analysis ๐, and interactive visualizations ๐ using MySQL ๐ฅ๏ธ and Tableau ๐.
business-analytics business-intelligence data-exploration data-modeling data-preprocessing data-restructuring data-visualization database-design er-diagram geographic-analysis interactive-dashboard mysql profit-analysis sales-analysis sales-performance sales-trends sql star-schema tableau time-series-analysis
Last synced: 21 Jul 2025
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 28 Jan 2026
https://github.com/jvelezmagic/pandas-missing
A pandas extension to explore and handle missing values.
data-exploration eda missing-data missing-values pandas
Last synced: 14 Apr 2025
https://github.com/lcvriend/laserbeans
Toolbox for data exploration
altair data-exploration data-visualization
Last synced: 15 Mar 2025
https://github.com/rakshit-vasava/medisentiment-bert-lda-nlp-driven-patient-review-analysis
Using NLP techniques like BERT and LDA for sentiment analysis of patient reviews
bert data-exploration data-visualization lda-model machine-learning nlp python semantic-analysis
Last synced: 29 Apr 2026
https://github.com/anastasius21/imdb-movie-analysis
Analysis of IMDb's Top 1000 Movies dataset using Pandas, Matplotlib, and Seaborn. It provides visualizations and insights into various aspects of movies, such as ratings, genres, directors, and release years.
data-analysis data-exploration data-science data-visualization imdb imdb-dataset jupyter-notebook python
Last synced: 25 Apr 2026