An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with exploratory-data-visualizations

A curated list of projects in awesome lists tagged with exploratory-data-visualizations .

https://github.com/vega/altair_ally

Altair Ally is a companion package to Altair, which provides a few shortcuts to create common plots for exploratory data analysis.

altair eda exploratory-data-analysis exploratory-data-visualizations vega-lite visualization

Last synced: 14 Dec 2025

https://github.com/squey/squey

Squey is a visualization software designed to interactively explore and understand large amounts of tabular data (this is the read-only mirror of https://gitlab.com/squey/squey)

cybersecurity data-analysis data-science data-visualization exploratory-data-visualizations parallel-coordinates parquet parquet-files parquet-viewer pcap timeseries timeseries-analysis visualization

Last synced: 08 Mar 2025

https://github.com/ksdkamesh99/flight-price-prediction

A Flask based Web Application that Predicts the Flight Price using RandomForestRegressor.Its GUI is based on Swagger API. This is hosted on the Heroku platform.

exploratory-data-analysis exploratory-data-visualizations flask heroku random-forest-regression swagger-ui

Last synced: 12 May 2025

https://github.com/arminpasalic/vectoria

Browser-first text exploration, clustering, and semantic search.

browser cluster-analysis exploratory-data-visualizations llm rag rag-chatbot semantic-search umap-hdbscan

Last synced: 25 Feb 2026

https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets

Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account

data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping

Last synced: 27 Jul 2025

https://github.com/cosmoduende/r-marvel-vs-dc

DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R

comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros

Last synced: 11 Apr 2025

https://github.com/mgobeaalcoba/exploratory_data_analysis_with_python

Explore and analyze data effectively with Python. This repository offers tools and techniques for conducting insightful exploratory data analysis (EDA) to extract valuable insights.

colab-notebook eda exploratory-data-analysis exploratory-data-visualizations jupyter-notebook models python3

Last synced: 29 Dec 2025

https://github.com/Yacine87/EDA_R_Packages

EDA is a must to do step in the data science workflow. Working on data, wrangling & transforming them is time consuming, and it determine the success degree of the next steps (pre preocessing, modelling, communicating outputs & decision making). This repo will show you how to perform EDA in R using the tidyverse ecosystem, and will introduce a comparative approach between the main packages in R whcich could let you perform automated EDA & generating automated EDA html or pdf reports, ready to be communicated.

dataexplorer dlookr eda exploratory-data-analysis exploratory-data-visualizations explorer hmisc missing-data outliers r rmarkdown rtutor smarteda statistical-tests summarytools tidyverse

Last synced: 30 Jul 2025

https://github.com/vrandezo/thesurroundingocean

An exploratory view on the the Wikidata lexicographic data

exploratory-data-visualizations lexicon-based wikidata

Last synced: 27 Dec 2025

https://github.com/mchenryspagg/prosper-loan-project

A data analysis project that entails using the data from a fictional loan company known as Prosper to perform exploratory data analysis using univariate, bivariate and multivariate visualizations to produce insights that answers questions asked from the data

datastorytelling exploratory-data-analysis exploratory-data-visualizations jupyter-notebook matplotlib-pyplot numpy pandas python seaborn-plots

Last synced: 01 Sep 2025

https://github.com/annennenne/pcadsc

An R package for performing Principal Component Analysis-based Data Structure Comparisons (PCADSC)

data-structures exploratory-data-visualizations principal-component-analysis r

Last synced: 12 Dec 2025

https://github.com/frankelavsky/ligo-virgo-mass-plot

An interactive astrophysics project, exploring the masses of dead stellar objects (black holes and neutron stars). I used d3.js, a touch of jquery, flowtype, and advanced SVG techniques (in vanilla javascript) for this project.

astronomy astrophysics client-side css d3 d3js data-visualization exploratory-data-visualizations frontend frontend-app gravitational-waves html interactive interactive-visualizations javascript modular single-page-app svg svg-filters visualization

Last synced: 27 Dec 2025

https://github.com/jds485/geothermal_esda

This repository contains exploratory spatial data analysis (ESDA) functions and scripts. These functions are designed for geothermal spatial datasets, and are applicable to other spatial datasets.

bht exploratory-data-analysis exploratory-data-visualizations geothermal heat-flux nonparametric-statistics outlier-detection sensitivity-analysis spatial-analysis spatial-data-analysis spatial-data-science

Last synced: 16 May 2025

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/datarohit/fifa-2020--data-analysis

This is dataset is from Kaggle.com which contains data of 18000+ fifa players with more than 100 features about them for analysis. Simple analysis performed on this Dataset.

exploratory-data-analysis exploratory-data-visualizations matplotlib-pyplot numpy pandas seaborn

Last synced: 13 Jul 2025

https://github.com/saob007/modelado_retencion_personal_proyecto

Construcción de un modelo de aprendizaje automático que permite predecir si un empleado desertará o no de una empresa industrial de desarrollo automotriz

cleaning-data exploratory-data-analysis exploratory-data-visualizations jupyter-notebook logistic-regression-classifier pickle python3 random random-forest-classifier scikitlearn-machine-learning xgboost-classifier

Last synced: 01 Aug 2025

https://github.com/rakibhhridoy/uscrimeanalysis-appliedstatistics

Detecting the key reason for crime(manslaughter) happened in 30 years. There's a lot of aspect present in the data published by US government. The data and the finding can be used in any country prospect,if not all but handy few.

ab-testing applied-statistics crime crime-analysis crime-prediction crime-statistics eda exploratory-data-analysis exploratory-data-visualizations hypothesis-testing machine-learning python statistical-analysis statistical-learning statsmodels visualization

Last synced: 24 Oct 2025

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 20 Mar 2025

https://github.com/makoczoro/credit-default-risk-analysis-eda

This repository contains the detailed EDA Analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 20 Mar 2025

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 03 Mar 2025

https://github.com/saraasgari99/customer-big-data-analytics

In-depth analysis of customer behavior in e-commerce using big data analytics, visualization, and machine learning in Python (PCA, time-series, exploratory, sentiment, and predictive analysis)

big-data-analytics exploratory-data-analysis exploratory-data-visualizations machine-learning pandas pca python random-forest sentiment-analysis sklearn

Last synced: 30 Dec 2025

https://github.com/amoghkori/predicting-the-success-of-falcon-9-rocket-landings

Predictive algorithms for determining the success of Falcon 9 first-stage landing, enabling informed bidding strategies for rocket launch startups.

data-collection data-wrangling exploratory-data-analysis exploratory-data-visualizations machine-learning-algorithms model-selection predictive-modeling web-scraping

Last synced: 17 Mar 2025

https://github.com/anderson-andre-p/exploratory-data-analysis.roller-coaster

This repository contains an exploratory data analysis (EDA) project focused on roller coasters. The project involved organizing, cleaning, and visualizing the data to gain insights into roller coasters' characteristics and performance.

data-analysis eda exploratory-data-analysis exploratory-data-visualizations notebook

Last synced: 15 Mar 2025

https://github.com/hariprasath-v/hackerearth-novartis-data-science-hiring-challenge

Hackereath data science hiring challenge to predict the hack attacks on digital payment processs.

exploratory-data-analysis exploratory-data-visualizations machine-learning r random-forest

Last synced: 02 Mar 2025

https://github.com/jpgiant/eda_of_medical_premium

Exploratory Data Analysis (EDA) of a medical premium dataset

exploratory-data-analysis exploratory-data-visualizations pandas-dataframe python

Last synced: 02 Jul 2025

https://github.com/jpgiant/gujaratrainfallanalysis_2021

Analysis about the rainfall that occurred in the districts of Gujarat state in 2021

data-analysis exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas-python python

Last synced: 02 Jul 2025

https://github.com/benzerinsio/heartdisease-eda

📊 Análise Exploratória de Dados (EDA) - Doenças Cardíacas | Estudo em Exploração de Fatores Cardíacos para prática e demonstração de técnicas analíticas e visuais.

analise-de-dados analise-exploratoria analise-exploratoria-de-dados eda exploratory-data-analysis exploratory-data-visualizations

Last synced: 31 Mar 2025

https://github.com/muthukumar0908/energy_consumption_analysis

Project will analyze energy usage and greenhouse gas (GHG) emissions of Ontario's Broader Public Sector (BPS) organizations, leveraging a comprehensive database of reported data.

datacleaning exploratory-data-visualizations plotly powerbi

Last synced: 30 Mar 2025