Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 01 Jul 2024
https://github.com/devsgnr/breadroll
breadroll π₯ is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
bun csv csv-parser data-engineering data-science data-transformation eda exploratory-data-analysis tsv tsv-parser
Last synced: 21 Jun 2024
https://github.com/tommyod/KDEpy
Kernel Density Estimation in Python
data-analysis exploratory-data-analysis kernel-density-estimation python python3 statistics
Last synced: 18 Jun 2024
https://github.com/rwoldford/loon
A Toolkit for Interactive Statistical Data Visualization
data-visualization exploratory-data-analysis high-dimension-visualization high-dimensional-data interactive-graphics interactive-visualizations r r-package r-programming r-stats statistical-graphics statistical-learning statistics tcl-applications tcl-extension tcl-tk tcltk
Last synced: 16 Jun 2024
https://github.com/copenhaver/factoranalysis
Alternating minimization approach to factor analysis
exploratory-data-analysis factor-analysis optimization
Last synced: 10 Jun 2024
https://github.com/joachim-gassen/ExPanDaR
R Package for Interactive Panel Data Exploration
accounting eda exploratory-data-analysis finance open-science package r replication shiny shiny-apps
Last synced: 10 Jun 2024
https://github.com/krupanss/IEDA
R Package for the Interactive Shiny Application for exploratory data analysis thru visualization
eda exploratory-data-analysis interactive-analysis interactive-data-analysis interactive-eda interactive-visualizations r rpackage rshiny
Last synced: 10 Jun 2024
https://github.com/business-science/correlationfunnel
Speed Up Exploratory Data Analysis (EDA)
correlation exploratory-analysis exploratory-data-analysis exploratory-data-visualizations r-package tidyverse
Last synced: 10 Jun 2024
https://github.com/Nazaniiin/EDA_QualityofRedWine
:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.
charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization
Last synced: 10 Jun 2024
https://github.com/ben519/mltools
Exploratory and diagnostic machine learning tools for R
exploratory-data-analysis machine-learning r
Last synced: 10 Jun 2024
https://github.com/tusharnankani/whatsapp-chat-data-analysis
An Exhaustive WhatsApp Chat Data Analysis.
data-analysis data-mining data-visualization exploratory-data-analysis hacktoberfest visualization whatsapp whatsapp-analysis
Last synced: 09 Jun 2024
https://github.com/hsbc/tslumen
A library for Time Series EDA (exploratory data analysis)
analysis data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations pandas profiling python time-series time-series-analysis time-series-eda time-series-profiling timeseries timeseries-analysis timeseries-eda
Last synced: 07 Jun 2024
https://github.com/zzawadz/DepthProc
depth-functions exploratory-data-analysis r statistics
Last synced: 04 Jun 2024
https://github.com/Yacine87/EDA_R_Packages
EDA is a must to do step in the data science workflow. Working on data, wrangling & transforming them is time consuming, and it determine the success degree of the next steps (pre preocessing, modelling, communicating outputs & decision making). This repo will show you how to perform EDA in R using the tidyverse ecosystem, and will introduce a comparative approach between the main packages in R whcich could let you perform automated EDA & generating automated EDA html or pdf reports, ready to be communicated.
dataexplorer dlookr eda exploratory-data-analysis exploratory-data-visualizations explorer hmisc missing-data outliers r rmarkdown rtutor smarteda statistical-tests summarytools tidyverse
Last synced: 04 Jun 2024
https://github.com/Nelson-Gon/mde
mde: Missing Data Explorer
data-analysis data-cleaning data-exploration data-science datacleaner datacleaning exploratory-data-analysis missing missing-data missing-value-treatment missing-values missingness omit r r-package r-stats recode replace rstats statistics
Last synced: 04 Jun 2024
https://github.com/tuanle618/AEDA
AEDA - Automated Data Exploratory Analysis in R
data-science eda eda-report exploratory-data-analysis r
Last synced: 04 Jun 2024
https://github.com/Nelson-Gon/shinymde
A shiny interface to mde, the missing data explorer R package. Deployed at https://nelson-gon.shinyapps.io/shinymde
dashboard eda exploratory-data-analysis gui-application missing-data missing-values missingness open-source r r-package recoding-variables rstats scientific-visualization shiny-apps shinydashboard statistical-analysis statistics visualization
Last synced: 03 Jun 2024
https://github.com/superchordate/data-viz-talk
Resources from the "Data Visualization in Business Communication" presentation at the 2023 Gamma Iota Sigma Regional Conference in Fort Worth, TX.
data-visualization eda exploratory-data-analysis public-data
Last synced: 03 Jun 2024
https://github.com/superchordate/storyteller
AutoML R framework functions for quickly finding stories from data.
automl eda exploratory-data-analysis r
Last synced: 03 Jun 2024
https://github.com/DeDeDeDer/Personal_Projects
This holds all my personal data-related project's (Automation, Modelling, Analysis)
actuarial-science actuarial-statistics claims-reserving datascience datascraping excelvba exploratory-data-analysis feature-engineering insurance-claims modelling-framework predictive-modeling python3
Last synced: 03 Jun 2024
https://github.com/rasbt/musicmood
A machine learning approach to classify songs by mood.
exploratory-data-analysis lyrics machine-learning mood song-dataset
Last synced: 03 Jun 2024
https://github.com/great-northern-diver/loon
A Toolkit for Interactive Statistical Data Visualization
data-analysis data-science data-visualization exploratory-analysis exploratory-data-analysis high-dimensional-data interactive-graphics interactive-visualizations loon python statistical-analysis statistical-graphics statistics tcl-extension tk
Last synced: 30 May 2024
https://github.com/Jean-njoroge/Breast-cancer-risk-prediction
Classification of Breast Cancer diagnosis Using Support Vector Machines
breast-cancer-prediction breast-cancer-tumor breastcancer-classification classification data-analysis dataprocessing exploratory-data-analysis notebook pipelines prediction-model python supervised-learning svm
Last synced: 29 May 2024
https://github.com/evidence-dev/evidence
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..
analytics business-intelligence dashboard data-engineering data-science data-visualization dbt duckdb exploratory-data-analysis finance open-source self-hosted sql statistics svelte tailwindcss webassembly
Last synced: 27 May 2024
https://github.com/data-describe/data-describe
dataβ°describe: Pythonic EDA Accelerator for Data Science
analysis data-science eda exploratory-data-analysis pypi
Last synced: 20 May 2024
https://github.com/kianweelee/Edator
A python package that performs exploratory data analysis for users. Additionally, it generates 3 types of output files (cleaned CSV, plots and a text report).
data-analysis data-science exploratory-data-analysis
Last synced: 20 May 2024
https://github.com/TysonStanley/furniture
The furniture R package contains table1 for publication-ready simple and stratified descriptive statistics, tableC for publication-ready correlation matrixes, and other tables #rstats
cran descriptive-statistics exploratory-data-analysis health table-one table1 tables tidy tidyverse
Last synced: 20 May 2024
https://github.com/ropensci/visdat
Preliminary Exploratory Visualisation of Data
exploratory-data-analysis missingness peer-reviewed r r-package ropensci rstats visualisation
Last synced: 20 May 2024
https://github.com/duttashi/learnr
Exploratory, Inferential and Predictive data analysis. Feel free to show your :heart: by giving a star :star:
exploratory-data-analysis inferential-statistics predictive-modeling r
Last synced: 20 May 2024
https://github.com/hneth/ds4psy
Data science for psychologists (ds4psy): R package supporting book and course
data-literacy data-science education exploratory-data-analysis psychology r r-package social-sciences visualisation
Last synced: 19 May 2024
https://github.com/InfuseAI/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 13 May 2024
https://github.com/achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project
Complete-Life-Cycle-of-a-Data-Science-Project
analysis data-analysis data-science dataset deep-learning eda exploratory-data-analysis feature-engineering federated-learning machine-learning nlp-models python python-library pytorch reinforcement-learning scraper supervised-learning transfer-learning unsupervised-learning web-scraping
Last synced: 13 May 2024
https://github.com/daya6489/SmartEDA
a R package for data exploratory analysis
analysis exploratory-data-analysis
Last synced: 11 May 2024
https://github.com/vega/altair_ally
Altair Ally is a companion package to Altair, which provides a few shortcuts to create common plots for exploratory data analysis.
altair eda exploratory-data-analysis exploratory-data-visualizations vega-lite visualization
Last synced: 09 May 2024
https://github.com/lux-org/lux
Automatically visualize your pandas dataframe via a single print! π π‘
data-science exploratory-data-analysis jupyter pandas python visualization visualization-tools
Last synced: 09 May 2024
https://github.com/mdh266/NYCBuildingEnergyUse
Creating Regression Models Of Building Emissions On Google Cloud
bokeh data-science energy-efficiency exploratory-data-analysis google-app-engine missing-data missing-values outlier-detection outlier-removal regression regression-models scikit-learn xgboost
Last synced: 07 May 2024
https://github.com/aeturrell/skimpy
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
data-science eda exploratory-data-analysis pandas statistics summary-statistics
Last synced: 07 May 2024
https://github.com/great-expectations/great_expectations
Always know what to expect from your data.
cleandata data-engineering data-profilers data-profiling data-quality data-science data-unit-tests datacleaner datacleaning dataquality dataunittest eda exploratory-analysis exploratory-data-analysis exploratorydataanalysis mlops pipeline pipeline-debt pipeline-testing pipeline-tests
Last synced: 28 Apr 2024
https://github.com/sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
apis apiwrapper cleaning connector data-exploration data-science datacleaning dataconnector dataprep datapreparation eda exploratory-data-analysis webconnector
Last synced: 28 Apr 2024
https://github.com/ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
big-data-analytics data-analysis data-exploration data-profiling data-quality data-science deep-learning eda exploration exploratory-data-analysis hacktoberfest html-report jupyter jupyter-notebook machine-learning pandas pandas-dataframe pandas-profiling python statistics
Last synced: 28 Apr 2024
https://github.com/fbdesignpro/sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
data-analysis data-exploration data-profiling data-science data-visualization eda exploration exploratory-data-analysis machine-learning pandas pandas-dataframe python statistics
Last synced: 28 Apr 2024
https://github.com/Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data
Last synced: 21 Apr 2024
https://github.com/ahmed-mohamed-sn/olliePy
OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
ai analytics charts dashboard data data-analytics data-science data-scientist eda error-analysis exploratory-data-analysis machine-learning python visualization
Last synced: 20 Apr 2024
https://github.com/mast-group/sequence-mining
Probabilistic Sequence Mining
data-mining exploratory-data-analysis
Last synced: 19 Apr 2024
https://github.com/JasonKessler/scattertext
Beautiful visualizations of how language differs among document types.
computational-social-science d3 eda exploratory-data-analysis japanese-language machine-learning natural-language-processing nlp scatter-plot semiotic-squares sentiment stylometric stylometry text-as-data text-mining text-visualization topic-modeling visualization word-embeddings word2vec
Last synced: 17 Apr 2024
https://github.com/neerjad/DataVisualization
Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
exploratory-data-analysis plotly tutorial visualisation
Last synced: 15 Apr 2024
https://github.com/harunurrashid97/100-Days-Of-ML-Code
A day to day plan for this challenge. Covers both theoritical and practical aspects
100-days-of-code 100daysofmlcode article data-preprocessing data-science datascience decision-tree eda exploratory-data-analysis implementation infographics linear-regression machine-learning machine-learning-algorithms python regression-algorithms siraj-raval-challenge textsummarization tutorials vizualization
Last synced: 13 Apr 2024
https://github.com/jadianes/data-science-your-way
Ways of doing Data Science Engineering and Machine Learning in R and Python
data-frame data-science data-science-engineering exploratory-data-analysis jupyter machine-learning notebook python r tutorial
Last synced: 10 Apr 2024
https://great-northern-diver.github.io/loon/
A Toolkit for Interactive Statistical Data Visualization
data-analysis data-science data-visualization exploratory-analysis exploratory-data-analysis high-dimensional-data interactive-graphics interactive-visualizations loon python statistical-analysis statistical-graphics statistics tcl-extension tk
Last synced: 08 Apr 2024
https://github.com/dataprofessor/code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
data-professor data-science data-science-python dataprofessor datascience exploratory-data-analysis machine-learning machinelearning pandas python python-data-science r scikit-learn scikit-learn-python shiny streamlit
Last synced: 01 Apr 2024
https://github.com/zmjones/mmpf
Monte-Carlo methods for prediction functions
exploratory-data-analysis machine-learning r rstats
Last synced: 31 Mar 2024
https://github.com/zmjones/edarf
exploratory data analysis using random forests
exploratory-data-analysis machine-learning r random-forest rstats
Last synced: 31 Mar 2024
https://github.com/alastairrushworth/inspectdf
π οΈ π Tools for Exploring and Comparing Data Frames
comparison dataframe eda exploratory-data-analysis r rstats visualization
Last synced: 26 Mar 2024
https://github.com/nbarrowman/vtree
An R package for calculating and drawing variable trees
data-science data-visualization exploratory-data-analysis r statistics
Last synced: 26 Mar 2024
https://github.com/dgwozdz/HN_SO_analysis
Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
eda exploratory-data-analysis granger-causality hackernews python relationship stackoverflow
Last synced: 24 Mar 2024
https://github.com/Renumics/spotlight
Interactively explore unstructured datasets from your dataframe.
audio computer-vision data-centric-ai data-curation data-visualization exploratory-data-analysis hacktoberfest images machine-learning meshes timeseries unstructured-data video
Last synced: 23 Mar 2024
https://github.com/mstaniak/autoEDA-resources
A list of software and papers related to automatic and fast Exploratory Data Analysis
autoeda automation eda exploratory-data-analysis visualization
Last synced: 21 Mar 2024
https://github.com/ank0409/Ditching-Excel-for-Python
Functionalities in Excel translated to Python
dataframe eda excel exploratory-data-analysis machine-learning numpy pandas pivot-tables python tutorial vba
Last synced: 17 Mar 2024
https://github.com/lozuwa/impy
Impy is a Python3 library with features that help you in your computer vision tasks.
dataset exploratory-data-analysis machine-learning preprocessing raw-data statistics tidy-data
Last synced: 16 Mar 2024