Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/devsgnr/breadroll

breadroll πŸ₯Ÿ is a simple lightweight library for data processing operations written in Typescript and powered by Bun.

bun csv csv-parser data-engineering data-science data-transformation eda exploratory-data-analysis tsv tsv-parser

Last synced: 21 Jun 2024

https://github.com/copenhaver/factoranalysis

Alternating minimization approach to factor analysis

exploratory-data-analysis factor-analysis optimization

Last synced: 10 Jun 2024

https://github.com/krupanss/IEDA

R Package for the Interactive Shiny Application for exploratory data analysis thru visualization

eda exploratory-data-analysis interactive-analysis interactive-data-analysis interactive-eda interactive-visualizations r rpackage rshiny

Last synced: 10 Jun 2024

https://github.com/Nazaniiin/EDA_QualityofRedWine

:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.

charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization

Last synced: 10 Jun 2024

https://github.com/ben519/mltools

Exploratory and diagnostic machine learning tools for R

exploratory-data-analysis machine-learning r

Last synced: 10 Jun 2024

https://github.com/Yacine87/EDA_R_Packages

EDA is a must to do step in the data science workflow. Working on data, wrangling & transforming them is time consuming, and it determine the success degree of the next steps (pre preocessing, modelling, communicating outputs & decision making). This repo will show you how to perform EDA in R using the tidyverse ecosystem, and will introduce a comparative approach between the main packages in R whcich could let you perform automated EDA & generating automated EDA html or pdf reports, ready to be communicated.

dataexplorer dlookr eda exploratory-data-analysis exploratory-data-visualizations explorer hmisc missing-data outliers r rmarkdown rtutor smarteda statistical-tests summarytools tidyverse

Last synced: 04 Jun 2024

https://github.com/tuanle618/AEDA

AEDA - Automated Data Exploratory Analysis in R

data-science eda eda-report exploratory-data-analysis r

Last synced: 04 Jun 2024

https://github.com/superchordate/data-viz-talk

Resources from the "Data Visualization in Business Communication" presentation at the 2023 Gamma Iota Sigma Regional Conference in Fort Worth, TX.

data-visualization eda exploratory-data-analysis public-data

Last synced: 03 Jun 2024

https://github.com/superchordate/storyteller

AutoML R framework functions for quickly finding stories from data.

automl eda exploratory-data-analysis r

Last synced: 03 Jun 2024

https://github.com/rasbt/musicmood

A machine learning approach to classify songs by mood.

exploratory-data-analysis lyrics machine-learning mood song-dataset

Last synced: 03 Jun 2024

https://github.com/data-describe/data-describe

data⎰describe: Pythonic EDA Accelerator for Data Science

analysis data-science eda exploratory-data-analysis pypi

Last synced: 20 May 2024

https://github.com/kianweelee/Edator

A python package that performs exploratory data analysis for users. Additionally, it generates 3 types of output files (cleaned CSV, plots and a text report).

data-analysis data-science exploratory-data-analysis

Last synced: 20 May 2024

https://github.com/TysonStanley/furniture

The furniture R package contains table1 for publication-ready simple and stratified descriptive statistics, tableC for publication-ready correlation matrixes, and other tables #rstats

cran descriptive-statistics exploratory-data-analysis health table-one table1 tables tidy tidyverse

Last synced: 20 May 2024

https://github.com/ropensci/visdat

Preliminary Exploratory Visualisation of Data

exploratory-data-analysis missingness peer-reviewed r r-package ropensci rstats visualisation

Last synced: 20 May 2024

https://github.com/duttashi/learnr

Exploratory, Inferential and Predictive data analysis. Feel free to show your :heart: by giving a star :star:

exploratory-data-analysis inferential-statistics predictive-modeling r

Last synced: 20 May 2024

https://github.com/hneth/ds4psy

Data science for psychologists (ds4psy): R package supporting book and course

data-literacy data-science education exploratory-data-analysis psychology r r-package social-sciences visualisation

Last synced: 19 May 2024

https://github.com/daya6489/SmartEDA

a R package for data exploratory analysis

analysis exploratory-data-analysis

Last synced: 11 May 2024

https://github.com/vega/altair_ally

Altair Ally is a companion package to Altair, which provides a few shortcuts to create common plots for exploratory data analysis.

altair eda exploratory-data-analysis exploratory-data-visualizations vega-lite visualization

Last synced: 09 May 2024

https://github.com/lux-org/lux

Automatically visualize your pandas dataframe via a single print! πŸ“Š πŸ’‘

data-science exploratory-data-analysis jupyter pandas python visualization visualization-tools

Last synced: 09 May 2024

https://github.com/aeturrell/skimpy

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.

data-science eda exploratory-data-analysis pandas statistics summary-statistics

Last synced: 07 May 2024

https://github.com/sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

apis apiwrapper cleaning connector data-exploration data-science datacleaning dataconnector dataprep datapreparation eda exploratory-data-analysis webconnector

Last synced: 28 Apr 2024

https://github.com/Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data

Last synced: 21 Apr 2024

https://github.com/ahmed-mohamed-sn/olliePy

OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.

ai analytics charts dashboard data data-analytics data-science data-scientist eda error-analysis exploratory-data-analysis machine-learning python visualization

Last synced: 20 Apr 2024

https://github.com/mast-group/sequence-mining

Probabilistic Sequence Mining

data-mining exploratory-data-analysis

Last synced: 19 Apr 2024

https://github.com/neerjad/DataVisualization

Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph

exploratory-data-analysis plotly tutorial visualisation

Last synced: 15 Apr 2024

https://github.com/jadianes/data-science-your-way

Ways of doing Data Science Engineering and Machine Learning in R and Python

data-frame data-science data-science-engineering exploratory-data-analysis jupyter machine-learning notebook python r tutorial

Last synced: 10 Apr 2024

https://github.com/zmjones/mmpf

Monte-Carlo methods for prediction functions

exploratory-data-analysis machine-learning r rstats

Last synced: 31 Mar 2024

https://github.com/zmjones/edarf

exploratory data analysis using random forests

exploratory-data-analysis machine-learning r random-forest rstats

Last synced: 31 Mar 2024

https://github.com/alastairrushworth/inspectdf

πŸ› οΈ πŸ“Š Tools for Exploring and Comparing Data Frames

comparison dataframe eda exploratory-data-analysis r rstats visualization

Last synced: 26 Mar 2024

https://github.com/nbarrowman/vtree

An R package for calculating and drawing variable trees

data-science data-visualization exploratory-data-analysis r statistics

Last synced: 26 Mar 2024

https://github.com/dgwozdz/HN_SO_analysis

Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality

eda exploratory-data-analysis granger-causality hackernews python relationship stackoverflow

Last synced: 24 Mar 2024

https://github.com/mstaniak/autoEDA-resources

A list of software and papers related to automatic and fast Exploratory Data Analysis

autoeda automation eda exploratory-data-analysis visualization

Last synced: 21 Mar 2024

https://github.com/lozuwa/impy

Impy is a Python3 library with features that help you in your computer vision tasks.

dataset exploratory-data-analysis machine-learning preprocessing raw-data statistics tidy-data

Last synced: 16 Mar 2024