Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/CICIFLY/Data-Analytics-Projects

This repository contains the projects related to data collecting, assessing,cleaning,visualizations and analyzing

data-analysis data-visualization jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 12 Nov 2024

https://github.com/tkrabel/edaviz

edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab

altair data-analysis data-exploration data-sciene data-visualization eda edaviz exploratory-data interactive jupyter-notebook matplotlib pandas plotly project-jupyter pyhon qgrid seaborn

Last synced: 25 Dec 2024

https://github.com/milaan9/11_python_matplotlib_module

Matplotlib is an amazing visualization library in Python for 2D plots of arrays. Matplotlib is a multi-platform data visualization library built on NumPy arrays and designed to work with the broader SciPy stack. It was introduced by John Hunter in the year 2002. One of the greatest benefits of visualization is that it allows us visual access to huge amounts of data in easily digestible visuals. Matplotlib consists of several plots like line, bar, scatter, histogram, etc

data-analysis data-visualization ipython-notebook matplotlib matplotlib-examples matplotlib-exercises matplotlib-figures matplotlib-heatmap matplotlib-pyplot matplotlib-python matplotlib-tutorial python-matplotlib python-tutor python-tutorial-github python-tutorial-notebook python-tutorials python4beginner python4datascience python4everybody tutor-milaan9

Last synced: 23 Dec 2024

https://github.com/nickslevine/zebras

Data analysis library for JavaScript built with Ramda

data-analysis data-science functional-programming javascript pandas ramda

Last synced: 07 Nov 2024

https://github.com/acerbilab/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 26 Dec 2024

https://github.com/pavelkomarov/exportify

Export Spotify playlists using the Web API. Analyze them in the Jupyter notebook.

data-analysis github-pages-website javascript javascript-promise jupyter-notebook spotify spotify-api spotify-web-api

Last synced: 22 Dec 2024

https://github.com/koldlight/curso-python-analisis-datos

Curso de python básico orientado al análisis de datos, en español

course data data-analysis folium hacktoberfest numpy pandas python requests seaborn spanish

Last synced: 25 Dec 2024

https://github.com/dataplane-app/dataplane

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.

airflow data data-analysis data-engineering data-integration data-pipelines data-science dataplane datawarehouse etl finance golang kubernetes pipelines robotics-process-automation rpa scheduler workflow workflow-automation workflows

Last synced: 12 Nov 2024

https://github.com/aws/amazon-redshift-python-driver

Redshift Python Connector. It supports Python Database API Specification v2.0.

amazon-redshift aws-redshift data-analysis data-science

Last synced: 26 Dec 2024

https://github.com/ayush1997/visualize_ML

Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.

data-analysis machine-learning matplotlib python statisics visualization

Last synced: 25 Oct 2024

https://github.com/nshiab/simple-data-analysis

Easy-to-use and high-performance JavaScript library for data analysis.

data data-analysis data-science duckdb javascript nodejs typescript

Last synced: 28 Oct 2024

https://github.com/codekitchen/pipeline

the `pipeline` shell command

data-analysis data-mining shell-scripting

Last synced: 24 Nov 2024

https://github.com/totalhack/zillion

Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/

ai analytics data-analysis data-warehousing datasources openai python query-builder reporting semantic-data-model semantic-layer sql text-to-sql warehouse

Last synced: 08 Nov 2024

https://github.com/Azure/DataScienceVM

Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)

ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver

Last synced: 27 Nov 2024

https://github.com/azure/datasciencevm

Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)

ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver

Last synced: 26 Dec 2024

https://github.com/briatte/ida

Introduction to Data Analysis, using R (2013)

course data-analysis r

Last synced: 27 Oct 2024

https://github.com/koolreport/core

An Open Source PHP Reporting Framework that helps you to write perfect data reports or to construct awesome dashboards in PHP. Working great with all PHP versions from 5.6 to latest 8.0. Fully compatible with all kinds of MVC frameworks like Laravel, CodeIgniter, Symfony.

data-analysis data-pipelines data-pivot data-summarization data-visualization data-viz framework mysql-reporting-tools php php-reporting-tools php-reports report-generator reporting reporting-engine reporting-tool

Last synced: 27 Dec 2024

https://github.com/calculist/calculist

the open source thinking tool for problem solvers

data-analysis note-taking tree-structure

Last synced: 04 Nov 2024

https://github.com/phillipdupuis/dtale-desktop

Build a data visualization dashboard with simple snippets of python code

data-analysis data-science data-visualization fastapi pandas python react typescript visualization

Last synced: 27 Dec 2024

https://github.com/archd3sai/Customer-Survival-Analysis-and-Churn-Prediction

In this project, I have utilized survival analysis models to see how the likelihood of the customer churn changes over time and to calculate customer LTV. I have also implemented the Random Forest model to predict if a customer is going to churn and deployed a model using the flask web app.

customer-churn-prediction customer-survival-analysis data-analysis explainable-ai flask-application hazard partial-dependence-plot random-forest shap-values survival-analysis

Last synced: 06 Nov 2024

https://github.com/risenw/datasist

A Python library for easy data analysis, visualization, exploration and modeling

data-analysis data-science data-visualization feature-engineering machine-learning python-3

Last synced: 22 Dec 2024

https://github.com/toobigdata/papa

一个浏览器端数据爬虫,做每个人的数据助手

chrome data-analysis kickstarter spider

Last synced: 07 Nov 2024

https://github.com/unytics/airbyte_serverless

Airbyte made simple (no UI, no database, no cluster)

airbyte bigquery data data-analysis data-engineering data-warehouse elt etl pipeline

Last synced: 22 Dec 2024

https://github.com/cuducos/calculadora-do-cidadao

💵 Tool for Brazilian Reais monetary adjustment/correction

brasil brazil data-analysis hacktoberfest monetary python

Last synced: 25 Oct 2024

https://github.com/javascriptdata/dnotebook

Dnotebook is a Jupyter-like library for javaScript environment. It allows you to create and share pages that contain live code, text and visualizations.

data-analysis interactive-visualizations javascript live-code notebook notebook-javascript

Last synced: 12 Nov 2024

https://github.com/opensource9ja/dnotebook

Dnotebook is a Jupyter-like library for javaScript environment. It allows you to create and share pages that contain live code, text and visualizations.

data-analysis interactive-visualizations javascript live-code notebook notebook-javascript

Last synced: 14 Nov 2024

https://github.com/moosetechnology/moose

MOOSE - Platform for software and data analysis.

data-analysis moose pharo smalltalk software-analysis

Last synced: 27 Dec 2024

https://github.com/moosetechnology/Moose

MOOSE - Platform for software and data analysis.

data-analysis moose pharo smalltalk software-analysis

Last synced: 17 Nov 2024

https://github.com/ing-bank/probatus

Validation (like Recursive Feature Elimination for SHAP) of (multiclass) classifiers & regressors and data used to develop them.

binary-classifiers data-analysis data-science feature-elimination machine-learning multi-class-classification recursive-feature-elimination regressors shap statistics tree-model

Last synced: 21 Dec 2024

https://github.com/hbuschme/TextGridTools

Read, write, and manipulate Praat TextGrid files with Python

annotation data-analysis elan linguistics praat python textgrid

Last synced: 27 Nov 2024

https://github.com/iam-mhaseeb/skytrax-data-warehouse

A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.

airflow data-analysis data-analytics data-cleaning data-engineering data-orchestration data-processing data-visualization data-warehouse data-warehousing database docker metabase python python3 redshift s3 s3-bucket sql

Last synced: 14 Dec 2024

https://github.com/anicetngrt/jiro-nn

A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.

adam classification cuda data-analysis deep-learning dropout gpu gpu-computing machine-learning ml nalgebra neural-networks nn opencl pipelines regression rust sgd

Last synced: 25 Dec 2024

https://github.com/NCAS-CMS/cf-python

A CF-compliant Earth Science data analysis library

cf cfdm cfunits data-analysis earth-science metadata netcdf pp python um

Last synced: 27 Nov 2024

https://github.com/ArturSepp/QuantInvestStrats

Quantitative Investment Strategies (QIS) package implements Python analytics for visualisation of financial data, performance reporting, analysis of quantitative strategies.

asset-management data-analysis data-visualization investment-analysis performance-attribution portfolio-optimization portfolio-risk-management python quantitative-finance

Last synced: 14 Oct 2024

https://github.com/canner/wren-engine

🤖 The semantic engine for LLMs, bringing semantic context to AI agents. 🔥

business-intelligence data data-analysis data-analytics data-lake data-warehouse hacktoberfest llm semantic semantic-layer sql

Last synced: 22 Dec 2024

https://github.com/AnicetNgrt/jiro-nn

A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.

adam classification cuda data-analysis deep-learning dropout gpu gpu-computing machine-learning ml nalgebra neural-networks nn opencl pipelines regression rust sgd

Last synced: 24 Sep 2024

https://github.com/hay/dataknead

Effortless conversion between data formats like JSON, XML and CSV

csv data-analysis data-conversion json python python3

Last synced: 26 Dec 2024

https://github.com/juliadata/indexedtables.jl

Flexible tables with ordered indices

data-analysis data-manipulation indexedtables julia juliadb

Last synced: 25 Dec 2024

https://github.com/hdfgroup/hsds

Cloud-native, service based access to HDF data

asyncio aws data-analysis docker hdf5 multi-dimensional python scientific-data

Last synced: 26 Dec 2024

https://github.com/jadianes/spark-r-notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

big-data bigdata data-analysis data-science exploratory-data-analysis jupyter jupyter-notebook notebook r sparkr

Last synced: 09 Nov 2024

https://github.com/apachecn/ds100-textbook-zh

:book: [译] UCB DS100 数据科学的原理与技巧

data-analysis ds100 machine-learning python textbook ucb

Last synced: 12 Nov 2024

https://github.com/acerbilab/pyvbmc

PyVBMC: Variational Bayesian Monte Carlo algorithm for posterior and model inference in Python

bayesian-inference data-analysis gaussian-processes machine-learning python variational-inference

Last synced: 28 Dec 2024

https://github.com/ajayarunachalam/msda

Library for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector

anamoly-detection-using-graphs anomaly-detection correlation data-analysis deep-learning deep-neural-networks explainable-artificial-intelligence feature-engineering feature-selection multidimensional-data multisensor python pytorch sensor sensor-data signal-processing tabular-data time-series variation visualization

Last synced: 18 Dec 2024

https://github.com/winvector/data_algebra

Codd method-chained SQL generator and Pandas data processing in Python.

data-analysis data-science pandas python

Last synced: 25 Dec 2024

https://github.com/ujjwalkarn/xda

R package for exploratory data analysis

data-analysis data-science exploratory-data-analysis r

Last synced: 11 Nov 2024

https://github.com/tiannaparris/data-analysis-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

data-analysis data-science data-visualization excel matplotlib pandas portfolio powerbi python r scipy seaborn sql tableau

Last synced: 10 Dec 2024

https://github.com/abhiamishra/ggshakeR

An analysis and visualization R package that works with publicly available soccer data

analysis data-analysis data-visualization football-analytics library machine-learning plotting r soccer soccer-analytics visualization

Last synced: 13 Nov 2024

https://github.com/bccp/nbodykit

Analysis kit for large-scale structure datasets, the massively parallel way

astrophysics clustering cosmology data-analysis large-scale-structure mpi mpi4py parallel-computing python

Last synced: 29 Nov 2024

https://github.com/Nesvilab/philosopher

PeptideProphet, PTMProphet, ProteinProphet, iProphet, Abacus, and FDR filtering

bioinformatics data-analysis go mass-spectrometry ms-data proteomics

Last synced: 09 Nov 2024

https://github.com/deanmarchiori/analysis-flow

Data Analysis Workflows & Reproducibility Learning Resources

data-analysis reproducibility reproducible-data-science reproducible-science tooling workflow

Last synced: 04 Dec 2024

https://github.com/innat/ML-Resource

A concise resource repository for machine learning

data-analysis data-science deep-learning kaggle machine-learning python spark

Last synced: 11 Nov 2024

https://github.com/imsanjoykb/data-science-regular-bootcamp

Regular practice on Data Science, Machien Learning, Deep Learning, Solving ML Project problem, Analytical Issue. Regular boost up my knowledge. The goal is to help learner with learning resource on Data Science filed.

artificial-intelligence data-analysis data-science data-science-notebook data-science-projects data-visualization database-connection deep-learning etl-pipeline etl-process feature-engineering machine-learning mysql-database neural-network numpy pandas postgresql python python-automation sqlite

Last synced: 12 Oct 2024

https://github.com/basedosdados/analises

📊 Repositório de códigos simples e replicáveis das análises publicadas.

data-analysis data-visualization open-source

Last synced: 23 Dec 2024

https://github.com/phanxuanquang/askdb

Revolutionize the way we interact with SQL databases using Generative AI

csharp dapper data-analysis database dotnet gemini genai generative-ai llm llms sql windows winui winui3

Last synced: 23 Dec 2024

https://github.com/dcwuser/metanumerics

Meta.Numerics is library for advanced numerical computing on the .NET platform. It offers an object-oriented API for statistical analysis, advanced functions, Fourier transforms, numerical integration and optimization, and matrix algebra.

csharp-library data-analysis dotnet math math-library matrix matrix-algebra matrix-factorization matrix-library matrix-multiplication numerical-analysis numerical-integration numerical-optimization numerics optimization scientific-computing special-functions statistical-analysis statistical-tests statistics

Last synced: 12 Oct 2024

https://github.com/sciruby/daru-view

daru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.

charts daru daru-view data-analysis data-visualization graphs iruby-notebook nanoc plot-library rails ruby sinatra

Last synced: 14 Nov 2024

https://github.com/SciRuby/daru-view

daru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.

charts daru daru-view data-analysis data-visualization graphs iruby-notebook nanoc plot-library rails ruby sinatra

Last synced: 30 Oct 2024

https://github.com/Coorsaa/shinyMlr

shiny-mlr: Integration of the mlr package into shiny

data-analysis data-visualization machine-learning mlr r r-package shiny shiny-apps

Last synced: 04 Dec 2024