An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/svelteplot/svelteplot

Svelte-native plotting framework based on the grammar of graphics

data-analysis data-visualization grammar graphics interactive-visualization svelte

Last synced: 11 Apr 2026

https://github.com/toxictoskey/dex-autotrader-bot

This is a Cryptocurrency Trading bot on DeFi that works in multiple Chain with unique trading strategies for cryptocurrencies. It performs automated technical analysis of cryptocurrencies, manages risk, reduces slippage and has customizable strategies such as Stop Loss and Buy the Dip.

arbitrum automated-trading base-network binance binance-smart-chain blockchain bybit curve data-analysis defi dydx eth fraxtal kucoin layer2 polygon sniping-bot solana starknet zksync

Last synced: 25 Jun 2025

https://github.com/databrickslabs/tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

data-analysis data-science pandas python scala time-series timeseries timeseries-analysis timeseries-data

Last synced: 29 Apr 2025

https://github.com/pydpiper/pylightxl

A light weight, zero dependency, minimal functionality excel read/writer python library

api data-analysis excel microsoft office pypi python python-library python2 python3

Last synced: 16 May 2025

https://github.com/PydPiper/pylightxl

A light weight, zero dependency, minimal functionality excel read/writer python library

api data-analysis excel microsoft office pypi python python-library python2 python3

Last synced: 29 Mar 2025

https://github.com/helicalinsight/helicalinsight

Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.

amazon-redshift big-data business-intelligence dashboard data-analysis data-visualization druid graph-database hive mongodb mysql neo4j nosql oracle-database postgresql rdbms reporting sql-editor sqllite

Last synced: 06 Apr 2025

https://github.com/boostorg/histogram

Fast multi-dimensional generalized histogram with convenient interface for C++14

boost boost-libraries c-plus-plus c-plus-plus-14 convenient convenient-interface data-analysis header-only histogram statistics

Last synced: 05 Apr 2025

https://github.com/CJWorkbench/cjworkbench

The data journalism platform with built in training

data-analysis data-journalism data-science data-visualization journalism notebook

Last synced: 17 Jul 2025

https://github.com/kde/labplot

LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.

data-analysis data-science data-visualization fitting graph graph2d plotting scientific-plotting scientific-visualization

Last synced: 16 May 2025

https://github.com/pavelkomarov/exportify

Export Spotify playlists using the Web API. Analyze them in the Jupyter notebook.

data-analysis github-pages-website javascript javascript-promise jupyter-notebook spotify spotify-api spotify-web-api

Last synced: 12 Apr 2025

https://github.com/X-lab2017/open-digger

Open source analysis tools

data-analysis github hacktoberfest openrank

Last synced: 20 Mar 2025

https://github.com/Derek-Jones/ESEUR-book

Issue handling for Evidence-based Software Engineering: based on the publicly available data

book data-analysis empirical-research engineering-data evidence-based human-cognitive-characteristics software-development software-engineering

Last synced: 19 Jul 2025

https://github.com/rasgointelligence/rasgoql

Write python locally, execute SQL in your data warehouse

data-analysis data-science pandas python sql

Last synced: 14 Jun 2025

https://github.com/rasgointelligence/RasgoQL

Write python locally, execute SQL in your data warehouse

data-analysis data-science pandas python sql

Last synced: 20 Jul 2025

https://github.com/Bears-R-Us/arkouda

Arkouda (αρκούδα): Interactive Data Analytics at Supercomputing Scale :bear:

chapel data data-analysis data-science distributed-computing eda hpc python

Last synced: 08 Jul 2025

https://github.com/wizardforcel/data-science-notebook

:book: 每一个伟大的思想和行动都有一个微不足道的开始

data-analysis data-science machine-learning notebook numpy pandas sklearn tensorflow

Last synced: 10 Apr 2025

https://github.com/lucasxlu/LagouJob

Data Analysis & Mining for lagou.com

data-analysis data-mining lagou machine-learning nlp python3 web-crawler

Last synced: 18 Jul 2025

https://github.com/bears-r-us/arkouda

Arkouda (αρκούδα): Interactive Data Analytics at Supercomputing Scale :bear:

chapel data data-analysis data-science distributed-computing eda hpc python

Last synced: 06 Apr 2025

https://github.com/curiositry/eegrunt

A Collection Python EEG (+ ECG) Analysis Utilities for OpenBCI and Muse

data-analysis data-visualization ecg eeg muse neuroscience openbci python

Last synced: 13 Apr 2025

https://github.com/recodehive/stackoverflow-analysis

Stack overflow is a professional community for developers. This repo analysis 3 years of developer Survey done by Stackoverflow and do visualization and predict the salary of Data Scientist in future.

canva collaborate data-analysis data-science data-visualization ghdesktop github github-pages machine-learning stack-overflow student-vscode survey-analysis vscode

Last synced: 15 May 2025

https://github.com/hexinfo/dat

Asking yours data in a natural language way through pre-modeling (data models and semantic models).

agent-framework agents chatbi chatdata context-engineering data-analysis llms mcp nl2sql rag text2sql

Last synced: 04 Mar 2026

https://github.com/mcekovic/tennis-crystal-ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

big-data bigdata data-analysis data-science database elo elo-rating forecast goat machine-learning prediction sports statistics tennis tennis-score

Last synced: 11 Mar 2026

https://github.com/koldlight/curso-python-analisis-datos

Curso de python básico orientado al análisis de datos, en español

course data data-analysis folium hacktoberfest numpy pandas python requests seaborn spanish

Last synced: 12 Apr 2025

https://github.com/tkrabel/edaviz

edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab

altair data-analysis data-exploration data-sciene data-visualization eda edaviz exploratory-data interactive jupyter-notebook matplotlib pandas plotly project-jupyter pyhon qgrid seaborn

Last synced: 13 Jun 2025

https://github.com/milaan9/11_python_matplotlib_module

Matplotlib is an amazing visualization library in Python for 2D plots of arrays. Matplotlib is a multi-platform data visualization library built on NumPy arrays and designed to work with the broader SciPy stack. It was introduced by John Hunter in the year 2002. One of the greatest benefits of visualization is that it allows us visual access to huge amounts of data in easily digestible visuals. Matplotlib consists of several plots like line, bar, scatter, histogram, etc

data-analysis data-visualization ipython-notebook matplotlib matplotlib-examples matplotlib-exercises matplotlib-figures matplotlib-heatmap matplotlib-pyplot matplotlib-python matplotlib-tutorial python-matplotlib python-tutor python-tutorial-github python-tutorial-notebook python-tutorials python4beginner python4datascience python4everybody tutor-milaan9

Last synced: 06 Apr 2025

https://github.com/acerbilab/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 09 Apr 2025

https://github.com/naruaika/witt-data-studio

A powerful, user-friendly, and integrated data platform

business-intelligence data-analysis data-visualization gnome-desktop gtk4 open-source python

Last synced: 15 May 2026

https://github.com/nickslevine/zebras

Data analysis library for JavaScript built with Ramda

data-analysis data-science functional-programming javascript pandas ramda

Last synced: 24 Aug 2025

https://github.com/acclab/dabestr

Data Analysis with Bootstrap Estimation in R

data-analysis data-visualization estimation r statistics

Last synced: 22 Oct 2025

https://github.com/aws/amazon-redshift-python-driver

Redshift Python Connector. It supports Python Database API Specification v2.0.

amazon-redshift aws-redshift data-analysis data-science

Last synced: 04 Mar 2026

https://github.com/isxcode/spark-yun

Ultra-Lightweight AI-Powered Big Data Center | 至轻云-超轻量级智能化大数据中心

apache cdh data-analysis docker flink hadoop hive kubernetes saas spark

Last synced: 01 Sep 2025

https://github.com/dataplane-app/dataplane

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.

airflow data data-analysis data-engineering data-integration data-pipelines data-science dataplane datawarehouse etl finance golang kubernetes pipelines robotics-process-automation rpa scheduler workflow workflow-automation workflows

Last synced: 27 Dec 2025

https://github.com/Hack23/cia

Citizen Intelligence Agency. Open-source intelligence platform analyzing Swedish political activities using AI and data visualization. Tracks politicians, government institutions, and parliamentary data, offering detailed insights, performance metrics, and advanced analytics.

ai civic-tech css data-analysis data-visualization goverment government-data java ministries open-data osint parliament-charts parliamentary-monitoring political-analysis political-parties politics riksdagen sverigesriksdag sweden sweden-data

Last synced: 17 Jan 2026

https://github.com/ayush1997/visualize_ML

Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.

data-analysis machine-learning matplotlib python statisics visualization

Last synced: 14 Mar 2025

https://github.com/archd3sai/Customer-Survival-Analysis-and-Churn-Prediction

In this project, I have utilized survival analysis models to see how the likelihood of the customer churn changes over time and to calculate customer LTV. I have also implemented the Random Forest model to predict if a customer is going to churn and deployed a model using the flask web app.

customer-churn-prediction customer-survival-analysis data-analysis explainable-ai flask-application hazard partial-dependence-plot random-forest shap-values survival-analysis

Last synced: 08 Apr 2025

https://github.com/codekitchen/pipeline

the `pipeline` shell command

data-analysis data-mining shell-scripting

Last synced: 16 Oct 2025

https://github.com/Azure/DataScienceVM

Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)

ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver

Last synced: 20 Jul 2025

https://github.com/totalhack/zillion

Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/

ai analytics data-analysis data-warehousing datasources openai python query-builder reporting semantic-data-model semantic-layer sql text-to-sql warehouse

Last synced: 07 Jan 2026

https://github.com/azure/datasciencevm

Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)

ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver

Last synced: 07 Apr 2025

https://github.com/koolreport/core

An Open Source PHP Reporting Framework that helps you to write perfect data reports or to construct awesome dashboards in PHP. Working great with all PHP versions from 5.6 to latest 8.0. Fully compatible with all kinds of MVC frameworks like Laravel, CodeIgniter, Symfony.

data-analysis data-pipelines data-pivot data-summarization data-visualization data-viz framework mysql-reporting-tools php php-reporting-tools php-reports report-generator reporting reporting-engine reporting-tool

Last synced: 22 Jan 2026

https://github.com/briatte/ida

Introduction to Data Analysis, using R (2013)

course data-analysis r

Last synced: 13 Jul 2025

https://github.com/unytics/airbyte_serverless

Airbyte made simple (no UI, no database, no cluster)

airbyte bigquery data data-analysis data-engineering data-warehouse elt etl pipeline

Last synced: 16 May 2025

https://github.com/sczesla/pyastronomy

A collection of astronomy-related routines in Python

astronomy data-analysis python

Last synced: 17 Mar 2026

https://github.com/phillipdupuis/dtale-desktop

Build a data visualization dashboard with simple snippets of python code

data-analysis data-science data-visualization fastapi pandas python react typescript visualization

Last synced: 13 Sep 2025

https://github.com/calculist/calculist

the open source thinking tool for problem solvers

data-analysis note-taking tree-structure

Last synced: 04 Apr 2026

https://github.com/Sanmeet007/logger

Logger is a Flutter-based Android app that enables you to view and export call logs in CSV or JSON format and perform lightweight on-device analysis.

andriod call-data-record-analysis call-logs csv-export csv-import data-analysis flutter json-export open-source

Last synced: 20 Feb 2026

https://github.com/pragunbhutani/dbt-llm-agent

LLM based AI Agent to automate Data Analysis for dbt projects with remote MCP server

agent agentic-ai ai ai-data-analysis data-analysis data-analyst dbt llm text-to-sql

Last synced: 19 Jan 2026

https://github.com/risenw/datasist

A Python library for easy data analysis, visualization, exploration and modeling

data-analysis data-science data-visualization feature-engineering machine-learning python-3

Last synced: 24 Oct 2025

https://github.com/toobigdata/papa

一个浏览器端数据爬虫,做每个人的数据助手

chrome data-analysis kickstarter spider

Last synced: 11 Apr 2025

https://github.com/tiannaparris/data-analysis-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

data-analysis data-science data-visualization excel matplotlib pandas portfolio powerbi python r scipy seaborn sql tableau

Last synced: 30 Oct 2025

https://github.com/cuducos/calculadora-do-cidadao

💵 Tool for Brazilian Reais monetary adjustment/correction

brasil brazil data-analysis hacktoberfest monetary python

Last synced: 14 Mar 2025

https://github.com/tidyomics/plyranges

A grammar of genomic data transformation

bioconductor data-analysis dplyr genomic-ranges genomics tidy-data

Last synced: 06 Feb 2026

https://github.com/javascriptdata/dnotebook

Dnotebook is a Jupyter-like library for javaScript environment. It allows you to create and share pages that contain live code, text and visualizations.

data-analysis interactive-visualizations javascript live-code notebook notebook-javascript

Last synced: 02 May 2025