An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/cengel/R-data-wrangling

Materials for my my R data workshop. https://cengel.github.io/R-data-wrangling/

data-analysis data-workshop datascience material r rstats social-sciences teaching tidyverse workshop

Last synced: 06 May 2025

https://github.com/rubydamodar/the-ultimate-pandas-bootcamp

Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources

beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset

Last synced: 19 Apr 2025

https://github.com/pgomba/mdpi_explorer

A simple package to explore MDPI´s articles by journal. A series of functions help to obtain lists of papers, obtain data from them (turnaround times, special issues and articles types) and create summary graphs.

analysis data-analysis data-visualization mdpi metrics scientific-journals visualization web-scraping

Last synced: 13 May 2025

https://github.com/iBridges-for-iRODS/iBridges

A wrapper around the python-irodsclient to allow for easy interaction with iRODS servers.

data-analysis data-engineering data-science datascience irods-client

Last synced: 14 Jul 2025

https://github.com/osl-pocs/skdata

Python tools for data analysis

data data-analysis data-science open-data python

Last synced: 23 Feb 2026

https://github.com/jm199504/python-exercises

Python 练习册(基础操作 / 数据库 / 数据统计处理 / 图文生成 / 数据转换 / 算法题 / 小应用 / 程序开发)

data-analysis database python

Last synced: 07 May 2025

https://github.com/UtrechtUniversity/iBridges

A wrapper around the python-irodsclient to allow for easy interaction with iRODS servers.

data-analysis data-engineering data-science datascience irods-client

Last synced: 29 Jun 2025

https://github.com/alessandrocorradini/harvard-data-analysis-for-life-science-xseries

Lectures, Code and Quizzes for the Data Science for Life Science XSeries

data-analysis datascience edx harvardx machine-learning

Last synced: 02 Jan 2026

https://github.com/alejandrodumas/kodiak

Enhance your feature engineering workflow with Kodiak

data-analysis pandas

Last synced: 19 Jul 2025

https://github.com/rob-med/everything-shapelets

This repo contains useful links to research papers and implementations of shapelets discovery/learning techniques from different sources.

data-analysis data-mining shapelets time-series-analysis timeseries

Last synced: 07 Mar 2026

https://github.com/lgrcia/nuance

Efficient detection of planets transiting quiet or active stars

correlated-noise data-analysis exoplanets search systematics transits variability

Last synced: 29 Dec 2025

https://github.com/squey/squey

Squey is a visualization software designed to interactively explore and understand large amounts of tabular data (this is the read-only mirror of https://gitlab.com/squey/squey)

cybersecurity data-analysis data-science data-visualization exploratory-data-visualizations parallel-coordinates parquet parquet-files parquet-viewer pcap timeseries timeseries-analysis visualization

Last synced: 08 Mar 2025

https://github.com/csinva/data-viz-utils

Functions for easily making publication-quality figures with matplotlib.

big-data data-analysis data-science data-visualization eda legend matplotlib python python3 scatterplot time-series

Last synced: 05 May 2025

https://github.com/ptyadana/sql-for-data-analysis-parch-and-posey

SQL for Data Analysis using PostgresSQL - analyzing Parch&Posey fictional company

challenges data-analysis pgadmin pgadmin4 posey-company postgres postgresql postgresql-database schema sql udacity

Last synced: 11 Mar 2026

https://github.com/bcgov/shinyrems

An R package to launch shinyrems; an online application that allows a user to access, download, clean, plot and calculate simple statistics using data from the B.C. government Environmental Monitoring System database.

data-analysis environment environmental-data water-quality

Last synced: 30 Jul 2025

https://github.com/yashksaini-coder/zomato-data-analysis

Zomato Data Analysis to explore insights and build predictive models with a dynamic & Interactive dashboard using Streamlit Web application. Also deploying and scaling with Cyclops-UI

data-analysis docker kubernets streamlit streamlit-webapp

Last synced: 26 Jul 2025

https://github.com/ajayarunachalam/gui-pandas-ai

GUIPandasAI - Integrating Generative AI capabilities into Pandas as Web Interface along with key-words based data analysis services

ai chatgpt data data-analysis data-analytics data-science generative-ai gpt-3 gpt-4 llm pandas python streamlit web-app

Last synced: 06 Jul 2025

https://github.com/mkmcc/watch-calibration

estimate the accuracy of a clock from a sound recording

calibration data-analysis fourier-analysis signal-processing watches

Last synced: 25 Jan 2026

https://github.com/llnl/topoms

Topological Analysis for Molecular Systems

data-analysis data-viz

Last synced: 29 Apr 2025

https://github.com/aifred-health/vulcanai

A high level deep learning framework for quickly prototyping networks with added tools in data visualisation, model interpretability and performance metrics

data-analysis data-cleaning data-science data-visualization deep-learning deep-neural-networks feature-engineering mental-health python3 pytorch scikit-learn

Last synced: 01 Aug 2025

https://github.com/andrewreynen/lazylyst

Lazylyst is a GUI created for time series review, using a flexible framework for new workflows

data-analysis earthquakes gui python qt seismology

Last synced: 22 Apr 2025

https://github.com/samuroi/samuroi

SamuROI - Structured analysis of multiple user-defined ROIs, an open source Python-based analysis environment for imaging data.

calcium-imaging conda data-analysis data-visualization opencv python scikit-image

Last synced: 15 Apr 2025

https://github.com/mcwaage1/qs

Quantified Self: A Personal Data Aggregator and Dashboard for Self-Trackers and Quantified Self Enthusiasts

activity-tracking data-analysis data-visualization fitbit goodreads google-sheets lastfm mood personal-data quantified-self quantifiedself self-tracking writing

Last synced: 16 Apr 2025

https://github.com/chuongmep/aps-bot

Explore Data By CLI With Autodesk Platform Services

aps autodesk-forge autodesk-platform-services cli data-analysis data-science forge

Last synced: 12 Apr 2025

https://github.com/thesimonho/rok-data

Data for Rise of Kingdoms

d3js data-analysis dataset dataviz gaming statistics

Last synced: 12 Apr 2025

https://github.com/adamdempsey90/ndtamr

N-dimensional adaptive mesh refinment tree structure in Python

amr data-analysis python visual

Last synced: 28 Oct 2025

https://github.com/nicovandenhooff/top-repo-analysis

This repository contains my work that supports my article on Towards Data Science: "Exploring the Most Popular Machine Learning and Deep Learning GitHub Repositories."

altair automation data-analysis data-science data-visualization pygithub python

Last synced: 21 Aug 2025

https://github.com/chyikwei/bnp

Bayesian nonparametric models for python

bayesian data-analysis probabilistic-graphical-models python topic-modeling

Last synced: 03 May 2025

https://github.com/sigbla/sigbla-app

Sigbla is a framework for working with data in tables, using the Kotlin programming language. It supports various data types, reactive programming and events, user input, charts, and more.

dashboard dashboard-application data-analysis data-science data-visualization kotlin kotlin-dsl kotlin-library sigbla spreadsheet table

Last synced: 17 Jan 2026

https://github.com/zekeriyyaa/pyspark-structured-streaming-ros-kafka-apachespark-cassandra

A structured streaming was applied to the robot data from ROS-Gazebo simulation environment using Apache Spark. Data is collected in Kafka, analyzed by Apache Spark and stored in Cassandra.

apache-cassandra apache-kafka apache-spark cqlsh data-analysis kafka-consumer kafka-producer pyspark python python3 ros ros-noetic spark-cassandra spark-cassandra-connector spark-kafka-connector spark-kafka-integration spark-sql spark-streaming structured-streaming

Last synced: 30 Jun 2025

https://github.com/alipsa/ride

A nice R development and analytics environment, for the Renjin JVM implementation of R

analytics data-analysis data-science data-visualization integrated-development-environment r sql

Last synced: 14 Oct 2025

https://github.com/msyriac/orphics

A library containing analysis and theory tools for cosmological data.

analysis cmb cosmology data-analysis theory

Last synced: 16 Mar 2025

https://github.com/nceas/metajam

Bringing data and metadata togetheR

data data-analysis metadata r repositories

Last synced: 14 Feb 2026

https://github.com/tushar2704/everyday_python

Welcome to Everyday Python Sheets – your go-to resource for everyday Python cheat sheets, pro tips, interview questions, Python one-liners, and Python data structures. Whether you're a beginner looking to learn Python or an experienced developer seeking quick reference materials, this Streamlit application has got you covered.

artificial-intelligence cheatsheet data data-analysis data-science data-structures data-visualization database protips python streamlit streamlit-tushar2704 tushar2704

Last synced: 09 May 2026

https://github.com/kennethleungty/fifa-football-world-rankings

Analyzing FIFA World Football Rankings with Python and R

data-analysis data-analytics data-science football python r soccer sports

Last synced: 12 Jul 2025

https://github.com/kongruksiamza/python-datascience

เอกสารประกอบการสอนเนื้อหา Python - Data Science และงานด้าน Machine Learning

data-analysis data-science numpy pandas python

Last synced: 05 May 2025

https://github.com/junioralive/indian-medicine-dataset

A curated dataset of Indian medicines, organized by brand. Essential for healthcare research and pharmaceutical analysis.

brand-medicines data-analysis drug-information healthcare-analytics healthcare-data indian-medicine medical-research medicine-database pharmaceuticals pharmacy-data

Last synced: 18 Aug 2025

https://github.com/agnes-rs/agnes

A data wrangling library for Rust

data-analysis rust

Last synced: 30 Apr 2026

https://github.com/tanglespace/hotstepper

A Numpy based step function library for analysis and profit. More than just taking you up and down.

data-analysis kernel-methods linear-algebra numpy pandas step-functions time-series

Last synced: 19 Mar 2025

https://github.com/asnelt/mixedvinetoolbox

Matlab toolbox for canonical vine copula trees with mixed continuous and discrete marginals

c-vines copula copula-models copulae copulas data-analysis dependency-analysis dependency-modeling matlab modeling regular-vines statistics

Last synced: 17 Mar 2025

https://github.com/juliadatacubes/pyramidscheme.jl

Building and using pyramids for large raster data

data-analysis data-visualization geospatial julia visualization

Last synced: 21 Mar 2025

https://github.com/rickiepark/python4daml

<코딩 뇌를 깨우는 파이썬>(한빛미디어, 2023)의 코드 저장소

data-analysis machine-learning problem-solving python

Last synced: 17 Oct 2025

https://github.com/betta-cyber/netease_music_api

netease cloud music api for python

crawler data-analysis netease-cloud-music

Last synced: 30 Jul 2025

https://github.com/ndleah/health-analysis

This case study is contained within the Serious SQL course by Danny Ma

data-analysis data-exploration data-with-danny database serious-sql sql

Last synced: 02 Mar 2025

https://github.com/rameerez/bicimad-data-analysis

🚲 BiciMAD - Data analysis + ML usage predictions of Madrid's public bike system data

artificial-intelligence bicimad bicimad-data bike city data-analysis emt-opendata ipython-notebook machine-learning madrid neural-network python

Last synced: 17 Sep 2025

https://github.com/rugk/crops-parser

🌱🍎🍆 A shell script to parse the data by the Food and Agriculture Organization of the United Nations on crops/fruits.

agriculture agriculture-research crop crops data-analysis data-science food fruit fruits statistics streetcomplete tree vegetables

Last synced: 11 Mar 2026

https://github.com/yashuv/python-for-data-science-ai-and-development

Python for Data Science, AI & Development - offered by IBM on Coursera

coursera-course data-analysis data-science ibm numpy pandas python

Last synced: 12 Apr 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes deep-learning exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms pandas python scikit-learn visualization

Last synced: 06 Oct 2025

https://github.com/friendly/psy6136

Psychology 6136: Categorical Data Analysis

categorical-data data-analysis statistics visualization

Last synced: 23 Jan 2026

https://github.com/awadell1/datamaster

Tool for accessing and extracting data from MoTeC Log Files

data-analysis data-visualization fsae matlab motec

Last synced: 22 Apr 2025

https://github.com/easonlai/chat_with_csv_streamlit_with_chart

In this repository, you will find an example code for creating an interactive chat experience that allows you to ask questions about your CSV data with chart visualization capabilities.

azure-openai azure-openai-api azure-openai-service chart-visualization data-analysis data-visualization gpt gpt-3 langchain langchain-python matplotlib openai python streamlit streamlit-webapp

Last synced: 26 Apr 2025

https://github.com/omarsar/data_mining_2017_fall_lab

Contains information and instructions for the first Data Mining lab session for 2017 Fall.

data data-analysis data-mining data-science data-visualization

Last synced: 08 Sep 2025

https://github.com/wildtreetech/explore-open-data

📈🔍 Exploring open data from Zürich

binder data-analysis notebook open-data opendata python tutorial

Last synced: 13 Mar 2026

https://github.com/dsnchz/solid-uplot

SolidJS wrapper for uPlot — an ultra-fast, tiny time-series & charting library with a SolidJS enhanced plugin system

analysis canvas canvas-chart canvas-chart-library charting-library charts data-analysis data-analytics data-visualization solidjs trading visualization

Last synced: 13 Oct 2025

https://github.com/charliezcr/Kpop-Data-Analysis

Data analysis about K-pop industry, artists, and companies. Visualized business performances of public K-pop companies and analyzed artist management and international marketing strategies

data-analysis data-visualization kpop pandas python

Last synced: 15 Apr 2025

https://github.com/eshikashah/ibm-data-science-ml-with-python-project

Capstone project for IBM data science course - ML with python.

algorithms data-analysis data-science ibm machine-learning python

Last synced: 07 May 2025

https://github.com/llnl/pydv

PyDV: Python Data Visualizer

1d data-analysis data-viz python visualization

Last synced: 21 Jan 2026

https://github.com/thechymera/repsep

Reproducible Self-Publishing - Demo Publications in the Most Common Formats

article data-analysis foss latex poster presentation publishing python reexecutable-publication reproducible-research

Last synced: 02 Apr 2026

https://github.com/mehmetkahya0/web-resource-downloader

This is a Python script that downloads all resources (images, scripts, stylesheets, etc.) from a given website.

algorithms beautifulsoup4 bs4 bs4-requests data-analysis data-science datascience python python3 requests scraper scraping

Last synced: 12 Oct 2025

https://github.com/amhsirak/ickle

DataFrame, analysis & manipulation library for tiny labeled datasets

data-analysis dataframe datascience ickle pandas python

Last synced: 22 Apr 2026

https://github.com/chalmerlowe/jupyter_tutorial

An introduction to Jupyter and Jupyter Labs for data analysis, data science, and Python development

data-analysis data-science jupyter jupyter-notebook jupyterlab notebook python tutorial

Last synced: 10 Apr 2025

https://github.com/open-risk/correlationmatrix

correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations

correlation-analysis correlation-matrices data-analysis data-science statistics

Last synced: 04 Jul 2025

https://github.com/Big-Bytes-Labs/fisher-rs

A fast, powerful, flexible and easy to use open source data analysis and manipulation tool written in Rust

data-analysis dataframe-library machine-learning rust-lang

Last synced: 03 Apr 2025

https://github.com/glentner/dataphile

Data analytics library for Python and suite of open source, command line based data ops tools.

data-analysis data-ops data-science python scientific-computing

Last synced: 07 May 2025

https://github.com/maastrichtlawtech/law3025-legal-analytics

📚 Materials for Legal Analytics (LAW3025) @ Maastricht University

course-materials data-analysis data-visualization legaltech python

Last synced: 23 Jan 2026

https://github.com/nihaljn/datahawk

Viewer for text datasets in formats like HuggingFace, JSONL, etc.

data-analysis data-browser nlp research

Last synced: 14 Jan 2026

https://github.com/toutiaoio/howtos

How-To Recipes, 碎片化实用教程, 开发技巧

cookbook data-analysis howto-tutorial howtos pandas recipes tips-and-tricks tutorials

Last synced: 14 Apr 2025

https://github.com/GuntharDeNiro/gunbot-quant

An open-source toolkit for quantitative analysis of crypto & stock markets, featuring an advanced market screener, portfolio backtester, and companion tools for the Gunbot trading bot.

algorithmic-trading backtesting cryptocurrency data-analysis fastapi fintech gunbot market-screener pandas python quantitative-finance react stocks trading trading-bot trading-strategies

Last synced: 21 Aug 2025

https://github.com/ugentbiomath/wwdata

Python package to analyse, validate, fill and visualise data acquired in the context of (waste) water treatment

data-analysis jupyter-notebook

Last synced: 24 Oct 2025

https://github.com/hoangsonww/standard-deviation-calculator

📊 This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.

algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations

Last synced: 22 Sep 2025

https://github.com/rubenknex/qtplot

Data visualization application for data taken with qtlab or QCoDeS

data-analysis data-visualization physics plotting python science

Last synced: 07 May 2025