An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/rameerez/bicimad-data-analysis

🚲 BiciMAD - Data analysis + ML usage predictions of Madrid's public bike system data

artificial-intelligence bicimad bicimad-data bike city data-analysis emt-opendata ipython-notebook machine-learning madrid neural-network python

Last synced: 17 Sep 2025

https://github.com/yashuv/python-for-data-science-ai-and-development

Python for Data Science, AI & Development - offered by IBM on Coursera

coursera-course data-analysis data-science ibm numpy pandas python

Last synced: 12 Apr 2025

https://github.com/junioralive/indian-medicine-dataset

A curated dataset of Indian medicines, organized by brand. Essential for healthcare research and pharmaceutical analysis.

brand-medicines data-analysis drug-information healthcare-analytics healthcare-data indian-medicine medical-research medicine-database pharmaceuticals pharmacy-data

Last synced: 18 Aug 2025

https://github.com/tanglespace/hotstepper

A Numpy based step function library for analysis and profit. More than just taking you up and down.

data-analysis kernel-methods linear-algebra numpy pandas step-functions time-series

Last synced: 19 Mar 2025

https://github.com/rugk/crops-parser

🌱🍎🍆 A shell script to parse the data by the Food and Agriculture Organization of the United Nations on crops/fruits.

agriculture agriculture-research crop crops data-analysis data-science food fruit fruits statistics streetcomplete tree vegetables

Last synced: 18 Oct 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes deep-learning exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms pandas python scikit-learn visualization

Last synced: 06 Oct 2025

https://github.com/asnelt/mixedvinetoolbox

Matlab toolbox for canonical vine copula trees with mixed continuous and discrete marginals

c-vines copula copula-models copulae copulas data-analysis dependency-analysis dependency-modeling matlab modeling regular-vines statistics

Last synced: 17 Mar 2025

https://github.com/ndleah/health-analysis

This case study is contained within the Serious SQL course by Danny Ma

data-analysis data-exploration data-with-danny database serious-sql sql

Last synced: 02 Mar 2025

https://github.com/dsnchz/solid-uplot

SolidJS wrapper for uPlot — an ultra-fast, tiny time-series & charting library with a SolidJS enhanced plugin system

analysis canvas canvas-chart canvas-chart-library charting-library charts data-analysis data-analytics data-visualization solidjs trading visualization

Last synced: 13 Oct 2025

https://github.com/friendly/psy6136

Psychology 6136: Categorical Data Analysis

categorical-data data-analysis statistics visualization

Last synced: 13 Apr 2025

https://github.com/chalmerlowe/jupyter_tutorial

An introduction to Jupyter and Jupyter Labs for data analysis, data science, and Python development

data-analysis data-science jupyter jupyter-notebook jupyterlab notebook python tutorial

Last synced: 10 Apr 2025

https://github.com/eshikashah/ibm-data-science-ml-with-python-project

Capstone project for IBM data science course - ML with python.

algorithms data-analysis data-science ibm machine-learning python

Last synced: 07 May 2025

https://github.com/charliezcr/Kpop-Data-Analysis

Data analysis about K-pop industry, artists, and companies. Visualized business performances of public K-pop companies and analyzed artist management and international marketing strategies

data-analysis data-visualization kpop pandas python

Last synced: 15 Apr 2025

https://github.com/amhsirak/ickle

DataFrame, analysis & manipulation library for tiny labeled datasets

data-analysis dataframe datascience ickle pandas python

Last synced: 07 May 2025

https://github.com/open-risk/correlationmatrix

correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations

correlation-analysis correlation-matrices data-analysis data-science statistics

Last synced: 04 Jul 2025

https://github.com/mehmetkahya0/web-resource-downloader

This is a Python script that downloads all resources (images, scripts, stylesheets, etc.) from a given website.

algorithms beautifulsoup4 bs4 bs4-requests data-analysis data-science datascience python python3 requests scraper scraping

Last synced: 12 Oct 2025

https://github.com/glentner/dataphile

Data analytics library for Python and suite of open source, command line based data ops tools.

data-analysis data-ops data-science python scientific-computing

Last synced: 07 May 2025

https://github.com/librariesio/metrics

:chart_with_upwards_trend: What to measure, how to measure it.

data-analysis measure metrics open-source

Last synced: 24 Feb 2025

https://github.com/Big-Bytes-Labs/fisher-rs

A fast, powerful, flexible and easy to use open source data analysis and manipulation tool written in Rust

data-analysis dataframe-library machine-learning rust-lang

Last synced: 03 Apr 2025

https://github.com/easonlai/chat_with_csv_streamlit_with_chart

In this repository, you will find an example code for creating an interactive chat experience that allows you to ask questions about your CSV data with chart visualization capabilities.

azure-openai azure-openai-api azure-openai-service chart-visualization data-analysis data-visualization gpt gpt-3 langchain langchain-python matplotlib openai python streamlit streamlit-webapp

Last synced: 26 Apr 2025

https://github.com/thechymera/repsep

Reproducible Self-Publishing - Demo Publications in the Most Common Formats

article data-analysis foss latex poster presentation publishing python reexecutable-publication reproducible-research

Last synced: 19 Apr 2025

https://github.com/awadell1/datamaster

Tool for accessing and extracting data from MoTeC Log Files

data-analysis data-visualization fsae matlab motec

Last synced: 22 Apr 2025

https://github.com/omarsar/data_mining_2017_fall_lab

Contains information and instructions for the first Data Mining lab session for 2017 Fall.

data data-analysis data-mining data-science data-visualization

Last synced: 08 Sep 2025

https://github.com/wildtreetech/explore-open-data

📈🔍 Exploring open data from Zürich

binder data-analysis notebook open-data opendata python tutorial

Last synced: 12 Aug 2025

https://github.com/hoangsonww/standard-deviation-calculator

📊 This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.

algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations

Last synced: 22 Sep 2025

https://github.com/mathewroy/ynabr

Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.

api data-analysis data-science data-visualization r ynab ynab-api

Last synced: 30 Jul 2025

https://github.com/ugentbiomath/wwdata

Python package to analyse, validate, fill and visualise data acquired in the context of (waste) water treatment

data-analysis jupyter-notebook

Last synced: 24 Oct 2025

https://github.com/rubenknex/qtplot

Data visualization application for data taken with qtlab or QCoDeS

data-analysis data-visualization physics plotting python science

Last synced: 07 May 2025

https://github.com/chaganti-reddy/evmarket-india

Electric Vehicle Market Segmentation Analysis in India

data-analysis data-science machine-learning market-segmentation pandas python

Last synced: 12 Apr 2025

https://github.com/nconrad/hotmap

WebGL Heatmap Viewer for Big Data and Bioinformatics

big-data bioinformatics charts data-analysis data-visualization dataviz heatmap pixijs webgl

Last synced: 20 Jan 2026

https://github.com/toutiaoio/howtos

How-To Recipes, 碎片化实用教程, 开发技巧

cookbook data-analysis howto-tutorial howtos pandas recipes tips-and-tricks tutorials

Last synced: 14 Apr 2025

https://github.com/30mb1/pandas-boost

This repo contains code with comparison of Pandas speedup libs, such as modin, dask, swifter, pandarallel and numba

data-analysis machine-learning multiprocessing pandas pandas-tutorial python

Last synced: 30 Apr 2025

https://github.com/powermobileweb/tokenplex

A middleware framework and persistence layer to aggregate and normalize crypto-currency data.

blockchain cql cryptocurrency cryptocurrency-exchanges data-analysis express fintech ico javascript nodejs redis scale ticker-data token

Last synced: 28 Jun 2025

https://github.com/openbridge/chatlytics

Chatlytics is a data query and visualization platform for chat!

charts chatbot data-analysis data-visualization slack slack-bot

Last synced: 10 Apr 2025

https://github.com/nihaljn/datahawk

Viewer for text datasets in formats like HuggingFace, JSONL, etc.

data-analysis data-browser nlp research

Last synced: 14 Jan 2026

https://github.com/GuntharDeNiro/gunbot-quant

An open-source toolkit for quantitative analysis of crypto & stock markets, featuring an advanced market screener, portfolio backtester, and companion tools for the Gunbot trading bot.

algorithmic-trading backtesting cryptocurrency data-analysis fastapi fintech gunbot market-screener pandas python quantitative-finance react stocks trading trading-bot trading-strategies

Last synced: 21 Aug 2025

https://github.com/llnl/pydv

PyDV: Python Data Visualizer

1d data-analysis data-viz python visualization

Last synced: 11 Dec 2025

https://github.com/xuri/excelize-py

Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.

calculation chart data-analysis data-science data-visualization ecma-376 excel excelize golang microsoft office ooxml pipy python spreadsheet visualization xlsm xlsx xlsxreader xlsxwriter

Last synced: 07 May 2025

https://github.com/engali94/twitter-account-analyzer

Using various Python libraries such as Pandas, tweetPy, JSON ans matplotLib to take a sneak peek on your Twitter account using Google Colab.

data-analysis data-visualization python3 twitter-api twitter-sentiment-analysis twitter-streaming-api

Last synced: 29 Apr 2025

https://github.com/chandraprakash-bathula/apparel-recommendations

This project implements a personalized apparel recommendation engine using content-based search with the Amazon API, NLTK, and Keras libraries.

boxplot cnn-keras data-analysis data-science deep-learning linear-regression machine-learning numpy pandas scatter-plot scikit-learn svm tensorflow xgboost

Last synced: 23 Mar 2025

https://github.com/brad-cannell/freqtables

Quickly make tables of descriptive statistics (i.e., counts, percentages, confidence intervals) for categorical variables. This package is designed to work in a tidyverse pipeline, and consideration has been given to get results from R to Microsoft Word ® with minimal pain.

categorical-data data-analysis descriptive-statistics epidemiology r

Last synced: 04 Jul 2025

https://github.com/maugavilla/well_hello_stats

Tutorials to learn R from scratch

data-analysis r statistics tutorial

Last synced: 24 Jul 2025

https://github.com/ct83/surway

SurWay is a survey/polling website for cab drivers where they can report their typical work hours and which company they work for, this data is then stored anonymously and used to generate charts and insights.

charts data-analysis data-visualization data-visualization-project docker docker-compose express node nodeexpress nodejs react react-chartjs-2 react-charts react-project react-projects reactjs reactjs-demo

Last synced: 16 Sep 2025

https://github.com/cosmoduende/r-google-search-history-analysis

Explore your activity on Google with R: How to Analyze and Visualize Your Personal Data Search History. Find out how and how much you have used the most popular search engine in the world, using a copy of your personal data.

data-analysis data-visualisation data-visualization data-viz google google-history google-takeout personal-data-analysis r-language r-programming r-script rvest search-data search-history takeout takeout-data

Last synced: 11 Apr 2025

https://github.com/yangfa-zhang/lunax

Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.

data-analysis data-science lunax machine-learning tabular-data

Last synced: 14 Dec 2025

https://github.com/thecoderpinar/spotify_trends_2023_analysis

Exploring Spotify's latest trends, top songs, genres, and artists using Python, Pandas, NumPy, Matplotlib, CNNs for image-based analysis, and advanced algorithms for music recommendation. Dive into the world of music data and discover what's trending on Spotify! 🎵📊

cnn cnn-keras data-analysis data-science data-visualization machine-learning matplotlib music-trend numpy pandas python spotify

Last synced: 30 Apr 2025

https://github.com/zekeriyyaa/traffic-data-analysis-with-apache-spark-based-on-mobile-robot-data

Mobile robot data were analyzed with Apache-Spark to extract five different statistical result such as travel time, waiting time, average speed, occupancy and density were produced.

agv apache-spark big-data data-analysis data-visualization industrial-robot mobile-robot mongodb mssql pyqt5 pyspark python spark

Last synced: 21 Apr 2025

https://github.com/qurator-spk/mods4pandas

Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis

alto alto-xml data-analysis digital-humanities library mets mods pandas qurator

Last synced: 16 Jan 2026

https://github.com/iam-mhaseeb/Satellite-Imagery-Analysis-of-Vegetation-in-Southern-Pakistan

This repository contains a study how we can examine the vegetation cover of a region with the help of satellite data. The notebook in this repository aims to familiarise with the concept of satellite imagery data and how it can be analyzed to investigate real-world environmental and humanitarian challenges.

data-analysis data-visualization jupyter-notebook notebook pakistan pakistan-analysis python3 satellite-data satellite-imagery satellite-images vegetation vegetation-analysis

Last synced: 08 May 2025

https://github.com/ptyadana/tableau_2020_a-z_hands-on

Tableau Projects for data analysis, data analytics and data visualaization on different data sets

data-analysis data-science data-visualization tableau tableau-dashboards tableau-desktop tableau-public tableau-workbooks

Last synced: 03 Aug 2025

https://github.com/olow304/sqdata

📊 Simple SQL Client for lightweight data analysis using Reactjs framework. Demo

chartsjs data-analysis highcharts javascript nodejs react reactjs sql sqljs visualization

Last synced: 07 May 2025

https://github.com/fabriziomusacchio/python_neuro_practical

This is the course material for the advanced course into Python for Data Scientists.

data-analysis data-science jupyter jupyter-notebook jupyter-notebooks open-source python teaching teaching-materials

Last synced: 22 Jul 2025

https://github.com/jethronap/jstat

A Java Library for Statistics, Data Analysis and Visualization.

classification data-analysis gradient-descent java linear-regression statistics

Last synced: 21 Jun 2025

https://github.com/codeperfectplus/dataanalysiswithjupyter

A Perfect Repository For Data Anaysis with Jupyter Notebook.:chart_with_upwards_trend::chart_with_downwards_trend:

codeperfe data-analysis database hacktoberfest jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python-3 seaborn-plots

Last synced: 13 May 2025

https://github.com/venkat-0706/social-media-analysis

Clean & analyze social media data with Python. Explore trends, sentiments, & user behavior. Includes data cleaning, visualization, & insights.

data-analysis data-visualization hashtag-trends natural-language-processing sentiment-analysis social-media-analysis text-mining user-engagement-metrics

Last synced: 13 Apr 2025

https://github.com/kennethleungty/english-premier-league-var-analysis

Analyzing Video Assistant Referee (VAR) decisions in the English Premier League (2019 - 2021)

data-analysis data-analytics data-science english-premier-league football soccer var

Last synced: 27 Aug 2025

https://github.com/luminousmen/python_for_ds

Python for Data Analysis workshop

data-analysis data-science python tutorial

Last synced: 01 May 2025

https://github.com/ammarlodhi255/student_performance_indicator_end-to-end_implementation

An end-to-end machine learning project, student performance indicator. The goal of this project is to understand the influence of the parents background, test preparation, and various other variables on the students performance.

aws cd-pipeline data-analysis data-science data-science-projects eda end-to-end-machine-learning machine-learning machine-learning-projects regression regression-analysis

Last synced: 27 Sep 2025

https://github.com/aifred-health/vulcan-old

Deprecated: A high level deep learning framework for quickly prototyping networks with added tools in data visualisation, model interpretability and performance metrics

data-analysis deep-learning deep-neural-networks lasagne machine-learning mental-health theano

Last synced: 01 Aug 2025

https://github.com/viper373/jd-comments

爬取京东商品评论数据

crawler-python data-analysis python spider

Last synced: 15 Apr 2025

https://github.com/aiguofer/sql_connectors

A simple wrapper for SQL connections using SQLAlchemy and Pandas read_sql to standardize SQL workflow with multiple data sources.

data-analysis data-analytics data-exploration data-science pandas relational-databases sql sqlalchemy standardized-api

Last synced: 13 Oct 2025

https://github.com/tusharnankani/data-analysis-with-python

A complete introduction to data analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data data-analysis data-visualization hacktoberfest jovian jovian-ml jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 07 May 2025

https://github.com/abeltavares/marketpipe

🛠 Containerized and configurable Airflow ETL pipeline for collecting and storing stock and cryptocurrency market data.

airflow aws ci-cd cryptocurrency data-analysis data-collection data-storage docker iac oop pgadmin pipeline postgresql python sql stocks unit-testing

Last synced: 22 Apr 2025

https://github.com/cschreib/vif

Easy, robust, and fast numerics in C++.

astronomy astrophysics c-plus-plus data-analysis library

Last synced: 06 May 2025

https://github.com/jongan69/soltrendio

A solana wallet address analyzer with ai and apis for building useful tools on top of

ai blockchain data-analysis solana trading-systems

Last synced: 07 Sep 2025

https://github.com/jmgirard/circumplex

R Package for the Analysis and Visualization of Circumplex Data

circular circumplex data-analysis ggplot2 interpersonal psychology r r-package rcpparmadillo rstats tidyverse

Last synced: 15 Aug 2025

https://github.com/joshuaulrich/stl-rug

Content presented at the Saint Louis R User Group

data-analysis data-science r

Last synced: 26 Aug 2025

https://github.com/iamgmujtaba/scholar_search

This project provides a tool for extracting and analyzing the quantity and distribution of scholarly articles related to a particular topic or field over a desired time span, using Google Scholar search results and built-in data visualization functionality.

academia academic academic-papers data-analysis data-visualization google google-scholar scholarly-articles

Last synced: 30 Apr 2025