An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/wtbates99/stock-indicators

A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.

backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series

Last synced: 15 Oct 2025

https://github.com/ktmud/github-life

A data explorer for GitHub projects' life cycles

data-analysis github scraper time-series

Last synced: 16 May 2026

https://github.com/nhsdigital/sde_example_analysis

Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.

data-analysis data-science databricks-notebooks machine-learning mlflow

Last synced: 25 Oct 2025

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 26 Apr 2025

https://github.com/startreedata/metabase-pinot-driver

Apache Pinot driver for the Metabase business intelligence front-end

apache data-analysis data-visualization metabase pinot

Last synced: 10 Mar 2026

https://github.com/thecoderpinar/earthquake-explorer

🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.

data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data

Last synced: 23 Aug 2025

https://github.com/prathamesh693/04_sales-and-demand-forecasting-for-retail-chains

This project aims to forecast weekly sales for retail stores using historical sales and economic data. By applying advanced time series forecasting models, we enable better inventory management, demand planning, and revenue optimization for retail chains. The project includes both traditional statistical models and deep learning techniques.

arima-forecasting data-analysis data-visualization exploratory-data-analysis jupyter-notebook prophet-facebook python spyder-python-ide streamlit-webapp

Last synced: 01 Jul 2025

https://github.com/EmirhanServeren/NFT-CollaBot

NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.

data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet

Last synced: 17 Apr 2025

https://github.com/roaldarbol/anibehavr

🪲 An R package for Analysis of Animal Behaviour

animal-behavior behavioural-states data-analysis r

Last synced: 11 Oct 2025

https://github.com/garrettj403/albertaenergysources

Get grid data from Alberta Electric System Operator (AESO)

alberta canada data-analysis energy-data

Last synced: 10 Oct 2025

https://github.com/agungbudiwirawan/socioeconomic_analysis

The objective of this project is to analyze the socio-economic in Chicago.

chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server

Last synced: 04 Jul 2025

https://github.com/splch/qbs

An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.

classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling

Last synced: 01 Apr 2025

https://github.com/pmsipilot/intercom2dw

Intercom2DW is an attempt at loading all the data available in an Intercom application.

data-analysis docker intercom

Last synced: 11 Apr 2026

https://github.com/nghorbani/neuraldataanalysis

Download Data from https://bit.ly/3g8RUmi

data-analysis matlab spike-detection spike-rate

Last synced: 22 Apr 2025

https://github.com/nmicht/food-language-variety

A map to display distribution of food names using their synonyms in different regions.

data-analysis language-distribution language-variety map

Last synced: 31 Jan 2026

https://github.com/implicitlayer/alphaanalysis

Full library for analyzing financial data using ML and classical approaches

data-analysis data-science deep-learning finance machine-learning risk-analysis time-series-analysis trading transformers

Last synced: 12 Mar 2026

https://github.com/mszell/fyp2021

INSTRUCTOR's VERSION: Lectures of First Year Project 2021, Project 1

data-analysis data-science education python teaching-materials

Last synced: 02 May 2025

https://github.com/liweitianux/chandra-acis-analysis

Chandra ACIS analysis tools and documents

acis chandra data-analysis

Last synced: 19 Apr 2025

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling geographical-data r sentiment-analysis statistics text-analysis web-scraping

Last synced: 25 Apr 2025

https://github.com/tushar2704/instagram-user-analytics

This project revolves around the exploration and analysis of user engagement patterns on the popular social media platform, Instagram. By delving into user data and interaction metrics, this project aims to provide valuable insights into user behavior, content performance, and trends.

artificial-intelligence data-analysis data-science instagram project tushar2704

Last synced: 23 Jan 2026

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 17 Jan 2026

https://github.com/sebastian-gregoricchio/snakeatac

Snakemake pipeline for analysis and normalization of ATAC-seq data starting from fastq.gz files.

atac-seq atac-seq-pipeline chromatin-accessibiity data-analysis epigenomics pipeline snakemake

Last synced: 31 May 2026

https://github.com/alyssonmach/9-data-science-apps-with-python

[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.

data-analysis data-science data-visualization machine-learning streamlit-webapp

Last synced: 12 Feb 2026

https://github.com/erictleung/microbiome-analysis-resources

:microscope: Resources and notes on studying the human microbiome

bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources

Last synced: 18 Jan 2026

https://github.com/linuxto5re/gateiodatafilteration

Gate.io API analysis: spot, margin, futures; filters high volume, identifies common cryptos.

arbitrage-bot centralized-exchanges cryptocurrency data-analysis gate gateio-api python rest trading-strategies

Last synced: 23 Oct 2025

https://github.com/andreip/twitter-authorities

Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

data-analysis mongodb python twitter-topics

Last synced: 19 Apr 2026

https://github.com/mykhode/python-sic-mini-project

SAMSUNG SIC Finish Project Course - Python

data-analysis python-analysis samsung-sic

Last synced: 10 Oct 2025

https://github.com/nasdin/drone-strike-visualization

Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns

analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization

Last synced: 16 Oct 2025

https://github.com/julianolaurentino/sql_sample

Estudos voltados para criação de consultas em SQL. Este repositório está sendo alimentando semanalmente.

data-analysis sql sql-query sql-server

Last synced: 05 Mar 2025

https://github.com/rmnldwg/liver-smart

Data and analysis pipeline for a study on the potential advantages of daily adaptive liver SBRT performed at the University Hospital Zurich.

data-analysis fractionation jupyter-notebook liver-cancer metastasis radiation-oncology stereotactic

Last synced: 27 May 2026

https://github.com/mainakverse/ml-algorithms-starter

List of machine learning algorithms that are needed to start with ML projects and lay a foundation into data science

data-analysis data-science jupyter-notebooks machine-learning-algorithms practice

Last synced: 19 Apr 2025

https://github.com/fabienarcellier/qjoin

qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality

composable data-analysis developer-tools functools python

Last synced: 01 Mar 2026

https://github.com/yahia3200/become-an-independent-data-scientist

My final project for the Applied Plotting, Charting & Data Representation in Python Course

data-analysis data-science data-visualization matplotlib

Last synced: 16 Mar 2025

https://github.com/arzan101/ola-data-analytics

Ola - Identified the reason and trends for ride cancellation. Process - Cleaned and Processed Data from multiple sources, applied Sql queries and visualized data using PoweBi . Motive - To reduce the cancellation rate

dashboard data-analysis data-mining data-visualization dataanalytics excel powerbi sql

Last synced: 06 Jan 2026

https://github.com/kylekirkby/cardatasnatch

CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.

beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering

Last synced: 15 Apr 2025

https://github.com/llnl/hdtopology

High-dimensional topological data analysis library for NDDAV

analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization

Last synced: 29 Apr 2025

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/ronylpatil/whatsapplib

WhatsApp Group Chat Analysis Python Package.

data-analysis open-source pypi-package python-library python-package

Last synced: 02 Jan 2026

https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment

Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.

data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis

Last synced: 09 May 2026

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 30 Jul 2025

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/naso7y/students-performance-analysis

A project analyzing students' academic performance to identify trends and factors affecting outcomes. Built with Python, using data visualization and statistical techniques to derive actionable insights.

data-analysis data-visualization machine-learning python

Last synced: 23 Feb 2026

https://github.com/ivanildobarauna-dev/data-pipeline-sync-ingest

ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.

business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python

Last synced: 28 Oct 2025

https://github.com/jpquast/icp-ms-data-explorer

A shiny app for the exploration of ICP-MS data.

data-analysis icp-ms r shiny shiny-apps

Last synced: 17 Jan 2026

https://github.com/fernandezfran/exma

A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data

computational-physics data-analysis molecular-dynamics oop python science

Last synced: 16 Jan 2026

https://github.com/emso-c/stream-analyser

A tool that analyses YouTube live streams.

cli data-analysis guessing highlights python youtube-video

Last synced: 18 Jan 2026

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 03 Jan 2026

https://github.com/markmelnic/scalg

List scoring algorithm. Analyse data using a range based procentual proximity algorithm.

algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm

Last synced: 08 Oct 2025

https://github.com/markmelnic/carsen-desktop

A python dashboard app for scraping and tracking cars for sale on websites such as mobile.de

automation dashboard data-analysis interface scraping scraping-websites tkinter-python

Last synced: 08 Oct 2025

https://github.com/devexpress-examples/web-forms-pivot-grid-bind-to-sql-data-source

This example demonstrates how to create an ASPxPivotGrid and bind it to data via code.

asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms

Last synced: 06 Jul 2025

https://github.com/fenghaojiang/ethereum-etl

ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain

data-analysis ethereum etl

Last synced: 14 Jan 2026

https://github.com/gustavohnsv/teamwork_mqa

Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.

data-analysis group-project r team-repo

Last synced: 15 Aug 2025

https://github.com/visionkernel/centerspoke

Centerspoke is a data management and analysis tool that allows easy access to cloud databases. Say goodbye to using excel for data management. This open-source CLI tool allows for the rapid processing and analysis of all your data, and makes it easy to upload your excel files into your cloud databases.

cli cloud-database data-aggregation data-analysis data-analysis-python data-management data-science python python3

Last synced: 24 May 2026

https://github.com/quantumudit/analyzing-goodreads-famous-quotes

This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 20 May 2026

https://github.com/ernestaroozoo/memestocks.net

MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.

dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit

Last synced: 06 Mar 2026

https://github.com/elhaban3ro/thewildtool

TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖

ai audio audio-processing data-analysis data-science dataset deeplearning python

Last synced: 03 Sep 2025

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 11 Oct 2025

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 30 Mar 2025

https://github.com/mathieu2301/pbsc-tracker

Expérience de tracking des vélos en libre service fonctionnants avec PBSC

ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker

Last synced: 23 Jun 2026

https://github.com/jimut123/ultimate_date_finder

To find the best place for dating in your country

data-analysis data-science date-cluster dating geo location maps software

Last synced: 16 Jan 2026

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 04 Oct 2025

https://github.com/is-leeroy-jenkins/badger

A data science & analysis toolkit for federal analysts with the environmental protection agency based on WPF, Net 8, and is written in C#.

ai budget budget-management data-analysis data-science data-visualization federal-government large-language-models machine-learning

Last synced: 14 Oct 2025

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning oneness-consciousness predictive-modeling python3 scikit-learn

Last synced: 12 Apr 2025

https://github.com/briatte/asr

Applied Stats with R and RStudio (first-year social-science tutorials)

course data-analysis data-science data-visualization r statistics

Last synced: 14 Apr 2026

https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart

The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.

dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet

Last synced: 08 Jan 2026

https://github.com/tillbiskup/cwepr

A Python package based on the ASpecD framework for handling cwEPR data.

continuous-wave data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science

Last synced: 06 Sep 2025

https://github.com/codebypinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 12 Oct 2025

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 17 May 2026

https://github.com/walidalsafadi/titanic-disaster

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction

Last synced: 16 Mar 2025

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 06 Feb 2026

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 10 May 2026

https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result

Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result

blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization

Last synced: 08 Jan 2026