An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/implicitlayer/alphaanalysis

Full library for analyzing financial data using ML and classical approaches

data-analysis data-science deep-learning finance machine-learning risk-analysis time-series-analysis trading transformers

Last synced: 12 Mar 2026

https://github.com/startreedata/metabase-pinot-driver

Apache Pinot driver for the Metabase business intelligence front-end

apache data-analysis data-visualization metabase pinot

Last synced: 10 Mar 2026

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 12 Sep 2025

https://github.com/linuxto5re/gateiodatafilteration

Gate.io API analysis: spot, margin, futures; filters high volume, identifies common cryptos.

arbitrage-bot centralized-exchanges cryptocurrency data-analysis gate gateio-api python rest trading-strategies

Last synced: 23 Oct 2025

https://github.com/wtbates99/stock-indicators

A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.

backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series

Last synced: 15 Oct 2025

https://github.com/nasdin/drone-strike-visualization

Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns

analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization

Last synced: 16 Oct 2025

https://github.com/rcv911/emd

Empirical Mode Decomposition

data-analysis emd empirical-mode-decomposition

Last synced: 18 Jan 2026

https://github.com/3bdalrhmans3d/dataqualityproject

An interactive web application for data quality analysis, machine learning, and conversational AI, built with Streamlit.

data-analysis data-visualization ml numpy ollama pandas python seaborn streamlit

Last synced: 23 Feb 2026

https://github.com/erictleung/microbiome-analysis-resources

:microscope: Resources and notes on studying the human microbiome

bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources

Last synced: 18 Jan 2026

https://github.com/mszell/fyp2021

INSTRUCTOR's VERSION: Lectures of First Year Project 2021, Project 1

data-analysis data-science education python teaching-materials

Last synced: 02 May 2025

https://github.com/ihabbendidi/diamond-analysis

Exploratory statistical analysis of a Diamond dataset

data-analysis data-visualization exploratory-data-analysis machine-learning r

Last synced: 17 Oct 2025

https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python

Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.

data-analysis data-analysis-python data-analyst data-science excel python spreadsheet

Last synced: 27 Apr 2025

https://github.com/0mppula/element-compare

A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.

compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand

Last synced: 19 Jan 2026

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 26 Apr 2025

https://github.com/ktmud/github-life

A data explorer for GitHub projects' life cycles

data-analysis github scraper time-series

Last synced: 16 May 2026

https://github.com/EmirhanServeren/NFT-CollaBot

NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.

data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet

Last synced: 17 Apr 2025

https://github.com/tsffarias/ibge-nomes-brasil

Análise de dados dos nomes no Brasil utilizando a base de dados disponibilizada pelo IBGE (2010)

data-analysis ibge python

Last synced: 05 Jul 2025

https://github.com/petulla/readroper

Read single and multi-card ASCII polling datasets in R

ascii data-analysis polling-data r

Last synced: 31 Mar 2025

https://github.com/arnavk-09/world-population

🌏 Taipy demo to explore world population data...

data-analysis python taipy world-population

Last synced: 31 Mar 2025

https://github.com/prathamesh693/04_sales-and-demand-forecasting-for-retail-chains

This project aims to forecast weekly sales for retail stores using historical sales and economic data. By applying advanced time series forecasting models, we enable better inventory management, demand planning, and revenue optimization for retail chains. The project includes both traditional statistical models and deep learning techniques.

arima-forecasting data-analysis data-visualization exploratory-data-analysis jupyter-notebook prophet-facebook python spyder-python-ide streamlit-webapp

Last synced: 01 Jul 2025

https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql

The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.

data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server

Last synced: 24 Mar 2025

https://github.com/splch/qbs

An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.

classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling

Last synced: 01 Apr 2025

https://github.com/pmsipilot/intercom2dw

Intercom2DW is an attempt at loading all the data available in an Intercom application.

data-analysis docker intercom

Last synced: 11 Apr 2026

https://github.com/thecoderpinar/earthquake-explorer

🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.

data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data

Last synced: 23 Aug 2025

https://github.com/singhkunall/-india-census-2011-population-demographics-dashboard

Interactive Excel dashboard visualizing India's 2011 Census Population Demographics using charts, pivot tables, and slicers.

data-analysis data-visualization excel-dashboard india-census population-data

Last synced: 04 Feb 2026

https://github.com/nghorbani/neuraldataanalysis

Download Data from https://bit.ly/3g8RUmi

data-analysis matlab spike-detection spike-rate

Last synced: 22 Apr 2025

https://github.com/agungbudiwirawan/socioeconomic_analysis

The objective of this project is to analyze the socio-economic in Chicago.

chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server

Last synced: 04 Jul 2025

https://github.com/andreip/twitter-authorities

Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

data-analysis mongodb python twitter-topics

Last synced: 19 Apr 2026

https://github.com/liweitianux/chandra-acis-analysis

Chandra ACIS analysis tools and documents

acis chandra data-analysis

Last synced: 19 Apr 2025

https://github.com/julianolaurentino/sql_sample

Estudos voltados para criação de consultas em SQL. Este repositório está sendo alimentando semanalmente.

data-analysis sql sql-query sql-server

Last synced: 05 Mar 2025

https://github.com/aravind-selvam/covid_dashboard

With Covid death and vaccine data. I have created a dashboard.

covid-19 data-analysis data-science data-visualization tableau tableau-public visualization

Last synced: 08 Mar 2026

https://github.com/luminati-io/Amazon-dataset-samples

A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.

amazon api data-analysis data-science dataset ecommerce products web-scraping

Last synced: 09 Apr 2025

https://github.com/elhaban3ro/thewildtool

TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖

ai audio audio-processing data-analysis data-science dataset deeplearning python

Last synced: 03 Sep 2025

https://github.com/walidalsafadi/titanic-disaster

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction

Last synced: 16 Mar 2025

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 10 May 2026

https://github.com/code-jl/nfl-point-kicker-data-scraper

A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.

automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping

Last synced: 06 Sep 2025

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 16 May 2025

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 13 Jun 2025

https://github.com/cosmoduende/r-arduino

Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two

arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport

Last synced: 24 Apr 2026

https://github.com/camille-maslin/simulfcimage

🔍 SimulFCImage: A professional multispectral image processing application developed for ImViA Laboratory.

academic-project computer-vision data-analysis gui-application image-processing image-viewer multispectral-images pyqt5 python scientific-visualization spectral-analysis

Last synced: 05 Feb 2026

https://github.com/mustafah/dream-my-plots

Create visual plots in Python with the help of text prompting popular LLMs through langchain

ai artificial-intelligence automation data-analysis data-visualization langchain llms machine-learning plotting python

Last synced: 13 Apr 2026

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 13 Jul 2025

https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql

This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.

campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics

Last synced: 27 Feb 2025

https://github.com/DCS-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 25 Apr 2025

https://github.com/virajbhutada/music-recommendation-system

This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.

data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience

Last synced: 24 Jan 2026

https://github.com/scailfin/flowserv-core

Reproducible and Reusable Data Analysis Workflow Server

benchmarks data-analysis reproducibility reusability workflows

Last synced: 14 Jan 2026

https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result

Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result

blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization

Last synced: 08 Jan 2026

https://github.com/dcs-training/bayesian-statistics

Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file

bayesian-statistics data-analysis r statistics

Last synced: 05 Feb 2026

https://github.com/tillbiskup/cwepr

A Python package based on the ASpecD framework for handling cwEPR data.

continuous-wave data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science

Last synced: 06 Sep 2025

https://github.com/cbg-ethz/scdna-pipe

Python data analysis pipeline for single cell copy number event history reconstruction

bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow

Last synced: 05 Jan 2026

https://github.com/draym/covid19tracker

Coronavirus COVID-19 dashboard to track global cases

covid-19 covid19-tracker dashboard data-analysis

Last synced: 07 Jan 2026

https://github.com/shlokashah/student-depression-and-suicide-rate-prediction

https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/

data-analysis data-visualization machine-learning student suicide-rate-prediction

Last synced: 19 Nov 2025

https://github.com/narius2030/hive-datawarehouse-analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics

Last synced: 01 Apr 2025

https://github.com/haloapping/malas-ngetik-clf

Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.

data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart

The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.

dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet

Last synced: 08 Jan 2026

https://github.com/mathieu2301/pbsc-tracker

Expérience de tracking des vélos en libre service fonctionnants avec PBSC

ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker

Last synced: 23 Jun 2026

https://github.com/aad99bxp/whatsapp-chat-analyzer

A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.

data-analysis heroku-deployment python3

Last synced: 15 Mar 2025

https://github.com/is-leeroy-jenkins/badger

A data science & analysis toolkit for federal analysts with the environmental protection agency based on WPF, Net 8, and is written in C#.

ai budget budget-management data-analysis data-science data-visualization federal-government large-language-models machine-learning

Last synced: 14 Oct 2025

https://github.com/yusufcinarci/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19-data-visualization data-analysis data-science data-visualization

Last synced: 22 Jul 2025

https://github.com/quantumudit/analyzing-goodreads-famous-quotes

This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 20 May 2026

https://github.com/luizassimoes/fitness-report

Create a personalized Fitness Wrapped report using your Apple Health data with this Streamlit application. Generate comprehensive, detailed summaries of your annual fitness activities, providing valuable insights into your year-long progress and achievements.

data-analysis data-visualization python streamlit

Last synced: 12 Feb 2026

https://github.com/visionkernel/centerspoke

Centerspoke is a data management and analysis tool that allows easy access to cloud databases. Say goodbye to using excel for data management. This open-source CLI tool allows for the rapid processing and analysis of all your data, and makes it easy to upload your excel files into your cloud databases.

cli cloud-database data-aggregation data-analysis data-analysis-python data-management data-science python python3

Last synced: 24 May 2026

https://github.com/martincastroalvarez/django-data-analytics

Data Analytics, PnL, LTV & retention analysis with Django

analytics beautifulsoup4 d3 d3js data-analysis django ltv rest-api visualization

Last synced: 06 May 2026

https://github.com/kevinschoon/qviz

QViz Interactive Plotting

data-analysis data-visualization go gonum qframe yaegi

Last synced: 01 Jun 2026

https://github.com/ernestaroozoo/memestocks.net

MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.

dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit

Last synced: 06 Mar 2026

https://github.com/aditiiprasad/whatsstat

A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.

chat-analyzer data-analysis data-visualization nlp streamlit text-processing whatsapp

Last synced: 02 Sep 2025

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 29 Jun 2025

https://github.com/devexpress-examples/web-forms-pivot-grid-bind-to-sql-data-source

This example demonstrates how to create an ASPxPivotGrid and bind it to data via code.

asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms

Last synced: 06 Jul 2025

https://github.com/alieymsxxn/sql_project_data_job_analysis

This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.

data-analysis postgresql sql sqlite

Last synced: 16 Apr 2025

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/johnsesana/eda-liquor-sales

Exploratory Data Analysis on Public Datasets

data-analysis data-visualization sql tableau-dashboards

Last synced: 09 Mar 2026

https://github.com/gxelab/tutorials

Tutorials of frequently used software packages and libraries in the lab

bioinformatics data-analysis evolution genetics genomics julia python3 r-language statistics visualization

Last synced: 18 Jan 2026

https://github.com/ronylpatil/whatsapplib

WhatsApp Group Chat Analysis Python Package.

data-analysis open-source pypi-package python-library python-package

Last synced: 02 Jan 2026

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/fernandezfran/exma

A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data

computational-physics data-analysis molecular-dynamics oop python science

Last synced: 16 Jan 2026

https://github.com/akshat0427/spotify_history

code to find out some insights in spotify streaming data (work in progress)

data-analysis data-visualization

Last synced: 04 Feb 2026

https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate

The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques

big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql

Last synced: 27 Jan 2026

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026