An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/nafisalawalidris/bitcoin-price-analysis-api

Explore this comprehensive API for Bitcoin price analysis, featuring historical data and real-time updates from major exchanges. Built with FastAPI, Python and PostgreSQL, this open-source project is optimised for efficiency and scalability catering to developers, traders and analysts. Contribute and collaborate to enhance functionality.

api-development bitcoin bitcoin-api cryptocurrency data-analysis fastapi financial-data machine-learning open-source postgresql python restful-api

Last synced: 12 Mar 2026

https://github.com/misaghmomenib/nba-games-analysis

This is a Data Analysis Project That You Can Monitor the Number of Games and Wins and Losses of Different NBA Teams and Have Different Analyzes From It.

data-analysis data-analyst data-visualization git open-source python

Last synced: 04 Apr 2025

https://github.com/shlokashah/ipl-data-analysis

Data Analysis and Visualizations done on IPL dataset

data-analysis data-visualization pandas powerbi

Last synced: 01 Sep 2025

https://github.com/amirhosseinhonardoust/community-price-tracker

A community-driven app for tracking and comparing the prices of everyday goods across cities. Built with Python, SQLite, Pandas, and Streamlit, it lets users log, visualize, and analyze item price trends, inflation, and cost-of-living differences with clear charts and full local data privacy.

community-project cost-of-living dashboard data-analysis data-visualization economic-analysis inflation-tracking local-data open-data pandas price-tracker public-insight python sqlite streamlit

Last synced: 08 Nov 2025

https://github.com/sondosaabed/advanced-data-wrangling

In this advanced course, Learning the three phases of data wrangling: gathering, assessing, and cleaning data.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 09 Apr 2025

https://github.com/banitalebi/data-visualization-dashboard

The Data Visualization Dashboard serves as an excellent starting point for projects focused on data analysis and representation.

data-analysis data-visualization interactive-charts nextjs shadcn-ui starter talwindcss typescript

Last synced: 10 Apr 2025

https://github.com/kingabzpro/annual-recycled-energy-saved-in-singapore

Learn how much Singapore is saving energy per years by recycling plastics, paper, glass, ferrous and non-ferrous metal

cleaning-data data-analysis data-science deepnote energy environment

Last synced: 19 Jun 2025

https://github.com/msk-access/access_data_analysis

Scripts for downstream analysis plotting of pipeline output

analysis data-analysis msk-access python r

Last synced: 10 Feb 2026

https://github.com/simeononsecurity/track-helium-mobile-wifi

A collection of scripts and tools that tracks the availability of helium mobile wifi networks in the wild from the Wigle Dataset and Helium API. Updates every 24 hours.

automation bigdata carrieroffload data data-analysis dataset dataset-generation helium openroaming passpoint python

Last synced: 23 Apr 2025

https://github.com/madebyarthouse/momentum-coalition-compass

Application for analysing and visualizing data from a political questionnaire

austria data-analysis politics venn-diagram

Last synced: 27 Jul 2025

https://github.com/jgolafshan/wallstreetsocial

Analyze Reddit posts/comments, uses an NLP model to recognize stock symbols and options positions

data-analysis flexible machine-learning mentioned-stocks named-entity-recognition natural-language-processing nlp reddit social-media-analysis sql wallstreetbets

Last synced: 22 Jul 2025

https://github.com/misaghmomenib/social-engagement-analysis

In This Repository, Using a Csv File Related to Social Network Interactions I Made a General Analysis That You Can Have at Your Disposal

data-analysis data-visualization git open-source pandas-python python

Last synced: 21 Nov 2025

https://github.com/Haritha1005/data_analyst_coursework_and_practice

This repository showcases my data analytics and data science skills through projects, fostering collaboration and community engagement

data-analysis data-visualization etl excel matplotlib numpy-library pandas powerbi-report python3 r scipy sql tableau

Last synced: 16 Oct 2025

https://github.com/abubakkar32/machine-learning-practice

This GitHub repository is a valuable resource for machine learning and Python enthusiasts. It includes a wide range of projects and tools, covering topics like Data Visualization, Data Analysis, ML, DL, Automation, NLP, Web Scraping, and more. Contributors are welcome to join and learn together in this supportive community. Happy coding!

automation big-data chatbot data-analysis data-visualization datastractures machine-learning mathplotlib nlp-machine-learning numpy pandas plotly pypdf2-lib pytest python3 seaborn selenium-webdriver skit-learn webscraping

Last synced: 23 Mar 2025

https://github.com/deshima-dev/decode

:zap: DESHIMA code for data analysis

astronomy data-analysis deshima python spectroscopy submillimeter

Last synced: 18 Jun 2025

https://github.com/alexcj10/analyzing-amazon-sales-data

This repository is dedicated to analyzing Amazon sales data to identify trends and insights that can help improve sales strategies and performance.

amazon beautifulsoup data-analysis data-science data-visualization ecommerce machine-learning matlpotlib numpy pandas python sales skilearn

Last synced: 10 Jul 2025

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 12 Apr 2025

https://github.com/taylorteixeira/projeto-ed-satc

Este projeto foi desenvolvido para demonstrar as funcionalidades e práticas de Data Engineering por meio da integração eficiente de infraestrutura, ingestão, processamento e análise de dados em larga escala

azure data-analysis databricks mongodb terraform

Last synced: 17 Feb 2026

https://github.com/moataz-elmesmary/data-analysis-using-excel-and-power-bi-sessions

Excel & Power BI Sessions from basics to advanced for Data Analysis

data-analysis dax excel powerbi powerpivot powerquery

Last synced: 02 Jan 2026

https://github.com/leonism/sample-superstore

This is the Python version analysis approach, towards the legendary Sample Superstore Dataset with Pandas

data-analysis datamining datascience dataset eda jupyter-notebook machine-learning python

Last synced: 09 Oct 2025

https://github.com/thecoderpinar/hms-brainactivity-analysiss

Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.

critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing

Last synced: 30 Apr 2025

https://github.com/ksm26/function-calling-and-data-extraction-with-llms

Master the techniques of function-calling and structured data extraction with LLMs. Learn to enhance LLM capabilities, integrate web services, and build practical applications for real-world data usability.

advanced-workflows ai-integration custom-functionality customer-service-transcripts data-analysis data-extraction end-to-end-applications function-calling llms natural-language-processing openapi practical-implementation structured-data web-services-integration

Last synced: 01 May 2026

https://github.com/data-engineering-community/data-engineering-meetup-in-a-box

A collection of guides, resources, and support for DE meetup organizers.

data data-analysis data-engineering data-mining data-structures database meetups

Last synced: 02 Aug 2025

https://github.com/bluebrain/data-validation-framework

Simple framework to create data validation workflows.

data data-analysis python validation validation-tool

Last synced: 14 May 2025

https://github.com/edaaydinea/estimating-the-probability-of-confirmed-covid-19-cases-taking-into-the-intensive-care-unit-icu-

This repository includes the slides and coding parts for the Estimating the Probability of Confirmed COVID-19 Cases Taking into the Intensive Care Unit (ICU).

covid-19 data-analysis data-science data-visualization machine-learning

Last synced: 11 Apr 2025

https://github.com/avannaldas/quickview

QuickView is a python package for a quick glance of the dataset, just pass the pandas dataframe. It gives some useful summary and plots with just one line of code. All the summary that it outputs is made available through member variables. It is built using matplotlib.

data-analysis data-visualization dataframe machine-learning matplotlib pandas pandas-dataframe plots python3

Last synced: 01 Jul 2025

https://github.com/kennethleungty/pymysql-demo

PyMySQL - Connecting Python and SQL for Data Science

data-analysis data-science mysql pandas python sql

Last synced: 12 Jul 2025

https://github.com/jwngr/notreda.me

Historical Notre Dame Fighting Irish football schedules, stats, and analyses

college-football data-analysis data-visualization football notre-dame sports stats

Last synced: 09 Apr 2025

https://github.com/the-data-dilemma/parquettohuggingface

ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.

audio-dataset audio-processing automatic-speech-recognition data-analysis data-science dataset healthcare-application huggingface huggingface-datasets pandas parquet parquet-generator python3 speech-data speech-recognition speech-to-text speech-translation

Last synced: 21 Aug 2025

https://github.com/rolv-io/rolvapp

Rolv is your AI-powered research assistant for life sciences!

ai biology data-analysis data-science genomics life-sciences medicine

Last synced: 02 Mar 2026

https://github.com/dmytrovoytko/data-engineering-amazon-reviews

Data Engineering project for ZoomCamp`24: JSONL -> PostgreSQL/BigQuery + Metabase + Mage.AI

bash-script bigquery codespaces data-analysis data-visualization etl metabase pipeline python-script

Last synced: 31 Jul 2025

https://github.com/shreeparab1890/fifa-wc-2022-qatar-data-analysis-eda

This is a Jupyter Notebook( iPython Notebook) with Data Analysis (EDA) on FIFA WC Qatar 2022 match data.

data-analysis data-analysis-python data-science data-visualization eda fifa matplotlib-pyplot numpy pandas plotly-express python-3

Last synced: 08 Mar 2026

https://github.com/mrgeislinger/udacitydand_proj_wrangleandanalyzedata

Wrangling and analyzing data project for Udacity's Data Analyst Nanodegree. Wrangles WeRateDogs™ (@dog_rates) Twitter data from local, online, and Twitter API sources.

data-analysis data-analyst data-science datascience jupyter-notebook python3 twitter udacity-data-analyst-nanodegree udacity-nanodegree

Last synced: 09 Oct 2025

https://github.com/dataforgeopenaihub/steam-sales-analysis

This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven Cloud and visualized using Tableau dashboards for insightful analysis of gaming trends and sales performance.

cloud-computing data-analysis data-engineering data-pipepline data-warehousing games mysql-database python steam-api tableau typer-cli

Last synced: 06 Feb 2026

https://github.com/qzcool/sac

中国证券业协会数据分析和可视化。Data Analysis and Visualization for Securities Association of China (SAC).

china data-analysis finance python

Last synced: 11 Jul 2025

https://github.com/abhaysingh71/ai-powered-healthcare-intelligence-network

The AI-Powered Healthcare Intelligence Network is an AI-driven system offering disease prediction, drug recommendations, heart disease risk assessment, and an AI medical chatbot. Using ML, NLP, and LLMs, it provides accurate diagnoses, insights, and recommendations, enhancing healthcare accessibility, efficiency, and decision-making .

airtificialintelligence chatbot data-analysis data-science datawrangling disease-prediction healthcare-ai heart-disease huggingface langchain large-language-models lightgbm machine-learning mistral-7b recommendation-system retrieval-augmented-generation sentence-transformers shap vector-database

Last synced: 01 Feb 2026

https://github.com/bpkaur/word-frequency-in-moby-dick

To find out the most frequent words in the novel Moby Dick using Python.

beautifulsoup data-analysis data-science moby-dick nltk notebook-jupyter python3

Last synced: 09 Nov 2025

https://github.com/stappit/blog

I often post solutions to textbook exercises, including: Bayesian Data Analysis (BDA) by Gelman et al; Causal Inference in Statistics Primer (CISP) by Pearl et al; Purely Functional Data Structures (PFDS) by Okasaki.

bayesian-data-analysis blog data-analysis data-science gelman hakyll haskell pearl purely-functional-data-structures solutions stan static-site statistical-inference statistics

Last synced: 14 Mar 2025

https://github.com/nico-curti/data-analysis

Data Analysis Utilities (from Statistics to Machine Learning)

data-analysis deep-learning-algorithms linear-algebra machine-learning-algorithms statistics

Last synced: 07 Oct 2025

https://github.com/ideas-lab-nus/eplusr-paper

Data and code for Jia and Chong (2020): Hongyuan Jia and Adrian Chong (2020). eplusr: A framework for integrating building energy simulation and data-driven analytics. (Accepted in Energy and Buildings).

bayesian-calibration data-analysis data-driven-analytics data-exploration energyplus eplusr multi-objective-optimization parametric-analysis r

Last synced: 21 Feb 2026

https://github.com/arv-anshul/yt-watch-history-v2

Analyse your YouTube watch history using ML with Graphs.

data-analysis docker fastapi machine-learning python streamlit youtube

Last synced: 01 Jul 2025

https://github.com/frauddi/dataspot

Find data concentration patterns and hotspots. Built for fraud detection and risk analysis.

anomalies anomalies-detection data-analysis data-science fraud-detection hotspots pattern-mining python

Last synced: 09 Apr 2026

https://github.com/ogoodness/vbreaker-js

CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition

affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher

Last synced: 11 Apr 2025

https://github.com/cusyio/datenanalyse-in-python

Kurs zur automatisierten Aufbereitung, Zusammenfassung und Erstellung von Diagrammen tabellarischer Daten mit Python.

data-analysis pandas-python pandas-tutorial python

Last synced: 24 Apr 2025

https://github.com/drkostas/eestech-bigdata-challenge

EESTech Challenge is a brand new competition organized by EESTEC, that has the aim to create opportunities for European students to gain knowledge in the field of EECS and develop a professional network. The technological topic of 2017-2018ths competition was Big Data. This is the code I sumbitted with my team (BFS), which consisted of 3 members in total.

apache-spark data-analysis data-visualization eestec machine-learning matplotlib python

Last synced: 25 Apr 2025

https://github.com/invia-flights/blitzly

Lightning-fast way to get plots with Plotly ⚡️

data-analysis data-science plotly plotting-in-python python visualization

Last synced: 14 Jan 2026

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 28 Apr 2025

https://github.com/txn2/mcp-data-platform

A semantic data platform MCP server that composes multiple data tools with bidirectional cross-injection - tool responses automatically include critical context from other services.

data-analysis data-lake data-warehouse golang golang-library mcp mcp-server

Last synced: 31 May 2026

https://github.com/neelshah18/scopus-analysis-for-indian-researcher

It contains data analysis of indian researcher in scopus journal from 2000 to 2016. It also have dataset.

artificial-intelligence data-analysis deep-learning india industry jupyter-notebook machine-learning scopus-analysis universities world

Last synced: 06 Jul 2025

https://github.com/datadesk/extreme-heat-excess-deaths-analysis

A statistical analysis of excess deaths attributable to extreme heat in California's most populous counties

data-analysis data-journalism journalism news python r statistics

Last synced: 11 Oct 2025

https://github.com/ankan24/machine-learning-data-analysis

This repository contains a collection of Jupyter Notebooks that demonstrate various machine learning and data analysis techniques. The project does not provide a detailed description or specific use cases, but the notebooks cover a range of topics related to machine learning and data analysis.

data-analysis jupiter-notebook machine-learning

Last synced: 19 Oct 2025

https://github.com/acs/ghtorrent

Experimental GItHub project analysis based on GHTorrent

data-analysis data-visualization ghtorrent github visualization

Last synced: 20 Mar 2025

https://github.com/erictleung/phyloseq-cheatsheet

:notebook: Minimal cheatsheet for functions in the phyloseq R package

bioconductor bioinformatics cheatsheet data-analysis microbiome microbiota notes phyloseq r

Last synced: 25 Mar 2025

https://github.com/abmenzel/messenger-wrapped

Explore your Facebook Messenger chat history, in a manner inspired by Spotify Wrapped

data-analysis data-visualization facebook meta nextjs reactjs spotify

Last synced: 18 Jul 2025

https://github.com/bydevmar/master_masd_fpo

Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.

acp afc algebra big-data-analytics dashboards data-analysis datascience economics english graph-theory latex linear-algebra non-linear-algebra probability prog python scientific-research software-package statistics

Last synced: 05 May 2025

https://github.com/dbeaudoinfortin/napsdataanalysis

Canadian National Air Pollution Surveillance Program (NAPS) data downloader, importer, extractor, analysis, and visualization toolbox.

air-pollution air-quality air-quality-data aqhi aqhi-canada canada cesi data-analysis eccc geoscience heatmap naps pollution

Last synced: 16 Mar 2025

https://github.com/bobleesj/cif-bond-analyzer

An interactive Python script that computes the minimum atomic bonding distances from sites, generating histograms and pair counts.

crystal-structure data-analysis high-throughput materials-infomatics solid-state

Last synced: 16 Jan 2026

https://github.com/rameerez/dashboards

🍱 Ruby gem to create customizable bento-style admin dashboards in your Rails app

admin admin-dashboard admin-panel bento business business-analytics business-intelligence dashboard data-analysis gem rails ruby

Last synced: 13 Apr 2025

https://github.com/misaghmomenib/data-analysis-projects

A Repository Featuring a Collection of Data Analysis Projects, Showcasing Various Techniques and Tools for Extracting Insights From Data. Explore, Learn, and Utilize These Projects to Enhance Your Data Analysis Skills and Workflows.

data-analysis data-analysis-python data-visualization jupyter-notebook open-source python

Last synced: 13 Apr 2025

https://github.com/tzerk/gammaspec

A collection of functions to analyse gamma spectra

data-analysis gamma-ray-spectrometry r

Last synced: 15 Apr 2025

https://github.com/djoshea/trial-data

Interfaces and utilities for analysis of neurophysiology and behavioral data

data-analysis electrophysiology neuroscience

Last synced: 23 Jan 2026

https://github.com/iamantimpal/iamantimpal

👋 Hi, I'm Antim Pal, the Founder of Optimism Educator. An online platform dedicated to empowering students with skills in Computer Science, Web Design, Graphic

data-analysis data-science data-visualization database database-design database-management datascience graphical-user-interface graphics grapic-design reading-list readme readme-badges readme-generator readme-md readme-profile readme-stats readme-template

Last synced: 10 Apr 2025

https://github.com/iamyajat/whatsapp-chat-analyzer-api

An API to analyse WhatsApp chats and generate insights

data-analysis data-science fastapi python whatsapp

Last synced: 17 Oct 2025

https://github.com/anotherkamila/stalky

Self-hosted app to keep track of anything you want. Don't let others stalk you, stalk yourself!

csv data-analysis data-storage geeks privacy-by-design self-hosted timeseries tracking

Last synced: 12 Apr 2025

https://github.com/nicodupont/mooc

All my finished Moocs on the subject of the data science mainly

data-analysis data-science data-visualization datacamp jupyter-notebook machine-learning mooc pandas python sas sql

Last synced: 28 Apr 2025

https://github.com/coalio/Assistant

A data science library providing flexible dataframes for Lua 5.1+

data-analysis data-science data-structures dataframe lua

Last synced: 11 Apr 2025

https://github.com/sanvishal/Exoplanet-Explore

An Interactive data visualization of Exoplanets

animation d3js data-analysis data-science exoplanet python space visualization

Last synced: 14 Apr 2025

https://github.com/flexmonster/pivot-jupyter-notebook

Jupyter Notebook pivot table example with Flexmonster

data-analysis data-science interactive jupyter-notebook pivot-tables python

Last synced: 16 Jun 2025

https://github.com/pseudomanifold/enchiridion-tda

An enchiridion for instructing mortals in the hidden arts of topological data analysis

data-analysis graph-kernels persistent-homology topology

Last synced: 12 Mar 2026

https://github.com/zeeshanahmad4/my-path-to-python

Templates and Referance code in Python for Web development,Data Science, Analysing,visualization,Cleaning and Scraping, Machine Learning, Artificial Intelegence

code data-analysis data-visualization machine-learning python refereance selenium template

Last synced: 12 Aug 2025

https://github.com/thecoderpinar/gen-expression

Gene expression analysis is a fundamental component of genomics research, providing valuable insights into how genes are regulated and their impact on various biological processes. This project delves into the realm of gene expression data, aiming to uncover hidden patterns and relationships within complex datasets. 🚀

bioinformatics biotechnology data-analysis data-science data-visualization genomics kaggle machine-learning pca python

Last synced: 30 Apr 2025

https://github.com/globeandmail/upstartr

An R package powering core startr functionality

data data-analysis data-journalism data-visualization journalism news r r-package

Last synced: 03 Oct 2025