An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/famarks/grafarg

Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License

analytics charts data data-analysis data-science data-visualization grafana grafarg graph

Last synced: 19 Jan 2026

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/shreeparab1890/flipkart-laptops-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.

data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 14 Apr 2026

https://github.com/titanscouting/tra-superscript

The Red Alliance data analysis package

data-analysis frc-scouting hacktoberfest python

Last synced: 11 Oct 2025

https://github.com/abhi-lab2/ipl-data-analysis

IPL data analysis for future predictions

data-analysis data-science python

Last synced: 14 Apr 2026

https://github.com/bcko/ud-da-stroopeffect

Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)

data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree

Last synced: 04 Jul 2025

https://github.com/thomascenni/anfavea-data-analysis

Data analysis with Pandas and Datapane.

data-analysis datapane pandas

Last synced: 06 May 2026

https://github.com/phomint/udacity_dataanalysis

All projects and activities

data-analysis python udacity-nanodegree

Last synced: 11 Oct 2025

https://github.com/ibnaleem/cyberchef-discord

A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server

compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam

Last synced: 28 Jan 2026

https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel

This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.

automation data-analysis data-science dataset-generation excel-import signal-processing

Last synced: 18 Apr 2026

https://github.com/carolinedotxyz/dp_sgd_classification

A hands-on educational walkthrough of training a CelebA (Eyeglasses) image classifier with Differentially Private SGD using PyTorch and Opacus. The focus of this repo is on clarity and reproducibility through balanced subsets, deterministic preprocessing, and side-by-side baseline vs. DP training, while acknowledging real trade-offs.

celeba-dataset classification data-analysis dp-sgd machine-learning opacus python pytorch

Last synced: 16 May 2026

https://github.com/nomadsdev/financial-trend-analyzer

FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.

business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis

Last synced: 19 Jan 2026

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 30 Mar 2025

https://github.com/ahammadshawki8/playing-with-pandas

🐼 Pandas is one of my favourite library in python. It is well-known for "Analyzing" data. Learn basics and beyond the basics of Pandas from this repository. 🤍🖤

beginner-friendly data-analysis favourite-library pandas python

Last synced: 17 Apr 2026

https://github.com/ndohvich/ibm-data-science-professional-certificate

Kickstart your career in data science & ML. Build data science skills, learn Python & SQL, analyze & visualize data, build machine learning models. No degree or prior experience required.

coursera dash data-analysis data-science html5 ibm ibm-professional-certificate javascript machine-learnng python sql

Last synced: 16 Nov 2025

https://github.com/strampelligiovanni/straklip

An HST pipeline for reducing wide-field imaging observations not specifically designed for High Contrast Imaging analysis. Published in Strampelli et al. 2022.

binaries data-analysis data-reduction direct-imaging exoplanets high-contrast-imaging hst wide-field-surveys

Last synced: 12 Oct 2025

https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis

End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.

data-analysis data-visualization powerbi sql sql-server

Last synced: 16 Mar 2025

https://github.com/listiangr/ecommerce_sales_data_analysis

Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.

dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel

Last synced: 19 Jan 2026

https://github.com/gustavo-zamai/shop_data_analisys

Analysis diferents shopping mall sells

data-analysis openpyxl pandas python3 pywin32

Last synced: 01 Mar 2025

https://github.com/satvikvirmani/engineering-graduate-salary-analysis

A Data Science/Machine Learning project to analyse and study salary patterns of a engineering graduate in India.

data-analysis data-science jupyter-notebook machine-learning prediction python regression

Last synced: 19 Jun 2026

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/gustavo-zamai/product_return_data_analysis

Analysis returns products of differents stores

data-analysis excel pandas plotly-express python3 pywin32

Last synced: 13 May 2026

https://github.com/devexpress-examples/aspnet-pivot-grid-custom-aggregates

This example shows how to aggregate data by the field's first value.

asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms

Last synced: 06 Jul 2025

https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard

The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.

business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports

Last synced: 09 Apr 2025

https://github.com/jinkogule/multi-analyst

O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.

apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application

Last synced: 12 Apr 2026

https://github.com/akash1070/data-science-virtual-internship-by-anz

Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.

data-analysis data-science predictive-analytics presentation-slides

Last synced: 24 Mar 2025

https://github.com/aritrakar/statpy

A simple package containing some functions for analysing Gaussian and Binomial distributions. Created for the Udacity AWS MLE Foundations 2021 course.

data-analysis python statistics

Last synced: 24 Oct 2025

https://github.com/zeinhasan/eksploration-and-data-visualization-course-material

Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials

data-analysis data-visualization statistics

Last synced: 24 Jun 2026

https://github.com/oguzgn/fully-automated-performance-marketing-dashboard

This project integrates data from multiple ad platforms with Google Analytics to track marketing campaigns. It uses a structured naming system and UTM tags. Data is visualized in Looker Studio dashboards to analyze campaign performance and ad spend.

bigquery data-analysis data-engineering data-modeling marketing-analytics marketing-automation marketing-data-science marketingdata sql

Last synced: 24 Mar 2025

https://github.com/raccoon-hero/gender-equality-tracker

A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.

chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api

Last synced: 07 May 2026

https://github.com/ayobami6/tweet-data-analysis

WeRateDogs Tweets Scrape using twitter Api

data-analysis data-science twitter webscraping

Last synced: 31 May 2026

https://github.com/prernarohra/quakeguard

QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.

artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization

Last synced: 21 May 2026

https://github.com/prernarohra/mental-health-prediction

This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.

data-analysis data-science data-visualization machine-learning mental-health python

Last synced: 20 May 2026

https://github.com/gracysapra/r-in-data-science

This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.

data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science

Last synced: 19 Apr 2026

https://github.com/bristolmyerssquibb/blockr.workshop

R in Pharma 2024 blockr workshop

data-analysis nocode r

Last synced: 18 Apr 2026

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 16 Mar 2025

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 11 Apr 2026

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 23 Mar 2025

https://github.com/quantitext/quantitext

Official repository for QuantiText applications in the .NET ecosystem.

api aspnet-core csharp data-analysis dotnet-core mvc-architecture

Last synced: 30 Mar 2025

https://github.com/m-faizan-mahmood/detailed-exploratory-data-analysis-eda-marketing-recomendations.

This project focuses on cleaning, preprocessing, and analyzing data using Pandas and NumPy. Key steps include handling missing values, removing outliers, feature engineering, and exploratory data analysis (EDA). Visualizations with Matplotlib and Seaborn highlight trends in customer spending, campaign performance, and product sales.

big-data data-analysis data-processing data-science eda exploratory-data-analysis numpy pandas python

Last synced: 11 Apr 2026

https://github.com/nirmit27/book-recommender-system

This is a book recommendation system based on item-based Collaborative Filtering memory-based model created using Flask.

data-analysis data-science flask python python3 recommender-system render

Last synced: 05 May 2026

https://github.com/derrickbaruga7/mapping-median-age-europe

An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.

data-analysis data-science data-visualization datascience european-union mapping r

Last synced: 25 Mar 2025

https://github.com/raad07/sql_project-world_layoffs_dataset

This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.

data-analysis database mysql sql

Last synced: 27 Jan 2026

https://github.com/neerajcodes888/whatsapp-chat-analyzer

A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱

analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud

Last synced: 12 Apr 2026

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 20 Jun 2025

https://github.com/agustinmusanti/delitosencaba-proyectofinal-dataanalytics-coderhouse

En este repositorio muestro mi proyecto final en el curso "Data Analytics" de Coderhouse.

data-analysis excel powerbi

Last synced: 22 Jan 2026

https://github.com/siddharthbadal/sql-case-studies-data-analysis

Data Analysis case studies on various databases using SQL . Demonstrating proficiency in solving diverse business problems. Projects cover sales, orders, products, finance, healthcare and other sectors, and highlight my ability to analyze complex datasets through SQL queries, data manipulation, and visualization techniques.

data-analysis sql sql-query sql-server sqlserver

Last synced: 08 Jan 2026

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/seabbs/explorebcgonoutcomes

Analysis to explore the association of BCG vaccination and TB outcomes.

bcg data-analysis regression rstats tuberculosis

Last synced: 23 Feb 2026

https://github.com/rkolehov/retail-sales-analysis-project

End-to-end e-commerce analysis showcasing SQL and data visualization skills. Tracks sales, customer behavior, product performance, and delivery efficiency. Interactive dashboards provide actionable insights for business decision-making

analytics dashboard data-analysis ecommerce jupyter-notebook postgresql python sql tableau vscode

Last synced: 19 Apr 2026

https://github.com/antonijn/polyfit

Fits a polygon to a given data input

c data-analysis linear-algebra toy

Last synced: 16 Jul 2025

https://github.com/bretsw/beds

Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists

analytics data-analysis data-analytics data-science

Last synced: 31 Mar 2025

https://github.com/unnatmalik/dattavism-ai-powered-data-insight-generator-

Dattavism is an AI-powered data insight platform that transforms raw CSV files into comprehensive, contextualized reports—complete with visualizations, statistical summaries, and natural language insights. Dattavism is designed to handle datasets across diverse domains. it is Built using Python, Streamlit, Gemini API, Pandas, Matplotlib, NumPy,

data-analysis python streamlit

Last synced: 24 Jul 2025

https://github.com/silvano315/med-physics

This would be a repository about medical physics. It will based on 4 paths: medical data to analyse, SOTA programs for medical purposes, computer vision and eXplainability.

computer-vision data-analysis data-science explainable-ai medical-imaging medical-physics medical-tool

Last synced: 24 Mar 2025

https://github.com/ryanfranklin237/data-visualization-spreadsheets

Data visualization done with microsoft excel and google spreadsheets

data-analysis data-science data-visualization google-spreadsheets microsoft-excel

Last synced: 22 Feb 2026

https://github.com/rajnish93/jpandas

A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.

data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript

Last synced: 11 Jun 2025

https://github.com/nafisalawalidris/buybuy-e-commerce-company

The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.

buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql

Last synced: 16 Mar 2025

https://github.com/ajimaulana123/e-commerce-data-analis

Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer

data-analysis

Last synced: 28 Apr 2026

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 03 Feb 2026

https://github.com/ansh420/mcdonald_case-study

It is basically depend on the market Segment Analysis. It is a case study of mcDonald.

algorithms-implemented data-analysis python3 segmentation

Last synced: 12 Apr 2026

https://github.com/nafisalawalidris/springforth-university-foodbank

Springforth University Food Bank: A collaborative initiative with UNESCO to address student food insecurity. Contains code and resources for the web application, data analysis, and insights into the prevalence and impact of food insecurity on academic performance.

academic-performance collaborative-initiative data-analysis data-visualization excel pivot-tables powerbi springforth-university-food-bank student-food-insecurity unesco

Last synced: 17 Feb 2026

https://github.com/azmainadel/twitter-data-neo4j

Playing with graph database on a large dataset of twitter data.

data-analysis data-visualization neo4j-database snap

Last synced: 06 Apr 2025

https://github.com/ryannapp12/quant_trading_engine

A modular, and scalable quantitative trading engine built in Python. This project demonstrates efficient data caching with SQLite, concurrent backtesting, and advanced risk analytics, showcasing best practices in clean code architecture and performance optimization.

algorithmic-trading backtesting dash data-analysis data-visualization fintech lstm machine-learning numpy pandas plotly python quantitative-finance real-time risk-management sqlite technical-analysis tensorflow time-series-analysis trading-strategies

Last synced: 11 Apr 2026

https://github.com/dcs-training/intronetworkanalysis

This is a repository for the Introduction to Network Analysis course provided by Brian Wong for the CDCS. Within the repository there are files with sample datasets and a guide to building datasets. It will be updated before each section. Go to the Readme file

data-analysis data-visualisation gephi network-analysis text-analysis

Last synced: 27 Jan 2026

https://github.com/gorhkdwj/da_portfolio

Kim Jae Chun's DA_Portfolio

data data-analysis python sql

Last synced: 20 Feb 2026

https://github.com/ssreeramj/youtube_channels_analysis

This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.

data-analysis heroku pandas python streamlit youtube

Last synced: 29 Apr 2026

https://github.com/harshals499/ecosecure-visualization

Data visualization project using Qlik to analyze sales performance for EcoSecure Systems.

business-intelligence data-analysis data-visualization qlik-sense sales-analysis

Last synced: 12 Jun 2026

https://github.com/ahmad-ali-rafique/weather-prediction-fcnn

This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.

ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information

Last synced: 28 Aug 2025

https://github.com/macdon112/layoff-analysis

SQL data cleaning & analysis of global layoffs

data-analysis data-cleaning data-exploration sql

Last synced: 21 Feb 2026

https://github.com/willie-conway/datavista

DataVista is a comprehensive, production-grade data analysis and machine learning platform that combines real-time data ingestion from live APIs, interactive visualizations, statistical analysis, hypothesis testing, and machine learning model training — all in a unified, professional-grade interface. Built with React and Recharts.

analytics-platform api-integration classification coingecko-api csv-import data-analysis data-cleaning-and-preprocessing data-pipeline data-science data-visualizations etl hypothesis-testing json-export machine-learning-models open-meteo react recharts regression statistics world-bank

Last synced: 30 May 2026

https://github.com/realorangeone/docker-cyberchef

A containerized deployment of CyberChef, with additional protections

cyberchef data-analysis data-manipulation docker encoding

Last synced: 24 Aug 2025

https://github.com/paezha/isdas

Companion package for An Introduction to Spatial Data Analysis and Statistics with R

data-analysis gis rstats spatial-analysis spatial-statistics

Last synced: 04 Jan 2026