An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/soufianboukir/ecom-analytics-platform

End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.

data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries

Last synced: 14 Jun 2026

https://github.com/chetanmalviya513/Firm-Financial-Transaction-Analysis

📊 Financial Analysis & Forecasting Processed large-scale financial data using Python for trend analysis and insights. Developed interactive Tableau dashboards to improve forecasting accuracy and reduce costs by 25%.

data-analysis financial-data forecasting insights msexcel pandas python reporting tableau-dashboards

Last synced: 15 Jun 2026

https://github.com/mindgamesnl/yanderestats

https://mindgamesnl.github.io/YandereStats/

data-analysis reporting-pipeline yandere yandere-sim

Last synced: 18 Jun 2026

https://github.com/kirkalyn13/open-signal-report-generator

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 19 Jun 2026

https://github.com/souravsuvarna/whatsapp-chat-analyzer-api

The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.

api data-analysis data-science fastapi publicapi python

Last synced: 20 Jun 2026

https://github.com/markmusic27/data-statistics-calculator

💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.

data-analysis standard-deviation statistics statistics-calculator

Last synced: 20 Jun 2026

https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries

🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈

ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics

Last synced: 21 Jun 2026

https://github.com/rogernet/desafio-profissional-produto-data-driven

Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.

data-analysis data-science data-visualization product

Last synced: 23 Jun 2026

https://github.com/rudra-g-23/find-my-joint

A utility to find potential join keys (matching columns) across multiple DataFrames.

data-analysis data-visualization join network-graph pandas pandas-dataframe

Last synced: 24 Jun 2026

https://github.com/jcaperella29/financial-data-scraper

Financial Data Scraper is a Python-based web scraping tool using Selenium to extract financial data from Stock Analysis. It scrapes Income Statement, Balance Sheet, Cash Flow, and Ratios for multiple companies and saves them as CSV files.

automation data-analysis finance financial-statements investment python selenium stock-market web-scraping

Last synced: 28 Jul 2025

https://github.com/madhuresh2011/amazon-sales-report-analysis-using-python

This project focuses on analyzing Amazon sales data using Python to uncover insights into sales performance, customer behavior, and product trends

charts cleaning-data data-analysis jupyter-notebook matplotlib numpy pandas python seaborn visualization

Last synced: 17 Apr 2026

https://github.com/sumitkumargiri/machine-learning-project

This repository contain all the best practices for managing Github repository.

data-analysis github machine-learning opensource project python

Last synced: 05 Apr 2026

https://github.com/as16082023/nashville-housing-data-cleaning-project

This project involved using MySQL to clean and optimize a Nashville housing dataset, addressing key data quality issues to ensure it was ready for accurate analysis.

data-analysis data-cleaning mysql nashville-housing-data

Last synced: 10 Apr 2025

https://github.com/shriram-vibhute/digit_classification

This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.

classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn

Last synced: 28 Oct 2025

https://github.com/sayantanidalui/student-mental-health-analysis

A SQL-based analysis project exploring student mental health, stress, and lifestyle patterns. Uncovers key insights using joins, CTEs, and window functions — no other tools used.

data-analysis mental-health mysql sql studentdata

Last synced: 07 Jul 2025

https://github.com/mynenik/xyplot-win32

XYPLOT Plotting and Data Analysis Program for 32-bit Windows

cpp data-analysis data-manipulation data-visualization forth mfc windows-app

Last synced: 18 Mar 2025

https://github.com/tim-hub/python-course

A new Python Course, a new trial to offer MOOC style learning resources and content for python learners

data-analysis learning python

Last synced: 17 Mar 2025

https://github.com/michellepellon/jobx

A modern, powerful job scraper for LinkedIn, Indeed and beyond.

compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper

Last synced: 17 Jan 2026

https://github.com/gad-dimnt-cptec/scanplot

Um sistema de plotagem simples para o SCANTEC

data-analysis jupyter-notebook pandas python scantec

Last synced: 17 Jan 2026

https://github.com/tynoee/covid19_data_analysis

This is an analysis of Covid 19 dataset using multiple SQL queries. The dataset used for this analysis includes various information regarding COVID-19 cases such as confirmed cases, deaths, and recoveries, segmented by different geographical locations and time periods.

data-analysis excel sql sqlserver-2019 tableau tableau-public

Last synced: 16 Feb 2026

https://github.com/hemanthkumarsunkari27/pmay_analysis_project

Built for the 1st AI for Good Hackathon by Snowflake, this project uses data analytics and AI to explore housing and sanitation trends in India under PMAY. Using Snowflake and Streamlit, it provides interactive insights into regional disparities, helping guide sustainable infrastructure development.

data-analysis data-visualization pmay-analysis sanitation-coverage snowflake-integration streamlit-dashboard sustainable-development

Last synced: 26 Mar 2025

https://github.com/hoangsonww/fred-banking-data-analysis

💸 AI-powered banking data explorer that combines FRED API insights with vector search, regression analysis, and interactive chat via OpenAI, Claude, and Gemini. Built with TypeScript, React, and Express for seamless full-stack performance.

anthropic chartjs claude-ai data data-analysis data-analytics data-science data-visualization fred fred-api gemini google-generative-ai logistic-regression multiple-regression openai pinecone react regression typescript vector-database

Last synced: 09 Apr 2025

https://github.com/wiseaidev/truth-guard

Analyzing a 79k Dataset of Misinformation and Fake News

data-analysis fastapi lstm machine-learning python supervised-learning

Last synced: 19 Jan 2026

https://github.com/sd7campeon/yelp-sentiment-analysis-with-python-bs4-and-llm

A scalable pipeline for automated extraction, preprocessing, and sentiment analysis of Yelp reviews. Uses advanced HTTP requests, HTML parsing, and text normalization (tokenization, stopword removal, lemmatization) to enable precise polarity and subjectivity analysis for consumer insights and business analytics.

beautifulsoup beautifulsoup4 business-analytics cuda data-analysis nlp-machine-learning nltk opinion-mining pandas python python3 requests-library-python sentiment-analysis text-preprocessing textblob torch web-scraping yelp-reviews

Last synced: 06 May 2026

https://github.com/rayyan9477/coin-detection-project

This Coin Detection Project leverages machine learning techniques to identify coins using a dataset from Kaggle. Key libraries utilized include OpenCV for image processing, TensorFlow for model training, and Pandas for data manipulation. The project also employs NumPy for numerical operations and Matplotlib for visualization.

computer-vision data-analysis data-science data-visualization machine-learning notebook python

Last synced: 15 May 2026

https://github.com/jasontanx/capstone-project-machine-learning

A final semester project from my MSc Data Science course

data-analysis datascience machinelearningprojects tourism-data

Last synced: 26 Mar 2025

https://github.com/mijisu0103/sk-ai-data-academy

This repository contains the projects and assignments completed during the SKADA course, which focused on foundational skills in Python, data analysis, and machine learning. Here, you will find various scripts, notebooks, and documentation that showcase my learning journey and the practical applications of the concepts covered in the course.

data-analysis machine-learning python

Last synced: 10 Jul 2025

https://github.com/hafeez-urrehman/mobile-price-classification

In the Mobile Price Classification project, I built a predictive model to categorize mobile phones into different price ranges based on their features by applying machine learning techniques.

data-analysis linear-regression machine-learning mobile-price-prediction model-save-and-load predictive-modeling

Last synced: 15 May 2026

https://github.com/misaghmomenib/stock-momentum-analysis

A Python-based Data Analysis Tool Designed to Evaluate Stock Momentum. Leverages Historical Market Data to Identify Trends, Predict Price Movements, and Assist in Making Informed Investment Decisions.

data-analysis data-analysis-python data-visualization git open-source python

Last synced: 10 Apr 2025

https://github.com/revan-alqahmi/summarize-talabat-company-reviews

Natural Language Processing Project, which is a program that analyzes Arabic comments at Talabat Company and classifies them into positive, negative, and neutral using machine learning algorithms and natural language processing techniques.

artificial-intelligence data-analysis machine-learning-algorithms natural-language-processing python

Last synced: 11 Jan 2026

https://github.com/sarathchandranpm/cleaning-and-exploratory-analysis-of-global-layoff-data

This project involves a thorough data analysis and cleaning process centered on global layoff data. It showcases advanced data management abilities by integrating data cleaning methods with a detailed exploration of workforce reduction patterns across various companies, industries, and countries.

data-analysis data-cleaning mysql sql

Last synced: 22 Sep 2025

https://github.com/garcane/nike_web_crawler

This project involves web scraping Nike's product pages to extract product names, prices and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.

beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup

Last synced: 18 Apr 2026

https://github.com/grypesc/graduateadmissions

Visualization, analysis and predictive modeling of a Kaggle graduate admissions dataset.

data-analysis data-mining data-science data-visualization dataset

Last synced: 08 Jul 2025

https://github.com/patilni3/matplotlib-in-depth

Python's Matplotlib Library for Data Analysis, Machine Learning, Data Science and many more...

data-analysis data-representation data-science data-visualization matplotlib matplotlib-pyplot plots-in-python powerbi seaborn

Last synced: 03 Apr 2025

https://github.com/phillbertnevinemmanuel/coviddeathvaceda

an exploratory data analysis based on dataset of covid statisics from 2020-2022

data-analysis database sql

Last synced: 09 Apr 2025

https://github.com/frankelavsky/political-polarization-challenge

I had 8 hours to build a solution to the research claim that "politics have become more divided in the past 50 years." You can navigate views of congressional voting patterns using arrows. I used d3, require, MVC pattern, and vanilla js. Pre-processed the data in node.js. Data is from DW-NOMINATE: ftp://k7moa.com/junkord/HANDSL01114A20_STAND_ALONE_30.DAT

client-side css d3 d3js data-analysis data-visualization frontend frontend-app html interactive interactive-visualizations javascript modular nodejs political-science politics requirejs research single-page-app visualization

Last synced: 06 Apr 2026

https://github.com/vitia-fritelle/ipynb_converter

Jupyter notebook to Python file conversor

data-analysis data-science jupyter-notebook python

Last synced: 28 Apr 2026

https://github.com/akash1070/data-science-virtual-internship-by-accenture

data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.

data-analysis data-cleaning data-science python3 tableau visualization

Last synced: 15 May 2026

https://github.com/arnabsaha7/customer-churn_prediction---analysis

Predict customer churn using machine learning. This project employs a RandomForestClassifier to analyze customer data and determine the likelihood of churn. Explore the Jupyter Notebook for insights into the data and model, and contribute to the project's development.

customer-churn-prediction data-analysis machine-learning

Last synced: 02 Mar 2025

https://github.com/denko5/valentine-s-analysis

This repository dives into Valentine's trends and behaviors using SQL, focusing on exploratory data analysis to uncover patterns and insights. It features SQL queries, datasets, and documentation to guide readers through the process. Designed for collaboration and educational use.

africa analytics data-analysis eda exploratory-data-analysis insights kenya sql sql-server sqlworkbench trends valentines-day

Last synced: 04 May 2025

https://github.com/dimits-ts/synthetic_moderation_experiments

Experiments relating to synthetic LLM user-agents and LLM facilitators in online discussions

data-analysis dataset-generation llms llms-reasoning nlp

Last synced: 06 Mar 2026

https://github.com/li-pearl/gene-count-normalizer

First step of data wrangling in MERFISH data project

data-analysis merfish merscope python

Last synced: 13 May 2026

https://github.com/jasoncobra3/whatsapp_chat_analyzer

WhatsApp Chat Analyzer is a powerful tool that provides insightful analytics from your WhatsApp conversations. Whether you're curious about your chatting habits, want to analyze group dynamics, or need to extract meaningful data from your conversations, this tool has got you covered!

data-analysis data-science data-visualization machine-learning streamlit streamlit-webapp whatsapp-chat whatsapp-chat-analyzer

Last synced: 31 Jan 2026

https://github.com/giordano-lucas/tesco-extension

Products clustering and interactive visualization

clustering data-analysis data-visualization tesco

Last synced: 17 Jun 2026

https://github.com/gmbeddard/ee152-realtime_embedded_systems-finalproject

An STM32-based implementation of the Pan-Tompkins algorithm for real-time QRS detection. Includes robust debugging tools, heart rate monitoring, and live ECG signal support via a python graphing script.

cpp-programming data-analysis ecg embedded-c freertos stm32

Last synced: 21 Apr 2026

https://github.com/nishchal-kansara/loan_eligibility_prediction

This project aims to create a robust machine learning model that accurately predicts an applicant's eligibility for a loan based on various features such as income, credit history, and marital status.

data-analysis data-cleaning data-science data-visualization datascience dataset loan-eligibility

Last synced: 23 Jun 2026

https://github.com/docuvesta/la-prairie-luxury-skincare-makeup-analysis

Web scraping La Prairie skincare regional websites for brand and product insights 🛍️

cosmetics data-analysis data-analytics data-visualization jupyter-notebook luxury python science skincare

Last synced: 19 Apr 2026

https://github.com/stevapple/elasticsearch-utils

Asynchronous data processing and import/export for Elasticsearch, written in Python.

data-analysis data-processing elasticsearch python

Last synced: 15 May 2026

https://github.com/jishen-harilal/analytical-models-in-excel

A curated Excel workbook showcasing core data analysis techniques - including regression, classification, dimensionality reduction, and cross-validation - implemented entirely within spreadsheets. Ideal for demonstrating manual model logic, clean formatting, and advanced Excel proficiency without code.

analytics cross-validation data-analysis data-analytics data-science-portfolio data-visualization decision-trees excel excel-models knn linear-regression logistic-regression machine-learning manual-calculations no-code-machine-learning pca predictive-modeling spreadsheet-models statistical-analysis

Last synced: 15 May 2026

https://github.com/saksham-jain177/automated-data-analysis-and-visualization

About Automated Data Analysis and Visualization is a Streamlit web application designed for quick and insightful data analysis. Users can easily upload CSV files, perform automated preprocessing, and generate interactive visualizations such as histograms, scatter plots, and heatmaps.

automated-reporting data-analysis data-preprocessing data-science data-visualization datasets exploratory-data-analysis interactive-visualizations machine-learning python streamlit

Last synced: 15 May 2026

https://github.com/zachpinto/real-time-indicators

Streamlit-based analytics dashboard visualizing real-time economic indicators. This project uses cron jobs to provide real-time updates of common economic indicators

analytics-engineering data-analysis plotly streamlit visualization

Last synced: 15 May 2026

https://github.com/patilni3/numpy-in-depth

Python's NumPy Library for Data Analysis, Machine Learning, Data Science and many more...

data-analysis data-engineering data-science machine-learning numpy pandas

Last synced: 10 May 2026

https://github.com/patilni3/seaborn-in-depth

Python's Seaborn Library for Data Analysis, Machine Learning, Data Science and many more...

data-analysis data-reporting data-representation data-science data-visualization plots-in-python powerbi seaborn sns

Last synced: 03 Apr 2025

https://github.com/unrndm/dataanalysis

artifacts and sollutions of homework for course "Data Analysis" in Magistrate of HSE during 2023-2024

2023-2024 data-analysis hse

Last synced: 27 Mar 2025

https://github.com/pinedah/escom_development-of-applications-for-data-analysis

This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.

data-analysis data-science data-visualization jupyter-notebook python python-data-analysis

Last synced: 20 Jan 2026

https://github.com/listiangr/product_sales_data_analysis

Proyek ini menganalisis data penjualan untuk memberikan wawasan tentang tren penjualan, profitabilitas, dan permintaan produk, guna membantu perusahaan merencanakan strategi harga, promosi, dan pengelolaan inventaris yang lebih efektif.

corrplot data-analysis data-preprocessing data-visualization dplyr ggcorrplot ggplot2 product-sales r-language rstudio

Last synced: 03 Apr 2025

https://github.com/mdaffailhami/customer-data-analysis

This repository contains code and analysis for exploring customer data, focusing on profiling and contact preferences. The project includes various stages of data processing, from raw data preparation to final cleaned datasets, and employs Python and popular data analysis libraries to uncover insights and trends.

data-analysis data-cleaning data-science data-visualization jupyter jupyter-notebook pandas plotly python

Last synced: 03 Mar 2026

https://github.com/noeyislearning/sharpe-ratio-amazon-facebook

Explore the Sharpe Ratio and its application to evaluate the performance of two tech giants: Amazon and Facebook.

amazon data-analysis data-science data-visualization facebook python3 sharpe-ratio

Last synced: 27 Mar 2025

https://github.com/dogoncouch/dhcptranslate

Parses ISC DHCP server config, performs DNS resolution as needed, and outputs lease data in CSV format.

configuration csv-format data-analysis isc-dhcp isc-dhcp-server migration-tool

Last synced: 20 Mar 2025

https://github.com/vivienneforreal/covid4eu-sorbonne

Economy: “Analysis of Labor Market decisions of men and women during the COVID-19 pandemic in the 4EU+ countries”.

covid-19 data-analysis data-science data-visualization pandas

Last synced: 20 Mar 2025

https://github.com/thecoderpinar/customer-segmentation-clv-analysis

Optimize marketing strategies and enhance decision-making. Explore customer data, segment behavior, calculate CLV, analyze demographics, and visualize insights. 🚀

clv-analysis customer-segmentation data-analysis data-science data-visualization jupyter-notebook machine-learning marketing-strategy python

Last synced: 03 Apr 2025

https://github.com/pawlo77/kaggle-project

Repository for 'kaggle' project of Data Science Scientific Circle at Faculty of Mathematics and Information Science, Warsaw University of Technology

data-analysis data-science eda maschine-learning

Last synced: 20 Mar 2025

https://github.com/akashparley/stocklyzer

Stocklyzer is a real-time stock analysis web app built with Streamlit. It features stock performance tracking, technical indicators, CAPM-based risk-return insights, and ARIMA-based price prediction. Ideal for finance enthusiasts, analysts, and learners exploring data-driven investing tools.

arima-forecasting data-analysis financial-analysis machine-learning stock-price-prediction

Last synced: 16 May 2026

https://github.com/valeriopagliarino/electronics-2021-unito-public

Data analysis and simulations for the course "Electronics laboratory" held at Physics Dep. - University of Turin, 2021

data-analysis electronics physics

Last synced: 27 Mar 2025

https://github.com/rdrahul123/sales-dashboard

The Sales Analysis Dashboard was developed to provide insights into sales, profits, and product performance across different categories, timeframes, and geographic locations. By leveraging Power BI, the project aimed to transform raw data into actionable visualizations, facilitating better decision-making for stakeholders.

data-analysis data-science data-visualization dax powerbi

Last synced: 06 Jan 2026

https://github.com/enamhasan/analyzing-the-impact-of-recession-on-automobile-sales

Data Analyis and Visualization Dashboard of the Impact of Recession on Automobile Sales

dashboard data-analysis data-science data-visualization pandas plotly plotly-dash python

Last synced: 05 May 2026

https://github.com/nabilshadman/r-data-analysis

A modular R framework for data analysis, with emphasis on data processing and reproducible workflows.

data-analysis data-cleaning data-manipulation data-science descriptive-statistics programming r r-studio statistical-analysis statistical-computing t-test

Last synced: 04 Apr 2025

https://github.com/reusjimenez/data-analysis-labs

Casos completos y ejercicios prácticos de análisis de datos. 📊

data-analysis data-visualization jupyter-notebook machine-learning matplotib numpy panel python sklearn

Last synced: 04 Apr 2025

https://github.com/foxriver76/iobroker.intelliflow

Stream data analysis adapter for ioBroker.

data-analysis iobroker machine-learning streaming-data

Last synced: 04 Apr 2025

https://github.com/rishabhraj43/diwali-sales-analysis

A Data Analysis project made in Python

data-analysis python

Last synced: 01 May 2026

https://github.com/nafiealhilaly/analyzing-sa-schools-data

A simple python streamlit app to explore and analyze Saudi Arabia schools dataset from data.gov.sa

data-analysis data-visualization eda python streamlit

Last synced: 16 May 2026

https://github.com/quantumudit/sales-statistical-analysis

This project focuses on a statistical analysis (using SQL queries) of various key metrics that impacts the overall sales of a certain fictitious store.

data-analysis postgresql sales-analysis sql statistics

Last synced: 16 May 2026

https://github.com/avikdatta/python_data_docker_files

A repository for docker files for data analysis using Python and Hadoop

data-analysis dockerfile python-docker raspbian spark ubuntu1604

Last synced: 06 May 2025

https://github.com/adityav42/deloitte-forage-virtual-internship

About Submission for Deloitte's STEM Virtual Program on Forage, focusing on data analysis, forensic technology, and cybersecurity.

coding cybersecurity data-analysis deloitte development forage forensics-technology virtual-program

Last synced: 29 Oct 2025

https://github.com/anoni-net/onionoo-fastapi

Semantic/OpenAPI proxy for the Tor Metrics Onionoo API, built with FastAPI for easier integration and automated analysis.

agentic-ai ai-agents data-analysis fastapi network-metrics observability onionoo openapi privacy pydantic python semantic-apis tor tor-metrics

Last synced: 16 May 2026

https://github.com/cuadernin/regex_importance

Un simple ensayo sobre expresiones regulares

clean-code data-analysis data-mining data-science python r regex

Last synced: 05 Apr 2025