An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/rohithsaji97/open_gate_dip

An automatic gate opening system with an additional parking system (using Raspberry PI).

automated data-analysis digital-image-processing opencv python3 raspberry-pi-3 trained-models

Last synced: 04 Feb 2026

https://github.com/an1mch1k-theone/project_2_hh_analyze

Анализ вакансий из HeadHunter

data-analysis data-analysis-project postgresql python sql

Last synced: 14 Apr 2026

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 11 Jun 2026

https://github.com/jossimmar/ensa-ss25

Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.

data-analysis electrical-engineering python sqlite

Last synced: 30 Mar 2025

https://github.com/alejo1630/chicago_crimes

A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium

data-analysis data-visualization folium pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection

The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.

banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier

Last synced: 06 Oct 2025

https://github.com/victoriapm/analyze_a-b_test_results

Understand the results of an A/B test run by an e-commerce website.

ab-testing data-analysis ecommerce-website

Last synced: 06 Oct 2025

https://github.com/vi/rendercsv

Tool to convert CSV table to a picture.

animation csv csv2pic csv2png data-analysis picture png table table-renderer visualization

Last synced: 01 Apr 2025

https://github.com/nurfakhri/e-commerce-data-analyst

E-commerce data analysis supported by data wrangling, EDA, and web dashboard

dashboard data-analysis e-commerce flask-application python

Last synced: 10 Feb 2026

https://github.com/aryansharma5/data-visualization-and-thorough-analysis

comprehensive guide for data analysis and visualization

data-analysis data-visualization

Last synced: 18 Mar 2025

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 21 Jan 2026

https://github.com/aaryan-agr/canadian-energy

This project analyzes Canada's energy trade, focusing on imports, exports, and market trends in the energy sector.

data-analysis data-cleaning data-manipulation data-processing data-science data-vizualisation energy-sector time-series-analysis

Last synced: 10 Jun 2025

https://github.com/komailmk/instagram-reach-forecasting

This repository provides a Python-based solution for forecasting Instagram reach using historical data and SARIMA modeling techniques.

data-analysis data-visualizations machine-learning

Last synced: 05 Oct 2025

https://github.com/erickkhosasi/thelook-data_analysis

Final project for my SQL mini bootcamp. This project explores an e-commerce dataset to uncover key business insights. Data insights were queried in Google BigQuery and visualized with Google Sheets.

bigquery data-analysis e-commerce sql

Last synced: 05 Oct 2025

https://github.com/Zen204/airbnb-availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 02 Apr 2025

https://github.com/saeun-park/lg-aimers-4th

MQL 데이터 기반 B2B 영업기회 창출 예측 모델 개발

b2b data-analysis data-science machine-learning mql

Last synced: 08 Apr 2025

https://github.com/mafda/seattle_airbnb_data_analysis

This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.

crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data

Last synced: 29 May 2026

https://github.com/solrikk/pictrace-web

PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image

automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api

Last synced: 12 Apr 2026

https://github.com/tks18/pyquery

PyQuery is a local-first data operating system built on lazy execution that processes 100GB+ files while you doomscroll. No cap. 🧢

data-analysis data-science etl hdfs parquet pipeline polars python

Last synced: 14 Jan 2026

https://github.com/happybono/sonatasmooth

Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.

algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms

Last synced: 30 Oct 2025

https://github.com/lobooooooo14/badwords-pt-br

💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"

badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist

Last synced: 25 Mar 2025

https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity

In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.

data-analysis data-science data-visualization inequalipy matplotlib pandas plotly

Last synced: 22 May 2026

https://github.com/rara-ch/data-analysis-portfolio

This repository to store my data analytics projects, showcasing my skills in SQL and Python.

data-analysis mathematics matplotlib numpy pandas portfolio probability python seaborn sql statistics

Last synced: 12 Mar 2025

https://github.com/ddihora1604/iitk_esg

Researching and Analyzing key ESG (Environmental, Social, Governance indicators) metrics and their impact on stock performance and market behavior. Leveraging AI techniques (like Machine Learning and NLP) in finance to extract insights from ESG disclosures, enhancing financial predictions and sustainable investment strategies.

data-analysis data-visualization esg python yahoo-finance

Last synced: 24 Apr 2025

https://github.com/geo-y20/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber

Last synced: 13 Apr 2026

https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app

Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.

aws-ec2 data-analysis plotly preprocessing streamlit-webapp

Last synced: 02 Apr 2025

https://github.com/rani-sikdar/pwc-virtual-internship-powerbi

Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.

business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi

Last synced: 07 Jan 2026

https://github.com/sivas-2/food-demand

This project aims to predict food demand based on historical data, leveraging various statistical methods to achieve accurate forecasts.

data-analysis data-science dataanalysis food-demand-forecasting statistics

Last synced: 12 Aug 2025

https://github.com/chen0040/pyspark-advanced-algorithms

Samples of Advanced Algorithms and Data Analysis implemented in pyspark

advanced-algorithms data-analysis map-reduce pyspark

Last synced: 12 Jan 2026

https://github.com/prajakta1321/kaggle-ai-report-2023

A Report describing the trends in emergence of AI over the years !

data-analysis data-visualization python3

Last synced: 28 Jun 2025

https://github.com/phillbertnevinemmanuel/dataprofessionalquestionnaireanalysis_pbix

This project delves into the demographics and job satisfaction of data professionals, presenting insights through a user-friendly dashboard built with Power BI. The dataset has been graciously provided by Alex The Analyst

dashboard data-analysis powerbi visualization

Last synced: 19 Mar 2026

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 19 Mar 2026

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 04 Mar 2025

https://github.com/allanotieno254/employee-performance-tracker-excel-

An Excel-based tool to track and evaluate employee performance, compliance, and skills assessments with summary statistics and visual charts

compliance-tracker data-analysis employee-performance-analysis excel human-resources

Last synced: 19 Mar 2026

https://github.com/allanotieno254/road-accident-data-analysis-dashboard-using-excel

This repository contains the Road Accident Data Analysis Dashboard, a comprehensive Excel-based tool designed to provide in-depth analysis and visualization of road accident data.

dashboards-excel data-analysis excel kpi visualization

Last synced: 19 Mar 2026

https://github.com/5ekastanx/data-analysis

Extracting data from parsing, for example, like hacking using Python using all sorts of function methods

data-analysis html python

Last synced: 14 Mar 2025

https://github.com/jerela/mola

A Python library for matrix algebra

data-analysis linear-algebra-library matrix-algebra python

Last synced: 14 Jan 2026

https://github.com/lmuffato/analise-de-diarias-prefeituas-do-es

Esse código faz parte de um projeto de descoberta e combate a esquemas de corrupção, através do tratamento e cruzamento de dados abertos disponíveis em diversas prefeituras do Espirito Santo através do portal da transparência. Junção e análise de várias tabelas importadas em csv.

data-analysis personal-project r rstudio

Last synced: 12 Jun 2025

https://github.com/luminati-io/shopee-dataset-samples

A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..

api data-analysis data-mining datasets products shopee web-scraping

Last synced: 12 Feb 2026

https://github.com/sijuswamy/data-analytics-using-r

Course Repository for Data Analysis using R- Add-on course

data-analysis

Last synced: 12 Apr 2025

https://github.com/sarathchandranpm/restaurant_order_analysis

This project entails an in-depth analysis of a restaurant's order and menu data. The focus is on exploring customer ordering behaviors, menu item attributes, and order specifics. By investigating the connections between order details, menu items, and order dates, the project seeks to generate valuable insights into the restaurant's operations.

data-analysis mysql sql

Last synced: 10 Apr 2025

https://github.com/arhcoder/base-hackathon-2022

💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.

algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions

Last synced: 16 Apr 2026

https://github.com/as16082023/hotel-booking-analysis-eda-

Exploratory Data Analysis on hotel booking data using Python

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/as16082023/restaurant-order-analysis

Analyzing order data to identify the most and least popular menu items and types of cuisine

data-analysis maven-analytics mysql restaurant-order sql

Last synced: 10 Apr 2025

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 11 May 2026

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 11 Apr 2026

https://github.com/thevinh-ha-1710/diabetes-predictive-model

This project aims to train a predictive model to diagnose diabetes on women patients.

data-analysis data-science data-visualization model-training-and-evaluation python

Last synced: 13 Feb 2026

https://github.com/realorangeone/docker-cyberchef

A containerized deployment of CyberChef, with additional protections

cyberchef data-analysis data-manipulation docker encoding

Last synced: 24 Aug 2025

https://github.com/mr-vozhyk/test-tasks

Выполненные тестовые задания (не запрещенные к публикации)

analysis data-analysis digital-analysis e-commerce excel google-colab google-sheets marketing marketplace power-bi power-query python sql

Last synced: 07 May 2026

https://github.com/bcko/ud-da-stroopeffect

Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)

data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree

Last synced: 04 Jul 2025

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 03 Feb 2026

https://github.com/md-emon-hasan/1-simple-stock-price-ml-app

A simple mahcine learning application for stock prices, demonstrating data preprocessing, model training, and deployment using scikit-learn.

data-analysis data-science eda ml-app streamlit-webapp time-series time-series-analysis webapp

Last synced: 31 May 2026

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/lmuffato/dados-meteorologicos-inmet-tratamento

Tratamento e enálise de dados meteorológicos das estações locais fornecidos pelo INMET, utilizando a linguagem R

data-analysis personal-project r rstudio

Last synced: 12 Jun 2025

https://github.com/daniel1kp/openrtb-dashboard

This is a demo project designed to illustrate using Rill to analyze programmatic bid logs using the canonical open RTB framework.

data-analysis openrtb real-time-bidding rill

Last synced: 19 Mar 2026

https://github.com/dina-hosny/telco-customer-churn-analysis-using-power-bi

An interactive dashboard to represent some analysis of "Telco customer churn" data and the reasons that made customers churn using Microsoft Power BI.

business-intelligence data-analysis data-modeling data-visualization power-bi powerbi

Last synced: 19 Mar 2026

https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel

This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.

automation data-analysis data-science dataset-generation excel-import signal-processing

Last synced: 18 Apr 2026

https://github.com/sathyasris27/statistical-analysis-on-rehoming-time-for-different-dog-breeds-in-animal-shelter

The aim of this project is given a collection of records documenting the stray, unwanted, or neglected dogs sent to animal shelters to be rehomed, we analyse their rehoming patterns based on their breeds.

data-analysis r statistical-analysis statistical-inference statistical-models

Last synced: 05 Jun 2026

https://github.com/spaghettifunk/gvb

Analysis of GVB in Amsterdam

data-analysis public-transportation

Last synced: 28 Feb 2026

https://github.com/rayyan9477/household-transactions-analysis-and-clustering

This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.

customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning

Last synced: 27 Feb 2025

https://github.com/garcane/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/vitia-fritelle/analise_dieese

Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html

data-analysis economic-data

Last synced: 09 Apr 2025

https://github.com/ssreeramj/youtube_channels_analysis

This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.

data-analysis heroku pandas python streamlit youtube

Last synced: 29 Apr 2026

https://github.com/nafisalawalidris/buybuy-e-commerce-company

The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.

buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql

Last synced: 16 Mar 2025

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 11 Apr 2026

https://github.com/pradeepchegur/seamantic_web_design

We designed a semantic web for Instagram in Wix platform.

data-analysis framework instagram semantic-web website-design wix

Last synced: 19 Mar 2026

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 16 Mar 2025

https://github.com/devexpress-examples/aspnet-pivot-grid-custom-aggregates

This example shows how to aggregate data by the field's first value.

asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms

Last synced: 06 Jul 2025

https://github.com/viztruth/google-play-store-data-analysis

This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.

data-analysis data-visualization

Last synced: 21 Aug 2025

https://github.com/ahmednasef3/udemy-courses-full-eda

Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.

data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project

Last synced: 01 May 2026

https://github.com/edisedis777/duckdb-analyzer

A powerful tool for analyzing large CSV datasets using DuckDB.

csv data-analysis database duckdb

Last synced: 16 Apr 2026

https://github.com/beallio/wherewolf

Wherewolf is a production-grade, local SQL workbench designed for data engineers and analysts to query local files (CSV, Parquet, JSON) with ease. Built with Streamlit, it provides a unified interface to execute SQL against either DuckDB or PySpark engines without requiring complex setup.

big-data data-analysis data-engineering etl parquet performance pyspark python spark-sql sql uv

Last synced: 28 Apr 2026

https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard

The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.

business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports

Last synced: 09 Apr 2025

https://github.com/leosimoes/datascienceacademy-powerbi-3.0

Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.

business-intelligence dashboards data-analysis data-visualization microsoft-power-bi

Last synced: 19 Mar 2026

https://github.com/shadan100/sales-prediction-analysis

The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction

Last synced: 01 Mar 2026

https://github.com/akankshaaa013/30-day-machine-learning-deep-learning

To practically Learn, Explore, and Share my Insights on the Libraries and Tools that power Machine Learning.

data-analysis machine-learning python

Last synced: 15 Mar 2025

https://github.com/antononcube/wl-datareshapers-paclet

Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.

contingency-table cross-tabulation data-analysis data-transformation long-form wide-form

Last synced: 20 Mar 2026

https://github.com/antononcube/wl-quantileregression-paclet

Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.

data-analysis machine-learning quantile-regression time-series time-series-analysis

Last synced: 20 Mar 2026

https://github.com/emredurukn/data-analysis

Example notebooks for analyzing data

data-analysis data-visualization python

Last synced: 12 May 2026

https://github.com/harshmule1/store-sales-analysis

Sales Analysis Using Power Bi

data-analysis powerbi

Last synced: 19 Mar 2026