An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/abhijeet107/task-4

Design an interactive dashboard for business stakeholders.

data-analysis excel-csv tableau-dashboards tableau-public

Last synced: 22 Jan 2026

https://github.com/codeslash21/analyze-a-b-test-results

Analyze results of an A/B test run by an e-commerce website.

ab-test data-analysis

Last synced: 22 Jan 2026

https://github.com/abishek0103/olist-ecommerce-sql-project

SQL Project using Olist Dataset – E-commerce analysis with MS SQL Server to extract business insights.

business-insights data-analysis sql-server

Last synced: 19 Oct 2025

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/mothraa/etl-marketanalysis-webscraping-poo

OC project 2 refactoring (POO version not yet completed)

data-analysis etl poo python web-scraping

Last synced: 20 Oct 2025

https://github.com/tomkyle/binning

Determine optimal number of bins 𝒌 for histogram creation and optimal bin width 𝒉 using various statistical methods.

binning data-analysis distributions doanes-rule freedman-diaconis histogram histogram-binning math php-math rice-rule scotts-rule square-root statistics sturges-rule terrell-scotts-rule

Last synced: 21 Oct 2025

https://github.com/saisurajmatta/nashville-housing-data-cleaning-project

Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.

azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio

Last synced: 23 Jan 2026

https://github.com/navp7/hr_analysis_excel

This project utilizes Microsoft Excel to conduct a comprehensive analysis of HR data, focusing on identifying the various reasons for employee attrition and evaluating job satisfaction

dashboards data-analysis excel visualization

Last synced: 23 Jan 2026

https://github.com/changyeop-yang/study-datasciencefoundation

Big Data Science and its Analytics plays a major role in this decade. How to clean and prepare your data for analysis is still a challenge, like How to perform basic visualization of your data, How to model your data, How to curve-fit your data, And finally, how to present your findings and wow the audience

data-analysis ios kyungpook-national-university swift

Last synced: 23 Oct 2025

https://github.com/albertobarrago/sentinel

A contribute for the research of Corrado Malanga and Filippo Biondi

data-analysis sar

Last synced: 24 Oct 2025

https://github.com/brianlesko/r_data_science_stat5730

Written by Brian Lesko, the repository contains R Scripts demonstrating data science topics largely originating from study at Ohio State. Contents are written in R studio using the R markdown file. As of 1/21/23 Future projects concerning data science, statistics, and machine learning will be in python in my machine learning Repository

data data-analysis flight-data ggplot2 olympics-data r-markdown tidyverse

Last synced: 23 Jan 2026

https://github.com/jofaval/sonar

Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987

data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci

Last synced: 09 Apr 2026

https://github.com/alessandroryo/bike-rental-data-analysis

A data analysis project focused on understanding and predicting bike rental patterns. This project utilizes data processing, visualization, and predictive modeling techniques to gain insights into bike rental usage, fulfilling the final submission requirement for Dicoding Indonesia's Data Analysis course.

bike-rental data-analysis data-visualization jupyter-notebook machine-learning python streamlit

Last synced: 09 Apr 2026

https://github.com/a26nine/kortext-usage-dashboard

An interactive data visualisation dashboard built using Tableau software to understand the value of digital resources issued on Kortext platform at Middlesex University, London.

data-analysis data-science data-visualization knime tableau

Last synced: 01 Feb 2026

https://github.com/psychelzh/cogstruct-old

Data Analysis on Cognitive Structure

cognition data-analysis intelligence psychology

Last synced: 25 Oct 2025

https://github.com/ljadhav25/linear_regression_data_science

Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.

data-analysis data-science linear-regression machine-learning

Last synced: 26 Oct 2025

https://github.com/madhursinghbhadoriya/data_analysis_fifa-players

• Using NumPy, Matplotlib, Pandas, etc processed important Information and Characteristic traits on Jupyter Notebook.

analysis data-analysis data-science graphs jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/srinibas-masanta/infosys-springboard-internship

An interactive Power BI dashboard developed during my Infosys Springboard Internship to visualize Indian election trends. It integrates historical and live API data to analyze vote shares, turnout patterns, and demographic insights across constituencies, helping news agencies report results in real time.

dashboard data-analysis data-cleaning data-collection data-visualization dax-functions powerbi

Last synced: 25 Feb 2026

https://github.com/vishalsiingh/deloitte-virtual-internship

Submission for the STEM Virtual Program by Deloitte via Forage.

coding cyber-security data-analysis deloitte development forage forensics

Last synced: 23 Jan 2026

https://github.com/limatix/limatix

Limatix datacollect and processtrak tools

data-analysis python scientific-workflows

Last synced: 23 Jan 2026

https://github.com/9dl/usbfalcon

Automatically copies files from plugged USB drives to a specified location, enabling quick data retrieval for analysis.

automation data-analysis data-retrieval ethical-hacking file-copying usb

Last synced: 27 Oct 2025

https://github.com/code-jl/nfl-kicker-predictor

A sophisticated Python application that provides real-time NFL kicker statistics and performance analysis with an intuitive graphical interface.

beautifulsoup data-analysis data-visualization espn football gui nfl prediction python real-time-analytics real-time-data sport-analytics sports-data statistics tkinter web-scraping

Last synced: 01 Jun 2026

https://github.com/gaurabkundu1/road-accident-data-analysis

This is an Excel project on Road Accident Data Analysis in the form of an interactive Dashboard.

dashboard data-analysis data-vizualisation excel road-accidents

Last synced: 24 Jan 2026

https://github.com/diegopino/publibdata_codexhackathon

Public Library Data processing/analysis codex hackathon attempt

data-analysis data-visualization libraries public

Last synced: 24 Jan 2026

https://github.com/aneeshmurali-n/global-superstore-sales-dashboard---power-bi-stunning-dark-theme

This Power BI dashboard provides a comprehensive view of sales data, enabling users to analyze sales trends, identify top-performing regions, and gain insights into customer behavior.

dark-theme dashboard data-analysis data-science data-visualization powerbi salesdashboard

Last synced: 28 Jan 2026

https://github.com/rahulchouhan1/sql-data-warehouse-project

Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.

data-analysis data-cleaning data-engineering data-science data-warehouse datascience etl etl-pipeline sql sql-query sql-server

Last synced: 24 Jan 2026

https://github.com/wassimhd/pwc-switzerland-power-bi-in-data-analytics-virtual-case-experience

The Project helps to build a foundation in data analysis and Power BI software which is provided by PWC virtual internship

data-analysis data-visualization datastorytelling powerbi

Last synced: 28 Jan 2026

https://github.com/yash1882/music-store-data-analysis

A project focuses on analyzing music store data using SQL ♬

begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project

Last synced: 28 Jan 2026

https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin

This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.

arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series

Last synced: 06 May 2026

https://github.com/srimantapal205/dataengineerwireframedesigns

Data Engineer Wireframe Designs are essential for planning and visualizing data pipelines, architecture, and workflows before implementation.

data-analysis data-engineering dataflow dataflow-programming datapipeline dataprocessing development visualization

Last synced: 29 Jan 2026

https://github.com/angchekar28/sales-report-power-bi

A Power BI sales report analyzing country-wise and product-wise sales trends. Includes dashboards, decomposition trees, and key influencers analysis for business insights.

dashboard data-analysis data-cleaning data-visualization powerbi sales-report

Last synced: 16 Mar 2026

https://github.com/engineertolulope/us_states_living_ranking_analysis

Python script for analyzing and ranking U.S. states based on factors like cost of living, tax burden, diversity, crime rates, and climate. Uses weighted criteria to identify the best states to live in according to these metrics. Ideal for decision-making on relocation.

data-analysis data-science linear-regression machine-learning python scikit-learn

Last synced: 29 Jan 2026

https://github.com/wareflowx/excel-toolkit

A powerful command-line toolkit for Excel and CSV data manipulation, analysis, and transformation.

data-analysis data-wrangling excel pandas python uv

Last synced: 29 Jan 2026

https://github.com/mattdelaune/powerbi_healthcare_dashboard

Interactive Hospital Insights Dashboard built with Power BI, showcasing comprehensive analysis of patient demographics, treatment outcomes, and hospital performance.

data-analysis healthcare power-bi visualization

Last synced: 29 Jan 2026

https://github.com/smahala02/magnetism-lab

This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.

data-analysis inductance jupyter-notebook magnetism python toroids

Last synced: 29 Jan 2026

https://github.com/shrutiijoshi/marketing-campaign-report

The dataset includes information on campaign types, recipient segments, interactions (clicks, opens, bounces, etc.), and conversion metrics.

dashboard data-analysis data-visualization tableau-public

Last synced: 25 Feb 2026

https://github.com/joannescode/regex_with_py

Learning by practicing with Regex (Python)

data-analysis python3 regex

Last synced: 30 Jan 2026

https://github.com/surajwate/datalab

DataLab is a versatile toolkit designed to simplify data exploration, analysis, and visualization for data scientists.

data-analysis data-science python visualization

Last synced: 30 Jan 2026

https://github.com/mfakhriazhar/healthcare-dashboard-project

This project is a comprehensive data analysis and visualization of healthcare data using Power BI. It focuses on understanding patient distribution, billing trends, and hospital performance through a clean and interactive dashboard.

dashboard dashboardreporting data-analysis datacleaning excel powerbi powerquery

Last synced: 30 Jan 2026

https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-

This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms

data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier

Last synced: 15 Apr 2026

https://github.com/manishabarse/hr_data_analysis

Used Microsoft SQL Server Management Studio and Power BI

data-analysis powerbi sql ssms

Last synced: 30 Jan 2026

https://github.com/nehar-2404/airbnb-nyc-eda-ml

This project analyzes Airbnb listings in New York City to uncover key insights about pricing, host activity, and neighborhood trends. It covers data cleaning, EDA, and basic machine learning to predict listing prices.

airbnb data-analysis eda machine-learning matplotlib pandas pyhton seaborn visualization

Last synced: 15 Apr 2026

https://github.com/jcaperella29/jc_bioinformatics_hub

A personal hub to showcase my bioinformatics applications including RNA-Seq, ATAC-Seq, and miRNA-Seq analysis tools. Powered by simple HTML, CSS, and JavaScript with a biotech-themed design.

atac-seq bioinformatics biotech data-analysis github-pages portal rna-seq webapp

Last synced: 25 Feb 2026

https://github.com/gurpreet17/uc-davis-sql-for-data-science-specialization

Completed the SQL Basics for Data Science Specialization from the University of California, Davis, gaining proficiency in Data Analysis, SQL, Apache Spark, and Delta Lake.

apache-spark bigdata data-analysis data-science delta-lake sqlite

Last synced: 15 Apr 2026

https://github.com/auliannee/new-york-uber-pickups-analysis

This repository contains the projects related to data collecting, quality check, manipulation, analyzing, and visualizations.

data-analysis data-science ipython-notebook jupyter-notebook python

Last synced: 07 Feb 2026

https://github.com/tralahm/parliament-2017-dataset

Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition

csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek

Last synced: 31 Jan 2026

https://github.com/amishidesai04/flipkart-mobile-sales-analysis

Flipkart Mobile Sales Analysis is a Tableau project that visualizes mobile sales data from Flipkart. It highlights trends in brand performance, pricing, ratings, and customer preferences. The interactive dashboard helps users explore key insights for data-driven decisions in e-commerce and retail.

dashboard data-analysis data-visualization storyboard tableau

Last synced: 31 Jan 2026

https://github.com/traore-07/fedex-sales-analysis

Analysis of the FedEx Sales Transaction

data-analysis data-visualization sales-analysis tabeau

Last synced: 31 Jan 2026

https://github.com/cca/panopto-session-data

analyzing Panopto session data for retention purposes

data-analysis ipython-notebook video

Last synced: 07 Feb 2026

https://github.com/ginanti-riski/streamlit_datapenyewaansepeda

Analisis Bike Sharing adalah proyek yang bertujuan untuk memahami pola penyewaan sepeda berdasarkan berbagai faktor seperti cuaca, musim, dan hari. Proyek ini menggunakan teknik analisis data untuk mendapatkan wawasan yang lebih dalam mengenai tren peminjaman sepeda.

data-analysis data-analysis-python data-science data-visualization python streamlit

Last synced: 15 Apr 2026

https://github.com/malthejorgensen/repx

Python regular expression file transformer

command-line-tool data-analysis text-processing

Last synced: 31 Jan 2026

https://github.com/steviecurran/gbt-scripts

IDL scripts for the reduction of Green Bank Telescope data

data-analysis data-compression data-visualization radio-astronomy spectroscopy

Last synced: 31 Jan 2026

https://github.com/ajmannust41288/data-analyst

Data Analyst ,Microsoft Professional expert,Desktop PowerBi ,Tablue and Dashboards with ChatGP4 AI uses

business-analytics data-analysis data-analyst data-analytics eda

Last synced: 01 Feb 2026

https://github.com/axsk/geekgraph

parse, cluster and visualize boardgamegeek.com user profiles

data-analysis scraper

Last synced: 01 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/tapas-gope/global-superstore-sales

This repository contains a Power BI dashboard designed to provide comprehensive insights into sales performance across various regions, segments, and products. The dashboard utilizes a variety of visualizations, including bar charts, line charts, maps, and tables, to effectively communicate key metrics and trends.

business-intelligence data-analysis data-modeling data-visualization financial-reporting powerbi sales-analysis

Last synced: 07 Feb 2026

https://github.com/ludreinsalvador/life-expectancy-data-analysis

Contains Power BI dashboards analyzing global life expectancy trends, mortality rates, and health expenditures. Using a dataset sourced from Google Sheets, the project explores the impact of economic and healthcare factors on longevity.

dashboard data-analysis data-visualization healthcare-analysis life-expectancy powerbi

Last synced: 25 Feb 2026

https://github.com/rohitdusane/healthcare-analytics

𝐏𝐨𝐰𝐞𝐫 𝐁𝐈 𝐃𝐚𝐬𝐡𝐛𝐨𝐚𝐫𝐝 is designed to provide valuable insights into patient waiting times across outpatient and inpatient healthcare services. It offers a comprehensive analysis of key factors influencing wait lists, including Age Profile, Specialty, Time Bands, and Patient Case Types.

data-analysis data-visualization dax dax-query healthcare-analysis powerbi-report

Last synced: 01 Feb 2026

https://github.com/vishnu-vamshii/data-science-jobs-salaries

Created an interactive dashboard to analyze data science jobs salaries in different regions of the world, experience levels, average salaries in USD and type of employment along with a geographical visual.

data-analysis data-science data-visualization tableau tableau-dashboard

Last synced: 01 Feb 2026

https://github.com/vladimiracunadev-create/python-data-science-program

Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.

bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 Jun 2026

https://github.com/shubham200137/customer-churn-analysis

In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.

data-analysis data-visualization numpy-python pandas-python sqlite tableau

Last synced: 15 Apr 2026

https://github.com/suhail25/hotel-booking-analysis

Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.

data-analysis ipynb-notebook matplotlib pandas python seaborn

Last synced: 08 Feb 2026

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 26 Feb 2026

https://github.com/michalspano/maturitna-skuska-proj

Maturitná skúška 2021/2022 - objektívna spracovanie a analýza dát

data-analysis

Last synced: 19 Mar 2026

https://github.com/josericodata/statisticsapp

Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.

alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test

Last synced: 26 Feb 2026

https://github.com/jweinst1/xenon

A processing based language

data-analysis interpreter reactive-programming

Last synced: 15 Apr 2026

https://github.com/themihirmathur/uber-data-analytics

The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).

bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python

Last synced: 09 Feb 2026

https://github.com/an1mch1k-theone/project_1_hh_analyze

Проект: анализ резюме из HeadHunter

data-analysis data-analysis-project python

Last synced: 15 Apr 2026

https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-

This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.

bigquery consumer-insights data-analysis database sql target

Last synced: 26 Feb 2026

https://github.com/27ahmad/amazon-sales-analysis

This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.

data-analysis eda pandas python seaborn tableau

Last synced: 15 Apr 2026

https://github.com/rajeev2806/netflix-data-analysis

In this project i have implemented ETL . I used netflix dataset to clean and analyze using postgresql and python

data-analysis data-cleaning postgresql python

Last synced: 15 Apr 2026

https://github.com/nulltea/kicksware-scrapebot

Web scraping tool to retrieve sneaker details & images from web store sites

bot data-analysis pandas python sneakers web-scraping

Last synced: 15 Apr 2026

https://github.com/mathusanm6/critics-vs-players-analysis

This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.

data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games

Last synced: 16 Apr 2026

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/shruti23-ui/blinkit-powerbi-dashboard

A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.

data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql

Last synced: 09 Feb 2026

https://github.com/aakk23/perfomance-dashboard-tableau

This Tableau dashboard provides an interactive analysis of Superstore sales data, covering key metrics like sales, profit, orders, and customer trends. It helps visualize business performance across product categories, customer segments, and geographic regions.

data-analysis data-visualization superstore-data-analysis tableau tableau-dashboards

Last synced: 10 Feb 2026

https://github.com/tushar2704/imdb-movie-analysis

This project extracts meaningful insights, trends, and patterns from the data, shedding light on various aspects of the movie industry. By leveraging this analysis, filmmakers, studios, and enthusiasts can gain valuable information to inform decision-making, understand audience preferences, and contribute to the creation of successful movies.

artificial-intelligence data-analysis data-science imdb project tushar2704

Last synced: 10 Feb 2026

https://github.com/nickenshidqia/startup-venture-funding-dashboard-data-analysis

The Startup Venture Funding Dashboard is a comprehensive visual representation of the dynamic landscape of startup funding, providing valuable insights into the top startups, funding round types, markets, startup statuses, and investor details.

dashboard data-analysis tableau tableau-dashboards

Last synced: 11 Feb 2026

https://github.com/haonamnguyen/data-science-job-analysis

Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.

data-analysis data-science data-visualization jupyter-notebook python

Last synced: 16 Apr 2026

https://github.com/multitagging/benchmarks

Provides benchmarks to test the MultiTagging framework

benchmarks data-analysis ethereum smart-contracts vulnerabilities

Last synced: 11 Feb 2026

https://github.com/dhruwsunita/car-sales-dashboard

Car sales dashboard using Tableau visualization tool.

car-sales data-analysis data-visualization excel kpis tableau

Last synced: 27 Feb 2026

https://github.com/ohimoiza1205/mastercard-cybersecurity-simulation

Served as an analyst on Mastercard’s Security Awareness Team to identify and report security threats

cybersecurity data-analysis data-presentation security-awareness-training technical-security-awareness

Last synced: 11 Feb 2026