An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/jcm-ai/Quantium-Data-Analytics-Virtual-Experience-Program

This repository contains all about the proposed solutions to the assignments that I was required to complete as part of the Quantium Data Analytics Virtual Experience Program. 📊📈📉👨‍💻

commercial-thinking communication-skills data-analysis data-validation data-visualisation data-wrangling jupyter-notebook matplotlib-pyplot numpy-library pandas-python presentation-skills programming python3 scipy-stats seaborn statistical-testing

Last synced: 19 Aug 2025

https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn

Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation

data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis

Last synced: 08 Apr 2026

https://github.com/l0rd-inquisit0r/data-analytics

A repository of data analytics implementations in Python

ai data-analysis data-analysis-python data-analytics

Last synced: 18 Jun 2025

https://github.com/ejw-data/tableau-drug-study

Brief analysis of drug treatments that were also analyzed with pandas

data-analysis tableau

Last synced: 02 Jan 2026

https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard

This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.

dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit

Last synced: 18 Apr 2026

https://github.com/faizantkhan/automated-eda

This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.

automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz

Last synced: 18 Apr 2026

https://github.com/anamakarevich/suicide_rates_factors

Female suicide rates analysis for Udacity Hacathon

data-analysis data-cleaning linear-regression suicide

Last synced: 21 May 2026

https://github.com/puspacempaka/hackerrank-sql-challenges-intermediate

This repository features solutions to various intermediate-level SQL challenges from HackerRank. It includes efficient SQL queries, problem-solving techniques, and well-documented scripts. Explore these solutions to understand different SQL problems and enhance your skills.

challenges data-analysis database hackerrank-solutions queries sql sql-intermediate-level

Last synced: 02 Jan 2026

https://github.com/simranshaikh20/credit-card-dashboard

A Data Visualization Project using Microsoft Power bi

data-analysis data-visualization powerbi

Last synced: 02 Jan 2026

https://github.com/mmzong/gee_lifestyleeffectsonhypertension

Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.

aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots

Last synced: 29 Jul 2025

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/ginga1402/car_price_prediction

Predict the price of a car using MS Excel.

college-project data-analysis excel linear-regression

Last synced: 30 Mar 2025

https://github.com/abhishekyadav915/diwali_sales_analysis

This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python

Last synced: 05 Apr 2025

https://github.com/tashi-2004/data-visualization-tableau-traffic-collision-insights

Analysis of traffic collision data using Tableau, featuring interactive visualizations that highlight trends in injuries and fatalities, contributing factors, and geographic distributions. It includes various sheets and dashboards, with recommendations for enhancing road safety. The dataset is available for further exploration.

data-analysis data-visualization eda geospatial-analysis machine-learning predictive-modeling statistics tableau traffic-analysis

Last synced: 19 Mar 2026

https://github.com/smsraj2001/sds-datathon

A simple data science project/hackathon done as part of SDS course

data-analysis data-analysis-python data-cleaning data-science statistics statistics-for-data-science

Last synced: 16 Jul 2025

https://github.com/jabulente/t-test-python-implementation

A Python-based implementation of one-sample, two-sample, and paired t-tests for statistical analysis and hypothesis testing.

automation data-analysis data-science eda exploratory-data-analysis hypothesis-testing independent-ttest one-sample-t-test python reporting statistics ttest two-sample-t-test

Last synced: 27 Jun 2025

https://github.com/thesfinox/mltools

A collection of simple tools for data science and machine learning projects.

ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox

Last synced: 14 May 2025

https://github.com/chaedoll/teamproject-foreignerreport

국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)

data-analysis python

Last synced: 25 Feb 2025

https://github.com/leticiamilan/dashboard-analitico-de-vendas-globais

Dashboard Analítico de Vendas Globais - DSA - Desenvolvido com Power BI

dashboard dashboard-power-bi data-analysis power-bi powerbi

Last synced: 03 Feb 2026

https://github.com/mr-chang95/datascience_airbnb

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn

Last synced: 08 Apr 2026

https://github.com/dimits-ts/sport-repression-repl-study

A replication Study for the recent paper "International Sports Events and Repression in Autocracies: Evidence from the 1978 FIFA World Cup" paper.

data-analysis jupyter regression-models replication-study statistical-analysis

Last synced: 30 Jun 2026

https://github.com/kunalkumar2001/sales-project-using-excel-and-sql

Comprehensive sales analysis using SQL, Excel, and PowerPoint to uncover insights on top-sellers, peak times, and branch performance.

data-analysis data-analytics excel mssql sql

Last synced: 03 Nov 2025

https://github.com/rijul007/market-basket-analysis-using-r

Market Basket Analysis using association rules, leveraging R’s powerful tools for data-driven retail strategies.

data-analysis data-science r

Last synced: 02 Apr 2025

https://github.com/annnieglez/computer-vision-parking-lot

This project leverages computer vision techniques to analyze parking lot occupancy. The goal is to detect available parking spaces in real-time using image and video input.

computer-vision data-analysis data-science data-visualization google-colab image-classification image-processing machine-learning python transfer-learning

Last synced: 15 May 2026

https://github.com/kathisnehith/realestate-sales-analysis

Investigating real estate sales trends to understand market dynamics and inform investment decisions.

data-analysis excel realestate sales sql stastical-analysis-tools tableau

Last synced: 12 Feb 2026

https://github.com/myke003/data-analysis-projects

This repository serves as a collection of all my projects.

data-analysis jupyter-notebook powerbi

Last synced: 14 Mar 2025

https://github.com/vishal-038/real_estate_price_prediction

The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features

data-analysis data-science data-visualization machine-learning python

Last synced: 21 May 2026

https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau

• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.

data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop

Last synced: 09 Apr 2025

https://github.com/easonsyc/kc-house-price-prediction

Prediction for House Price in King County.

data-analysis jupyter-notebook machine-learning python

Last synced: 21 May 2026

https://github.com/cano1998/data-visualization-project

A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.

bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot

Last synced: 17 Jul 2025

https://github.com/garcane/credit-card-transactions-fraud-detection-project

The Credit Card Transactions Fraud Detection Project repository is designed to analyse and detect fraudulent transactions in credit card data.

data-analysis postgresql sql

Last synced: 03 Feb 2026

https://github.com/ishmal793/basic-python-

Beginner-friendly Python code examples and exercises – a strong foundation for aspiring data analysts.

data-analysis data-analytics learning-python-code problem-solving python-basics python-for-beginners

Last synced: 23 Jul 2025

https://github.com/nishumehta/retail-sales-analysis

Retail sales performance analysis using Python and Power BI.

data-analysis ipynb-notebook jupyter-notebook powerbi python

Last synced: 15 May 2026

https://github.com/shrutiii1109/diwali-sales-analysis-through-python

Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.

data-analysis jupyer-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/nishumehta/british-airways-reviews-analysis

This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.

dashboard data-analysis data-visualization tableau tableau-public

Last synced: 12 Jan 2026

https://github.com/prakashjha1/whatsapp-chat-analyzer

WhatsApp Analyzer means we are analyzing our WhatsApp group activities. It tracks our conversation and analyses how much time we are spending or saying it as “wasting” on WhatsApp.

data-analysis data-science natural-language-processing pandas pyhton regular-expression

Last synced: 15 May 2026

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 02 Jan 2026

https://github.com/vara-co/solar-eclipse-2024

Group Project on the 2024 Solar Eclipse's Path over the US with an interactive map and a couple of visualizations on the data gathered.

data-analysis data-visualizations html-css-javascript interactive-map javascript map solar-eclipse

Last synced: 15 May 2026

https://github.com/mamtapanda088/dataanalaysis-warmup-

Tasks: Create a DataFrame: Convert the dictionary into a pandas DataFrame. Top and Bottom Rows: Display the top 3 bottom ,3 rows of the DataFrame. Summary Statistics: Generate summary statistics for the dataset. Gender Count: Count the occurrences of each gender. Marks Analysis: Calculate the average, maxi, and min marks. Tools Used: Python ,pandas

data-analysis data-science jupyter-notebook visualization

Last synced: 04 Apr 2025

https://github.com/lucashomuniz/project-05

[STATISTICAL ANALYSIS] Integrating Automation and Visualization for Optimal Data Analytics

automation data-analysis kolmogorov-smirnov language-r nonparametric-analysis parametric-analysis shapiro-wilk shiny-apps statistics t-test wilcoxon-test

Last synced: 30 Mar 2025

https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 05 Apr 2025

https://github.com/achronus/data-exploration

A repository dedicated to interesting data exploration projects I've completed

data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 02 Jan 2026

https://github.com/shadz23/smart-energy-dashboard

Power BI dashboard analyzing household electricity consumption to reveal usage patterns, peak hours, and estimated costs for smarter energy management and reduced bills. 🐙

chart data-analysis data-visualization dax energy-consumption hs110 hs300 ibm ibm-cloud influxdb jupyter-notebook kasa kp115 linuxone observability photovoltaics-dashboard plotly sense

Last synced: 19 Aug 2025

https://github.com/k31ner/inmopipeline

Proyecto integral de análisis y modelado predictivo de datos inmobiliarios, que abarca recolección, transformación, visualización y machine learning utilizando Python y herramientas modernas de ingeniería y ciencia de datos.

data-analysis data-engineering data-science fastapi python streamlit

Last synced: 08 May 2026

https://github.com/rahmamohammad/retail_project

Retail & Data analytics: KPIs, sales trends, Excel planning pack, forecasting & inventory tracking.

data-analysis data-visualization ecommerce excel jupyter-notebook matplotlib python retail-analytics storytelling

Last synced: 17 May 2026

https://github.com/saymyname1337/bachelor-s-thesis

Bachelor's thesis of a student of the MPEI of Shevts G. V.

data-analysis ml python

Last synced: 23 Jul 2025

https://github.com/anas436/data-science-projects

Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in to discover insights and techniques in data science. Reach out for collaborations and feedback.

data-analysis data-science machine-learning

Last synced: 27 Mar 2025

https://github.com/sdley/logiciel-de-deliberation-uam-2022

Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités

data-analysis pandas python tkinter-gui

Last synced: 08 May 2026

https://github.com/maazie-khan/austin-housing-insights-powerbi

Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.

dashboard data-analysis data-science data-visualization database powerbi

Last synced: 02 Jan 2026

https://github.com/admacpherson/admacpherson.github.io

This repository hosts my personal website & portfolio. You can find my work experience, endorsements, contact information, and more on it at andrewmacpherson.dev

data-analysis personal-site portfolio website

Last synced: 15 Sep 2025

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset

Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns

data-analysis dataset regex selenium webdriver zomato-data-analysis

Last synced: 06 Apr 2025

https://github.com/aalkiyumi/project-3-docker-container-for-data-processing-script

This Dockerized Python application analyzes two text files (IF.txt and AlwaysRememberUsThisWay.txt). It counts total words, identifies the largest file, and finds the top three most frequent words in each. Results are saved to an output file and printed to the console.

cs5165 data-analysis data-engineering data-science docker introduction-to-cloud-computing statistical-analysis text-processing uc uc2026 university-of-cincinnati

Last synced: 17 May 2026

https://github.com/tushar2704/hiring-process-analytics

In this project, I am analyzing hiring process data to gain insights from about records of previous hires within a multinational company. By analyzing this data, I am aiming to uncover valuable trends and information about the company's hiring process, which can contribute to making informed decisions and improvements for the future.

data-analysis data-cleaning data-science data-wrangling excel tushar2704

Last synced: 25 Jan 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/tushar2704/consumables_sales_dashboard

Welcome to the Consumable Sales Dashboard, a powerful and intuitive data visualization tool built using Power BI. This dashboard offers a comprehensive view of sales data for consumable products, allowing you to quickly and easily analyze performance and identify trends.

dashboard data-analysis data-analytics data-science excel postgresql powerbi streamlit-tushar2704 tushar2704

Last synced: 04 Nov 2025

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/maazie-khan/power-bi-projects

Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.

dashboard data-analysis data-science data-visualization database excel powerbi

Last synced: 02 Jan 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/sramalhao/sleep_health_analysis

This repository contains a comprehensive project focused on analyzing various factors influencing sleep health, such as BMI, occupation, gender, age, physical activity, and stress levels.

analytics data-analysis eda matplotlib pandas python seaborn sklearn visualization

Last synced: 13 Apr 2026

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 30 Apr 2026

https://github.com/omari-kd/transborder-freight-data-analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-science data-visualization powerbi

Last synced: 30 Mar 2025

https://github.com/aditiagrawal04/netflix-insights-mysql-

SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.

business-intelligence data-analysis data-exploration mysql netflix sql sql-project

Last synced: 28 Jun 2025

https://github.com/kailenroa/sleep-efficiency-project

This project focuses on analyzing sleep efficiency using wearable technology data. It explores patterns in sleep behavior and key factors impacting sleep quality. A dashboard was created using phyton and data visualization tools to provide actionable insights and recommendations for improving sleep health.

dashboard data-analysis html phyton sleep-efficiency

Last synced: 06 Jan 2026

https://github.com/hevalhazalkurt/word_analyser

A web app developed in Python and Django that analyzes given text mathematically and sentimentally.

analyzer analyzes content data-analysis django emotion python python3 sentiment sentiment-analyser sentiment-analysis text text-analysis

Last synced: 19 May 2026

https://github.com/mmfava/analises-papers

Script base de alguns papers publicados entre 2019 e 2021.

data-analysis r

Last synced: 22 May 2026

https://github.com/sunnyrao07/data-analysis-dashboard-in-excel

I implemented a comprehensive data analysis solution using Excel, developing multiple dashboards and tables to visualize and interpret the data. This involved a rigorous data cleaning and preprocessing pipeline followed by data visualization.

dashboard data-analysis excel visualization

Last synced: 03 Feb 2026

https://github.com/al-ghaly/hotel-revenue-excel-analysis

Excel Dashboard to analyze data of a hotel over the past three years.

dashboard data-analysis data-visualization excel excel-analysis

Last synced: 02 Jan 2026

https://github.com/analyticslover/salifort-motors-turnover-project

The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.

data data-analysis datamodeling eda machine-learning pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/poglolopez/nesarc_research

Analyzing the relationship between Social Anxiety Disorder (SAD) and family history of behavioral problems using NESARC data. Includes statistical hypothesis testing (ANOVA, Chi-Square, Pearson Correlation, Moderation Analysis). Developed as part of the Data Analysis and Interpretation Specialization from Wesleyan University (Coursera).

anova chi-square coursera-assignment data-analysis hypothesis-testing mental-health moderation-analysis nesarc pandas pearson-correlation python social-anxiety statistical-analysis

Last synced: 14 Apr 2026

https://github.com/yrohitha/titanic-data-analysis

Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.

data-analysis machine-learning matplotlib pandas scipy-stats statistical-models

Last synced: 13 Mar 2025

https://github.com/vipulbunny/ml-learning_projects

A collection of machine learning projects implemented in Python, showcasing core concepts like regression, classification, clustering, and model evaluation techniques. Ideal for learners and data science enthusiasts.

classification clustering data-analysis data-science data-visualization decision-trees jupyter-notebook machine-learning model-evaluation random-forest regression supervised-learning unsupervised-learning

Last synced: 23 Jul 2025