An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/andimashkulli/vpms

Vehicle Parking Management System for Gjon Buzuku Gymnasium

backend-api data-analysis databases frontend-react mongodb nodejs software

Last synced: 12 Feb 2026

https://github.com/syarwinaaa09/exploring-airbnb-market-trends

a data analysis project exploring NYC Airbnb listings, using data visualization and pandas for price trends, room types, and reviews.

airbnb data-analysis data-science data-visualization jupyter-notebook new-york-city nyc pandas price-analysis reviews room-types

Last synced: 30 Apr 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/ankit21111/carpredict

This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!

data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/l1ght14/customer-churn-prediction

Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.

churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom

Last synced: 09 May 2026

https://github.com/edoaltamura/rotational-ksz-macsis

Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS

cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics

Last synced: 28 Feb 2026

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 16 Apr 2026

https://github.com/rijul007/market-basket-analysis-using-r

Market Basket Analysis using association rules, leveraging R’s powerful tools for data-driven retail strategies.

data-analysis data-science r

Last synced: 02 Apr 2025

https://github.com/kariemseiam/geoegy

An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.

accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development

Last synced: 13 Feb 2026

https://github.com/kathisnehith/realestate-sales-analysis

Investigating real estate sales trends to understand market dynamics and inform investment decisions.

data-analysis excel realestate sales sql stastical-analysis-tools tableau

Last synced: 12 Feb 2026

https://github.com/drod75/burger_king_analysis

A simple analysis on a burger king dataset.

data-analysis data-visualization jupyter-notebook pandas python seaborn

Last synced: 09 May 2026

https://github.com/nishumehta/retail-sales-analysis

Retail sales performance analysis using Python and Power BI.

data-analysis ipynb-notebook jupyter-notebook powerbi python

Last synced: 15 May 2026

https://github.com/prakashjha1/whatsapp-chat-analyzer

WhatsApp Analyzer means we are analyzing our WhatsApp group activities. It tracks our conversation and analyses how much time we are spending or saying it as “wasting” on WhatsApp.

data-analysis data-science natural-language-processing pandas pyhton regular-expression

Last synced: 15 May 2026

https://github.com/secureauditx/ecommerce-user-behavior-analysis

E-commerce User Behavior Analysis with Streamlit Dashboard

customer-segmentation data-analysis ecommerce python streamlit

Last synced: 28 Feb 2026

https://github.com/vara-co/solar-eclipse-2024

Group Project on the 2024 Solar Eclipse's Path over the US with an interactive map and a couple of visualizations on the data gathered.

data-analysis data-visualizations html-css-javascript interactive-map javascript map solar-eclipse

Last synced: 15 May 2026

https://github.com/anas436/data-science-projects

Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in to discover insights and techniques in data science. Reach out for collaborations and feedback.

data-analysis data-science machine-learning

Last synced: 27 Mar 2025

https://github.com/rizkipragustono/data_analysis_spark

Exploration: Data Analysis using Spark

apache-spark data-analysis pyspark python spark-sql sql

Last synced: 09 May 2026

https://github.com/m-ah07/text-sentiment-analysis-api

A lightweight Python project for analyzing the sentiment of textual data using the TextBlob library. This project provides a simple and effective way to measure the polarity and subjectivity of any given text.

data-analysis machine-learning python python-project sentiment-analysis text-analysis text-mining

Last synced: 14 Feb 2026

https://github.com/tsbarr/toronto-open-data

Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses

api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics

Last synced: 09 May 2026

https://github.com/lulloooo/bizdata-nexus

Collection of my Business & Data Analysis projects, from professional/academic endeavors to passion-driven explorations 📊

business-analysis data-analysis economics etl excel finance mysql python r risk-analysis

Last synced: 05 Apr 2026

https://github.com/kambleakash0/mubi_eda

Mini Project #1 for EAS503 course at SUNY Buffalo

data-analysis data-visualization eda

Last synced: 16 Apr 2026

https://github.com/mo-elshamy/machine-learning-practice

This repository serves as a collection of my work and learning in machine learning while my internship in Cellual-Technologies, including algorithm explanations, data preprocessing workflows, and two projects.

data-analysis data-science dbscan decision-trees eda gradient-boosting gxboost hierarchical-clustering kmeans-clustering knn-classification linear-regression logistic-regression machine-learning model pca polynomial-regression preprocessing random-forest support-vector-machines training

Last synced: 14 Feb 2026

https://github.com/balajimohan18/tableau-visualization-project

This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to reduce damages by calamities.

data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-public

Last synced: 19 Mar 2026

https://github.com/edumoraes1/republicacao-produtos

SQL Query realizada para criação de automação de disparo de push via salesforce

bq data-analysis salesforce sql

Last synced: 14 Feb 2026

https://github.com/parthds02/-daily-calorie-count-meal-plan-generator-

Welcome to the Daily Calorie Count Meal Plan Generator project! This Streamlit web application is designed to create personalized meal plans based on user inputs such as age, weight, gender, and calorie goals. It also allows users to download their customized meal plans as PDFs.

calories-tracker data-analysis data-science pdf-generation streamlit vscode

Last synced: 13 May 2026

https://github.com/fhdsl/seattlestatsummer_r

A 4-day introduction to R programming, focused on Fred Hutch Research Interns

beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse

Last synced: 19 Mar 2026

https://github.com/hlexnc/project-arepo

Data-driven stroke risk assessment & personalized recommendations, powered by machine-learning and an NLU-driven chatbot.

chatbot data-analysis docker docker-compose machine-learning nlu-chatbot python rasa scikit-learn sklearn streamlit

Last synced: 15 Feb 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/achique-luisdan/tops-songs-db

Base de datos de Tops Semanales de Canciones🎵 más reproducidas en Spotify🎶. Prácticas de SQL enfocadas en el Análisis de Datos (Data Analysis).

data-analysis plpgsql sql

Last synced: 15 Feb 2026

https://github.com/ani717/pneumonia_detection_effecientnet_b7

Pneumonia Detection in Chest X-ray Image with EfficientNet-B7. Accuracy = 87.98%, Precision = 100%, Recall = 83.87%, F1 Score = 91.23.

cnn computer-vision data-analysis data-augmentation efficientnet image-classification image-processing machine-learning

Last synced: 13 May 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/magnus0969/black-friday-sales-analysis

An in-depth analysis of Black Friday sales data to uncover trends, customer behavior, and product insights. Utilizing Python, data visualization, and machine learning techniques, this project provides key business intelligence to optimize sales strategies.

analysis data-analysis data-science python sales-analysis

Last synced: 09 May 2026

https://github.com/kailenroa/sleep-efficiency-project

This project focuses on analyzing sleep efficiency using wearable technology data. It explores patterns in sleep behavior and key factors impacting sleep quality. A dashboard was created using phyton and data visualization tools to provide actionable insights and recommendations for improving sleep health.

dashboard data-analysis html phyton sleep-efficiency

Last synced: 06 Jan 2026

https://github.com/nishumehta/british-airways-reviews-analysis

This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.

dashboard data-analysis data-visualization tableau tableau-public

Last synced: 12 Jan 2026

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 02 Jan 2026

https://github.com/shrutiii1109/diwali-sales-analysis-through-python

Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.

data-analysis jupyer-notebook matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/cano1998/data-visualization-project

A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.

bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot

Last synced: 17 Jul 2025

https://github.com/vishal-038/real_estate_price_prediction

The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features

data-analysis data-science data-visualization machine-learning python

Last synced: 21 May 2026

https://github.com/mamtapanda088/dataanalaysis-warmup-

Tasks: Create a DataFrame: Convert the dictionary into a pandas DataFrame. Top and Bottom Rows: Display the top 3 bottom ,3 rows of the DataFrame. Summary Statistics: Generate summary statistics for the dataset. Gender Count: Count the occurrences of each gender. Marks Analysis: Calculate the average, maxi, and min marks. Tools Used: Python ,pandas

data-analysis data-science jupyter-notebook visualization

Last synced: 04 Apr 2025

https://github.com/myke003/data-analysis-projects

This repository serves as a collection of all my projects.

data-analysis jupyter-notebook powerbi

Last synced: 14 Mar 2025

https://github.com/achronus/data-exploration

A repository dedicated to interesting data exploration projects I've completed

data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 02 Jan 2026

https://github.com/siddhant2105s/bring-your-own-device-boyd-system

This repository contains the design and implementation of the Bring Your Own Device (BYOD) System for managing personal devices at Life Insurance Company. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.

data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models my-sql relational-databases relational-model sql-queries

Last synced: 15 Feb 2026

https://github.com/saymyname1337/bachelor-s-thesis

Bachelor's thesis of a student of the MPEI of Shevts G. V.

data-analysis ml python

Last synced: 23 Jul 2025

https://github.com/leticiamilan/dashboard-analitico-de-vendas-globais

Dashboard Analítico de Vendas Globais - DSA - Desenvolvido com Power BI

dashboard dashboard-power-bi data-analysis power-bi powerbi

Last synced: 03 Feb 2026

https://github.com/hossein-rahmati/credit-card-fraud-detection

This repository contains the implementation of a machine learning pipeline for detecting fraudulent credit card transactions. The project leverages common data science libraries to preprocess data, train multiple models, and evaluate their performance using appropriate classification metrics.

data-analysis fraud-detection k-fold-cross-validation machine-learning random-forest-classifier

Last synced: 15 Sep 2025

https://github.com/maazie-khan/austin-housing-insights-powerbi

Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.

dashboard data-analysis data-science data-visualization database powerbi

Last synced: 02 Jan 2026

https://github.com/hevalhazalkurt/word_analyser

A web app developed in Python and Django that analyzes given text mathematically and sentimentally.

analyzer analyzes content data-analysis django emotion python python3 sentiment sentiment-analyser sentiment-analysis text text-analysis

Last synced: 19 May 2026

https://github.com/admacpherson/admacpherson.github.io

This repository hosts my personal website & portfolio. You can find my work experience, endorsements, contact information, and more on it at andrewmacpherson.dev

data-analysis personal-site portfolio website

Last synced: 15 Sep 2025

https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset

Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns

data-analysis dataset regex selenium webdriver zomato-data-analysis

Last synced: 06 Apr 2025

https://github.com/tushar2704/hiring-process-analytics

In this project, I am analyzing hiring process data to gain insights from about records of previous hires within a multinational company. By analyzing this data, I am aiming to uncover valuable trends and information about the company's hiring process, which can contribute to making informed decisions and improvements for the future.

data-analysis data-cleaning data-science data-wrangling excel tushar2704

Last synced: 25 Jan 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/tushar2704/consumables_sales_dashboard

Welcome to the Consumable Sales Dashboard, a powerful and intuitive data visualization tool built using Power BI. This dashboard offers a comprehensive view of sales data for consumable products, allowing you to quickly and easily analyze performance and identify trends.

dashboard data-analysis data-analytics data-science excel postgresql powerbi streamlit-tushar2704 tushar2704

Last synced: 04 Nov 2025

https://github.com/swethajoseph/sales-eda-project

Performed an advanced Excel-based exploratory data analysis (EDA) of an E-Commerce sales dataset to create an interactive dashboard for uncovering key business insights.

advancedexcel data-analysis data-visualization datacleaning dataformatting exploratory-data-analysis msexcel pivot-tables

Last synced: 19 Mar 2026

https://github.com/maazie-khan/power-bi-projects

Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.

dashboard data-analysis data-science data-visualization database excel powerbi

Last synced: 02 Jan 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/chaedoll/teamproject-foreignerreport

국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)

data-analysis python

Last synced: 25 Feb 2025

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/thesfinox/mltools

A collection of simple tools for data science and machine learning projects.

ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox

Last synced: 14 May 2025

https://github.com/abhishekyadav915/diwali_sales_analysis

This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python

Last synced: 05 Apr 2025

https://github.com/ginga1402/car_price_prediction

Predict the price of a car using MS Excel.

college-project data-analysis excel linear-regression

Last synced: 30 Mar 2025

https://github.com/wwgolay/hr1099-timelapse-vlbi

The repository for HR1099 timelapse VLBI.

astronomy astrophysics data-analysis website

Last synced: 03 Apr 2025

https://github.com/aditiagrawal04/netflix-insights-mysql-

SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.

business-intelligence data-analysis data-exploration mysql netflix sql sql-project

Last synced: 28 Jun 2025

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/mmzong/gee_lifestyleeffectsonhypertension

Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.

aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots

Last synced: 29 Jul 2025

https://github.com/mmfava/analises-papers

Script base de alguns papers publicados entre 2019 e 2021.

data-analysis r

Last synced: 22 May 2026

https://github.com/al-ghaly/hotel-revenue-excel-analysis

Excel Dashboard to analyze data of a hotel over the past three years.

dashboard data-analysis data-visualization excel excel-analysis

Last synced: 02 Jan 2026

https://github.com/simranshaikh20/credit-card-dashboard

A Data Visualization Project using Microsoft Power bi

data-analysis data-visualization powerbi

Last synced: 02 Jan 2026

https://github.com/poglolopez/nesarc_research

Analyzing the relationship between Social Anxiety Disorder (SAD) and family history of behavioral problems using NESARC data. Includes statistical hypothesis testing (ANOVA, Chi-Square, Pearson Correlation, Moderation Analysis). Developed as part of the Data Analysis and Interpretation Specialization from Wesleyan University (Coursera).

anova chi-square coursera-assignment data-analysis hypothesis-testing mental-health moderation-analysis nesarc pandas pearson-correlation python social-anxiety statistical-analysis

Last synced: 14 Apr 2026

https://github.com/yrohitha/titanic-data-analysis

Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.

data-analysis machine-learning matplotlib pandas scipy-stats statistical-models

Last synced: 13 Mar 2025

https://github.com/anamakarevich/suicide_rates_factors

Female suicide rates analysis for Udacity Hacathon

data-analysis data-cleaning linear-regression suicide

Last synced: 21 May 2026

https://github.com/fazej99/u.s-climate-and-temperature-analysis

This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.

data-analysis data-science data-visualization gis machine-learning streamlit

Last synced: 22 May 2026

https://github.com/faizantkhan/automated-eda

This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.

automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz

Last synced: 18 Apr 2026

https://github.com/galal-pic/advanced_regression

A project to predict house prices through machine learning different techniques

data-analysis data-science deep-learning feature-engineering flask machine-learning python regression

Last synced: 08 Jul 2025

https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard

This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.

dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit

Last synced: 18 Apr 2026