An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with dataanalytics

A curated list of projects in awesome lists tagged with dataanalytics .

https://github.com/GeostatsGuy/GeostatsPy

GeostatsPy Python package for spatial data analytics and geostatistics. Started as a reimplementation of GSLIB, Geostatistical Library (Deutsch and Journel, 1992) from Fortran to Python, Geostatistics in a Python package. Now with many additional methods. I hope this resources is helpful, Prof. Michael Pyrcz

dataanalytics geostatistics modeling spatial statistics

Last synced: 14 Mar 2025

https://github.com/huangjia2019/let-us-machine-learning

极客时间:Machine Learning from Scratch(零基础实战机器学习)

dataanalytics deep-learning machine-learning

Last synced: 17 Apr 2025

https://github.com/geostatsguy/excelnumericaldemos

A set of numerical demonstrations in Excel to assist with teaching / learning concepts in probability, statistics, spatial data analytics and geostatistics. I hope these resources are helpful, Prof. Michael Pyrcz

dataanalytics excel geostatistics machinelearning

Last synced: 31 Jan 2026

https://github.com/techascent/tmducken

tech.ml.dataset integration with duckdb

clojure dataanalytics duckdb

Last synced: 16 Mar 2025

https://github.com/luminati-io/amazon-popular-books-dataset

A dataset sample of the most reviewed and best-selling books on Amazon

amazon amazon-dataset books dataanalytics dataset ecommerce

Last synced: 27 Jul 2025

https://github.com/lintangwisesa/ujian_analyticsvisualization_jcds08

Panduan Soal Ujian Data Analytics & Visualization Job Connector Data Science batch 8

dataanalytics datascience exam visualization

Last synced: 26 Feb 2025

https://github.com/tanishq-ctrl/consumer-personality-analysis

This project focuses on analyzing customer behavior and spending patterns using a comprehensive dataset. Through advanced data visualization and analysis techniques, we aim to uncover actionable insights to improve marketing strategies, optimize product targeting, and enhance customer engagement.

dataanalysis dataanalytics matplotlib numpy pandas python seaborn

Last synced: 14 Jun 2025

https://github.com/akashkobal/data-science

I'm excited to share my data science project🚀, where I've applied various techniques and insights to solve a specific problem. The project follows best practices for maintainability and reproducibility, using the Data Science Project Template. Dive into the project to explore the code, datasets, documentation, and resources that showcase MyJourney

akash akash-kobal akashkobal applied-data-science artificial-intelligence classification data-science dataanalysis dataanalytics datascienceproject datascientist deep-learning kobal machine-learning prediction regression

Last synced: 17 Mar 2026

https://github.com/ricardolsmendes/datacatalog-custom-model-manager

Python package to load user-specified metadata models into Google Cloud Data Catalog, comprising Custom Entries, Tag Templates, and Tags

bigdata csv-import dataanalytics datacatalog datagovernance gcp gcp-datacatalog google-cloud python

Last synced: 25 Aug 2025

https://github.com/malexandersalazar/covid-19-peru-distribucion-vacunacion

Gráficos sobre la distribución de vacunados contra COVID-19 por grupo etáreo y grupo de riesgo, para todo el Perú y por departamento.

covid19 dataanalytics peru pongoelhombro python

Last synced: 19 Apr 2026

https://github.com/emso-exe/orcamento_de_redes_sociais_x_vendas

Projeto de machine learning aplicando regressão linear nos dados de orçamento de redes sociais e analisando a relação com as vendas.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience kaggle kaggle-dataset machine-learning machinelearning python python-3 python3 regressao-linear regression-linear

Last synced: 19 Apr 2026

https://github.com/sarincr/data-visualizations-and-dashboards

Data visualization is an interdisciplinary field that deals with the graphic representation of data. It is a particularly efficient way of communicating when the data is numerous as for example a Time Series.

analytics artificial-intelligence big-data bokeh businessintelligence dashboard data-science dataanalysis dataanalytics datavisualization dataviz deep-learning machine-learning matplotlib plotly python python3 seaborn visualization

Last synced: 30 Apr 2026

https://github.com/emso-exe/churn_clientes_de_banco

Projeto de análise de churn, utilizando machine learning na classificação de dados de clientes que poderão ou não efetuar o encerramento de conta bancária.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics kaggle kaggle-dataset machine-learning machinelearning python python-3 python3

Last synced: 08 Sep 2025

https://github.com/jabhij/twittertweets_visualization

In this repository I'll show how to visualize tweets, re tweets for a Particular Twitter Handle on Google Maps using R in simple steps.

data-science data-visualization dataanalytics python python3 twitter visualization

Last synced: 14 Apr 2026

https://github.com/halovina/fastapi-tutorial

FastAPI Tutorial For Backend Engineer

backend-api dataanalytics fastapi microservice python

Last synced: 30 Apr 2026

https://github.com/yashmistry-24/ytcomment-iq

YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.

analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube

Last synced: 15 Feb 2026

https://github.com/arzan101/ola-data-analytics

Ola - Identified the reason and trends for ride cancellation. Process - Cleaned and Processed Data from multiple sources, applied Sql queries and visualized data using PoweBi . Motive - To reduce the cancellation rate

dashboard data-analysis data-mining data-visualization dataanalytics excel powerbi sql

Last synced: 06 Jan 2026

https://github.com/tobilg/analyze-twitter-export

Analyze exported Twitter data

dataanalytics duckdb twitter

Last synced: 27 Jul 2025

https://github.com/tanishq-ctrl/big4-uae-banks-stock-price-prediction

This repository combines rigorous data preprocessing, advanced feature engineering, and Random Forest-based modeling to predict stock prices with precision for UAE banks

abudhabi banks data-visualization dataanalytics datanalysis dubai machine-learning machinelearningprojects predictive-analysis predictive-modeling stock-market stockprice-forecasting stockprice-prediction uae

Last synced: 06 Apr 2025

https://github.com/abdulbasit110/dashboard

A real-time data dashboard using Node.js, TypeScript, MySQL, and React. Developed at the National Center of Artificial Intelligence NEUROCOMPUTATION LAB, it visualizes device data in real-time with ApexCharts and Socket.io.

ant-design dataanalytics express mysql reactjs real-time tailwindcss typescript

Last synced: 08 Apr 2026

https://github.com/omkarpattnaik8080/studentperformanceanalysis

"This data analytics project examines student performance using Python and Pandas. Employing statistical analysis and visualization techniques with Matplotlib, it provides insights into academic trends. Explore this repository for data-driven insights essential for enhancing educational strategies and student outcomes."

data-visualization dataanalytics datascience kaggle numpy pandas

Last synced: 11 Apr 2026

https://github.com/gauravxlokhande/super-store-analysis-power-bi

Super Store Data Analysis Oct 2022 - Nov 2022 Super Store data analysis based on the different parameters using Microsoft Power Bi . by using the data in .xlsx and .csv format the analysis is done to predict the different parameters like current sales and probablity of future sales etc.. Technology used : Microsoft Power BI

dataanalytics powerbi

Last synced: 04 Mar 2026

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/hq969/olympics-dataset-analysis

The Olympic Dataset Analysis project involves exploring and analyzing historical Olympic Games data, including athletes, countries, medals, sports, and event performance. Using data analytics and visualization techniques, this project uncovers patterns in Olympic history, top-performing countries, and athlete achievements.

dataanalytics interaction medallion-architecture timeseries-analysis

Last synced: 14 Feb 2026

https://github.com/rsn601kri/electric-vehicles-market-size-analysis

The Electric Vehicles Market Size Analysis project is a comprehensive study aimed at understanding the current state and potential growth of the Electric Vehicles (EVs) market.

dataanalytics dataexploration electric-vehicles googlecolaboratory graph jupyter-notebook marketanalysis python

Last synced: 02 May 2026

https://github.com/emso-exe/investidores_do_tesouro_direto

Projeto de análise de perfil de investidores do Tesouro Direto com base nos dados do site tesourotransparente.gov.br.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience powerbi python python-3 python3 tesouro-direto tesourodireto

Last synced: 22 Jan 2026

https://github.com/abhaysingh71/real-estate-analytics

The Gurugram Real Estate Analytics Project is a comprehensive data-driven solution that enhances property decision-making through three key modules: Prediction, Analytics & Visualization, and Recommendation. This project empowers users with valuable insights, enabling smarter and more strategic real estate investments.

analytics data-science dataanalytics datascience-machinelearning house-price-prediction machine-learning-algorithms reccomendersystem webscrapping

Last synced: 27 Jan 2026

https://github.com/tanishq-ctrl/cyberattack-analysis-and-insights

This repository contains an in-depth analysis of a cybersecurity dataset. The primary goal is to identify patterns, vulnerabilities, and trends in cyberattacks by leveraging various visualizations and statistical insights. The project provides actionable insights for enhancing cybersecurity measures.

cyberattack cyberattacks data-science data-visualization dataanalysis dataanalysisusingpython dataanalytics

Last synced: 28 Jun 2025

https://github.com/ereh11/8-week-sql-challenge

Case study solutions for the #8 Weeks SQL Challengeヾ(●ω●)ノ

8weeksqlchallenge dataanalytics database postgresql

Last synced: 17 May 2026

https://github.com/ahmad-ali-rafique/adult-income-dataset

This repository contains a Jupyter Notebook exploring the adult income dataset. The notebook performs Exploratory Data Analysis (EDA), including visualizations with charts and graphs. Additionally, it implements various classification models to predict income and analyzes their accuracy.

accuracy classification dataanalytics datavisualization-project decision-tree-classifier eda evaluation evaluation-metrics exploratory-data-analysis logistic-regression machine-learning random-forest-classifier

Last synced: 23 Jun 2025

https://github.com/vinayakdon/machine-learning-project-sentimental-classifier-

A sentiment classification tool using machine learning in Python to analyze and predict the sentiment of text data. Features preprocessing, model training, hyperparameter tuning, and evaluation for accurate sentiment analysis.

dataanalytics dataprocessing datascience python training-data

Last synced: 17 May 2026

https://github.com/whatheheckisthis/pwc_project-

Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver

case-study data data-science dataanalytics databases datavisualization powerbi virtual

Last synced: 05 Apr 2025

https://github.com/malexandersalazar/eleccionesgenerales-peru-2021-wordcloud-plandegobierno

WordClouds de los planes de gobierno de los candidatos a las Elecciones Generales Perú 2021.

dataanalytics elecciones2021pe peru python wordcloud

Last synced: 18 May 2026

https://github.com/emso-exe/compra_de_carro

Projeto de machine learning aplicando regressão linear nos dados de compras de carros para criação de um modelo preditivo de valores para novas aquisições de veículos pelos clientes.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience kaggle kaggle-dataset machine-learning machinelearning python python-3 python3 regressao-linear regression-linear

Last synced: 18 Apr 2026

https://github.com/cyberoctane29/cyclistic-bike-share--analyzing-rider-behavior

Analyzed Cyclistic's bike-share data to uncover usage differences between casual riders and annual members. Utilized SQL and MySQL for data processing, R for visualisation, and Kaggle for collaboration. Insights will guide marketing strategies to convert casual riders into annual members.

data dataanalysis dataanalytics database rlanguage rmarkdown spreadsheet sql

Last synced: 22 May 2026

https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang

Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024

data data-science dataanalytics dataset json

Last synced: 28 Jun 2025

https://github.com/sarincr/basics-of-julia-programming-language

Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.

data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning

Last synced: 19 May 2026

https://github.com/aravachoudhary/agrimarket_monitor

This is a repository for an AI/ML project

dataanalytics machine-learning

Last synced: 27 Jul 2025

https://github.com/akarce/elk-stack-mastery

A comprehensive project focusing on setting up and configuring the Elastic Stack (Elasticsearch, Logstash, and Kibana) for efficient log management and analytics. This project includes Elasticsearch configurations, Logstash pipelines, and Kibana visualizations, with detailed step-by-step documentation.

dataanalytics datapipeline devops elasticsearch elasticstack elkstack kibana logging logmanagement logstash monitoring opensource systemmonitoring virtualbox visualization

Last synced: 31 Jul 2025

https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml

Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.

analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc

Last synced: 16 Apr 2026

https://github.com/prpriesler/covid19-insights-and-analytics

This project delves into the realm of data analytics and programming, focusing on four pivotal datasets related to the COVID-19 pandemic: confirmed global, death global, vaccination & population data, and Twitter data.

covid19 covid19-data data data-science dataanalytics deep-neural-networks machine-learning natural-language-processing

Last synced: 31 Aug 2025

https://github.com/bimadevs/supervised_regression_salaryprediction

This project aims to predict the salary of employees based on their years of experience using supervised machine learning techniques.

artificial-intelligence dataanalytics datascience dibimbing machine-learning ml predictive-modeling

Last synced: 02 Jul 2025

https://github.com/emso-exe/anuncios_em_redes_sociais

Projeto de machine learning aplicando regressão logistica nos dados de clientes que tiveram alguma interação com anúncios de redes sociais, se efetuaram ou não uma compra.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience kaggle kaggle-dataset logistic-regression machine-learning machinelearning python python-3 python3 regressao-logistica

Last synced: 20 Apr 2026

https://github.com/akashash01/indian_startups_layoff

Created an simple an Dashboard report of startups layoff in India 2023 using the data from kaggle.

dataanalyst dataanalytics datavisualization powerbi

Last synced: 04 Feb 2026

https://github.com/myselfsalman/pyspark-projects

"In this repository, I will showcase my Big Data Analytics projects, utilizing powerful data processing frameworks like PySpark to efficiently handle and analyze large-scale datasets."

bigdata dataanalytics datascience kaggle pyspark

Last synced: 14 Jun 2025

https://github.com/weihan07/quantium-data-analytics-job-simulation-forage-virtual-internship

This repository showcases my work in a Data Analytics and Commercial Insights simulation. Tasks include data preparation, customer analytics, and uplift testing using transaction data to generate strategic, data-driven recommendations. Outputs include code, benchmark analysis, and reports aimed at supporting informed business decisions.

dataanalytics datavalidation datavisualisation datawrangling r statisticaltesting

Last synced: 25 Mar 2025

https://github.com/akashash01/customer-ageing-analysis

A simple Dashboard report of Customer ageing analysis using Power BI.

dataanalytics datavisualisation powerbi

Last synced: 23 Feb 2026

https://github.com/sujitmahapatra/virat-kohli-dashboard

An interactive Power BI project analyzing Virat Kohli's cricketing journey. Explore metrics like total runs, centuries, high scores, and venue-wise performance through dynamic visualizations and filters.

data-analysis-project dataanalytics powerbi virat-kohli virat-kohli-dashboard

Last synced: 27 Jan 2026

https://github.com/bishtrishu/pizza_sales_data_analysis_sql

This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.

cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database

Last synced: 14 Apr 2026

https://github.com/cescobar0/proyecto-sql-eml

Proyecto de estudio del mercado laboral en el campo de Data Analyst

dataanalytics datascience excel git jobmarket powerquery research-project sql vscode

Last synced: 15 Mar 2026

https://github.com/vasugi2003/covid_19_daily_analysis

Analysis of COVID19 day to day updates using POWER BI analysis tool.

covid19 covid19-data csv data-science data-visualization dataanalytics powerbi predictive-analytics

Last synced: 19 Mar 2026

https://github.com/saba-gul/energy-drink-launch-case-study

Analyzing survey data to enhance CodeX's energy drink marketing strategy in India, optimizing demographics, preferences, competition, and marketing channels.

dataanalytics microsoft powerbi-report powerbi-visuals powerbidashboard

Last synced: 19 Mar 2026

https://github.com/yoursrijit/e-commerce-sales-dashboard-using_power_bi

"Unlock Your Sales Potential with Real-Time Insights: Power Up Your Strategy with Power BI Dashboard" I have created E-Commerce Sales Dashboard using Power BI. With this dashboard

data-visualization dataanalytics microsoft powerbi

Last synced: 19 Mar 2026

https://github.com/mishra-krishna/analysis-and-optimization-of-supply-chain-operations

Analyzed supply chain data to identify trends and key factors. Visualized sales, defect rates, lead times, and costs. Used Decision Tree Regressor to find top features impacting product costs and lead times.

data dataanalytics datavisualization supplychain supplychainanalytics

Last synced: 20 Apr 2026

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 22 Apr 2026

https://github.com/ahmad-ali-rafique/pyviznotebook

PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.

analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization

Last synced: 06 Jun 2026

https://github.com/pavankethavath/dataspark-illuminating-insights-for-global-electronics

DataSpark is a retail analytics project for Global Electronics leveraging Python, SQL, and Power BI. It uncovers customer insights, sales trends, and store performance to optimize marketing, inventory, and operations. Features include clean datasets, SQL-driven analysis, and interactive dashboards, driving data-driven growth and decision-making.

data-engineering data-visualization dataanalytics powerbi python retail-data sql

Last synced: 27 Apr 2026

https://github.com/malexandersalazar/covid-19-peru-analisis-oleadas

Análisis técnico de defunciones por COVID-19 en Perú para la detección de oleadas de COVID-19.

casos-covid covid19 dataanalytics defunciones peru python

Last synced: 28 Apr 2026

https://github.com/emso-exe/venda_de_medicamentos_controlados_e_antimicrobianos_-_industrializados

Projeto de análise de vendas de medicamentos controlados por um período de 12 meses e perfil dos consumidores com base nos dados disponibilizados pela Anvisa.

analise-de-dados anvisa dataanalysis dataanalyst dataanalytics medicament medicamento medicamentos medicaments python python-3 python3

Last synced: 10 May 2026

https://github.com/vbhatsaccnt/student_attrition_prediction

This is a capstone project. The project focuses on finding the attrition rate of a student admission for a university.

data-science dataanalytics ensemble-learning machine-learning student-admission student-attrition-prediction

Last synced: 17 Jun 2026

https://github.com/nirmalyabag20/amazon-prime-video-dashboard-using-tableau

I have developed a comprehensive Amazon Prime Video Dashboard, which offers in-depth insights into the platform's content library. This dashboard is designed to help stakeholders understand various aspects of the available shows

dataanalytics datavisualization datawrangling python sql tableau

Last synced: 18 Jun 2026

https://github.com/steveee27/financial-profile-analysis-of-companies-using-clustering-techniques

This project analyzes the financial attributes of companies from various industries and uses clustering techniques to group them into two distinct clusters. Based on the clustering results, insights and recommendations are provided to improve business strategies, focusing on financial strength, market presence, and industry representation.

businessintelligence clustering dataanalytics financialanalysis machinelearning

Last synced: 09 Mar 2025

https://github.com/prajjwol09/interactive-dashboard-project

These project is an interactive dashboard created using Microsoft Excel, designed to analyze and visualize bike purchase data and store analysis. The dashboard leverages advanced Excel features such as pivot tables, slicers, and various formulas to provide a dynamic and user-friendly experience.

dataanalytics excel interactive-dashboards microsoft pivot-tables slicers

Last synced: 03 Feb 2026

https://github.com/malexandersalazar/covid-19-peru-evolucion-vacunacion

Gráficos sobre la evolución de la campaña nacional de vacunación contra la COVID-19 para todo el Perú y por departamento.

covid19 dataanalytics peru pongoelhombro python

Last synced: 20 May 2026

https://github.com/piyushkumar2025/data-analytics-customer-segmentation-analysis

Implemented RFM model, ensured data quality, and performed EDA using Python. Developed a Tableau Dashboard for streamlined visualization and actionable insights into customer segmentation and behavior.

customer-segmentation dataanalysis dataanalytics datacleaning datavisualization exploratory-data-analysis numpy-library pandas-python python3 rfm-analysis

Last synced: 15 Apr 2026

https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql

Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.

analytics data dataanalytics mssql powerbi sql

Last synced: 26 Jun 2025

https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas

SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.

data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase

Last synced: 18 May 2026

https://github.com/lucaso21/customer-segmentation

A data analysis project segmenting customers based on certain characteristics.

dataanalytics datascience ggplot2 kmeans-clustering r tidyverse

Last synced: 16 Mar 2025

https://github.com/vinodbaste/hr-analytics-employee-attrition-and-performance-prediction

In this project, we enlisted the numerical and categorical attributes present in the publicly available dataset. Missing values were dropped to give better insights in data analysis. ANOVA and Chi-Square tests were carried out during statistical analysis. Machine Learning algo's were applied to understand, manage, and mitigate employee attrition.

data-science dataanalytics datavisualization machine-learning statistics

Last synced: 24 Mar 2025

https://github.com/chetanmalviya513/assembly-election-analysis-data-insights-engagement

Scraped and analyzed real-time election data to build interactive dashboards showcasing seat trends, vote share distribution, and postal ballot stats. The analysis uncovered insights on voting patterns, winning margins, and candidate forfeitures. Visual storytelling and timely data updates helped the project gain strong engagement on social media.

assembly data-visualization dataanalytics descriptive-statistics election-analysis election-data msexcel news tableau-dashboards webscraping

Last synced: 25 Apr 2026

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 08 May 2026

https://github.com/gurpreet0022/airbnb-eda

EDA on Airbnb booking data to uncover valuable insights, trends, and patterns

data data-science dataanalytics insights jupyter-notebook matplotlib numy pandas projects python3 seaborn visualization

Last synced: 11 May 2026

https://github.com/itsharshparmar/amazon-sales-dashboard-powerbi

A dynamic Power BI dashboard analyzing Amazon product sales performance using YTD, QTD, and review insights. Includes DAX measures, visualizations, and interactive filters.

dataanalytics datacleaning datamodeling datavisualization dax excel kpis powerbi powerquery salesinsights

Last synced: 26 Jan 2026

https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi

This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.

data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard

Last synced: 18 Feb 2026

https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms

The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.

data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier

Last synced: 01 May 2026

https://github.com/mnitin-reddy/summer-olympics-data-analysis-web-app

An interactive web app for exploring trends in Olympic Games history, analyzing overall medal tallies, country-wise performance, and athlete demographics. Built with Python and Streamlit, this app offers insights through visualizations and data-driven statistics.

dataanalytics matplotlib numpy pandas python seaborn streamlit

Last synced: 12 Apr 2026

https://github.com/pkx8326/data_adventure_01_a_sql_adventure_medium_article

This repository contains the SQL queries and the dataset to accompany an article on Medium.com titled "The Data Adventure 01: “How Long will your Business Start to Profit?” — An SQL Adventure"

dataanalysis dataanalyst dataanalytics postgresql sql windowfunction

Last synced: 10 Jun 2026

https://github.com/saro0307/voronoi-diagram-for-classification

Using Voronoi diagram to map random points scattered on a plane subdivides in exactly n cells enclosing a portion of the plane that is closest to each point

artificial-intelligence data-visualization dataanalytics graph machine-learning matplotlib plot plotting pyplot python python3 voronoi voronoi-diagram

Last synced: 08 Jun 2026

https://github.com/ahmad-ali-rafique/pandas-mastry-notebook

Pandas Mastry Notebook is a repository dedicated to exploring the capabilities of the pandas library for data manipulation, analysis, and visualization in Python. Dive into a variety of data operations, analytical techniques, and visualization methods to uncover insights from your datasets.

data-structures dataanalytics datase pandas pandas-library python

Last synced: 10 Apr 2026

https://github.com/ahmad-ali-rafique/linear-regression-modeling

In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models

Last synced: 19 Apr 2026

https://github.com/abdelrhman95/4-essential-python-projects-for-beginners

This repo contains simple projects for data scientist and data analytics

data-science dataanalytics eda python time-series xgboost

Last synced: 30 Apr 2026