Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/sabaasif2501/netflix-data-analysis
Exploratory data analysis of Netflix content using Python and pandas. Content types, genres, countries, and release years.
data-analysis netflix pandas portfolio-project python
Last synced: 08 May 2026
https://github.com/nagar2nd/jenson-usa-mysql-analysis
We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.
data-analysis problem-solving sql
Last synced: 01 Feb 2026
https://github.com/karan-malik/r-basicda-2
Basic Data Analysis in R
data-analysis data-science r-language rprogramming
Last synced: 19 Jun 2026
https://github.com/shru924/ecommerce_customer_behavior_analysis
A machine learning project that analyzes and segments e-commerce customers based on behavior patterns using Python, Random Forest, and data visualization.
customer-segmentation data-analysis jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/yeuner/file-analysis-sql-demo
Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.
data-analysis dataset pandas sql sqlite streamlit vizualization
Last synced: 15 Apr 2026
https://github.com/tameronline/ai-financial-analyst
AI-driven financial analyst system utilizing LangChain and Ollama for real-time stock analysis, market trends, and financial insights.
ai data-analysis finance financial-analysis langchain machine-learning nlp ollama stock-market
Last synced: 02 Feb 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/jbalooshie/movies-etl
Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.
data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy
Last synced: 11 Apr 2026
https://github.com/vladimiracunadev-create/python-data-science-program
Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.
bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 01 Jun 2026
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 16 May 2026
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 15 Apr 2026
https://github.com/amanraghuvanshi/adidas-western-zone-sales
Adidas United States Sales Report Analysis
data-analysis datatable pandas plotly statsmodels time-series
Last synced: 08 Feb 2026
https://github.com/suhail25/hotel-booking-analysis
Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.
data-analysis ipynb-notebook matplotlib pandas python seaborn
Last synced: 08 Feb 2026
https://github.com/mrgeislinger/bike-data-exploration
Data exploration of bike-related data
bicycle bike data-analysis data-science
Last synced: 08 Feb 2026
https://github.com/rodrigojunqueiradev/curso-sql-para-analise-de-dados
data-analysis data-science nosql pg pgadmin4 postgresql sql
Last synced: 08 Feb 2026
https://github.com/imgabreuw/minicurso-python-para-financas
Mini curso de Python para finanças, disponibilizado por Varos.
data-analysis financial-analysis python
Last synced: 29 Jun 2026
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/shibbir24/customer-sales-analysis-dashboard-using-tableau
Customer Sales Analysis Dashboard Using Tableau
dashboard data-analysis data-visualization sales-analysis tableau
Last synced: 08 Feb 2026
https://github.com/grindelfp/datasets-analysis
The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.
data-analysis datasets ipynb mlda
Last synced: 05 Mar 2026
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Feb 2026
https://github.com/siddhant2105s/airline-performance-analysis-dashboard
Enhancing Airline Performance Analysis for the Department of Transport
data-analysis data-visualization tableau
Last synced: 08 Feb 2026
https://github.com/ayushsiloiya619/online-food-orders-analysis
Data Analytics with Python
data-analysis data-visualization matplotlib pandas-dataframe python3 seaborn-python
Last synced: 08 May 2026
https://github.com/djm158/learning-microsoft-r
Working through https://www.gitbook.com/book/smott/introduction-to-microsoft-r-server/details and creating samples
data-analysis data-science microsoft microsoft-sql-server r
Last synced: 15 Apr 2026
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/jweinst1/xenon
A processing based language
data-analysis interpreter reactive-programming
Last synced: 15 Apr 2026
https://github.com/debjyotisaha/power-bi-projects-phase-1
Portfolio projects related to data visualisation in Power BI
data-analysis data-visualization dax-expression powerbi powerquery
Last synced: 18 Jan 2026
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 02 Sep 2025
https://github.com/ilovenooodles/probstat-water-potability
Tugas Besar Probabilitas dan Statistika 1
csv data-analysis jupyter-notebooks python
Last synced: 03 May 2026
https://github.com/iwasakiyuuki/data-analysis-platform-etl
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 01 Jul 2025
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/an1mch1k-theone/project_1_hh_analyze
Проект: анализ резюме из HeadHunter
data-analysis data-analysis-project python
Last synced: 15 Apr 2026
https://github.com/barraharrison/airbnb-price-trends
Looking at how Airbnbs differ in price when it comes to location, room type and host activity
data-analysis data-science pandas plotly python streamlit
Last synced: 09 Feb 2026
https://github.com/vanshuchaudhary/retail-sale
project uses MySQL to analyze retail sales data, focusing on customer behavior, sales trends, and product performance. The dataset includes transactions, customer demographics, and purchase details, helping businesses optimize strategies. Key Insights: 📊 Revenue Analysis – Total sales, top-spending customers 📅 Sales Trends
business-intelligence customer-behavior customer-behavior-analysis data-analysis mysql predictive-analytics retail-analytics sales-analysis sql-queries
Last synced: 23 Mar 2025
https://github.com/27ahmad/amazon-sales-analysis
This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.
data-analysis eda pandas python seaborn tableau
Last synced: 15 Apr 2026
https://github.com/tolumie/exploratory-data-analytics-projects
Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.
data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/mehassanhmood/storyboard
data-analysis framer-motion javascript react scrollytelling storyboard tailwind-css vite
Last synced: 21 Feb 2026
https://github.com/ryan-wong1/analyzing-arrest-patterns-in-chicago-data-analysis
Chicago Police Department (CPD) arrest data on offenses, locations, and demographics
data-analysis data-cleaning data-visualization exploratory-data-analysis matplotlib pandas python seaborn
Last synced: 08 May 2026
https://github.com/nulltea/kicksware-scrapebot
Web scraping tool to retrieve sneaker details & images from web store sites
bot data-analysis pandas python sneakers web-scraping
Last synced: 15 Apr 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/mathusanm6/critics-vs-players-analysis
This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.
data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games
Last synced: 16 Apr 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/purushothamadluru/kpi-driven-insights-dashboard-customer-churn-analysis
This repository features a Power BI project designed to deliver KPI-driven insights into customer churn patterns. Leveraging a robust dataset and advanced data modeling techniques, this project uncovers trends, identifies key drivers of churn, and enables businesses to make data-driven decisions.
customer-churn-analysis data-analysis insights-dashboard kpi powerbi
Last synced: 09 Feb 2026
https://github.com/manumoolimani/data-analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 21 Feb 2026
https://github.com/animesh-chourey/power-bi
Various projects at my attempt to learn Power BI
business-analytics data-analysis data-visualization powerbi
Last synced: 10 Feb 2026
https://github.com/kathisnehith/medicare-ip-hospital-analysis
In-depth Data analysis and visualization of Medicare inpatient hospital data.
data-analysis data-cleaning-and-preprocessing data-merging excel exploratory-data-analysis medicare-claim-costs-prediction powerquery sql tableau-dashboards
Last synced: 10 Feb 2026
https://github.com/tushar2704/imdb-movie-analysis
This project extracts meaningful insights, trends, and patterns from the data, shedding light on various aspects of the movie industry. By leveraging this analysis, filmmakers, studios, and enthusiasts can gain valuable information to inform decision-making, understand audience preferences, and contribute to the creation of successful movies.
artificial-intelligence data-analysis data-science imdb project tushar2704
Last synced: 10 Feb 2026
https://github.com/gnneto/nf-analyzer
Script Python para extrair dados de Notas Fiscais Eletrônicas (XML) e gerar Excel consolidado, com foco na extração de informações financeiras, como vencimentos e valores, para uma análise mais detalhada e eficiente. mantendo formatação numérica.
data-analysis excel finance nf-analyzer pandas python xlm
Last synced: 16 Apr 2026
https://github.com/vyjayanthipolapragada/marketing_statistical_analysis
Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns
customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/singhs05/global-youtube-trends
Understand the impact of Likes, comments, dislikes on the video consumption for the videos that were trending.
data-analysis mssqlserver query sql
Last synced: 18 Mar 2026
https://github.com/bcko/ud-da-eda-redwinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : Red Wine Quality dataset
data-analysis data-analyst-nanodegree exploratory-data-analysis r-markdown rstudio udacity udacity-data-analyst-nanodegree udacity-nanodegree
Last synced: 10 Feb 2026
https://github.com/prateekbisht23/inventory_management
This project is an Inventory Management System built using Python (Pandas, NumPy, SciPy) and Jupyter Notebook. It allows efficient tracking of stock, performing data analysis, and generating useful statistical insights (mean, standard error, confidence intervals) to support better decision-making.
data-analysis jupyter-notebook management python3
Last synced: 11 Feb 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/georgehanymilad/mobile-usage-behavior-analysis
Excel Project for Data Analysis
data-analysis data-visualization dataanalyst dataanalytics excel-dashboard pivot-tables powerquery storytelling
Last synced: 11 Feb 2026
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 28 Jan 2026
https://github.com/haonamnguyen/data-science-job-analysis
Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 16 Apr 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/multitagging/benchmarks
Provides benchmarks to test the MultiTagging framework
benchmarks data-analysis ethereum smart-contracts vulnerabilities
Last synced: 11 Feb 2026
https://github.com/praveen-devknight/event-registration-analytics-dashboard
This project presents an interactive and visually-rich Power BI dashboard that analyzes registration data from a college-level technical and non-technical event, Teciton. The dashboard provides comprehensive insights into participant demographics, event preferences, food choices, and time-based trends.
data-analysis data-visualization excel powerbi sql
Last synced: 11 Feb 2026
https://github.com/ohimoiza1205/mastercard-cybersecurity-simulation
Served as an analyst on Mastercard’s Security Awareness Team to identify and report security threats
cybersecurity data-analysis data-presentation security-awareness-training technical-security-awareness
Last synced: 11 Feb 2026
https://github.com/joemull/pyjade
A data curation script for the Jane Addams Digital Edition
data-analysis digital-humanities
Last synced: 11 Feb 2026
https://github.com/rodrigojunqueiradev/python-exercises
Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language
data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics
Last synced: 16 Apr 2026
https://github.com/virajbhutada/telecom-customer-churn-prediction
Predict and prevent customer churn in the telecom industry with this project. Harness the power of advanced analytics and Machine Learning on a diverse dataset to develop a robust classification model. Gain deep insights into customer behavior and identify critical factors influencing churn using interactive Power BI visualizations.
churn-prediction classification-models customer-attrition-analysis customer-churn-prediction data-analysis data-science decision-tree-classifier eda logistic-regression machine-learning machine-learning-algorithms machine-learning-models pandas powerbi powerbi-desktop python random-forest-classifier roc-curve xgboost-classifier
Last synced: 09 Apr 2026
https://github.com/swethajoseph/credit-risk-assessment-eda-case-study
Conducted an Exploratory Data Analysis (EDA) using Python to assess credit risk, identifying key factors that contribute to loan defaults and improving lending decisions
data-analysis data-visualization datacleaning datapreparation exploratory-data-analysis feature-engineering jupyter-notebook matplotlib-pyplot numpy-library pandas-library python-library risk-analysis risk-assessment risk-management seaborn-plots visual-studio-code
Last synced: 27 Feb 2026
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 11 Feb 2026
https://github.com/shinie19/sql-data-warehouse-project
Build a modern Data Warehouse from scratch with SQL Server, including ETL processes, data modeling and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-modeling data-normalization data-science data-standardization data-warehouse etl-pipeline medallion-architecture sql-server
Last synced: 29 Jun 2026
https://github.com/rodrigojunqueiradev/rodrigojunqueiradev.github.io
Professional Portfolio - Rodrigo Junqueira
analytics artificial-intelligence data-analysis data-engineering data-science data-visualization machine-learning mathematics nosql powerbi python r sql statistics
Last synced: 15 May 2026
https://github.com/farhannirzhor/vrinda_store_excel_project
This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report
data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting
Last synced: 05 Jan 2026
https://github.com/thlindustries/mortalidade_neonatal_python_react
Uma plataforma de visualização de dados montada utilizando Python e React com a library de visualização do Plotly
data-analysis data-visualization plotly python python3 react reactjs
Last synced: 16 Apr 2026
https://github.com/thanaraklee/exploring-and-analyzing-data-in-oracle-database
This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.
data-analysis oracle-database sql sql-developer
Last synced: 12 Feb 2026
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Feb 2026
https://github.com/mahapeth/invest-track
Реализация инструмента для мониторинга активности пользователей ИС "Инвест" для ВКР по направлению 01.03.02 Прикладная математика и информатика
analitycs app data-analysis data-visualization jupyter-notebook python sites
Last synced: 20 Jun 2026
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025
https://github.com/idb-devs/dataanalyticsairbnb
Construir um modelo de previsão de preço que permita uma pessoa comum que possui um imóvel possa saber quanto deve cobrar pela diária do seu imóvel.
data-analysis data-science jupyter python
Last synced: 18 Apr 2026
https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi
Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.
dashboards data-analysis data-visualization microsoft-power-bi power-bi
Last synced: 05 Jan 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/andimashkulli/vpms
Vehicle Parking Management System for Gjon Buzuku Gymnasium
backend-api data-analysis databases frontend-react mongodb nodejs software
Last synced: 12 Feb 2026
https://github.com/nhoiyee/other-python-projects
using Python in Jupyter Notebook
data-analysis data-engineering data-mining jupyter jupyter-notebook jupyter-notebooks python python3
Last synced: 12 Feb 2026
https://github.com/nabilshadman/power-bi-essential-training
Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning
dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard
Last synced: 12 Feb 2026
https://github.com/ankit21111/carpredict
This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!
data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn
Last synced: 16 Apr 2026
https://github.com/projects-developer/ransomware-prediction-using-machine-learning-project
The project aims to develop a machine learning-based system to predict and detect ransomware attacks on computer systems. Ransomware is a type of malware that encrypts a victim's files and demands a ransom in exchange for the decryption key. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificial-intelligence btechproject computerscienceproject cybersecurity-malware data-analysis data-mining deep-learning machinelearning mtechproject neural-networks ransomware-machine-learning
Last synced: 12 Feb 2026
https://github.com/martachesnova/big-data
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
big-data data-analysis dataset python spark sql
Last synced: 16 Apr 2026
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 28 Feb 2026
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 16 Apr 2026
https://github.com/bhavinpatel4199/artificial-intelligence---ai-for-decision-making
Artificial Intelligence for Decision Making is a collection of projects focused on applying AI and machine learning techniques to solve decision-making challenges. It includes projects on wine quality prediction, Cassandra data modeling, and text classification, showcasing a range of data science and machine learning applications.
artificial-intelligence cassandra-cql data-analysis data-engineering data-preprocessing data-structures decision-making deep-learning feature-selection machine-learning-algorithms sentiment-analysis text-classification
Last synced: 20 Jun 2026
https://github.com/ryan-wong1/nyc-job-postings-data-analysis
City of New York Current Job Postings 2024
data-analysis data-cleaning exploratory-data-analysis sql
Last synced: 13 Feb 2026
https://github.com/karlyndiary/data-visualisation-empowering-business-with-effective-insights
This Tata Group Sales Insights Dashboard uses a dataset provided by Forage.
analysis-and-presentation analytics-and-insights dashboard data-analysis data-cleanup data-interpretation data-visualization forage tableau tata-group visualisation
Last synced: 28 Feb 2026
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 13 Feb 2026
https://github.com/totonga/ods-exd-api-box
Helper package to build ASAM ODS EXD API grpc plugins.
asam data-analysis grpc grpc-server ods plugin python
Last synced: 03 Feb 2026
https://github.com/l1ght14/customer-churn-prediction
Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.
churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom
Last synced: 09 May 2026
https://github.com/matheusafonseca/c111
Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).
data-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/europanite/data-analysis-stability-evaluator
data-analysis-stability-evaluator
data-analysis data-analysis-project data-analysis-reliability data-analysis-stability data-analytics small-data-change
Last synced: 29 Jun 2026
https://github.com/rcv911/cluster_generation
Generation of cluster test data
cluster cluster-analysis cluster-generation clustering clustering-algorithm clusters data-analysis machine-learning
Last synced: 18 Jan 2026
https://github.com/kislerdm/github-repo-details
Application and library to fetch open source libraries details from github to perform due diligence
data-analysis golang opensource
Last synced: 01 Jul 2025
https://github.com/secureauditx/ecommerce-user-behavior-analysis
E-commerce User Behavior Analysis with Streamlit Dashboard
customer-segmentation data-analysis ecommerce python streamlit
Last synced: 28 Feb 2026
https://github.com/sahilmb/social-media-data-analysis
A social media platform chat analysis system built using python for root level analysis of huge blocks of text
analytics data-analysis python3 streamlit
Last synced: 31 Mar 2025
https://github.com/ranagaballah/true-fake-news
True Fake News Detector NLP model
data-analysis data-science data-visualization deployment machine-learning matplotlib nlp numpy pandas python
Last synced: 09 May 2026
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/dcs-training/r-visualisation-and-stats
This repository contains material from a 8 classes course on Data Visualisation and statistics with R
data-analysis data-visualisation data-wrangling intro-to-programming r statistics
Last synced: 20 Jun 2026