Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/pararang/nams-thesis-fuzzy
A specialized data processing tool designed to help with Fuzzy Delphi Method calculations for thesis research data analysis. Then extended with some new features for data processing with different method.
data-analysis dematel hacktoberfest hacktoberfest-accepted house-of-quality python sustainability vibecoding
Last synced: 27 Apr 2026
https://github.com/akashvarma26/data-analysis-on-imbd-using-sqlite3
Data Analysis on IMDb dataset using sqlite3 and Pandas in Jupyter notebook.
data-analysis jupyter-notebook pandas-dataframe sqlite
Last synced: 27 Apr 2026
https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
data-analysis data-extraction data-science data-scraping html mark-up python
Last synced: 27 Apr 2026
https://github.com/as16082023/project-portfolio
A guide to all my projects
dashboard data-analysis data-cleaning data-visualization excel mysql power-bi python sql tableau
Last synced: 27 Apr 2026
https://github.com/alkasaliss/nosql_opendata_nyc
PROJET NoSQL - ENSAI
data-analysis mongodb nosql open-data
Last synced: 27 Apr 2026
https://github.com/manasashetty01/regulatory-affairs-of-road-accidents
Regulatory Affairs of Road Accidents in Million-Plus Cities (India, 2020)
data-analysis data-science data-visualization exploratory-data-visualizations jupyter-notebook numpy pandas python
Last synced: 27 Apr 2026
https://github.com/elakkiya-u/digital-marketing-campaign-conversion-prediction
A Predictive Modelling whether a customer will convert based on digital marketing campaign data.
campaign-analytics churn-prediction data-analysis deployment digital-marketing-analytics machine-learning power-bi predictive-modelling presentation-slides python
Last synced: 27 Apr 2026
https://github.com/jofaval/supertsore-timeseries
Timeseries Data Analysis and Forecast of the sales from a superstore in 2015-2018
data-analysis data-science deep-learning deep-neural-networks forecasting google-colab lstm openml python tensroflow time-series time-series-analysis time-series-forecasting
Last synced: 27 Apr 2026
https://github.com/caesaredia/food-app-user-behavior-analysis
Analyze user behavior and optimize app experience in a food-tech startup through funnel analysis and A/A/B testing. Includes data prep, visualization, and statistical testing in Python.
a-b-testing chi-square data-analysis data-visualization funnel-analysis python statistical-testing user-behavior
Last synced: 27 Apr 2026
https://github.com/airdac/ml-palmerpenguins
Classification and analysis of the palmerpenguins dataset in Python. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science machine-learning palmer-penguin python upc
Last synced: 07 Jun 2026
https://github.com/bheemisme/employee-attrition-analysis
A Dashboard on employee-attrition-analysis
dashboard data-analysis data-science plotly plotly-dash python
Last synced: 28 Apr 2026
https://github.com/sferez/simple_linear_regression
Simple Linear Regression using Python
data-analysis data-science linear-regression python regression
Last synced: 28 Apr 2026
https://github.com/simranshaikh20/diwali-sales-analysis-for-business-insights
A data analyst project on diwali sales . In this state according state , gender, age we are able to know how much sale it done.
data-analysis data-visualization python
Last synced: 28 Apr 2026
https://github.com/emmanuelletocs/steam-game-recommender
A powerful recommendation system for Steam games, combining Content-Based and Collaborative Filtering techniques. Built with Python, Scikit-learn, and Streamlit to deliver accurate, real-time game recommendations. Perfect for gamers and data scientists interested in building intelligent recommendation engines.
als-algorithm data-analysis gaming-industry knn machine-learning mds mysql ncf neural-network pyspark recommendation-engine recommendation-system scikit-learn spark
Last synced: 28 Apr 2026
https://github.com/jovicdev97/financial-loan-datascience-notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 28 Apr 2026
https://github.com/ovuiproduction/tabletalk
TableTalk - Your Data, Your Language Query tabular data using natural language—no SQL required! Upload your data, ask questions, and get instant insights. 🔹 Convert Natural Language to SQL 🔹 Handle Complex Queries & Aggregations 🔹 Upload CSVs for Easy Analysis 🔹 React + Flask + SQLite3 Backend 🔹 Powered by LLMs for Accuracy
ai data-analysis flask llm machine-learning natural-language-processing prompt-engineering react sql sqlite
Last synced: 28 Apr 2026
https://github.com/sufyan14/weather-data-analysis
A Streamlit dashboard that forecasts 30-day weather trends using uploaded CSV data and Facebook Prophet.
data-analysis python streamlit
Last synced: 28 Apr 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/delonnewman/relational
Relational programming for Ruby
csv csv-import data data-analysis database export json relational relational-algebra relational-database relational-model relational-programming reporting reports ruby yaml
Last synced: 28 Apr 2026
https://github.com/rorrell/coviddeaths
A Jupyter Notebook where I create several visualizations based on data about COVID-19 deaths from 2020 to 2024
data-analysis data-visualization jupyter-notebook python3
Last synced: 28 Apr 2026
https://github.com/abhi227070/car-price-prediction
This project implements a machine learning model to predict the price of cars based on various features such as mileage, manufacturing date, fuel type, and more. Users can input car information, and the model will estimate the price of the car based on the provided data. This tool can be useful for both car buyers and sellers to estimate car price.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression regression-models scikit-learn scikitlearn-machine-learning
Last synced: 28 Apr 2026
https://github.com/buabaj/fortran-assignment
code repository for fortran and python climatology assignment.
big-data climatology data-analysis data-visualization fortran90 python
Last synced: 28 Apr 2026
https://github.com/priyanshubiswas-tech/e-commerce_data_analysis
Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.
data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python
Last synced: 28 Apr 2026
https://github.com/szapp/candyanalysis
Case study: Analyze the candy power ranking to identify and recommend popular candy characteristics
data-analysis data-visualization feature-selection interaction-terms
Last synced: 28 Apr 2026
https://github.com/tanzeelgcuf/medical-information-rule-based-prediction-model-with-api
a rule based system, that learns to make new rules, for medical information that will take the first 11 text fields and predict the last 2 text fields - the diagnosis and disposition. I needs to show the key words used to make the predicted diagnosis and disposition.
data-analysis django machine-learning-algorithms openpyxl python python3 rest-api
Last synced: 28 Apr 2026
https://github.com/leosimoes/alura-7daysofcode-dados
Desafios das Trilhas de Dados - Ciência de Dados, Machine Learning e Python Pandas.
data-analysis data-science jupyter-notebook machine-learning python
Last synced: 28 Apr 2026
https://github.com/ricram2/column-name-extractor
Jupyter Notebook. Takes Folder with one or more CSV and gives back one CSV with a compendium of column names and 3 example values (first, random, random)
Last synced: 29 Apr 2026
https://github.com/kisaa-fatima/data-visualization-with-tableauleu
Conducted Exploratory Data Analysis (EDA) on the Berkeley Earth Dataset (large scale dataset), which features high-resolution land and ocean time series data. Created interactive dashboards using Tableau to effectively visualize and highlight trends and patterns within the data.
data-analysis data-science exploratory-data-analysis insights python tableau visualizations
Last synced: 29 Apr 2026
https://github.com/prady2309/sales-prediction-using-python
Implemented using Multiple Linear Regression
data-analysis data-science machine-learning python
Last synced: 29 Apr 2026
https://github.com/emircanakyuzz/veri_gorsellestirilmesi_ve_analizi-analysis_and_visualization_of_dataset
Bu çalışmada numpy, pandas, seaborn ve matplotlib gibi veri biliminde çokca bilinen modülleri kullanarak analiz ve görselleştirme işlemleri gerçekleştirdim.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/thanaraklee/pyspark-dataframe-operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
data-analysis dataframe pyspark python
Last synced: 29 Apr 2026
https://github.com/kawshik-khan/fake-news-analysis
A fake news detection ML model. It utilizes the Bag of Words model for text vectorization and a Multinomial Naive Bayes classifier to predict whether news articles are real or fake. The project covers data preprocessing, model training, and performance evaluation with accuracy metrics and a confusion matrix.
data-analysis data-science machine-learning ml python3
Last synced: 08 Jun 2026
https://github.com/prateek5525/yt-analysis-project
This project utilizes the YouTube Data API to analyze channel and video performance, offering insights into subscriber counts, views, video metrics, and monthly trends. It generates visual reports and exports data in CSV format, aiding in effective decision-making and performance tracking.
data-analysis jupyter-notebook python3 seaborn-plots youtube-api
Last synced: 29 Apr 2026
https://github.com/nivasharmaa/spiderverse
A comprehensive Java program for analyzing and managing events and data points within a fictional spiderverse. Features event handling, anomaly detection, cluster management, and robust file I/O operations.
advanced-algorithms anomaly-detection clustering data-analysis file-io object-oriented-programming
Last synced: 29 Apr 2026
https://github.com/kasraskari/learn-r-codes
A learning repository for R programming, covering data manipulation, visualization, and statistical analysis. (Work in progress!) 🚧
data-analysis data-analysis-r data-visualization r r-examples r-graphics r-statistics statistics
Last synced: 08 Jun 2026
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 29 Apr 2026
https://github.com/mdaffailhami/king_county_home_sales_analysis
This repository contains code and analysis for exploring home sales data in King County, featuring geospatial mapping to visualize trends and factors influencing housing prices, including location, size, and various property features, using Python and popular data analysis libraries.
data-analysis data-science folium-maps geospatial python
Last synced: 29 Apr 2026
https://github.com/eco786786/restaurant_orders
This analysis seeks to uncover patterns in customer behaviour by examining restaurant order data.
data-analysis git postgresql tableau
Last synced: 29 Apr 2026
https://github.com/saroshfarhan/kaggle-playground-s4e11
Kaggle old competirion just for practice
data-analysis data-science data-visualization jupiter-notebook python3
Last synced: 29 Apr 2026
https://github.com/rafgpereira/obmep-analise
Código que analisa a retrospectiva das premiações da Obmep em determinada localidade e escola
data-analysis excel pandas python
Last synced: 29 Apr 2026
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/mfakhriazhar/python-data-analyst-tutorial
A collection of My Python learning files for Data Analyst purposes. Covers fundamental to advanced topics such as data exploration, visualization, statistical analysis, and the use of popular libraries like Pandas, NumPy, Matplotlib, and Seaborn. Suitable for personal documentation or shared learning references.
data-analysis data-science data-visualization exploratory-data-analysis portfolio python
Last synced: 29 Apr 2026
https://github.com/teja-1403/forage-standard-bank-data-science
This repository contains solutions to the 4 different tasks that must be performed during the Data Science virtual internship provided by Standard Bank via Forage.
automl communication-skills data-analysis data-science machine-learning python sql
Last synced: 29 Apr 2026
https://github.com/farhad-here/textprepx
A Multilingual Text Preprocessing Tool for English and Persian.
cleantext contractions data-analysis deep-learning emoji nlp nltk opp parsivar regex streamlit text-preprocessing textblob
Last synced: 29 Apr 2026
https://github.com/jofaval/melbourne-temperature-timeseries
Timeseries Data Analysis and Forecasting of the daily min temperature in Melbourne from 1981 to 1990
data-analysis data-science data-visualization deep-learning google-colab melbourne python temperature tensorflow timeseries timeseries-analysis
Last synced: 29 Apr 2026
https://github.com/srinibas-masanta/yelp-business-reviews-analysis
This project analyzes Yelp business reviews using Python, Snowflake, and SQL, focusing on efficient data ingestion, transformation, and analysis. We preprocess JSON data, optimize ingestion via Amazon S3, classify sentiments with Python UDFs, and extract insights using SQL queries—showcasing a streamlined end-to-end workflow.
amazon-s3 data-analysis json python snowflake sql
Last synced: 29 Apr 2026
https://github.com/sharoonjoseph11/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/sdley/cas_pratique-del_annuel
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 29 Apr 2026
https://github.com/varshan1123/sql-tableau-project
We analyze key indicators for our pizza sales data to gain insights into our business performance - A Data Analysis Project performed on Tableau & SQL.
analysis data-analysis data-science data-visualization excel mysql powerbi sql sql-server tableau tableau-dashboards
Last synced: 29 Apr 2026
https://github.com/brevex/code-complexity-data-analisis
Data collection that shows different complexity scores in an algorithmic dataframe.
code-analysis data-analysis data-science python
Last synced: 29 Apr 2026
https://github.com/al-ghaly/e-commerce-a-b-testing
A Statistical Analysis project in which I Performed an A/B test to analyze the effect of changing the user interface for an E-Commerce company's Website.
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 29 Apr 2026
https://github.com/yimethan/basics-of-data-analysis
2023-2 Basics of Data Analysis
data-analysis numpy pandas python
Last synced: 29 Apr 2026
https://github.com/psyplot/psy-transect
Visualize and explore transects with psyplot
data-analysis data-exploration data-science exploratory-data-analysis psyplot transects
Last synced: 29 Apr 2026
https://github.com/nishumehta/supermart-grocery-sales-retails-analytics
Tableau Dashboard Link :
data-analysis data-cleaning data-visualization jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn
Last synced: 29 Apr 2026
https://github.com/ahshah322/world-happiness-report-2025
Data analysis and visualization of the World Happiness Report 2025 using Python (pandas, seaborn, matplotlib). Explores how GDP, health, freedom, generosity, and corruption perception influence global happiness.
data-analysis data-science matplotlib numpy pandas python seaborn worldhappiness
Last synced: 29 Apr 2026
https://github.com/fanisgl/video-games-sales-data
Data Analysis of Sales Dataset using Python.
data-analysis data-science data-visualization dataset jupyter-notebooks matplotlib numpy pandas poisson-distribution python python3 sales statistics
Last synced: 29 Apr 2026
https://github.com/alam025/algo-trading-bot
Backtested 20+ strategies achieving 18% annualised returns on historical S&P 500 data
api ccxt data-analysis finance fintech pandas postgresql python
Last synced: 08 Jun 2026
https://github.com/jelhamm/model-ensembles-bagging-in-machine-learning
"This repository contains implementations of Boosting method, aimed at improving predictive performance by combining multiple models. by using titanic database."
bagging bagging-ensemble data-analysis database-analysis datamining datamining-algorithms ensemble-machine-learning ensemble-model jupyter-notebook machine-learning-models machine-learning-projects matplotlib module numpy-library pandas-library python sklearn-library
Last synced: 30 Apr 2026
https://github.com/edgarhtt/uber_freight_data_analysis
Uber Freight interview homework. It consisted of solving a 2 warehouse problem and an ETL task
data-analysis data-science data-visualization python
Last synced: 30 Apr 2026
https://github.com/avazasgarov/soccer-hypothesis-testing
Statistical analysis comparing goal-scoring patterns in Men’s vs. Women’s FIFA World Cups using hypothesis testing.
data-analysis eda hypothesis-testing matplotlib-pyplot pandas pingouin python scipy
Last synced: 30 Apr 2026
https://github.com/diogojorgebasso/dataanalysis_r_minesnancy
Les codes et les matériaux des cours d'analyse de données en R à Mines de Nancy. Vous y trouverez également des scripts R, des notebooks et d'autres ressources pour chaque leçon.
analyse-data data-analysis data-science data-visualization estatistics r statistiques statistiques-descriptives
Last synced: 30 Apr 2026
https://github.com/praveendecode/analytics-for-hospitals-health-care-data
Analytics for Hospitals' Health-Care Data
covid-19 data-analysis data-visualization exploratory-data-analysis ibm-cognos-analytics ibm-watson medical-domain python
Last synced: 30 Apr 2026
https://github.com/rijul007/smartwatch-data-analysis-using-python
Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python
Last synced: 30 Apr 2026
https://github.com/samuelpillai/machine-learning-classification-regression-nlp
A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.
classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining
Last synced: 30 Apr 2026
https://github.com/celineboutinon/laplace-immo
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 3
data-analysis data-analytics data-structures database-design database-schema databases mysql-connector-python mysql-workbench python sql
Last synced: 30 Apr 2026
https://github.com/prgermux/defect-finder
Defect Finder is an interactive Python-based GUI application for detecting and analyzing mechanical and non-mechanical defects in data. It provides defect visualization, periodicity analysis, and statistical insights, making it ideal for research and quality control workflows.
data-analysis defect-detection gui pyqt5 python quality-control statistics visualization
Last synced: 24 Mar 2025
https://github.com/sgb31/covid-19-data-analysis
"In this project, I analyzed COVID-19 data to explore trends, case growth, and key patterns. I worked on cleaning the data, performing exploratory analysis, and visualizing infection rates, recoveries, and fatalities. The goal was to gain insights into how the pandemic evolved and its overall impact.
data-analysis data-visualization matplotlib pandas python seaborn
Last synced: 13 May 2026
https://github.com/who-else-but-arjun/isro_xrf_sr
Source Codes for super resolution of the lunar elemental abundance map using a semi-supervised deep spatial interpolation model. This hybrid approach combined ResNet50 for spatial feature extraction with Graph Neural Network (GATv2Conv) layers and Convolutional Neural Networks (CNNs), followed by fusion layers.
cnn data-analysis graph-neural-networks pytorch semi-supervised-learning spatial-interpolation super-resolution
Last synced: 30 Apr 2026
https://github.com/kevingastelum/mydataanalysis
My DataAnalyst Projects | Python, SQL, Excel, PowerBI & Tableau
data-analysis python sql visualization
Last synced: 20 May 2026
https://github.com/jofaval/iris-flowers
Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936
classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/cyberoctane29/noaa-lightning-analysis
This project explores lightning strike data from the National Oceanic and Atmospheric Administration (NOAA) to identify seasonal trends and analyze strike frequency across months. It demonstrates data manipulation, aggregation, and visualization using Python, providing insights into lightning activity patterns.
data-analysis data-science data-visualization eda python
Last synced: 20 Apr 2026
https://github.com/zen204/accenture-tech-news-summarization-engine
A tool developed to analyze knowledge graphs from technology news articles, uncovering insights and trends about technology products, platforms, services, and their industry impact. Built during an internship at Accenture to inform decision-making in the tech landscape.
data-analysis decision-making graph-visualization industry-insights jupyter-notebook knowledge-graph machine-learning python tech-news tech-trends
Last synced: 29 Apr 2026
https://github.com/clchinkc/zombie
Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker
algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation
Last synced: 05 Apr 2026
https://github.com/lulloooo/bizdata-nexus
Collection of my Business & Data Analysis projects, from professional/academic endeavors to passion-driven explorations 📊
business-analysis data-analysis economics etl excel finance mysql python r risk-analysis
Last synced: 05 Apr 2026
https://github.com/prawy126/data-analysis
ai data-analysis data-visualization python python3 tinker
Last synced: 14 Jun 2025
https://github.com/darkdk123/house-valuation-model
A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.
data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics
Last synced: 07 Jul 2025
https://github.com/phanchenh/cohortanalysis_adventurework_sqlproject
Customer Retention and Lifetime Value Analysis Using Cohort Analysis on AdventureWorks Dataset (2011-2014)
adventureworks business-analytics business-intelligence clv-analysis customer-lifetime-value data-analysis data-visualization insights matplotlib microsoft-learn mssql mssqlserver pandas python retention-analysis retention-rate seaborn
Last synced: 12 Apr 2026
https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 12 Mar 2025
https://github.com/finnishcancerregistry/directadjusting
Compute estimates of weighted averages with confidence intervals.
biostatistics confidence-intervals data-analysis direct-adjusting direct-adjustment epidemiology health-statistics r r-package statistical-adjusting statistical-adjustment weighted-analysis weighted-average weighted-averages
Last synced: 26 Mar 2025
https://github.com/antononcube/wl-tilestats-paclet
Wolfram Language (aka Mathematica) paclet for statistics over 2D tillings. (Tile binning, aggregation functions application, etc.)
2d-data data-analysis geospatial-data mathematica wolfram-language
Last synced: 20 Mar 2026
https://github.com/gui-sitton/prepaid
In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 22 May 2026
https://github.com/antononcube/java-tilestats
Java package for statistics over 2D tillings. (Tile binning, aggregation functions application, etc.)
Last synced: 02 Apr 2025
https://github.com/r8vnhill/hdp
Hentai data processing
data-analysis e-hentai hentai kotlin
Last synced: 02 Apr 2025
https://github.com/susshiii/sql-layoffs-data-cleaning-and-eda
Full SQL project using MySQL to clean and analyze a real-world tech layoff dataset from 2020–2023.
data-analysis data-analytics-project data-cleaning eda layoffs mysql sql
Last synced: 07 Jul 2025
https://github.com/m-coder-umer/sales-dashboard-power-bi-project
An interactive Sales Dashboard built with Power BI using MySQL data, showcasing monthly trends, top-performing products, and key sales KPIs (Key Performance Indicators).
business-intelligence data-analysis data-cleaning data-modeling data-visualization dax interactive-dashboard mysql power-query powerbi sales-dashboard sql time-series-analysis
Last synced: 07 Jul 2025
https://github.com/lashawnfofung/super-heroes-analysis-project
This portfolio project involves a detailed analysis of 732 superhero records from the heroes_information.csv dataset, comprising 11 columns of unique characteristics for each hero. The primary goal is to showcase key insights derived from this rich dataset, demonstrating proficiency in data analysis using SQL.
data-analysis datasets mysql-database mysql-server mysql-workbench sql
Last synced: 07 Jul 2025
https://github.com/siddhartha-padhy/heart-disease-predictor
data-analysis machine-learning pandas python
Last synced: 20 Apr 2026
https://github.com/ryan-wong1/analyzing-stress-and-fatigue-drivers-in-railroad-workforces-data-analysis
Railroad dispatcher data on demographics, work, lifestyle, and stress factors
data-analysis data-cleaning data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/byte7/fifa17-analysis-and-prediction
⚽ FIFA 17 Analysis and Prediction⚽
data-analysis dreamteam fifa17 fifa17-analysis ultimate-team
Last synced: 26 Mar 2025
https://github.com/mikhaelmounay/salty-med
Salty Mediterranean - Grade 12 Data Analysis & Visualization Capstone Project
data-analysis data-visualization
Last synced: 02 Feb 2026
https://github.com/borjamome/top-goleadores
Mejores delanteros en Europa según los datos
data-analysis data-visualization football-analytics r
Last synced: 26 Mar 2025
https://github.com/borjamome/explorando-madrid
Exploring Madrid: A Data-driven Analysis with R 🐻🌳
data-analysis data-visualization madrid r
Last synced: 26 Mar 2025
https://github.com/zeynepcol/data-science-cryptocurrencies-data-analysis-forecasting
Cryptocurrency price analysis and prediction using regression models
artificial-intelligence crytpocurrency data-analysis data-mining data-preprocessing data-processing data-science data-visualization extract-data financial-analysis linear-regression lstm machine-learning regression-algorithms xgboost
Last synced: 07 Jul 2025
https://github.com/manish506/loan-approval-prediction
Explore predictive modeling in this project by applying classification techniques to a loan approval dataset. Analyze and preprocess the data, then use models like K-Nearest Neighbors, Random Forest, SVC, and Logistic Regression to predict loan outcomes. Gain insights into approval factors and enhance prediction accuracy.
classification classification-models data-analysis data-science jupyter-notebook loan-approval-prediction machine-learning predictive-analytics predictive-modeling project python
Last synced: 19 Jan 2026
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/bpkaur/exploring-67-years-of-lego
Exploring 67 years of LEGO
data-analysis datacamp pandas python3
Last synced: 10 May 2026