Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/junpenglao/jaefa
Just Another Eye-movement Filtering Algorithm
data-analysis eye-movement-data eye-tracking
Last synced: 12 Jan 2026
https://github.com/avratanubiswas/fluorpenplugin
A matlab user interface for analysing OJIP curve datasets from FluorPen instrument. That is, serving as an additional plug in for "quick categorical analysis".
data-analysis fluorpen ojip-curve
Last synced: 18 Mar 2026
https://github.com/anniefib/otherprojects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
airflow-etl-orchestration aws cloud-native-data-solutions data-analysis data-visualization database datamodelling datawarehousing eda end-to-end-data-pipelines machine-learning-models pgadmin4 spark-analytics sql
Last synced: 07 May 2026
https://github.com/suhail25/pizza-sales-analysis
Delved into detailed analysis of sales data presented in Excel by Pizza sales manager; implemented strategic pricing adjustments resulting in a 25% revenue surge and enhanced profit margins. Explore and cleaned the data set using SQL and then performed data analysis by filtering the 12% of data using SQL commands in MySQL.
data-analysis excel powerpoint-presentations sql
Last synced: 15 Feb 2026
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 15 Feb 2026
https://github.com/siddhant2105s/bring-your-own-device-boyd-system
This repository contains the design and implementation of the Bring Your Own Device (BYOD) System for managing personal devices at Life Insurance Company. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.
data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models my-sql relational-databases relational-model sql-queries
Last synced: 15 Feb 2026
https://github.com/swethajoseph/sales-eda-project
Performed an advanced Excel-based exploratory data analysis (EDA) of an E-Commerce sales dataset to create an interactive dashboard for uncovering key business insights.
advancedexcel data-analysis data-visualization datacleaning dataformatting exploratory-data-analysis msexcel pivot-tables
Last synced: 19 Mar 2026
https://github.com/arunesh-tiwari/sales-analysis
Tableau Data Analysis Project.
data-analysis data-visualization tableau
Last synced: 01 Mar 2026
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 02 Mar 2026
https://github.com/madusales/powerbi-etl-elt
Venho estudando, através do Bootcamp da DIO sobre Data Analytics & Power BI, acerca do uso de SQL para criar soluções em BI. Esse repositório é dedicado a registrar os meus conhecimentos adquiridos até então sobre o que é BI, Tipos de análises, ETL e ELT.
big-data business-intelligence data-analysis powerbi
Last synced: 19 Mar 2026
https://github.com/steno-aarhus/mediation-analysis-course
Modern mediation analysis for basic, clinical and epidemiological research in diabetes and endocrinology
data-analysis data-analysis-in-r diabetes diabetes-epidemiology mediation-analysis open-educational-resource
Last synced: 03 Mar 2026
https://github.com/samalyarov/practicum_projects
Various data analysis projects displaying tools and instruments that I am proficient with
data-analysis datetime folium geojson matplotlib numpy pandas plotly postgresql powerpoint python regular-expressions requests-library-python scipy seaborn sql sqlalchemy tableau tqdm
Last synced: 02 Apr 2026
https://github.com/lintangwisesa/ujian_analyticsvisualization_jcds07
Panduan Soal Ujian Data Analytics & Visualization Job Connector Data Science batch 7
data-analysis data-science data-visualisation exam
Last synced: 04 Mar 2026
https://github.com/edanur-y/bank-customer-churn-prediction-with-classification-models
Comparing the performances of multi-layer perceptron, decision tree, random forest, gradient boosting and extreme gradient boosting classifications on customer data to predict their status of exiting the bank.
data-analysis data-transformation hyperparameter-tuning python
Last synced: 16 Apr 2026
https://github.com/dpb24/netflix-global-top-10-performance
Using Machine Learning to predict Netflix Global Top 10 viewership trends (Python & R)
data-analysis data-science data-visualization decision-tree-regression gradient-boosting-regressor machine-learning media netflix predictive-analytics predictive-modeling python r random-forest random-forest-regression regression-models sklearn streaming-video xgboost-regression
Last synced: 16 Apr 2026
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 04 Mar 2026
https://github.com/yasumorishima/yasumorishima
Manufacturing Engineer & Data Analyst. 17 years exp in MFG. Python, VBA, Automation Specialist. (盛島康徳 / Yasunori Morishima)
automation data-analysis manufacturing portfolio python vba
Last synced: 05 Mar 2026
https://github.com/satyacoder29/e-commerce-sales-analysis
Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.
data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups
Last synced: 05 Mar 2026
https://github.com/pizofreude/divvybikes-share-success
Developing data-driven marketing campaign for Divvy to convert casual riders into annual members. Divvy is a bike-share program of the Chicago Department of Transportation (CDOT).
airflow bi-analytics data-analysis data-engineering data-visualization database dbt docker etl jupyterlab python r redshift s3
Last synced: 17 Apr 2026
https://github.com/vaishnavis03/finlatics_ml_program
This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.
correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn
Last synced: 17 Apr 2026
https://github.com/nicodupont/kaggle
All my Data analysis and Competitions
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 17 Apr 2026
https://github.com/mahmoudwal27/manufacturing_downtime
This project focuses on improving manufacturing efficiency by analyzing production data. Using Python, SQL, and Power BI, we built interactive dashboards to uncover patterns, minimize downtime, and optimize operations. The goal is to help stakeholders make data driven decisions for enhanced productivity.
data-analysis data-analysis-python data-visualization google-colab powerbi python sql
Last synced: 17 Apr 2026
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 03 Jun 2026
https://github.com/rishisolanke/pdf_query_langchain
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool
Last synced: 17 Apr 2026
https://github.com/royungar/sql_chicago_data_analysis_project
SQL-based data analysis project using SQLite, pandas, and Jupyter SQL magic commands. Analyzes crime, school, and census data from Chicago to explore socioeconomic patterns using filtering, joins, aggregation, and subqueries.
aggregation census-data chicago crime-data data-analysis data-engineering education-data ibm jupyter-notebook pandas sql sqlite subqueries
Last synced: 04 Jun 2026
https://github.com/phanchenh/businessdashboard_powerbiproject_pizzadataset
Pizza Business Performance Analysis and Growth Strategies 2015
business-analytics business-intelligence dashboard data-analysis data-visualization insights powerbi python
Last synced: 17 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/sanam2405/ahs
This contains the analysis of result of AHS Madhyamik Examination 2022
data-analysis data-visualization jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/vagnerbellacosa/116_usandoamazontextractocrextracaodadosdynamodb
Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB
amazon-textract aws data-analysis data-extraction digital-innovation-one dio dynamodb lab ocr python
Last synced: 18 Apr 2026
https://github.com/wang-q/tva
tva: Tab-separated Values Assistant
cli command-line-tool csv data-analysis data-processing etl high-performance rust streaming tabular-data tsv unix-philosophy
Last synced: 05 Apr 2026
https://github.com/mtimma001/clinical-trial-data-tool
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 05 Apr 2026
https://github.com/al-ghaly/prosper-loans-analysis
A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 18 Apr 2026
https://github.com/jordanconallluthaiswright/purchase-behaviour-data-analysis
This project analyzes Black Friday purchase behavior for Company XYZ, uncovering trends by gender, age, and location. Using data cleaning, statistical analysis, and visualization, it evaluates spending patterns, confidence intervals, and category preferences to provide actionable insights for optimizing marketing strategies and targeting.
business-analytics data-analysis jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/mi7773/advanced_sql_data_analytics_project
A hands-on SQL project simulating data analysis using fact and dimension tables, covering trends over time, cumulative metrics, performance breakdowns, segmentation, and reporting via SQL.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics database query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/arv-anshul/notebooks
My Jupyter notebooks in which I practice data science.
data-analysis data-science jupyter-notebook llm machine-learning marimo matplotlib regression transformers
Last synced: 19 Apr 2026
https://github.com/yuvrajsaraogi/unemployment-analysis-with-python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql
Last synced: 19 Apr 2026
https://github.com/souraevshing/data-science-01
Data analysis using jupyter notebook.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 19 Apr 2026
https://github.com/diegoglezsu/bulletin-fetcher
bulletin-py is python package to easily fetch bulletins and legal acts from a wide variety of sources of Eurpean Union.
data-analysis european-union legal-documents python sparql
Last synced: 19 Apr 2026
https://github.com/mlucifer27/bilateral-visualization
Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.
d3blocks data-analysis data-visualization plotly-python python streamlit
Last synced: 04 Jun 2026
https://github.com/akash-v7/telecom_customer_churn_prediction
A machine learning project to predict customer churn in the telecom industry using data analysis and classification models. The project includes data preprocessing, exploratory data analysis (EDA), model building, and insights to help telecom companies improve customer retention strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning predictive-modeling python
Last synced: 20 Apr 2026
https://github.com/kmbuki/uk_police_data
R programming - Using open data about crime and policing in England, Wales and Northern Ireland.
data-analysis data-visualization r
Last synced: 04 Jun 2026
https://github.com/dthung1602/goodread-bestbook-prediction
Data analysis - trying to predict the result of Goodreads Choice Adward
data-analysis goodreads pca python r xgboost
Last synced: 20 Apr 2026
https://github.com/jerinpious/movie-recommendation-system
A content-based movie recommendation system built using Python. The system processes movie data, extracts relevant features, and provides recommendations based on user preferences
content-based-recommendation data-analysis jupyter-notebook machine-learning pandas python streamlit
Last synced: 20 Apr 2026
https://github.com/sarthakmishraa/bike_rental_predictor
Bike Sharing Dataset : This dataset contains the hourly and daily count of rental bikes between years 2011 and 2012 in Capital bikeshare system with the corresponding weather and seasonal information.
data-analysis machine-learning python xgboost
Last synced: 20 Apr 2026
https://github.com/abinashsahoo007/project-bankruptcy-prevention
The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.
data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit
Last synced: 20 Apr 2026
https://github.com/ak-pydev/python_practice
Documenting my learning journey from python -> ML -> DL -> LLM/GenAI -> Agents exercises solved daily from Udemy/Kaggle/YouTube.
data-analysis data-science feature-engineering llms machine-learning mlflow mlops-workflow modeling python3 streamlit uvicorn
Last synced: 20 Apr 2026
https://github.com/salfaris/toy-data-analysis
Random toy data projects. For my portfolio data projects, see linked website
Last synced: 20 Apr 2026
https://github.com/mahmoudwal27/e-commerce-data-analysis
A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.
analytics data-analysis data-analysis-python data-set google-cloud python
Last synced: 21 Apr 2026
https://github.com/florence-nyokabi/house-power-consumption
Machine Learning: Exploring Regression Analysis
data-analysis data-cleaning data-science data-visualization feature-engineering jupyter-notebook jupyterlab machine-learning pandas-python regression-analysis regression-models
Last synced: 05 Jun 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/prady2309/air-traffic-analysis
data-analysis data-science data-visualization flights jupyter-notebook python3
Last synced: 21 Apr 2026
https://github.com/anushkundu/churn-prediction
Telecom Customer Churn Prediction Using Machine Learning!
accuracy-score classification-algorithm classification-report data-analysis data-science deep-learning gradient-boosting-classifier keras-tensorflow logistic-regression machine-learning random-forest-classifier recall-precision roc-auc-score smote-sampling svm-classifier
Last synced: 21 Apr 2026
https://github.com/maddieemihle/home_sales
A PySpark-powered analysis of real estate trends using home sales data. This project explores average prices by year, room configuration, and property features, while demonstrating SparkSQL, caching, and partitioning techniques in a scalable data pipeline—all within Google Colab
apache-spark caching data-analysis googlecolab parquet pyspark sparksql
Last synced: 21 Apr 2026
https://github.com/floffah/my-listening
Various ways to analyse your Spotify extended streaming history data
convex data-analysis listening-history spotify
Last synced: 23 Apr 2026
https://github.com/al-ogr/sf_pr1_job_analysis_hh
SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter
data-analysis data-science ipynb plotly python
Last synced: 23 Apr 2026
https://github.com/tranngoca5039/bigquery-a5y
📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.
analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql
Last synced: 05 Jun 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/thc1006/nycu_timtable_crawler
🎓 NYCU Course Data Crawler & Timetable System | 國立陽明交通大學課程爬蟲與選課系統 - Python web scraper for course schedules, syllabi & educational data analysis. Crawls 18K+ courses with 98% success rate. Features: interactive timetable, JSON API, Google Colab support, batch processing, resume capability.
academic course course-selection crawler data-analysis education educational-data google-colab json-api nycu open-data python schedule student-tools syllabus taiwan timetable university web-automation web-scraping
Last synced: 24 Apr 2026
https://github.com/shudhanshurp/adidas-us-data-analysis
This Power BI project analyzes Adidas sales data across different regions, retailers, and product categories in the U.S. The dashboards provide insights into sales performance, operational metrics, and future forecasts to support data-driven decision-making.
data-analysis data-transformation data-visualization forecasting powerbi python retail-analytics
Last synced: 24 Apr 2026
https://github.com/ihnokim/datk
Data Analysis Toolkit (DATK)
data-analysis data-engineering data-science deep-learning image-processing pandas signal-processing
Last synced: 24 Apr 2026
https://github.com/misszeferino/cyclistic-bike-share-analysis
Data Analysis using R
data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 06 Jun 2026
https://github.com/strixion/demoversion_ai
The demoversion of StrixionAI
ai csv data-analysis data-analytics json python txt
Last synced: 24 Apr 2026
https://github.com/arunabhagit/inventory-misalignment-and-revenue-loss-in-multi-store-bike-retail
This project focuses on identifying the inventory and demand mismatch causing stagnant sales and lost revenue in a bike retail chain. By analyzing store-level performance and regional customer preferences, the project aims to detect underperforming products.
data-analysis data-visualization powerbi python
Last synced: 24 Apr 2026
https://github.com/henriquetourinho/s.i.g.m.a
Plataforma de busca e análise de arquivos para Linux, com GUI avançada em PySide6 e foco em metadados ricos para investigações profundas.
data-analysis developer-tools file-search metadata open-source pyqt pyside6 python python-brasil qt6 sysadmin-tools
Last synced: 24 Apr 2026
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 24 Apr 2026
https://github.com/voidnire/redditviralmysteryposts
Análise de posts de subreddits de mistério. O que define um post viral neste tipo de sub?
data-analysis data-visualization mysteries mystery nlms python-3 reddit
Last synced: 24 Apr 2026
https://github.com/yxuco/ethdecoder
This CLI decodes Ethereum transactions and events, stores results in CouchDB, and then exports customized views to CSV files for data visualization and analysis.
data-analysis decoding ethereum
Last synced: 24 Apr 2026
https://github.com/yuvrajsaraogi/-iris-flower-classification
Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.
classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python
Last synced: 24 Apr 2026
https://github.com/lightning-chart/lcjs-example-0507-dashboardfiberanalysis
A demo application showcasing using LightningChart JS to visualize fiber analysis data.
area-plot area-series chart charts dashboard data-analysis demo heatmap javascript lcjs lightningchart-js performance visualization webgl
Last synced: 24 Apr 2026
https://github.com/manisharora96/data-analysis-of-smartwatch
The project is structured with sample data, step-by-step Jupyter notebooks, and modular Python scripts for automated analysis
data-analysis data-visualization jupyter-notebook python smartwatch-analysis
Last synced: 24 Apr 2026
https://github.com/edwinrlambert/emomap-sentiment-analysis
To analyze public sentiment related to specific locations in a city (e.g., parks, transit stations, restaurants, neighborhoods) using geo-tagged social media posts, reviews, and comments. The goal is to visualize how people feel across different areas and times.
data-analysis jupyter-notebook python sentiment-analysis
Last synced: 24 Apr 2026
https://github.com/mariann95/sql_data_warehouse_and_analytics_project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
data-analysis data-analytics data-cleaning data-engineering data-lakehouse data-science data-science-portfolio data-warehouse data-warehousing datalake datawarehouse datawarehousing etl etl-job etl-pipeline medallion-architecture sql sql-query sql-server sqlserver
Last synced: 06 Jun 2026
https://github.com/tmoulik/bikeshare-python
Analysis of Bikeshare data from three major cities
data-analysis data-visualization python udacity-nanodegree
Last synced: 25 Apr 2026
https://github.com/m-biriulova/python-job-market-analysis
Web scraping, data analysis, and visualization of Python developer vacancies in Czech Republic.
automation beautifulsoup data-analysis data-visualization portfolio-project python selenium web-scraping
Last synced: 25 Apr 2026
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/chandansoren/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 26 Apr 2026
https://github.com/ys1f/geothermal_project
Geothermal Data Analysis & Visualization for Texas – well data, temperature gradients & zone mapping
bht bottom-hole-temperature data-analysis folium geopandas geospatial geothermal gis interpolation irena jupyter-notebook mapping python rasterio spatial-analysis temperature-gradient texas visualization well-data zone-mapping
Last synced: 26 Apr 2026
https://github.com/pararang/nams-thesis-fuzzy
A specialized data processing tool designed to help with Fuzzy Delphi Method calculations for thesis research data analysis. Then extended with some new features for data processing with different method.
data-analysis dematel hacktoberfest hacktoberfest-accepted house-of-quality python sustainability vibecoding
Last synced: 27 Apr 2026
https://github.com/odinleepro/airbnbnewyorkcityanalysis
AirbnbNewYorkCityAnalysis is a comprehensive data analysis and visualization project exploring short-term Airbnb rental trends across New York City (2008–2022). Using open source Airbnb data, the project combines data cleaning, statistical summaries, and Tableau dashboards to uncover pricing patterns, borough level distribution, and insights.
airbnb analytics-project data-analysis data-cleaning data-science data-visualization new-york-city real-estate-analytics tableau urban-analysis
Last synced: 27 Apr 2026
https://github.com/malexandersalazar/covid-19-peru-estimacion-oxigeno-requerido
Análisis técnico de casos confirmados por COVID-19 en Perú para la estimación de oxígeno medicinal requerido.
covid-19 data-analysis data-science peru python
Last synced: 27 Apr 2026
https://github.com/hfzdzakii/dicoding-airqualityanalysisdata
This repo is a master submission for my Dicoding Final Project. Air Quality Dataset is being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization streamlit
Last synced: 27 Apr 2026
https://github.com/tillscode/personal-finance-ml-analysis
Machine learning analysis of personal financial data with predictive modeling and interactive dashboard
dashboard data-analysis finance machine-learning python scikit-learn
Last synced: 28 Apr 2026
https://github.com/sujata-adhikari/data-analysis
Data analysis of Market sales data using PowerBi, created dashboard to show analysis.
data-analysis excel pandas powerbi
Last synced: 12 Jun 2026
https://github.com/jovicdev97/financial-loan-datascience-notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 28 Apr 2026
https://github.com/ovuiproduction/tabletalk
TableTalk - Your Data, Your Language Query tabular data using natural language—no SQL required! Upload your data, ask questions, and get instant insights. 🔹 Convert Natural Language to SQL 🔹 Handle Complex Queries & Aggregations 🔹 Upload CSVs for Easy Analysis 🔹 React + Flask + SQLite3 Backend 🔹 Powered by LLMs for Accuracy
ai data-analysis flask llm machine-learning natural-language-processing prompt-engineering react sql sqlite
Last synced: 28 Apr 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/matheusafonseca/python-data-visualization-matplotlib-seaborn-masterclass-udemy
This repository is dedicated to storing the code developed during the "Python Data Visualization: Matplotlib & Seaborn Masterclass" course on Udemy.
charts data-analysis data-analysis-python data-science data-visualization database graphics graphics-programming jupyter-notebook matplotlib matplotlib-plots python python3 seaborn seaborn-plots
Last synced: 28 Apr 2026
https://github.com/abhi227070/car-price-prediction
This project implements a machine learning model to predict the price of cars based on various features such as mileage, manufacturing date, fuel type, and more. Users can input car information, and the model will estimate the price of the car based on the provided data. This tool can be useful for both car buyers and sellers to estimate car price.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression regression-models scikit-learn scikitlearn-machine-learning
Last synced: 28 Apr 2026
https://github.com/wei-rongrong2/openfoodfactclustering
A project that explores clustering food products based on nutritional attributes using K-Means, Fuzzy C-Means, and DBSCAN algorithms, with a Streamlit dashboard for visualizing results.
clustering dashboard data-analysis dbscan food-products fuzzy-cmeans k-means machine-learning nutrition nutrition-clustering open-food-facts streamlit
Last synced: 28 Apr 2026
https://github.com/bala-1409/titanic-survived-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning-algorithms matplotlib predictive-modeling python3 seaborn
Last synced: 28 Apr 2026
https://github.com/prady2309/sales-prediction-using-python
Implemented using Multiple Linear Regression
data-analysis data-science machine-learning python
Last synced: 29 Apr 2026
https://github.com/thanaraklee/pyspark-dataframe-operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
data-analysis dataframe pyspark python
Last synced: 29 Apr 2026
https://github.com/prateek5525/yt-analysis-project
This project utilizes the YouTube Data API to analyze channel and video performance, offering insights into subscriber counts, views, video metrics, and monthly trends. It generates visual reports and exports data in CSV format, aiding in effective decision-making and performance tracking.
data-analysis jupyter-notebook python3 seaborn-plots youtube-api
Last synced: 29 Apr 2026
https://github.com/vanshuchaudhary/zomato
This Jupyter Notebook contains an exploratory data analysis (EDA) of Zomato restaurant data. It includes data cleaning, visualization, and insights into restaurant ratings, pricing, cuisine distribution, and location-based trends.
business-analytics data-analysis data-mining data-science data-visualization datascience matplotlib pandas-dataframe pandas-python python python-3 python-library
Last synced: 29 Apr 2026