Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/gemaquejr/restaurant-orders
Projeto com o objetivo de aplicar os conceitos de POO e trabalhar com Set, Hashmap e Dict. Este projeto foi criado para avaliação final na seção 06 do módulo de ciência da computação do Curso de Desenvolvimento Web na Trybe.
data-analysis dict hashmap poo python set
Last synced: 30 Oct 2025
https://github.com/josedanielchg/nyc-schools-test-scores-exploration
DataCamp project analyzing NYC public school test scores to identify top math-performing schools, the best overall SAT scores, and borough-level variability using Python and pandas
data-analysis jupyter-notebook python
Last synced: 19 Mar 2025
https://github.com/badranalyst/startup-expansion-analysis-with-pandas-matplotlib-and-power-bi
Analyzes startup growth and expansion factors using Pandas for data analysis and Matplotlib for visualizations. Complements findings with data visualizations in Power BI, providing actionable insights into funding and market trends.
dashboard data-analysis data-visualization dataset matplotlib matplotlib-pyplot pandas power-bi powerbi
Last synced: 16 May 2026
https://github.com/estevan-ulian/py-agent-voice
Um projeto para lidar com interações de voz entre humano e agente de I.A. permitindo a leitura e análise de dados de um arquivo CSV.
agent-based-modeling data-analysis python3 whisper-ai
Last synced: 11 Apr 2025
https://github.com/fmind/malpop
Rank the popularity of malware applications by their occurrence on VirusTotal
data-analysis malware popularity ranking virustotal
Last synced: 11 Apr 2025
https://github.com/czesctuklap/sustainable-fashion-database-analysis
This project, analyzes a dataset of sustainable fashion trends for 2024. It includes data preprocessing, exploration, visualization, and insights on environmental impact factors such as carbon footprint, water usage, waste production, and sustainability practices.
data-analysis data-visualization database dataset keggle sustainable-fashion
Last synced: 30 Apr 2026
https://github.com/felipe-veas/visor-sueldos-publicos
Herramienta interactiva para visualizar y analizar remuneraciones del sector público en Chile, construida con Streamlit.
audit chile data-analysis python streamlit transparency
Last synced: 16 May 2026
https://github.com/imnotamr/ai
A collection of machine learning and AI projects implemented in Jupyter notebooks, covering regression, classification, and neural networks
ai classification colab-notebook data-analysis data-preprocessing data-preprocessing-and-cleaning data-visualization deep-learning deep-neural-networks jupyter-notebook machine-learning model-evaluation predictive-modeling project-based-learning python supervised-learning supervised-learning-algorithms supervised-learning-classifiers unsupervised-learning unsupervised-learning-algorithms
Last synced: 17 May 2026
https://github.com/satvikpraveen/pcc-vizforge
🎨 Personal data visualization toolkit generating synthetic datasets across multiple domains (random walks, dice simulations, weather patterns, earthquakes, GitHub analytics) with beautiful Matplotlib & Plotly visualizations. Includes Jupyter notebooks, interactive dashboards & statistical analysis. Perfect for learning data science! 🚀📊
analytics dashboard data-analysis data-generation data-science data-visualization github-analytics interactive-visualization jupyter-notebook matplotlib plotly probability python random-walk scientific-computing seismology statistical-analysis synthetic-data time-series weather-data
Last synced: 17 May 2026
https://github.com/eslamdyab21/a-b-test-to-an-e-commerce-website
A/B test to an e-commerce website
csv data-analysis data-science hypothesis-testing pandas python udacity-data-analyst-nanodegree
Last synced: 17 May 2026
https://github.com/gabrielczar/machine-learning
Repositorio de Analise de Dados and Machine Learning
data-analysis data-science jupyter-kernels jupyter-notebook learning-exercise machine-learning
Last synced: 14 Jul 2025
https://github.com/tinaland101/python-api-challenge
This project involves analyzing weather data from cities around the world using the OpenWeatherMap API and creating visualizations to explore the relationship between weather variables and latitude.
api-integration-and-data-retrieval data-analysis data-collection-and-geospatial-analysis problem-solving-and-decision-making statistical-analysis
Last synced: 03 Mar 2025
https://github.com/fabricioism/nyc-schools-perceptions
data-analysis data-science data-visualization dataquest r
Last synced: 29 Oct 2025
https://github.com/nishumehta/house-sales-analysis
House Sales Analysis Dashboard for King County, Washington, built with Tableau. Features interactive charts and maps to explore sales patterns, price distributions, and property conditions.
dashboard data-analysis data-visualization tableau tableau-dashboards tableau-public
Last synced: 11 Jan 2026
https://github.com/rdrahul123/my_python-codes
Python Programming codes and Notebooks
anaconda data-analysis data-science jupyter-notebook python python3 visual-studio
Last synced: 17 May 2026
https://github.com/arv-anshul/pw-api
Perform data analysis on PW Skills APIs. Made a web app using streamlit. See any course syllabus, analytics, quizzes and assignments.
api course data-analysis ineuron-ai physics-wallah project pw-skills python3 streamlit
Last synced: 18 Apr 2026
https://github.com/edjoukou/altip-sales-analysis
It is about Sales data analysis
data-analysis mysql-database sql tableau visualization
Last synced: 20 Jul 2025
https://github.com/abidshafee/google.colaboratory_projects
This repository contains the collections of interactive python notebooks (ipynb) that are some of my projects on Data Science, Machine Learning (ML), and Natural Language Processing (NLP).
colaboratory data-analysis data-science lstm machine-learning nlp statistics time-series
Last synced: 09 Jul 2025
https://github.com/betkh/datascieneinpython
Jupiter Notebook files
data-analysis data-visualization
Last synced: 16 Jun 2025
https://github.com/clemence-g/heat-dome-analysis
atmospheric-science data-analysis geopotential heat-wave jupyter
Last synced: 18 Mar 2025
https://github.com/lauratrigo/codigo_roti
Análise de ROTI é uma ferramenta em MATLAB para processar e visualizar dados ionosféricos (ROTI) de múltiplas estações GNSS. Desenvolvido para pesquisas em geofísica espacial, o script gera gráficos temporais comparativos com filtros de qualidade e tratamento de dados faltantes. 📡
data-analysis geophysics image-processing matlab roti scientific-initiation
Last synced: 24 Jun 2025
https://github.com/sadratehranian/prediction-of-covid-19-diagnosis
Build an algorithm in MATLAB using ML techniques to predict if a person is having COVID-19 or not depending on the existing medical conditions. Further research has been conducted on identifying the most suitable machine learning techniques and increase their prediction accuracy.
covid-19 data-analysis data-science data-visualization machine-learning matlab prediction visualization
Last synced: 11 Sep 2025
https://github.com/thyripian/ibm_data_science_capstone
data-analysis data-science data-visualization python python3
Last synced: 12 May 2026
https://github.com/qorah/vic-edu-housing-insights
Analysis of education outcomes and housing affordability in Victoria, Australia.
data-analysis jupyter-notebook
Last synced: 18 Mar 2025
https://github.com/mahdikh03/custumers_clustering_rmf
A data analysis project to implement RFM (Recency, Frequency, Monetary) analysis for customer segmentation and behavior analysis using the K-Means algorithm.
customer-segmentation data-analysis k-means-clustering unsupervised-learning
Last synced: 09 May 2025
https://github.com/niaid/categorical-data-analysis
bcbb-training data-analysis data-science r statistics
Last synced: 24 Jun 2025
https://github.com/riciokzz/covid-19-analysis
Covid-19 Analysis In South Korea
covid-19 data-analysis data-cleaning data-engineering exploratory-data-analysis machine-learning south-korea
Last synced: 20 Jul 2025
https://github.com/niaid/genetic-linkage-analysis
Materials for ACE course on Genetic Linkage Analysis.
ace ace-uganda2020 analysis bcbb-training clinical data-analysis genetics ngs ngs-analysis
Last synced: 24 Jun 2025
https://github.com/ebrizzzz/data-visualization-project-using-tableau
A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.
analytics data data-analysis data-science data-visualization python regression tableau
Last synced: 16 Jun 2025
https://github.com/iamsainikhil/us-births-analysis
Analysis of US-Births during 1994-2003 based on CDC-NCHS data set.
Last synced: 16 May 2026
https://github.com/muneeb1030/webscrapper_altnews
The project utilizes a combination of Python, Scrapy, and Selenium to navigate through the dynamic content of AltNews.in and collect valuable information for analysis and verification.
data-analysis data-collection python3 scrapy scrapy-spider selenium selenium-python
Last synced: 17 May 2026
https://github.com/macorisd/instagram-fake-account-analysis
A project in R focused on detecting fake Instagram accounts. It includes exploratory data analysis, data visualization, and analysis using three techniques: association rules, formal concept analysis, and regression. The results are presented in an interactive Quarto book.
data-analysis data-science data-visualization r
Last synced: 10 Jun 2025
https://github.com/eslamdyab21/apara-data-gui
Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.
csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui
Last synced: 17 May 2026
https://github.com/tknishh/investing-platform
An investing platform application to help users get information and analyze various foreign currency assets. The investing platform uses an ETL pipeline to insert new batches of Forex data once a day.
data-analysis investing-platform pipeline
Last synced: 18 Mar 2025
https://github.com/imdadmiran17/data_analysis_exercise
data-analysis numpy numpy-arrays numpy-exercises python3
Last synced: 17 May 2026
https://github.com/shaikh-raj/data-science-portfolio
Data Science Portfolio of Raj Shaikh including Case Studies and Articles that I have completed that solve various business problems.
articles case-study data-analysis deep-learning machine-learning nlp statistics
Last synced: 20 Jul 2025
https://github.com/arction/lcjs-example-0507-dashboardfiberanalysis
A demo application showcasing using LightningChart JS to visualize fiber analysis data.
area-plot area-series chart charts dashboard data-analysis demo heatmap javascript lcjs lightningchart-js performance visualization webgl
Last synced: 12 Mar 2025
https://github.com/brevex/hotel-booking-demand-data-analysis
Data analysis in Python of demand for urban hotels and resorts showing their causes and relationships
data-analysis data-science hotel-booking-analysis kaggle python
Last synced: 08 May 2026
https://github.com/nehul1149/olympic-data-analysis
This project is an interactive data visualization and analytics platform for exploring historical Olympic Games data. Built with Python and Streamlit, it offers an in-depth analysis of medal tallies, athlete statistics, and country-wise performance trends, providing users with powerful insights into the world's biggest sporting event.
analysis data-analysis data-science data-visualization matplotlib python streamlit
Last synced: 18 May 2026
https://github.com/soumasish2005/ai-chatbot-using-snowflake
This project is a Streamlit application that allows users to upload a CSV file and ask questions about their data in natural language.
cloud data-analysis data-science data-visualization python snowflake streamlit
Last synced: 17 May 2026
https://github.com/mfakhriazhar/stock-price-prediction
Stock prices are highly volatile and influenced by various factors, making accurate prediction a major challenge in investment decisions.
data-analysis data-science deep-learning python recurrent-neural-networks
Last synced: 18 May 2026
https://github.com/thecoderpinar/globalwarmingforecast
🌍 Global Warming Forecast Tool An advanced tool for analyzing and forecasting climate trends using ARIMA and Prophet models, with interactive visualizations and scenario simulations.
arima climate-change data-analysis environmental-science forecasting global-warming machine-learning prophet streamlit time-series-analysis visualization
Last synced: 27 Mar 2025
https://github.com/spring-0/netflix-media-data-analysis
Exploring and analyzing Netflix data to uncover trends through data visualization and statistical analysis.
Last synced: 27 Mar 2025
https://github.com/jasonsu131/cps188-term-project
A data analysis program developed in C to extract information about diabetic patients across Canada from a governmental spreadsheet available online. The program showcases summaries and averages based on the extracted data.
c data-analysis data-statictics file-reading
Last synced: 28 Mar 2025
https://github.com/bjornmelin/minneanalytics
MinneAnalytics project work.
competitive-programming data-analysis data-visualization r
Last synced: 09 Jul 2025
https://github.com/velut/thesis-sw
Software and datasets used in the "Cost-effective and Scalable Activity Matching using Crowdsourcing" thesis
bpmn cost crowdflower crowdsourcing data-analysis dataset performance-analysis plotting-algorithms r thesis
Last synced: 19 Jun 2025
https://github.com/mae776569/weratedogs-wrangling
Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations
data-analysis data-science data-visualization tweets twitter-api
Last synced: 25 Jan 2026
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/mfakhriazhar/ecom-qtt-prediction
In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.
data-analysis data-science data-visualization e-commerce-project eda machine-learning python
Last synced: 19 May 2026
https://github.com/kenwuqianghao/scotiabank-datathon-2023
Code and data analysis done for 2023 Scotiabank Datathon
data-analysis fraud-detection jupyter-notebook python
Last synced: 18 May 2026
https://github.com/yash22222/british-airways-data-science-internship
All 2 Task Assigned By British Airways Data Science Virtual Internship Programme
csv data-analysis data-science data-visualization google-colaboratory jyputer-notebook machine-learning microsoft-excel microsoft-powerpoint python
Last synced: 16 May 2026
https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data
Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.
data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping
Last synced: 30 May 2026
https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset
This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations
business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis
Last synced: 07 Apr 2026
https://github.com/annaanastasy/classification-project-student-grades
A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.
catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling
Last synced: 29 Mar 2025
https://github.com/manuelgil/vscode-data-pack
This extension pack includes the essential extensions for data analysts.
data-analysis data-science data-structures data-visualization vscode-extension
Last synced: 07 Apr 2026
https://github.com/sondosaabed/data-visualization-in-tableau
data-analysis data-visualization nanodegree plot tableau udacity
Last synced: 08 Sep 2025
https://github.com/prarthana-singh/heart-attack-prediction-model
A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.
classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn
Last synced: 25 Jun 2025
https://github.com/sparkerdata/hockeyshotmap
Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).
data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics
Last synced: 18 May 2026
https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql
In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.
cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql
Last synced: 18 May 2026
https://github.com/ivanayala96/end-to-end-business-intelligence-solution-logistics-financial-performance-dashboard
Project Overview: This project features a comprehensive Power BI solution developed for Ayala's Consultancy. It transforms raw operational data (generated via Python) into a strategic decision-making tool, managing a dataset of $7.71M in total sales and over 2,500 transactions.
anlytics bussines-report bussiness-intelligence data-analysis dax power-bi powerbi python
Last synced: 22 Apr 2026
https://github.com/mostafa-ghorab/global-happiness-analysis
An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.
business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/dacosmicgiant/marketing-sms-analyser
Mini project for R language SEM - V
Last synced: 21 Mar 2025
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/stefagnone/unsupervised-analysis-project
This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.
data-analysis data-visualization engagement-metrics facebook-live-sellers k-means-clustering logistic-regression marketing-insights pca-analysis python social-media-analytics
Last synced: 05 Apr 2025
https://github.com/stefagnone/data_storyboarding_visualization
Data Storyboarding and Visualization Techniques for Effective Communication
data-analysis data-visualization ggplot2-analysis r tableau-dashboards
Last synced: 05 Apr 2025
https://github.com/stefagnone/-wedding-vendor-pricing-and-customer-satisfaction-analysis
Data-driven analysis of wedding vendor pricing and customer satisfaction, with database design, SQL optimization, and cost breakdown generation.
business-intelligence cost-optimization customer-satisfaction data-analysis database-design python sql vision-board-analysis wedding-planning wedding-vendor-pricing
Last synced: 03 May 2026
https://github.com/stefagnone/moneyball_project
Data-driven analysis inspired by the Moneyball approach, identifying affordable replacements for key Oakland A's players using R and sabermetrics to support cost-effective recruitment.
baseball-statistics data-analysis data-driven-decision-making player-replacement-strategy r-programming sabermetrics sports-analytics
Last synced: 05 Apr 2025
https://github.com/rorrell/rightwhaledata
A Jupyter Notebook where I wrangle some data on right whale sightings and create a visualization
data-analysis data-visualization jupyter-notebook python3
Last synced: 11 May 2026
https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi
"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."
business-intelligence data-analysis data-visualization dax powerbi python sql
Last synced: 18 May 2026
https://github.com/jatin-mehra119/car_price_prediction
Predicting price of the cars using small dataset.
data-analysis data-visualization jupyter-notebook machine-learning python regression-models sklearn sklearn-pipeline
Last synced: 07 Apr 2026
https://github.com/tarasbln/big-quant
Official public repository of the Berlin Investment Group (BIG) Quant Team, featuring quantitative finance research, algorithmic trading strategies, market analyses, educational materials, and open-source projects.
data-analysis education finance investment investment-club python3 quantative-finance quantative-trading quantitative-research research
Last synced: 21 Mar 2025
https://github.com/misszeferino/netflix-exploratory-analysis
Netflix exploratory analysis using python
data-analysis data-visualization pandas plotly python
Last synced: 07 Apr 2026
https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda
Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.
data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn
Last synced: 07 Apr 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/darshan1924/house-price-pridiction
This repository contains a machine learning project for predicting house prices based on various features, including geographical coordinates. The project includes data preprocessing steps to handle# House Price Prediction Project
data-analysis data-preprocessing house-prices jupyter-notebook machine-learning prediction
Last synced: 27 Mar 2025
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 02 Jan 2026
https://github.com/oshinrathor/Data-Science-Systems-and-Analytics-Projects
Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.
dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics
Last synced: 12 Sep 2025
https://github.com/mosalem149/pythonutilities
A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.
data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting
Last synced: 15 May 2026
https://github.com/hoxo-m/blog
HOXO-M Blog
data-analysis data-science r-package
Last synced: 30 Oct 2025
https://github.com/spshah1701/world-development-indicators
Analysis of World Development Indicators (WDI) using big data technologies, specifically Databricks, Apache Spark, and Scala.
apache-spark big-data data-analysis spark-sql
Last synced: 17 Mar 2025
https://github.com/akash1070/predicting-zomato-restaurant-ratings
Perform extensive Exploratory Data Analysis(EDA) on the Zomato Dataset. Building an appropriate Machine Learning Model that will help various Zomato Restaurants to predict their respective Ratings based on certain features deploy the Machine learning model via Flask
data-analysis extratreesregressor flask linear-regression machine-learning random-forest zomato-bangalore zomato-data-analysis
Last synced: 18 May 2026
https://github.com/huynhtanphatt/diagnosing-uk-railway-performances
This project analyzes UK railway ticket and operation data to show how revenue, passenger demand, and on-time performance are connected.
data-analysis data-visualization datastorytelling python railway sql ticketing transportation
Last synced: 24 Apr 2026
https://github.com/sbera01/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 24 Dec 2025
https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal
Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.
data-analysis data-visualization python
Last synced: 24 Feb 2025
https://github.com/dinamohsin/toman-bikeshare-data-analysis-sql-power-bi
This project involves data analysis using SQL, Power BI, and CSV datasets to extract insights and visualize key business metrics.
csv-files data-analysis data-visualization database powerbi sql sql-server
Last synced: 22 Apr 2026
https://github.com/jerinpious/house-price-prediction
This project is a machine learning-based application to predict house prices. A frontend interface has been developed using Streamlit to make the prediction process user-friendly for regular customers. The project is structured
data-analysis data-engineering data-science eda machine-learning pandas python random-forest scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics
Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.
data-analysis data-visualization eda powerbi python sql
Last synced: 21 May 2026
https://github.com/sreejabethu/smart-report-analyzer
An AI-powered app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.
data-analysis huggingface llm nlp pdf-analysis python question-answering streamlit summarization
Last synced: 18 May 2026
https://github.com/ljadhav25/knn-algorithm-data-science-
This repository contains a project demonstrating the implementation and application of the K-Nearest Neighbors (K-NN) algorithm in Data Science. The objective is to provide a comprehensive understanding of the K-NN algorithm, including data preprocessing, model training, evaluation, and visualization of results. This project is ideal for beginners
data-analysis data-science knn-classification machine-learning matplotlib-pyplot numpy pandas-library seaborn
Last synced: 16 Apr 2026
https://github.com/cowboymrzamo2380/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 05 Apr 2025
https://github.com/calebtheman116/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments pivot-tables sentiment-analysis tableau-public text-reviews
Last synced: 21 Jul 2025
https://github.com/jofaval/california-housing-pricing
Data Analysis about the California Housing Pricing in 1997
data-analysis data-science data-visualization deep deep-learning deep-neural-networks google-colab keras machine-learning matplotlib python regression scikit-learn seaborn tensorflow
Last synced: 05 Apr 2026
https://github.com/mindlessmuse666/iris-knn
Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.
algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn
Last synced: 17 Aug 2025
https://github.com/clarajacintho/ig4-ds
The final project for the Multidimensional Data Analysis and Data Mining courses, where we analyze data from motorcyclists to determine what causes accidents
data-analysis data-science shiny-apps
Last synced: 11 May 2025
https://github.com/saadhaniftaj/logistic--lasso-regression-data-analysis
Iris dataset analysis with logistic and Lasso regression, using coordinate descent for feature selection and binary classification. Includes preprocessing and data visualizations
data-analysis lasso-regression-model logistic-regression python statistics
Last synced: 18 May 2026
https://github.com/andremenezesds/house_rocket_sales_insights
Data analysis for House Rocket Sales Company database, including insights for sales optimization.
data-analysis data-visualization geopandas git github jupyter-notebook linux numpy pandas powerbi python seaborn seaborn-python streamlit streamlit-webapp ubuntu vscode windows10
Last synced: 21 Jan 2026