Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/tbep-tech/piney-point-analysis
Materials for analysis of Piney Point monitoring data
data-analysis open-science piney-point tampa-bay tbep water-quality
Last synced: 19 Feb 2026
https://github.com/pizofreude/divvybikes-share-success
Developing data-driven marketing campaign for Divvy to convert casual riders into annual members. Divvy is a bike-share program of the Chicago Department of Transportation (CDOT).
airflow bi-analytics data-analysis data-engineering data-visualization database dbt docker etl jupyterlab python r redshift s3
Last synced: 17 Apr 2026
https://github.com/mainak-97/netflix-content-analysis-project
SQL-based analysis of Netflix’s movies and TV shows dataset to uncover content trends, popular genres, geographical insights, and audience preferences. Includes data queries, findings, and a presentation of key insights.
data-analysis mysql mysql-workbench powerpoint presentation-slides sql
Last synced: 23 Sep 2025
https://github.com/tbep-tech/seagrass-analysis
Materials for assessing coverage changes and analysis of drivers of change for Tampa Bay seagrass
dashboard data-analysis seagrass tampa-bay water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/fim-seagrass
Materials for analysis of FIM data, seagrass, and other datasets
data-analysis fim seagrass tampa-bay
Last synced: 19 Feb 2026
https://github.com/remram44/apex-legends-ocr-data
Get data from Apex Legends streams using OCR
apex-legends data-analysis video-games
Last synced: 31 Jul 2025
https://github.com/dina-hosny/analyze-and-model-airline-system
Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions
data-analysis data-modeling data-warehouse datawarehousing dwh plsql sql
Last synced: 05 Mar 2026
https://github.com/farrelfaricaf/exploratorydataanalyst---titanic
This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.
data data-analysis data-science data-visualization eda python titanic-dataset
Last synced: 31 Jul 2025
https://github.com/tkhoa2711/twitter-hate-speech
Hate speech detection on Twitter
Last synced: 28 Jul 2025
https://github.com/nandit123/python_on_excel
Data Analysis using python libraries on excel data
csv data-analysis data-science fill fluctuations graph numpy python python-library
Last synced: 16 May 2026
https://github.com/pauliorandall/airline-passenger-satisfaction-r
Analysing the Airline Passenger Satisfaction dataset from Maven Analytics
data-analysis data-analytics r
Last synced: 01 Aug 2025
https://github.com/computingvictor/mercadona_agent
Web app to explore supermarket products with advanced filters, search, favorites, and nutritional info. Includes data analysis notebooks for deeper insights.
css data-analysis data-science data-visualization filtering html interactive-ui javascript notebooks nutritional-info pandas product-catalog python supermarket webapp
Last synced: 09 Apr 2026
https://github.com/darkdk123/handwashing-discovery-analysis
A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.
data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots
Last synced: 09 Apr 2026
https://github.com/lucas-mazzolim/superstore-bi
Project where I prepared two data sources for querying and created a BI visualization in Data Studio. Used tools as Mysql, Looker Studio, Google Spreadsheet and Python.
business-intelligence data-analysis data-visualization google-looker-studio mysql spreadsheet
Last synced: 27 Jul 2025
https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques
Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.
data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/celineboutinon/chicken-run
OpenClassrooms Data Analyst 2022-2023 - Projet 9
data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot missingno numpy pandas plotly python scikit-learn scipy seaborn statsmodels
Last synced: 09 Apr 2026
https://github.com/aygp-dr/claude-log-stream
Advanced analytics engine for Claude Code logs with real-time processing capabilities
claude-api clojure data-analysis monitoring
Last synced: 24 Sep 2025
https://github.com/palwisha-18/time_series_analysis_lex_vs_gdp
Analyzes how a country’s GDP per capita correlates with the life expectancy of its citizens over a period of about 100+ years
data-analysis data-visualization pandas plotl time
Last synced: 19 May 2026
https://github.com/owenl0000/housepricesproject
Kaggle Project
data-analysis data-science data-visualization gridsearchcv kaggle-competition kaggle-dataset linear-regression machine-learning machine-learning-algorithms numpy onehot-encoding ordinal-encoding pandas python random-forest-regression sckit-learn seaborn streamlit xgboost-regressor
Last synced: 09 Apr 2026
https://github.com/yousefmohammad/american_collage_quickanalysis
Quick Anaylsis about American Colleges
data-analysis data-visualisation data-visualization datanalysis datavisualisation datavisualization excel microsoft-excel
Last synced: 09 Mar 2026
https://github.com/tbep-tech/pep-r-training
Materials for PEP R training
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/aravind2060/employee_engagement_analysis_spark
Using Spark Structured APIs to analyze employee data and extract insights related to employee satisfaction, engagement, concerns, and job titles within an organization.
apache-spark data-analysis data-preprocessing docker docker-compose python
Last synced: 09 Apr 2026
https://github.com/shashwat9kumar/us-accidents-data-analysis
Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle
accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python
Last synced: 17 Apr 2026
https://github.com/jasoncobra3/finops-copilot
An end-to-end AI-powered FinOps platform that ingests cloud billing data, analyzes cost trends, answers natural-language questions using a RAG pipeline (LangChain + FAISS + sentence-transformers + Groq), and provides actionable cost optimization recommendations. Includes a FastAPI backend and Streamlit dashboard UI - fully containerized with Docker
ai-assistant cloud-cost-optimization cloud-enginee cost-analytics data-analysis devops docker faiss faiss-vector-database fastapi finops groq langchain llm pandas rag rag-pipeline sentence-transformers sqlite3 streamlit
Last synced: 13 Apr 2026
https://github.com/tbep-tech/tberf-oyster
Materials for evaluating TBERF oyster restoration success
ccmp-bh4 ccmp-bh6 data-analysis tampa-bay tbep tberf
Last synced: 19 Feb 2026
https://github.com/tbep-tech/pep-graphics
Materials for generating PEP graphics
data-analysis pep water-quality
Last synced: 19 Feb 2026
https://github.com/rodolfo-brandao/pos-graduacao
[pt-BR] Repositório para armazenar alguns materiais e projetos de cada módulo da minha especialização em Ciência de Dados (2025–2027)
artificial-intelligence data-analysis data-science data-visualization databases deep-learning jupyter linear-algebra machine-learning python r statistics
Last synced: 09 Apr 2026
https://github.com/0xunkn0wn4m1r/data_engineering_banking_project
🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.
automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning
Last synced: 09 Apr 2026
https://github.com/bpkaur/a-network-analysis-of-game-of-thrones
A Network analysis of Game of Thrones: To analyze the co-occurrence network of the characters in the Game of Thrones books
data-analysis data-science machine-learning networkx python3
Last synced: 01 May 2026
https://github.com/myounesdev/authorgraphanalyzer
a web-based visualization tool for analyzing and exploring author collaboration networks
algorithms binary-tree bts d3js data-analysis dijkstra-algorithm django exception-handling pandas python scss
Last synced: 08 Jun 2026
https://github.com/analyst-lochan/flight-delay-and-cancellation-dataset-2019-2023-
This project demonstrates a complete data analytics pipeline starting from raw real-world flight data to professional visual dashboards using SQL Server and Power BI. It showcases data import, cleaning, optimization, transformation, and dynamic DAX-based visual reporting.
airline-performance business-intelligence data-analysis data-cleaning data-modeling data-visualization dax etl flight-data kaggle-dataset portfolio-project powerbi powerbi-dashboard sql sql-server
Last synced: 09 Sep 2025
https://github.com/vishal-bhandary/sql-data-analytics
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql
Last synced: 30 Jun 2026
https://github.com/tbep-tech/tbep-r-training
Repository for miscellaneous R training materials
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown
This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/phanchenh/supplychaindashboard_datacpsupplychain
Tracking Trends in Supply Chain – A Sales and Profit Review (2015-2017)
business-analytics business-intelligence data-analysis data-visualization dax-languague dax-query mssql mssqlserver powerbi supply-chain supply-chain-management
Last synced: 01 Aug 2025
https://github.com/jofaval/ionosphere
Binary Classification of Ionosphere signals at Goose Bay, Labrador in 1988
data-analysis data-science data-visualization deep-learning google-colab keras machine-learning python scikit-learn tensorflow uci xgboost
Last synced: 09 Apr 2026
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/tbep-tech/tbeploads
R Package for estimating nutrient loading to Tampa Bay
data-analysis loads package tampa-bay tbep tbnmc water-quality
Last synced: 19 Feb 2026
https://github.com/jimohola/flight-price-predict-deployment
Deploying Machine Learning Models via Microsoft Azure
cloud-computing css data-analysis data-preprocessing data-visualization flask html machine-learning python3
Last synced: 05 May 2026
https://github.com/grindelfp/data-analysis-example
One of my UNI Artificial Intelligence Systems course's projects.
data-analysis data-preprocessing ipynb
Last synced: 19 Sep 2025
https://github.com/nathadriele/ifood-data-governance-pipeline
Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.
airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit
Last synced: 02 Apr 2026
https://github.com/mituskillologies/aiml-dypiemr-sep24
Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.
artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3
Last synced: 05 Apr 2025
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/dlozeve/topological-persistence
Topological persistence diagram (barcode) of a triangulation
data-analysis persistence topology
Last synced: 02 Aug 2025
https://github.com/ausaaf-rh/movie-recommendation-system-collaborative-filtering
🎬 A comprehensive movie recommendation system implementing item-based collaborative filtering with cosine similarity. Features real-time recommendations, performance evaluation metrics (Precision@K, Recall@K), and interactive user interface. Built with Python, scikit-learn, and MovieLens dataset for academic research and learning purposes.
agents data-analysis jupyter-notebook python python3
Last synced: 17 Apr 2026
https://github.com/baslia/zillow_exploration
data-analysis data-science ds machine-learning pandas python
Last synced: 09 Apr 2026
https://github.com/dmvianna/python-nix
Trivial Nix environment with pandas and postgresql
Last synced: 27 Jul 2025
https://github.com/danymukesha/bioga
Apply multi-objective genetic algorithms to genomic data for biologically informed feature selection and pattern discovery.
data-analysis gene-expression genetic-algorithms genomics optimization-algorithms
Last synced: 18 Sep 2025
https://github.com/takshak26/predict_blood_donations-
About The title of the project is “Predict Blood Donations”. It uses python as language, data science, and machine learning as the field of operation, TPOT library for model selection, logistic regression for model building, and jupyter notebook as the code editor.
data-analysis data-visualization datascience machine-learning python3
Last synced: 16 May 2026
https://github.com/yuvrajs2003/formula-1-performance-analysis
Analysis of F1 races and their drivers
data-analysis data-science data-visualization hyperparameter-tuning pandas python
Last synced: 09 Apr 2026
https://github.com/hfzdzakii/dicoding-shipclusteringanalysisdataandmodelling
This repo is a master submission for my Dicoding Final Project. Ship Performance Clustering Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
clustering data-analysis machine-learning
Last synced: 27 Jul 2025
https://github.com/jotstolu/netflix-sql-data-analysis-project
This project explores the Netflix dataset using SQL queries to uncover trends, patterns, and business insights that could help stakeholders understand content distribution, viewer preferences, and platform optimization
data-analysis sql sql-server tsql
Last synced: 02 Aug 2025
https://github.com/jabercrombia/video-game-data
This project integrates FastAPI as the backend and Next.js as the frontend to create a full-stack web application. It processes and displays vides game sales data, enabling seamless API communication while maintaining a scalable and efficient architecture.
data-analysis nextjs nintendo playstation python typescript video-game
Last synced: 02 Apr 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/mrendiks/analyst-data-survey-monkey
Learn how to analyst data from dataset surver monkey using Excel and Python
data-analysis ipynb-jupyter-notebook python
Last synced: 07 Mar 2026
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 27 Jul 2025
https://github.com/ramapinnimty/udacity-mlfoundation-nanodegree
This is a repository containing solutions to the assignments that are a part of the Udacity Machine Learning Foundation Nanodegree program.
assignments data-analysis python3 statistics udacity-machine-learning-nanodegree
Last synced: 26 Jul 2025
https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project
In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.
data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql
Last synced: 09 Apr 2026
https://github.com/ljadhav25/healthcare-data-collection-and-analysis
This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.
data-analysis data-visualization flask-application flask-backend html-css-javascript pycharm-ide python
Last synced: 09 Apr 2026
https://github.com/syed-amjad-ali/restaurant-sales-sql-project
This was a simple SQL project where I analyzed restaurant sales data, showcasing skills in data creation and querying. The project explores menu performance, order trends, and customer insights.
aggregations business-intelligence data-analysis guided-project joins maven-analytics querying restaurant-sales sales-data sql subqueries
Last synced: 03 Jan 2026
https://github.com/jhrcook/checkplease
Analysis of an immune checkpoint-blockade screen.
bayesian-statistics data-analysis pymc3 python python3 r
Last synced: 17 Apr 2026
https://github.com/kuuhaku86/datmingemastik19
data-analysis data-mining data-science data-visualization
Last synced: 02 Aug 2025
https://github.com/quesocosteno03/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 02 Aug 2025
https://github.com/monteirooscar98/tarifas-publicas-sp-dieese
Extração de dados através de WebScraping no site do Dieese e Analise em relação as Tarifas Públicas do Município de São Paulo.
data-analysis data-visualization python webscraping
Last synced: 03 May 2026
https://github.com/jpcadena/cancer-classification
Breast cancer classification project.
cancer-detection classification data-analysis data-science deep-learning imblearn machine-learning neuronal-network numpy pandas pylint python scikit-learn supervised-learning tensorflow
Last synced: 09 Apr 2026
https://github.com/rh01/data-analysis-with-r
Duke University - Data Analysis With R
data-analysis r r-language r-studio rmarkdown
Last synced: 23 May 2026
https://github.com/yamslam/contentsunderpressure_processing
A repository for data processing and analysis for Contents Under Pressure.
data-analysis data-processing data-visualization game-based-learning judgments process-safety
Last synced: 07 Sep 2025
https://github.com/waghraj1699/car-price-prediction
Implementation of ML algorithm to predict the car price
artificial-intelligence data-analysis data-science data-visualization feature-engineering linear-regression machine-learning machine-learning-algorithms regression-models
Last synced: 02 Aug 2025
https://github.com/idaraabasiudoh/credit_card_fraud_detection
This repository contains a machine learning project focused on detecting credit card fraud using Decision Tree and Support Vector Machine (SVM) classifiers.
data-analysis jupyter-notebook machine-learning python3 scikit-learn snapml
Last synced: 19 Feb 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/ariyaarka/sales-analysis
A simple analysis on random dataset of pizza sales using SQL
data-analysis presentation-slides sql
Last synced: 17 Jan 2026
https://github.com/karsterr/repeated-measurement
An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.
anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse
Last synced: 18 Sep 2025
https://github.com/abdullahashfaqvirk/PowerBI-Dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 27 Sep 2025
https://github.com/shimazadeh/ft_linear_regression
Implementing a modular linear regression from scratch to predict the price of cars using a gradient descent algorithm.
data-analysis data-science hyperparameter-tuning linear-regression predictive-modeling
Last synced: 03 Jun 2026
https://github.com/zwelz3/unofficial-survivor-knowledge-graph
A comprehensive RDF knowledge graph covering all 50 seasons of Survivor (US), with 23,000+ triples across 749 named graphs.
Last synced: 23 May 2026
https://github.com/lc-rezende/eqx_boston_dataset
Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.
data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/hecatops/ad_libs
A real time advertisement data analytics platforming, displaying important metrics in easy to understand language.
dashboard data-analysis data-visualization kpi plotly-dash python
Last synced: 07 Nov 2025
https://github.com/vitor-ace/sunspots-data-analysis
This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.
data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python
Last synced: 06 May 2026
https://github.com/faint-liebfraumilch101/fraud-detection-sql-unsupervised
🕵️♂️ Detect fraud in bank transactions using SQL for feature engineering and Python's Isolation Forest for unsupervised anomaly detection.
anomaly-detection banking-data data-analysis data-science financial-analytics fraud-detection isolation-forest machine-learning portfolio-project python sql sqlite unsupervised-learning
Last synced: 07 May 2026
https://github.com/prasannnnn/real-time-share-price-scraping-and-analysis
The Stock Sentiment Analyzer is a web-based application built with Streamlit, BeautifulSoup, and Pandas to help users analyze the sentiment of a stock (BUY, SELL, or HOLD) based on its financial data. The tool extracts key financial metrics like Market Cap, Stock P/E, Dividend Yield, ROCE, ROE, and the 52-week High/Low from Screener.in.
beautifulsoup4 data-analysis python sentiment-analysis streamlit streamlit-dashboard webscraping
Last synced: 03 Aug 2025
https://github.com/ridemountainpig/education-level-data-analysis
An analysis of the relationship between education levels, unemployment rates, and credit card spending in Taiwan's six major cities.
data-analysis matplotlib pandas-python
Last synced: 17 Apr 2026
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/0xnu/england-house-prices
Predict house prices for the next five years across all English local authorities.
data-analysis england england-house-prices housing-market housing-market-analysis predictive-modeling regression
Last synced: 03 Aug 2025
https://github.com/monish-nallagondalla/sensor_fault_detection
This repo contains sensor data for analysis, focusing on sensor readings, their attributes, and classification (Good/Bad). It includes 500+ sensors with features for predictive modeling, anomaly detection, and sensor failure prediction.
anomaly-detection classification data-analysis data-science machine-learning predictive-modeling python sensor-data
Last synced: 01 May 2026
https://github.com/mxagar/space_exploration
This repository is a collection of mini-projects and tutorials related to space images and geo-spatial data.
data-analysis deep-learning geospatial machine-learning
Last synced: 29 Sep 2025
https://github.com/mhkamel/ecommerce-targeting-system
A Flask-based E-Commerce Targeting System that provides customer segmentation and personalized product recommendations. Users can upload structured interaction data for analysis, receive AI-driven recommendations, and gain insights into user behavior. The application is built with Flask, Pandas, Scikit-Learn, and integrates an interactive web inter
ai bootstrap csv-processing customer-segmentation data-analysis data-science e-commerce flask machine-learning pandas python recommendation-system scikit-learn user-behavior web-application
Last synced: 09 Apr 2026
https://github.com/hari00887/analysis-of-global-terrorism
Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.
data-analysis data-visualization powerbi
Last synced: 02 Mar 2026
https://github.com/macnianios/retail_sales_analysis
final data science project on techpro academy data science stream
anova clustering colab-notebook data-analysis data-science data-science-projects linear-regression numpy pandas python
Last synced: 17 Apr 2026
https://github.com/labex-labs/numpy-for-beginners
This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.
array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations
Last synced: 20 Jun 2026
https://github.com/bilalhameed248/xg-boost-ts-prediction
Predict/Forecast monthly and daily charges, as well as payments associated with claims generated during the billing process
charges-prediction data-analysis data-analysis-python data-modeling data-science payment-prediction prediction rcm revenue-forecast sql sql-query time-series time-series-analysis xgboost xgboost-model xgboost-regression
Last synced: 09 Mar 2026
https://github.com/rahulsm20/car-data
A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.
data-analysis data-analytics matplotlib numpy pandas python
Last synced: 09 Apr 2026
https://github.com/asghar-rizvi/hotel_reservation_data_analysis
This project involves a comprehensive data analysis of a hotel reservation dataset using Excel. The primary focus is on examining reservation cancellations. Through detailed analysis and visual representation.
dashboard dashboard-templates data-analysis data-analysis-excel data-representation data-science excel
Last synced: 02 Mar 2026
https://github.com/agustin-caceres/arg-telecom-analisis
Telecom Argentina Insights
business-analytics business-intelligence data-analysis data-visualization database postgresql python streamlit
Last synced: 09 Apr 2026
https://github.com/lyubov0406/data_analyst_portfolio
В репозитории собраны пет-проекты, демонстрирующие мои навыки в аналитике данных
data-analysis matplotlib numpy pandas portfolio python scipy seaborn sql tableau visualization
Last synced: 09 Apr 2026
https://github.com/PanosChatzi/Healthcare_and_Bioinformatics_Analyses
This repo contains the final assignments of the Data Analyst bootcamp by Workearly. Python and SQL were used to complete the assignments.
data-analysis data-cleaning data-visualisation jupyter matplotlib pandas python seaborn
Last synced: 05 Aug 2025