Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-20 00:07:30 UTC
- JSON Representation
https://github.com/gaurav-van/house_price_predictor_streamlit_web_app
Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project
data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit
Last synced: 02 May 2026
https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration
Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.
data-analysis data-visualization dataframe mysql pandas plotly python streamlit
Last synced: 02 May 2026
https://github.com/seankwarren/water-quality-analysis
An examination of water quality in the Atlanta watershed with a focus on identifying neglected areas and potential strategies for improving water quality monitoring
analytics data-analysis jupyter-notebook python
Last synced: 03 May 2026
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/nfaltir/youtube-channel-analysis
Youtube API channel Analysis using pandas
data-analysis data-science data-visualization google webscraping youtube youtube-api
Last synced: 02 May 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/gauthamnairvm/trex-app
Text Refinement EXplorer - An EDA tool for text based data.
data-analysis data-visualization groq-api large-language-models llama3 natural-language-processing text2sql
Last synced: 03 May 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/cs-joy/pandasv2.0.3
learn data analysis with pandas
data-analysis pandas pandas-learning
Last synced: 03 May 2026
https://github.com/mirseo/pandas_learning
pandas_learning
data-analysis data-analysis-python data-science data-visualization numpy numpy-example pandas pd python python-3 python3
Last synced: 03 May 2026
https://github.com/anandanraju/youtube-data-api-model
The YouTube Analytics API enables you to generate custom reports containing YouTube Analytics data. The API supports reports for channels and for content owners. Report fields are characterized as either dimensions or metrics
analytics data-analysis data-science metrics model python telemetry youtube youtube-api
Last synced: 03 May 2026
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 03 May 2026
https://github.com/jossimmar/ensa-scripts_py
Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.
data-analysis electrical-engineering python sqlite
Last synced: 10 May 2026
https://github.com/theairbend3r/mice-memory-response
Effect of memory on current response in mice using methods from computational neuroscience and machine learning.
computational-neuroscience data-analysis data-science machine-learning neuroscience python
Last synced: 09 Jun 2026
https://github.com/zeynepcol/data-analysis-visualization
Data visualization and interactive analytics - Olympics Dataset
data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit
Last synced: 03 May 2026
https://github.com/nomadsdev/sys-moninsight
System Monitoring and Analysis Tool is a utility for real-time performance tracking. It logs CPU, memory, and disk usage, provides visual graphs, and offers performance recommendations. Perfect for optimizing system efficiency.
automation cpu-usage data-analysis data-visualization disk-usage matplotlib memory-usage performance-analysis performance-optimization psutil python real-time-monitoring resource-management sys-moninsight system-metrics
Last synced: 19 Jun 2026
https://github.com/shivshah19/movie-recommendation-system
This Movie Recommendation System is designed to provide personalized movie recommendations based on user preferences.
cosine-similarity data-analysis machine-learning pandas python streamlit
Last synced: 03 May 2026
https://github.com/0xjeremy/me-18-final
Data collection and Analysis tools for IMUs
data-analysis imu raspberry-pi
Last synced: 03 May 2026
https://github.com/chouaib-629/customersegmentation
Hadoop-based Customer Segmentation project using the Online Retail Dataset. Implements MapReduce for processing and Python for preprocessing to uncover customer purchasing patterns for targeted marketing.
big-data customer-segmentation data-analysis data-science distributed-computing hadoop hadoop-mapreduce java mapreduce marketing-analytics python
Last synced: 04 May 2026
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/cliffordnwanna/climate_data_analysis_and_visualization
The Climate Data Visualization and Analysis project showcases a comprehensive exploration of African climate data, with an emphasis on identifying patterns and trends in average temperatures across different countries and time periods. It serves as a practical demonstration of my data analysis, data visualization, and problem-solving skills.
data-analysis data-science mathplotlib pandas plotly visualization
Last synced: 04 May 2026
https://github.com/mystique85/altseason-ethereum-analysis
Altcoin season analysis relative to Ethereum – price comparisons, technical indicators, and historical market trends
altcoins bitcoin blockchain crypto data-analysis ethereum investing
Last synced: 04 May 2026
https://github.com/ruchit0807/heart_disease_prediction
An interactive ML-powered web app that predicts the risk of heart disease based on clinical inputs like age, chest pain, cholesterol, ECG, and more. Built using Python, Streamlit, and scikit-learn, it offers early risk assessment in a simple and accessible way—just enter your health metrics and get instant feedback.
data-analysis data-science knn-regression pandas streamlit
Last synced: 04 May 2026
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/gowthamsundaresan/eigenscan
blockexplorer for eigenlayer
crypto data-analysis eigenlayer nextjs web3
Last synced: 04 May 2026
https://github.com/angelgardt/wlm-sdarp-old
World of Linear Models: Statistics & Data Analysis in R for Psychologists
data-analysis data-visualization gh-pages manim-animations quarto r rstudio statistics
Last synced: 04 May 2026
https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask
It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.
data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api
Last synced: 04 May 2026
https://github.com/scarblase/sales_insights
A data-driven analysis of 15,000 sales records using Python, Pandas, and visualizations to uncover trends, optimize strategies, and enhance business performance. 🚀📊
data-analysis data-visualization dataset matplotlib-pyplot pandas python3 sales-analysis seaborn
Last synced: 05 May 2026
https://github.com/kimtth/agent-data-analyst-stream-chainlit
⚡️Chainlit-based Data Analyst Chat Agent (Responses API, Server Sent Events) 📈
agent azure-openai chainlit code-interpreter data-analysis server-sent-events stream-response
Last synced: 09 Jun 2026
https://github.com/gursv/autoworth
Used Car Price Prediction (India)
data-analysis data-analysis-python data-analytics data-cleaning data-preprocessing data-science-projects eda fine-tuning gridsearchcv machine-learning matplotlib-pyplot pandas python3 random-forest-regressor scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/vara-co/python-api-challenge
Weather and Perfect Vacationing Spots Worldwide, by using APIs
api apis data-analysis data-visualization hvplot jupyter-notebook matplotlib pandas vacation weather
Last synced: 05 May 2026
https://github.com/shuddha2021/stellar-candidate-selector
A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.
candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn
Last synced: 05 May 2026
https://github.com/elcaiseri/udacity-advanced-data-analysis
UDACITY - Advanced-Data-Analysis Track Project
Last synced: 05 May 2026
https://github.com/gonzalo123/pivot.pandas
Data Analysis with Python. Pivot tables with Pandas
data-analysis jupyter-notebook pandas pivot-tables python
Last synced: 05 May 2026
https://github.com/wizardoftrap/football-team-analytics
This Jupyter notebook, created on Kaggle, analyzes football player and team statistics for the 2024-2025 season. It provides insights into player performance, team metrics, and playing styles across major European leagues using data from the dataset players_data-2024_2025.csv.
data-analysis data-visualization jupyter-notebook pandas python
Last synced: 05 May 2026
https://github.com/kiranmayi5/python-projects
A collection of Python projects showcasing skills in data analysis and visualization.
data-analysis data-visualization machine-learning nlp python
Last synced: 05 May 2026
https://github.com/myounus-codes/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
In this project I have cleaned the data for the model. Project Google Colab Link: https://colab.research.google.com/drive/1vQY-XEFJSdEkW2PQOSf1j13Yk8L-XXNw?usp=sharing
algorithms data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 05 May 2026
https://github.com/githubuseraccountamazing/the-amari-project
a project in which I attempted to push some of the limits of stable-diffusion while taking some data along the way
ai ai-generated-images bash data-analysis machine-learning stable-diffusion textual-inversion
Last synced: 05 May 2026
https://github.com/mr-vozhyk/karpov.courses-study
Часть заданий, мини-проектов и финальный проект от karpov.courses
airflow data-analysis git python sql statistics
Last synced: 05 May 2026
https://github.com/mirokeimioniemi/classifying-software-pirates
Exploring the factors driving people into software piracy by training two machine learning models to predict whether a person with certain characteristics and sentiments is likely to possess any pirated software or not using a dataset collected via a survey targeting users of music production software.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning piracy python software-piracy survey
Last synced: 06 May 2026
https://github.com/scarblase/homeless-animals-analysis
A data-driven exploration of homeless animal statistics 🐶🐱. Analyze age distribution, shelter dynamics, and adoption patterns using Python, Pandas, and Seaborn.
animals data-analysis data-mining data-science data-science-projects data-visualization matplotlib matplotlib-pyplot numpy pandas plotly python python3 ukraine
Last synced: 06 May 2026
https://github.com/thameran/mmar
Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
cli data-analysis datascience finance generator github-config go haskell hurst markdown mmar mmark time-series xml2rfc
Last synced: 06 May 2026
https://github.com/kirkalyn13/opensignal_autogenerate_report
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/amirhosseinhonardoust/customer-sentiment-intelligence-platform
An enterprise-grade NLP + Streamlit + SQL platform for analyzing customer feedback. Performs automated sentiment detection, stores labeled reviews in SQLite, and delivers real-time dashboards with probability insights to support business, marketing, and product optimization decisions.
community-project cost-of-living dashboard data-analysis data-visualization economic-analysis inflation-tracking local-data open-data pandas price-tracker public-insight python sqlite streamlit
Last synced: 06 May 2026
https://github.com/namratha2301/best-selling-books
Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.
data-analysis data-visualization matplotlib pandas sckiit-learn seaborn
Last synced: 06 May 2026
https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost
Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.
classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost
Last synced: 06 May 2026
https://github.com/scarblase/portfolioprojects
A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊
csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql
Last synced: 06 May 2026
https://github.com/monish-nallagondalla/diamondpriceprediction
Diamond Price Prediction is an end-to-end machine learning project that predicts diamond prices based on attributes like carat, cut, color, clarity, and dimensions. It features a Flask web application for real-time predictions and utilizes models such as Linear Regression, Lasso, and Ridge.
data-analysis data-science flask jupyter-notebooks machine-learning predictive-modeling python
Last synced: 06 May 2026
https://github.com/eslamdyab21/imdb-data-analysis
This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue
data-analysis pandas python udacity-data-analyst-nanodegree
Last synced: 06 May 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/sayantanidalui/indian-government-budget-analysis
A complete end to end data analysis project using Python, SQL, and Power BI based on a Kaggle dataset. Built to explore trends, allocations, and insights from India’s Union Budget (2021–24) for practice purposes.
data-analysis mysql pandas powerbi storytelling
Last synced: 07 May 2026
https://github.com/allanccwang/electronic_projects
implement the circuit with microcontroller
arduino circuit-analysis circuit-simulations circuits-and-electronics cpp data-analysis microcontroller physics python wemos
Last synced: 07 May 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/eduardoedubox/health_data_analysis
Health data analysis using Jupyter Notebook
data-analysis data-science database jupyter-notebook pandas python
Last synced: 07 May 2026
https://github.com/hamzacham/data_set_projet-5
data-analysis data-science database dataset jupyter jupyter-notebook paython training
Last synced: 07 May 2026
https://github.com/idaraabasiudoh/knn-customer-classification
Labels telecommunication customer base to respective groups to determine service type required for each customer.
data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn
Last synced: 07 May 2026
https://github.com/sayedgamal99/data-science
This is a repository for Data Science Projects.
data-analysis data-science deep-learning machine-learning python regression supervised-learning
Last synced: 07 May 2026
https://github.com/backdoorali/insider-threat-detection-project
Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.
cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence
Last synced: 07 May 2026
https://github.com/gorodroz/crypto-tracker
Realtime Bitcoin price tracker using Binance WebSocket and REST API. Logs prices to CSV and supports Pandas for data analysis.
binance bitcoin crypto csv-logger data-analysis pandas python rest-api websocket
Last synced: 07 May 2026
https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis
Last synced: 07 May 2026
https://github.com/md-emon-hasan/data-science
Data science tutorials, including data preprocessing, analysis, visualization, project deployment, machine learning and deep learning algorithms.
artificial-intelligence data-analysis data-engineering data-science deep-learning machine-learning-algorithms python
Last synced: 07 May 2026
https://github.com/dogan-the-analyst/web_scraping_job_vacancies
data-analysis python web-scraping
Last synced: 07 May 2026
https://github.com/y-india/retail-sales-analysis-project
Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.
ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit
Last synced: 07 May 2026
https://github.com/1ayanabil1/iris-visualization
This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.
analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python
Last synced: 07 May 2026
https://github.com/bassamn/titanic-data-analysis
Exploratory data analysis (EDA) of the Titanic dataset using Python. Analyzed survival patterns by age, gender, and class with visualizations (seaborn/matplotlib). Non-ML focus—highlighting insights with statistics and plots.
data-analysis eda pandas python seaborn titanic visualization
Last synced: 08 May 2026
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 May 2026
https://github.com/robson-python/airplane-price-data-analysis
Airplane Price Data Analysis - Airplane Price Prediction
data-analysis data-science data-visualization jupyter-notebook linear-regression machine-learning matplotlib pandas python seaborn vscode
Last synced: 10 Jun 2026
https://github.com/shridhar1504/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning
Last synced: 08 May 2026
https://github.com/guglielmo/datalab-notebooks
Data analysis at openpolis
data-analysis data-science jupyter-notebooks pandas python3
Last synced: 08 May 2026
https://github.com/nickchristopherson/duluth-tourism-analysis
End-to-End Data Pipeline for Tourism Industry Analysis
data-analysis data-visualization duluth economic-analysis jupyter pandas pdf-extraction python tourism
Last synced: 08 May 2026
https://github.com/iguptashubham/ott-churn-eda-ml
Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.
data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python
Last synced: 08 May 2026
https://github.com/jethronap/jstat-gui
Web-based GUI application for data analysis
data-analysis data-visualization java jstat mongodb
Last synced: 08 May 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/cagandemirmr/google-play-yorum-analizi
Türkiyede 2024 yılında en çok beğenilen My Supermarket Simulator 3D oyununa ait yorumların duygu durumu,yorumların beğeni sayısını,Firmanın geri dönüşleri ve kullanıcı nicknameleri gibi değişkenleri analiz ederek içgörü topladım.
bert data-analysis data-science nlp
Last synced: 10 Jun 2026
https://github.com/allanotieno254/us-largest-companies-by-revenue-web-scraping
A Python project for web scraping and analyzing the largest companies in the United States by revenue from Wikipedia
automation beautifulsoup csv data-analysis data-cleaning data-execution data-extraction pandas python web-scraping
Last synced: 08 May 2026
https://github.com/framebuffers/mindhunter
Wrappers for Pandas DataFrames to add quicker access for common statistical values, utilities and functionality.
data-analysis data-science numpy pandas python utilities-python
Last synced: 08 May 2026
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 10 Jun 2026
https://github.com/md-emon-hasan/data_analytics_project
Data analytics tasks and solutions, featuring hands-on exercises for data cleaning, visualization, and analysis using Python libraries.
cars-dataset census-data covid19-data data-analysis london-house-price police-data weather-data
Last synced: 08 May 2026
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 09 May 2026
https://github.com/tanu272004/-income-mortgage-housing-insights-a-state-city-analysis-
To analyze state & city housing trends and affordability using data analytics.”
bigquery business-intelligence data-analysis data-visualization dax googlecloud kpi numpy powerbi predective-modeling python sql
Last synced: 09 May 2026
https://github.com/aminzibayi/atfc
Technology forecasting toolkit
data-analysis data-visualization graph technology-forecasting
Last synced: 09 May 2026
https://github.com/akshat0427/python_youtube_history
a bunch of data science operations performed on youtube history data
data-analysis data-science extracting-features
Last synced: 10 Jun 2026
https://github.com/avijit-jana/redbus-data-scraper-dashboard
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualisation data-visualization datadrivendecisions filtering python3 redbus selenium selenium-python streamlit streamlit-application travel web-scraping webscrapping
Last synced: 09 May 2026
https://github.com/sedatdikbas/aefes-time-series-forecasting
Bu proje, Anadolu Efes Biracılık ve Malt Sanayii A.Ş. (AEFES) piyasa verilerini kullanarak kapanış fiyatlarının gelecekteki değerlerini tahmin etmek amacıyla derin öğrenme yöntemleri (LSTM, BiLSTM, CNN+LSTM) kullanmaktadır. Projede, veri ön işleme, model eğitimi ve değerlendirme adımları detaylandırılmıştır.
bilstm cnn-lstm data-analysis deep-learning financial-forecasting lstm machine-learning python stock-price-prediction tensorflow
Last synced: 09 May 2026
https://github.com/rubinlake/rl-academy-data-analytics
Educational data analysis project demonstrating BMW sales data analysis with AI-powered code assistance using Cursor IDE and Jupyter notebooks
cursor-ide data-analysis educational-project jupyter langchain matplotlib numpy pandas python scipy seaborn
Last synced: 09 May 2026
https://github.com/dina-hosny/explore-us-bike-share-data-project
Explore US Bike Share Data project - FWD Data Analysis Professional Track. In this project, I used Python to explore data related to bike share systems for three major cities in the United States and answer questions about it by computing descriptive statistics.
data-analysis data-science numpy pandas python
Last synced: 09 May 2026
https://github.com/dogan-the-analyst/developer_survey_analysis
Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.
data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python
Last synced: 09 May 2026
https://github.com/talha-1010/imdb-data-analysis
A data analysis project made with python using pandas
data-analysis data-visualization jupyter-notebook pandas pandas-dataframe
Last synced: 09 May 2026
https://github.com/blagojeblagojevic/lol_data_analysis
classification data-analysis data-science jupyter-notebook kaggle python3
Last synced: 09 May 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk
The aim of this analysis is to understand the smoking patterns among adults in the UK.
data data-analysis data-visualization python3
Last synced: 09 May 2026
https://github.com/musaibnagani/fraud-detection
End-to-end fraud detection simulation using Python — Phase 1 (SQLite + Rules) and Phase 2 (MSSQL + Velocity/Behavioral Features) with synthetic banking data.
data-analysis fraud-detection fraudulent-transactions mssql mssql-database pandas python sqlite3 time-series
Last synced: 10 May 2026
https://github.com/christos99/scraping-project
This project is a Python-based tool for web scraping with a user-friendly GUI. Built with PyQt5 and Selenium, it allows users to scrape online listings by specifying keywords, price ranges, and exclusions. Results are displayed in a table and can be exported to an Excel file.
automation data-analysis excel gui openpyxl pandas pyqt5 python selenium web-scraping
Last synced: 10 May 2026
https://github.com/gabrielmpinho/cs50-sql
Solutions and notes from CS50’s Introduction to Databases with SQL. Covers CRUD operations, data modeling, normalization, joins, views, indexes, and connecting SQL with Python and Java. Begins with SQLite for portability and introduces PostgreSQL and MySQL for scalability.
data-analysis data-structures data-visualization database databases javascript python sql
Last synced: 10 May 2026
https://github.com/pratik-khose/data-analysis-with-pandasai
PandasAI with Llama3 for Interactive Data Analysis
data-analysis llama3 llma pandasai streamlit visualization
Last synced: 11 May 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026