Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026
https://github.com/humayun-raza-030/restaurant-recommendation-system
This project is a Restaurant Recommendation System that helps users find restaurants in Lahore based on their location, customer reviews, and ratings. The system scrapes restaurant data from Google Maps, analyzes user reviews for sentiment, and provides a visualization dashboard using Tableau.
data-analysis data-science data-visualization python
Last synced: 17 Apr 2026
https://github.com/nicodupont/kaggle
All my Data analysis and Competitions
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 17 Apr 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/mahmoudwal27/manufacturing_downtime
This project focuses on improving manufacturing efficiency by analyzing production data. Using Python, SQL, and Power BI, we built interactive dashboards to uncover patterns, minimize downtime, and optimize operations. The goal is to help stakeholders make data driven decisions for enhanced productivity.
data-analysis data-analysis-python data-visualization google-colab powerbi python sql
Last synced: 17 Apr 2026
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 08 May 2026
https://github.com/ryan-wong1/analyzing-arrest-patterns-in-chicago-data-analysis
Chicago Police Department (CPD) arrest data on offenses, locations, and demographics
data-analysis data-cleaning data-visualization exploratory-data-analysis matplotlib pandas python seaborn
Last synced: 08 May 2026
https://github.com/kgotsosm/fcc-data-analysis
Notebooks created for the Data Analysis Course on freeCodeCamp
data-analysis data-visualization matplotlib pandas seaborn
Last synced: 17 Apr 2026
https://github.com/victoorv/maladie_cardiaque
Prédire si un individu est atteint ou non de maladie cardiaque.
classification data-analysis data-science data-visualization exploratory-data-analysis heart-disease heart-disease-analysis heart-disease-classification heart-disease-prediction hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-analysis statistical-tests statistics
Last synced: 17 Apr 2026
https://github.com/lu-m-dev/biostatistics-eda
Exploratory data analysis and visualization system for biostatistical research
biostatistics data-analysis data-visualization eda
Last synced: 25 Jun 2026
https://github.com/BingyanStudio/github-analyzer
锐评一下你都在 GitHub 写了什么
data-analysis github llm reports selfhosted typescript
Last synced: 12 May 2025
https://github.com/royungar/automotive_sales_insights_dashboard
Data visualization project analyzing automotive sales, recalls, and customer sentiment using IBM Cognos Analytics. Features KPIs, treemaps, heatmaps, and advanced visual storytelling techniques.
automotive-industry business-intelligence cognos-analytics csv customer-sentiment dashboard data-analysis data-engineering data-visualization eda excel heatmap ibm kpi recall-analysis sales-data treemap
Last synced: 04 Jun 2026
https://github.com/phanchenh/businessdashboard_powerbiproject_pizzadataset
Pizza Business Performance Analysis and Growth Strategies 2015
business-analytics business-intelligence dashboard data-analysis data-visualization insights powerbi python
Last synced: 17 Apr 2026
https://github.com/davidmalko87/steam-library-exporter
Python script to export your Steam game library to CSV — playtime, genres, reviews, metacritic scores, prices, tags & estimated owners via Steam Web API + Store API + SteamSpy
csv-export data-analysis game-data metacritic playtime-tracker python steam steam-api steam-games steam-library steamspy
Last synced: 04 Apr 2026
https://github.com/vasilescur/decsci-group-9
DECSCI 101 - Final Project (Group 9)
data-analysis decision-science qualtrics statistics visualization
Last synced: 17 Apr 2026
https://github.com/santos-k/fashion-recommender-dashboard
The project is a neural network-based fashion recommendation system built using Python. The model used for this system is Resnet50, which is a deep learning model used for image recognition. The data used for training the model is scraped from Flipkart, with a total of 65,000 images.
ann cnn dash dashboard data-analysis data-science deep-learning eda gcp heroku kera machine-learning nueral-networks plolty python tensorflow
Last synced: 04 Apr 2026
https://github.com/q-viper/blog-notebooks
This is the repo to store most of my blogs in dataqoil.com and q-viper.github.io.
data-analysis data-science machine-learning-algorithms timeseries
Last synced: 04 Apr 2026
https://github.com/kwonnayeon/urban-parks-childrens-happiness
Grad thesis on urban parks’ impact on children’s happiness – data, results, and code
causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting
Last synced: 17 Feb 2026
https://github.com/sanam2405/ahs
This contains the analysis of result of AHS Madhyamik Examination 2022
data-analysis data-visualization jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/niniola-creator/niniola-creator
This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.
bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql
Last synced: 07 Apr 2026
https://github.com/prangonghose/wikipedia-blocking-policies
This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
data-analysis data-visualization matplotlib open-source pandas python3 seaborn
Last synced: 18 Apr 2026
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/rajeev2806/retail-order-data-analysis
Dataset downloaded from kaggle api and then data cleaning and analysis is performed
data-analysis data-cleaning postgresql
Last synced: 18 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/ljadhav25/django-data-analyzer
Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.
data-analysis data-visualization django-application matplotlib numpy pandas python seaborn
Last synced: 01 Mar 2026
https://github.com/steviecurran/multi-dish
Scripts to reduce data from large radio telescopes (GMRT, VLA)
data-analysis interferometer pipeline radio-astronomy telescopes
Last synced: 09 May 2026
https://github.com/bhaveshbhakta/amazon-sales-data-visualization
Amazon Sales Data Visualization
amazon-sales-data-analysis data-analysis data-preprocessing data-visualization machine-learning
Last synced: 18 May 2026
https://github.com/maheera421/pandas
Implementation of essential Pandas functions.
data-analysis data-manipulation pandas-dataframes pandas-datareader pandas-python
Last synced: 17 Jul 2025
https://github.com/andremenezesds/house_rocket_sales_insights
Data analysis for House Rocket Sales Company database, including insights for sales optimization.
data-analysis data-visualization geopandas git github jupyter-notebook linux numpy pandas powerbi python seaborn seaborn-python streamlit streamlit-webapp ubuntu vscode windows10
Last synced: 21 Jan 2026
https://github.com/prakhar-ff13/finding-donors-for-charityml
Udacity Machine Learning Engineer Nanodegree project 2
data-analysis data-science machine-learning supervised-learning udacity udacity-machine-learning-nanodegree udacity-nanodegree
Last synced: 05 Apr 2026
https://github.com/satti-hari-krishna-reddy/data-whisperer
Data Whisperer is an AI-driven tool that automates exploratory data analysis (EDA), generates actionable insights, and enables natural language querying of datasets. it combines the power of AI (Google Gemini) with interactive visualizations and professional reporting.
ai data-analysis data-visualization llm python3 streamlit
Last synced: 18 Apr 2026
https://github.com/kathisnehith/analyst_snehith_portfolio
Hello! This is My Portfolio Website
azure big-data data-analysis data-mining matplotlib mysql-database outlier-detection pandas-python powerbi python sql tableau validation
Last synced: 18 Apr 2026
https://github.com/vl1507/data_science_pro_course
Курс "Аналитик данных PRO (PRO DA-6)"
da data-analysis data-science ds jupyter-notebook machine-learning ml pro-da python
Last synced: 18 Apr 2026
https://github.com/clarajacintho/ig4-ds
The final project for the Multidimensional Data Analysis and Data Mining courses, where we analyze data from motorcyclists to determine what causes accidents
data-analysis data-science shiny-apps
Last synced: 11 May 2025
https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly
Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool
blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly
Last synced: 18 Apr 2026
https://github.com/shubh-bharadwaj/income-dataset-analysis
data-analysis data-science pandas python
Last synced: 18 Apr 2026
https://github.com/robinmillford/sales-metrics-dashboard-streamlit
This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.
dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard
Last synced: 19 Apr 2026
https://github.com/rodriguesl1/analise-ibovespa-fiap
Modelo de previsão do índice IBOVESPA utilizando técnicas de séries temporais. O projeto inclui análise exploratória, decomposição sazonal, testes de estacionariedade e modelagem com Prophet, AutoARIMA e outros modelos estatísticos para apoiar decisões de investimento.
autoarima b3 brasil data-analysis economia finance forecasting ibovespa pandas prophet python statsmodels time-series
Last synced: 19 Apr 2026
https://github.com/robertochiosa/automatic-powerpoint-report-rmd
Automatically generate good looking powerpoint presentations from a csv dataset
data-analysis data-science medium medium-article python r
Last synced: 19 Apr 2026
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 19 Apr 2026
https://github.com/decepticon-ts/cap-ai-studio
Description: A modern, powerful web application for advanced image analysis and batch processing, featuring real-time AI-powered image captioning, comprehensive reporting, and an intuitive user interface. Built with Streamlit and Google's Gemini API.
artificial-intelligence batch-processing computer-vision data-analysis gemini-api image-processing image-processing-python python streamlit streamlit-webapp threading
Last synced: 19 Apr 2026
https://github.com/ajmannust41288/python-developer
all about python backend ,data analytics,Data-Visualization
artificial-intelligence data-analysis data-science data-visualization-project deeplearning machine-learning machine-learning-algorithms matplotlib mongodb-database mongoose mongoosejs pandas-python python-library python-packages python-script python3 seaborn-plots
Last synced: 19 Apr 2026
https://github.com/mlucifer27/bilateral-visualization
Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.
d3blocks data-analysis data-visualization plotly-python python streamlit
Last synced: 04 Jun 2026
https://github.com/edwinrlambert/exploring-airbnb-market-trends
Dive into NYC's Airbnb market trends through detailed analysis of listings data, including prices, types, and review dates. This is a DataCamp project.
airbnb data-analysis jupyter-notebook market-trends python
Last synced: 19 Apr 2026
https://github.com/samwhaaa/superfoodsmax
A customer demographic & spending trend analysis on the fictional SuperFoodsMax grocery chain
data-analysis data-analytics data-visualization jupyter jupyter-notebook python
Last synced: 20 Apr 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/namratha2301/carprice_analysisandprediction
This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.
data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn
Last synced: 20 Apr 2026
https://github.com/misaghmomenib/soccer-match-analysis
This Project Predicts Football Match Outcomes (Home Win, Away Win, or Draw) Using Historical Match Data. It Involves Data Preprocessing, Exploratory Analysis, and Training a Random Forest Model to Predict Results Based on Features Like Shots, Possession, and Passes.
data-analysis git open-source python
Last synced: 20 Apr 2026
https://github.com/jbalooshie/school_district_analysis
Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.
data-analysis data-science dataframes jupyter-notebook numpy pandas python
Last synced: 20 Apr 2026
https://github.com/jofaval/california-housing-pricing
Data Analysis about the California Housing Pricing in 1997
data-analysis data-science data-visualization deep deep-learning deep-neural-networks google-colab keras machine-learning matplotlib python regression scikit-learn seaborn tensorflow
Last synced: 05 Apr 2026
https://github.com/xre22zax/roller-coaster
Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.
analytics data-analysis data-visualization pandas python python-lambda python3 visualization
Last synced: 20 Apr 2026
https://github.com/sarthakmishraa/bike_rental_predictor
Bike Sharing Dataset : This dataset contains the hourly and daily count of rental bikes between years 2011 and 2012 in Capital bikeshare system with the corresponding weather and seasonal information.
data-analysis machine-learning python xgboost
Last synced: 20 Apr 2026
https://github.com/abinashsahoo007/project-bankruptcy-prevention
The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.
data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit
Last synced: 20 Apr 2026
https://github.com/amr-yasser226/interactive-sales-analytics-dashboard
An interactive web-based dashboard for visualizing multinational electronics sales data. This project for the DSAI 203 course integrates a Python/Flask backend with an amCharts frontend to provide dynamic insights into product revenues, sales distribution, and employee statistics across different countries.
am5charts amcharts business-intelligence css dashboard data-analysis data-analytics data-visualization flask html javascript python sqlalchemy sqlite web-application
Last synced: 13 Apr 2026
https://github.com/sbera01/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 24 Dec 2025
https://github.com/profasem/logistics-performance-analysis
Power BI dashboard analyzing logistics performance, delivery delays, carrier efficiency, and regional risk.
business-intelligence dashboard data-analysis logistics powerbi python supply-chain
Last synced: 21 Apr 2026
https://github.com/hoxo-m/blog
HOXO-M Blog
data-analysis data-science r-package
Last synced: 30 Oct 2025
https://github.com/rachel-xmr/data-analysis-in-health-set-csc3062
CSC3062 Data Analysis and visualization
classification-algorithm data-analysis data-visualization model-evaluation nmf pca python svm t-sne visualization
Last synced: 05 Jun 2026
https://github.com/martinkalema/power-distribution-modelling
Power Distribution Modelling for cea and cel algorithms
data-analysis python synthetic-dataset
Last synced: 21 Apr 2026
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 02 Jan 2026
https://github.com/meerantajalli/networksecuritydefense
This Network Security defense systems acts as an indicator against SMP Floods, UDP Floods, ICMP Floods. This model is trained using packets from wireshark and can easily differentiate between normal network traffic and traffic that has been targetted on the machine by an attacker using the rate of packets transfer and using the source IP.
anomaly-detection classification cyber-security data-analysis ddos-detection icmp-flood intrusion-detection machine-learning network-security packet-analysis python random-forest security smp-flood udp-flood wireshark
Last synced: 21 Apr 2026
https://github.com/nikhilfuke1/a-b-testing-and-regression-analysis-python
Python Statistical Project involves data analysis, visualization, A/B testing, and regression analysis to determine the best-performing platform.
ab-testing data-analysis hypothesis-testing libraries python regression-analysis statistics visualization
Last synced: 21 Apr 2026
https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda
Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.
data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn
Last synced: 07 Apr 2026
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 22 Apr 2026
https://github.com/kgelli/apple-data-analysis---apache-spark
Modular ETL pipeline for analyzing Apple product purchase patterns using Apache Spark on Databricks with factory design patterns.
apache-spark data-analysis databricks delta-lake etl-pipeline factory-pattern pyspark
Last synced: 22 Apr 2026
https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi
"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."
business-intelligence data-analysis data-visualization dax powerbi python sql
Last synced: 18 May 2026
https://github.com/niaid/training-schedule
bcbb-training bioinformatics computational-biology data-analysis data-science pandas python
Last synced: 22 Apr 2026
https://github.com/ayushi-gajendra/buenos-aires-subway-statistics
A comprehensive data analysis of the Buenos Aires subway system ridership using Python and Pandas. This project identifies peak-hour congestion patterns, explores hourly passenger distributions, and utilizes the 95th percentile to isolate extreme traffic conditions for urban mobility insights.
95th-percentile buenos-aires data-analysis data-science-portfolio data-visualization matplotlib pandas python statistical-analysis subway-ridership transit-data urban-mobility
Last synced: 05 Jun 2026
https://github.com/floffah/my-listening
Various ways to analyse your Spotify extended streaming history data
convex data-analysis listening-history spotify
Last synced: 23 Apr 2026
https://github.com/suhas-005/eda-indian-startup-funding
Exploratory Data Analysis on Indian Startup Funding (2015-2020)
data-analysis data-analytics data-science data-visualization exploratory-data-analysis matplotlib pandas python seaborn startup-funding
Last synced: 09 May 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/ihnokim/datk
Data Analysis Toolkit (DATK)
data-analysis data-engineering data-science deep-learning image-processing pandas signal-processing
Last synced: 24 Apr 2026
https://github.com/strixion/demoversion_ai
The demoversion of StrixionAI
ai csv data-analysis data-analytics json python txt
Last synced: 24 Apr 2026
https://github.com/arunabhagit/inventory-misalignment-and-revenue-loss-in-multi-store-bike-retail
This project focuses on identifying the inventory and demand mismatch causing stagnant sales and lost revenue in a bike retail chain. By analyzing store-level performance and regional customer preferences, the project aims to detect underperforming products.
data-analysis data-visualization powerbi python
Last synced: 24 Apr 2026
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 24 Apr 2026
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026
https://github.com/yuvrajsaraogi/-iris-flower-classification
Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.
classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python
Last synced: 24 Apr 2026
https://github.com/manisharora96/data-analysis-of-smartwatch
The project is structured with sample data, step-by-step Jupyter notebooks, and modular Python scripts for automated analysis
data-analysis data-visualization jupyter-notebook python smartwatch-analysis
Last synced: 24 Apr 2026
https://github.com/stefagnone/-wedding-vendor-pricing-and-customer-satisfaction-analysis
Data-driven analysis of wedding vendor pricing and customer satisfaction, with database design, SQL optimization, and cost breakdown generation.
business-intelligence cost-optimization customer-satisfaction data-analysis database-design python sql vision-board-analysis wedding-planning wedding-vendor-pricing
Last synced: 03 May 2026
https://github.com/mehtadigisha/clean-visualize-analyze
Clean Visualize Analyze
cleaning-data data-analysis data-cleaning data-visualization eda juypter-notebook pandas python seaborn seaborn-plots visualization
Last synced: 09 May 2026
https://github.com/anderson-andre-p/uber-data-analysis
This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.
data-analysis data-science data-visualization python
Last synced: 15 Jun 2026
https://github.com/viper373/lol-dataanalytics
腾讯游戏-英雄联盟赛事20/21/22年数据综合分析预测
crawler-python data-analysis jupyter-notebook lol python spider
Last synced: 15 Jul 2025
https://github.com/pedrohdosanjos/economic-data-analysis
This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.
api data-analysis data-visualization jupyter-notebook python
Last synced: 24 Apr 2026
https://github.com/ismielabir/pycsvsummarizer
A lightweight tool to summarize CSV files using various features.
csv data-analysis data-summary python
Last synced: 25 Apr 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/rubix982/product-quality-classification
This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.
data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp
Last synced: 25 Apr 2026
https://github.com/xjwllmsx/hacker-news-engagement
Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.
data data-analysis jupyter python
Last synced: 25 Apr 2026
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/magnus0969/black-friday-sales-analysis
An in-depth analysis of Black Friday sales data to uncover trends, customer behavior, and product insights. Utilizing Python, data visualization, and machine learning techniques, this project provides key business intelligence to optimize sales strategies.
analysis data-analysis data-science python sales-analysis
Last synced: 09 May 2026
https://github.com/marielachirinosr/bellabeat-wellness-data-trends
Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.
data data-analysis data-visualization pandas python python3 tableau tableau-public
Last synced: 25 Apr 2026
https://github.com/m-biriulova/python-job-market-analysis
Web scraping, data analysis, and visualization of Python developer vacancies in Czech Republic.
automation beautifulsoup data-analysis data-visualization portfolio-project python selenium web-scraping
Last synced: 25 Apr 2026
https://github.com/viniciusds2020/streamlit_app_adult
Protótipo APP - Machine learning - Streamlit
app data-analysis data-science front-end joblib machine-learning python streamlit
Last synced: 25 Apr 2026
https://github.com/edwinrlambert/investigating-netflix-movies
Demonstrates data analysis and visualization techniques for Netflix movies using Python in a Jupyter notebook. This is a DataCamp project.
data-analysis data-analysis-python netflix python
Last synced: 25 Apr 2026
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 25 Apr 2026
https://github.com/chandansoren/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 26 Apr 2026
https://github.com/mktechai-0786/data-analysis-on-dr-visits
Data Analysis On Dr. Visits dataset
data-analysis matplotlib-pyplot numpy pandas seaborn
Last synced: 09 May 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/amyanchen/sf-airbnb
Exploratory Data Analysis of San Francisco Airbnb's
data-analysis data-science data-visualization r rmarkdown statistics
Last synced: 18 Jul 2025
https://github.com/deliprofesor/cinematic-data-analytics-and-recommendation-platform
This project analyzes a movie dataset using machine learning algorithms to predict success, explore revenue-popularity relationships, and develop recommendation systems. It employs techniques like K-Means, DBSCAN, GMM, decision trees, PCA, and NLP for insights and personalized suggestions.
clustering content-based-recommendation data-analysis data-visualization decision-tree gmm k-means machine-learning natural-language-processing nlp pca predictive-modeling python recommendation-system scikit-learn user-based-recommendation
Last synced: 26 Apr 2026
https://github.com/swapnanildutta/prediction-with-python
The projects are made using Jupyter Notebook
data-analysis jupyter-notebook machine-learning prediction python regression-models
Last synced: 27 Apr 2026