Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/27ahmad/netflix_sql_project
The Netflix SQL Project analyzes the Netflix dataset using SQL queries to gain insights into its content, identify trends, and address business problems related to movies and TV shows.
data-analysis postgresql-database sql
Last synced: 03 Feb 2026
https://github.com/sngr0x0/ranklytics-kr
OP.GG Scraping
data-analysis league-of-legends matplotlib opgg playwright-python scraping visualization
Last synced: 16 Oct 2025
https://github.com/hase3b/flask-dash-interactive-dashboard
An interactive data visualization dashboard created using Flask and Dash. This project includes comprehensive data preparation, exploratory data analysis (EDA), and dynamic visualizations with Seaborn and Plotly. Explore the multi-page Dash app with features like dropdowns and callbacks for updated plots.
callbacks dash dashboard data-analysis data-visualization dropdown eda flask interactive plotly seaborn web-app
Last synced: 19 May 2026
https://github.com/rock12231/weather-analysis-backend
Weather analysis, visualization & Data science
data-analysis data-science data-visualisation django-rest-framework jyputer-notebook prediction python
Last synced: 15 Mar 2025
https://github.com/rodrigojunqueiradev/data-exploration-and-cleaning
Credit Analysis Data: Foundations for Cleaning and Exploration
data-analysis data-engineering data-science data-visualization datascience matplotlib matplotlib-pyplot numpy pandas python python-3 python3
Last synced: 13 Apr 2026
https://github.com/kasraskari/learn-r-codes
A learning repository for R programming, covering data manipulation, visualization, and statistical analysis. (Work in progress!) 🚧
data-analysis data-analysis-r data-visualization r r-examples r-graphics r-statistics statistics
Last synced: 08 Jun 2026
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 29 Apr 2026
https://github.com/supertetelman/coursera-exdata-09
This repo contains several R scripts that were used to analyze, plot, and clean data from various datasets. These projects were part of the Coursera course, Exploratory Data Analysis. The end results of the analysis are included.
big-data course coursera data-analysis r
Last synced: 16 Oct 2025
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 11 Apr 2026
https://github.com/regmibijay/opencarp-analyzer
Reads Trace Files created by OpenCARP Models and exports data for easy plotting with inbuilt plotter script.
bioinformatics data-analysis opencarp
Last synced: 16 Jan 2026
https://github.com/ak-alien/combobullet
ComboBullet is a versatile log processing and credential extraction toolkit for Windows. It offers multiple features to filter, extract, and manage credentials and cookie data from raw .txt files. This tool is particularly useful for combo scrapers, data analysts, and penetration testers.
combo-extraction cookie-extraction credential-management data-analysis log-processing penetration-testing
Last synced: 30 Jun 2025
https://github.com/emcramer/clockplot
Plotting utility for a "clockplot" that puts groups into a time-ordered heterogeneity visualization
biology data-analysis data-visualization heterogeneity pseudotemporal-ordering
Last synced: 10 Mar 2026
https://github.com/danpoynor/python-number-guessing-game-with-stats
A number guessing game written in Python 3 that presents median, mode, and mean statistics
console-game data-analysis number-guessing-game python3 statistics
Last synced: 26 May 2026
https://github.com/abhay-sinha-0/carpricepredictionproject
A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.
artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library
Last synced: 15 May 2025
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/more-joao/color-distance-luminance
Data analysis project that aims to establish a relation between the Canberra distance between white and any given color in the RGB colorspace and its luminance.
canberra-distance data-analysis luminance python r rgb
Last synced: 02 May 2026
https://github.com/pizofreude/da-with-r
Data analysis with R data centric programming language
Last synced: 17 Oct 2025
https://github.com/khulnasoft/data-science-materials
data-analysis data-engineering data-science data-visualization
Last synced: 17 Oct 2025
https://github.com/samanhur/data_visualization_pcc
First experiences in data visualization with python
data-analysis data-science data-visualization python3
Last synced: 23 Mar 2025
https://github.com/anilyigitsel/istanbul-rental-apartments-analysis
This project analyzes the Istanbul Rental Apartments Dataset (2025), which includes rental apartment listings from Istanbul, Turkey.
data-analysis data-visualization jupyter-notebook matplotlib pandas python rental-housing
Last synced: 29 Apr 2026
https://github.com/prateek5525/online-shopping-analytics-project
The Online Shopping Analytics Project analyzed product trends, and regional sales using SQL and Tableau. Insights from the Sales and Location Dashboards highlighted key trends in demographics, product popularity, and regional performance. These findings empower businesses to optimize strategies, enhance marketing, and improve inventory management.
data-analysis excel kaggle-dataset sql tableau
Last synced: 20 Feb 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/abhijeet107/task-4
Design an interactive dashboard for business stakeholders.
data-analysis excel-csv tableau-dashboards tableau-public
Last synced: 22 Jan 2026
https://github.com/bala-1409/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization
Last synced: 04 Jan 2026
https://github.com/codeslash21/analyze-a-b-test-results
Analyze results of an A/B test run by an e-commerce website.
Last synced: 22 Jan 2026
https://github.com/casassg/ms_thesis
Social Media Analysis for Crisis Informatics in the Cloud
casassg-thesis data-analysis google-cloud kubernetes
Last synced: 19 Oct 2025
https://github.com/prakshi-23/restaurant-ratings-analysis-dashboard-using-power-bi
A Power BI Report on Restaurant Ratings Analysis
data-analysis data-visualisation dax powerbi powerbi-dashboard
Last synced: 22 Jan 2026
https://github.com/eco786786/restaurant_orders
This analysis seeks to uncover patterns in customer behaviour by examining restaurant order data.
data-analysis git postgresql tableau
Last synced: 29 Apr 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/eesunmoon/machine_learning
[Spring 2021] Machine Learning
data-analysis kaggle machine-learning ml python scikit-learn sklearn
Last synced: 14 Apr 2026
https://github.com/yulia-momotyuk/dla-data-analysis-practice
This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.
analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau
Last synced: 14 Apr 2026
https://github.com/ved-coder-king/wheat_ai_project
This project, Smart Wheat Farming AI System, was developed as part of the coursework for the Artificial Intelligence program at Esprit School of Engineering.
agriculture data-analysis data-visualization deep-learning image-classification machine-learning object-detection python wheat
Last synced: 15 Apr 2025
https://github.com/mtimma001/clinical-trial-data-tool-v2
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 14 Apr 2026
https://github.com/lucashomuniz/Project-03
Data-Driven Decision Making: Selecting the Best Regression Model for E-commerce Sales
benchmark-framework data-analysis data-driven data-visualization e-commerce-project language-python lasso-regression linear-regression-models machine-learning python ridge-regression
Last synced: 20 Oct 2025
https://github.com/lucashomuniz/Project-02
Data Analysis and Machine Learning Techniques for Liver Disease Prediction
classification-model data-analysis decision-tree healthcare-application knn-algorithm liver-disease-prediction logistic-regression machine-learning python-language python-script random-forest supervised-learning svm-model
Last synced: 20 Oct 2025
https://github.com/jimohola/zomato-restaurant-ratings-ml
Flask Deployment Machine Learning
css data-analysis flask html machine-learning python3
Last synced: 04 May 2026
https://github.com/ashwin331133/sql-pizza-outlet-sales-analysis
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
Last synced: 24 Feb 2026
https://github.com/tomkyle/binning
Determine optimal number of bins 𝒌 for histogram creation and optimal bin width 𝒉 using various statistical methods.
binning data-analysis distributions doanes-rule freedman-diaconis histogram histogram-binning math php-math rice-rule scotts-rule square-root statistics sturges-rule terrell-scotts-rule
Last synced: 21 Oct 2025
https://github.com/saisurajmatta/nashville-housing-data-cleaning-project
Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.
azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio
Last synced: 23 Jan 2026
https://github.com/jasontan22/aefes-time-series-forecasting
Bu proje, Anadolu Efes Biracılık ve Malt Sanayii A.Ş. (AEFES) piyasa verilerini kullanarak kapanış fiyatlarının gelecekteki değerlerini tahmin etmek amacıyla derin öğrenme yöntemleri (LSTM, BiLSTM, CNN+LSTM) kullanmaktadır. Projede, veri ön işleme, model eğitimi ve değerlendirme adımları detaylandırılmıştır.
bilstm cnn-lstm data-analysis deep-learning financial-forecasting lstm machine-learning python stock-price-prediction tensorflow
Last synced: 09 Aug 2025
https://github.com/bationoa/how_does_a_bike_share_navigate_speedy_success
Bike rendting case study
analytics business-intelligence cleaning-data data-analysis data-collection data-visualization r
Last synced: 26 May 2026
https://github.com/tenifayo/analysis-of-fordgobike-trip-data
Data Visualization using Ford GoBike Trip Data
data-analysis matplotlib pandas
Last synced: 11 Jul 2025
https://github.com/neeraj08823/cyclisticbike-case-study-with-r-tableau
Cyclistic Bike-Share Analysis
data-analysis data-cleaning data-visualization r tableau
Last synced: 19 Jun 2025
https://github.com/ttwag/p9_pandas
Problems that Introduce the DataFrame Object in Python's Pandas Library
data-analysis pandas-dataframe python
Last synced: 10 Jun 2025
https://github.com/aphp/jupyter-eds-notebooks
jupyter-eds-notebooks provides Docker images with preconfigured Jupyter environments for clinical and health data analysis, tailored for AP‑HP Datalabs and the HELIX platform.
data-analysis data-science data-visualization healthcare lab
Last synced: 13 Jan 2026
https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics
This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.
ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset
Last synced: 02 May 2026
https://github.com/dcs-training/network-analyisis-python
Course material for introducing data visualization with Altair and network analysis with NetworkX (in Python). Go to the readme file
data-analysis data-visualisation network-analysis python text-analysis
Last synced: 29 Apr 2026
https://github.com/tolumie/loan-approval-prediction
Loan Approval Prediction using Machine Learning | EDA + Decision Tree, Random Forest & Logistic Regression | Automating loan eligibility for Dream Housing Finance by analyzing customer data and predicting loan approvals.
classification credit-risk-analysis data-analysis decision-tree-classifier finance-analytics loan-approval logistic-regression-algorithm machine-learning predictive-modeling-techniques random-forest
Last synced: 30 Jun 2025
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 23 Jan 2026
https://github.com/diem0n/100daysofdatascience
This repository is a collection of things i do on as a data scientist each day as i am hired at a fictional company called keko corp
data-analysis data-engineering data-science data-science-from-scratch data-warehousing machine-learning python
Last synced: 09 Apr 2026
https://github.com/kaoutarmi/analyse-des-ventes-pour-optimiser-la-performance
Analyse des données de ventes pour identifier des opportunités d'amélioration des performances commerciales. Utilisation de Pandas pour le traitement des données, et Matplotlib/Seaborn pour la visualisation des tendances et des résultats.
business-intelligence data-analysis data-visualization jupyter-notebook matplotlib pandas sales-optimization seaborn
Last synced: 20 Aug 2025
https://github.com/saroshfarhan/kaggle-playground-s4e11
Kaggle old competirion just for practice
data-analysis data-science data-visualization jupiter-notebook python3
Last synced: 29 Apr 2026
https://github.com/misszeferino/erp-data-analysis
Data Analysis - ERP Data (merge and outliers)
data-analysis data-visualization matplotlib merge numpy outlier-detection python scipy
Last synced: 11 Apr 2026
https://github.com/shimaa83/eda_v2
Automatic EDA library
data-analysis data-science python
Last synced: 20 Apr 2026
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 15 Mar 2026
https://github.com/dcs-training/spatial_dynamics
Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file
data-analysis geographical-data gis qgis r statistics
Last synced: 23 Oct 2025
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/jayita11/healthcare-management-optimization-analysis-and-visualization
This project analyzes healthcare data from 2019 to May 2024, optimizing patient care, resource allocation, and financial management. Insights include billing trends, blood bank management, doctor performance, and medication demand, supported by excel,interactive Tableau dashboards and SQL analysis.
data-analysis excel healthcare interactive-dashboards mysql sql tableau-dashboards
Last synced: 23 Mar 2025
https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights
A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance
data-analysis data-visualization excel pivot-tables
Last synced: 30 Jan 2026
https://github.com/nikkvd/ipl-data-analysis-for-2024-special-edition-magazine
This project analyzes IPL data (2021-2023) using SQL to extract insights on player performances, team strategies, and trends for a special IPL 2024 edition magazine.
Last synced: 24 Feb 2026
https://github.com/elakkiya-u/digital-marketing-campaign
A machine learning project to predict whether a customer will convert based on digital marketing campaign data.
campaigns data-analysis deployment digital-marketing machine-learning predictive-modeling python
Last synced: 30 Jun 2025
https://github.com/browndwarf/contracosta
Wavelength dependent starspot contrast with Kepler/K2 and TESS
Last synced: 23 Jan 2026
https://github.com/sumitkundu102022/air-quality-report
Air Quality Report using PowerBI
data data-analysis data-visualization powerbi
Last synced: 23 Jan 2026
https://github.com/george-gca/ai_papers_analysis
Do some analysis based on main AI conferences
conferences data-analysis fasttext fasttext-embeddings fasttext-python python scikit-learn top2vec
Last synced: 29 Apr 2026
https://github.com/sugumarsrinivasan/sql-datawarehouse-project
Building Mordern datawarehouse with SQL Server, including ETL Processes, data modeling, and data analytics.
data-analysis data-analytics data-engineering data-lake data-science data-warehouse datawarehousing etl etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 19 Jun 2026
https://github.com/avinashkr-ai/weather-analysis-backend
Weather analysis, visualization & Data science
data-analysis data-science data-visualisation django-rest-framework jyputer-notebook prediction python
Last synced: 24 Oct 2025
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 06 May 2026
https://github.com/jofaval/sonar
Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987
data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci
Last synced: 09 Apr 2026
https://github.com/i7t5/sentimentnlp
Sentiment analysis for COMP 435 Introduction to Machine Learning, Spring 2025
data-analysis jupyter-notebook machine-learning nlp python sentiment-analysis
Last synced: 29 Apr 2026
https://github.com/stribedi-94/comcast-telecom-consumer-complaints-data-analysis
Comcast Telecom Consumer Complaints Analysis
comcast-telcom-complaints data-analysis data-science data-visualization python
Last synced: 26 May 2026
https://github.com/alessandroryo/bike-rental-data-analysis
A data analysis project focused on understanding and predicting bike rental patterns. This project utilizes data processing, visualization, and predictive modeling techniques to gain insights into bike rental usage, fulfilling the final submission requirement for Dicoding Indonesia's Data Analysis course.
bike-rental data-analysis data-visualization jupyter-notebook machine-learning python streamlit
Last synced: 09 Apr 2026
https://github.com/burcuyesilyurt/tableau_natural_disasters
Data Visualization - Tableau
data-analysis data-visualization tableau visualization
Last synced: 03 Jan 2026
https://github.com/nishumehta/coffee-beans-sales-analysis
An in-depth analysis of coffee bean sales using an interactive Excel dashboard, which highlights trends and customer insights
dashboard data-analysis data-visualization excel
Last synced: 28 Jan 2026
https://github.com/zborovskaanna/dou-salary-analysis
Python data analysis project focused on improving data manipulation skills using Pandas
Last synced: 26 Feb 2025
https://github.com/janiavdv/data-spirits
Analysis of alcohol and sports betting data, including a correlation investigation.
correlation data-analysis data-science machine-learning
Last synced: 11 Nov 2025
https://github.com/grlyntng/rpims
Django Code and documentation for the Retail Pharmacy Inventory Management System (best final year project award)
data-analysis django erp forecasting-models lstm-neural-networks reporting
Last synced: 26 May 2026
https://github.com/lotfiferaga/google-play-store-sentiment-analysis
Perform sentiment analysis on Google Play Store reviews using Python. Analyze user feedback to determine the overall sentiment (positive, negative, or neutral) towards various apps. Gain insights to aid developers and businesses in understanding user satisfaction levels and improving their products.
data-analysis data-visualization googleplayservices python reviewsanalysis-nlp
Last synced: 26 Feb 2025
https://github.com/kath92/my_data_projects
My data projects.
data-analysis data-vizualisation nlp-machine-learning poewrbi python tableau
Last synced: 23 Mar 2025
https://github.com/a26nine/kortext-usage-dashboard
An interactive data visualisation dashboard built using Tableau software to understand the value of digital resources issued on Kortext platform at Middlesex University, London.
data-analysis data-science data-visualization knime tableau
Last synced: 01 Feb 2026
https://github.com/prince-pastakiya/human-resources-tableau-project
👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.
chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public
Last synced: 18 Apr 2026
https://github.com/shrutiijoshi/apple_greenhouse_gas_emissions
A breakdown of Apple's greenhouse gas emissions from 2015 to 2022 as they aim to reach net zero emissions by 2030.
dashboard data-analysis data-visualization powerbi
Last synced: 06 Feb 2026
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 06 May 2026
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 11 Jun 2026
https://github.com/ljadhav25/linear_regression_data_science
Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.
data-analysis data-science linear-regression machine-learning
Last synced: 26 Oct 2025
https://github.com/madhursinghbhadoriya/data_analysis_fifa-players
• Using NumPy, Matplotlib, Pandas, etc processed important Information and Characteristic traits on Jupyter Notebook.
analysis data-analysis data-science graphs jupyter-notebook pandas python
Last synced: 07 May 2026
https://github.com/sejalkoli/british-airways-web-scraping
Data science virtual internship program by British Airways through Forage!
british-airways data-analysis data-science internship-project internship-task machine-learning present-insights project reporting web-scraping
Last synced: 26 Oct 2025
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 25 Feb 2025
https://github.com/khushi-sabarad/web_scraping
This project is a Python-based web scraper that extracts the menu from a cafe and saves it to an Excel file. It was created to automate the process of retrieving and updating menu prices, a task that was observed to be done manually at the hostel.
beautifulsoup data-analysis data-visualization market-analysis pandas python requests web-scraping wordcloud
Last synced: 29 Apr 2026
https://github.com/vishalsiingh/deloitte-virtual-internship
Submission for the STEM Virtual Program by Deloitte via Forage.
coding cyber-security data-analysis deloitte development forage forensics
Last synced: 23 Jan 2026
https://github.com/campagnucci/exercitando_pandas
Exercícios práticos de pandas com dados abertos da educação de São Paulo
data-analysis data-science education-data exercises pandas-tutorial
Last synced: 28 Jan 2026
https://github.com/codewithjazmine/bookbot
Python command-line tool that analyzes text files for word count and character statistics
command-line-tool data-analysis learning-project python text-analysis
Last synced: 23 Jan 2026
https://github.com/shivam5509/power-bi-project
Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.
advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server
Last synced: 11 Apr 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 24 Jan 2026
https://github.com/carlos-edulira/mbabigdata-projeto
Entrega do projeto MBA Unipe Big Data BI
data-analysis delta minio python spark
Last synced: 29 Apr 2026
https://github.com/9dl/usbfalcon
Automatically copies files from plugged USB drives to a specified location, enabling quick data retrieval for analysis.
automation data-analysis data-retrieval ethical-hacking file-copying usb
Last synced: 27 Oct 2025
https://github.com/marknature/machine-learning-intern
Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset
data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn
Last synced: 10 Apr 2026
https://github.com/farzeen-2001/blinkit-sales-analysis-using-powerbi
The project provides an overview about the BlinkIt Sales performances
data-analysis data-visualization datacleaning excel powerbi
Last synced: 24 Jan 2026
https://github.com/code-jl/nfl-kicker-predictor
A sophisticated Python application that provides real-time NFL kicker statistics and performance analysis with an intuitive graphical interface.
beautifulsoup data-analysis data-visualization espn football gui nfl prediction python real-time-analytics real-time-data sport-analytics sports-data statistics tkinter web-scraping
Last synced: 01 Jun 2026