Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/sanam2405/chatinfo
Analysing the WhatsApp Chat with my crush over a 6M period
data-analysis data-visualization python
Last synced: 27 Apr 2026
https://github.com/jku-vds-lab/loops
Loops is a JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.
data-analysis data-science data-visualization jupyter jupyter-notebook notebook provenance
Last synced: 29 Jan 2026
https://github.com/chelseammatta/nopd-cad-data-analysis
Analysis of 911 call data from New Orleans' 3rd & 4th police districts (2019-2022) using BigQuery
911-calls 911-data bigquery cad-data crime-analysis data-analysis emergency-response new-orleans public-safety sql
Last synced: 01 Jul 2025
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/titanscouting/tra-analysis
Titan Robotics 2022 Strategy Team Analysis Repository
data-analysis frc frc-scouting hacktoberfest python
Last synced: 29 Jan 2026
https://github.com/gappeah/nike_web_crawler
This project involves web scraping Nike's product pages to extract product names, prices and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.
beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup
Last synced: 04 Jul 2025
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 20 Jul 2025
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 19 Mar 2026
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 21 Jan 2026
https://github.com/robthepcguy/ahk-mouse-heatmap
An AutoHotkey script that records left, right, and middle mouse clicks, logging the date, time, and x and y coordinates. It features automatic GUI updates and generates a visual heatmap via a Python script, accessible from the system tray. This tool is ideal for analyzing user interaction and creating detailed mouse activity maps.
autohotkey-script click-counter click-log click-map data-analysis data-visualization heatmap heatmap-visualization mouse-events mouse-tracking python
Last synced: 01 Apr 2025
https://github.com/webuccinoco/mysql-pivot-tables
Build complex MySQL pivot tables without touching a single line of code. This free PHP tool lets you visually connect your database and map out your data sources with a few simple clicks.
business-analytics business-intelligence crosstab data-analysis data-analytics data-visualization mysql mysql-database mysql-pivot-table mysql-reports mysql-virtualization php php-pivot-table php-reports pivot-tables reporting-tools
Last synced: 04 Feb 2026
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 04 Sep 2025
https://github.com/sandergi/designbuildfly
Useful tools made for Design Build Fly at UW, hosted on Glitch so teammates can easily access. Check out our optimization tools here: https://github.com/JPaonaskar/DBF-Optimization
data-analysis inav-blackbox webapp
Last synced: 01 Apr 2025
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 28 Jan 2026
https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping
Last synced: 15 Mar 2025
https://github.com/carterlasalle/sportsarbfinder
Sports Betting Arbitrage Finder: Python tool for identifying profitable arbitrage opportunities across bookmakers. Features multi-region support, customizable profit margins, interactive calculator, and web interface. Uses real-time odds data from The Odds API. Ideal for betting enthusiasts, analysts, and educational purposes.
arbitrage-betting betting-strategy data-analysis finance gambling odds-api python sports-analytics sports-betting
Last synced: 31 Mar 2025
https://github.com/mardavsj/weather-prediction
Weather prediction model which mainly focuses on visualization.
data-analysis data-visualization matplotlib numpy pandas pandas-dataframe
Last synced: 10 Apr 2026
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 23 Mar 2025
https://github.com/shakhthi/deep-learning
All Materials, Practice codes and Projects related ML & DL
data-analysis deep-learning machine-learning
Last synced: 09 Apr 2025
https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-
# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻
analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits
Last synced: 01 Jul 2025
https://github.com/rahul-404/full_stack_data_science_with_generative_ai
Welcome to the repository for the course "Full Stack Data Science with Generative AI". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science and generative AI techniques.
data-analysis data-science data-visualization database deep-learning exploratory-data-analysis feature-engineering generative-ai machine-learning nlp python statistics
Last synced: 12 Apr 2026
https://github.com/shrawans007/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments hotel-service ms-excel ms-excel-data-analytics pivot-tables sentiment-analysis tableau tableau-public text-reviews treemap
Last synced: 22 Mar 2025
https://github.com/saidsef/ff18
A complete catalog of all the players in Fifa 2018 and their complete statistics
data-analysis data-visualization fifa18 machine-learning machine-learning-algorithms world-cup-2018 world-cup-ranking
Last synced: 29 May 2026
https://github.com/danhenriquex/data-science-project
The main goal of this project was to apply the concepts of data visualization and analysis.
data-analysis data-science numpy pandas python
Last synced: 12 Apr 2026
https://github.com/cworld1/novel-analysis
A simple project for analyzing Chinese novels
Last synced: 17 Mar 2025
https://github.com/abhi227070/whatsapp-chat-analyzer
The WhatsApp Chat Analyzer is a project that leverages machine learning and natural language processing techniques to analyze chat data from WhatsApp conversations. It provides insights such as message statistics, sentiment analysis, word clouds, and more.
artificial-intelligence data-analysis data-visualization machine-learning machine-learning-algorithms python-3 python-programming
Last synced: 29 Jun 2026
https://github.com/andr3w03/employee-attrition-problem
Employee Attrition Problem Analysis and Prediction
data-analysis data-science data-visualization dicoding gradient-boosting-classifier machine-learning problem-solving python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/kurosawaxyz/covid4eu-sorbonne
Economy: “Analysis of Labor Market decisions of men and women during the COVID-19 pandemic in the 4EU+ countries”.
covid-19 data-analysis data-science data-visualization pandas
Last synced: 04 Jul 2025
https://github.com/arnikz/piqmie
Proteomics Identifications & Quantitations Data Management & Integration Service
data-analysis data-management data-visualisation mass-spectrometry peptide-identification protein-inference protein-quantification proteomics silac web-application
Last synced: 03 Feb 2026
https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken
Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.
3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics
Last synced: 11 Apr 2026
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/antonioscardace/mri-brainage
Showing Accelerated Brain Ageing in Alzheimer's Patients.
alzheimers-disease brain-age classification data-analysis medical-analysis predictive-modeling regression
Last synced: 18 Jan 2026
https://github.com/odeyiany2/flit-apprenticeship-data-science-projects
This repo contains all my projects for my FLiT Apprenticeship
data-analysis data-science data-visualization machine-learning sql
Last synced: 17 May 2026
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 11 Jun 2026
https://github.com/ddihora1604/iitk_esg
Researching and Analyzing key ESG (Environmental, Social, Governance indicators) metrics and their impact on stock performance and market behavior. Leveraging AI techniques (like Machine Learning and NLP) in finance to extract insights from ESG disclosures, enhancing financial predictions and sustainable investment strategies.
data-analysis data-visualization esg python yahoo-finance
Last synced: 24 Apr 2025
https://github.com/happybono/sonatasmooth
Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.
algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms
Last synced: 30 Oct 2025
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026
https://github.com/rohithsaji97/open_gate_dip
An automatic gate opening system with an additional parking system (using Raspberry PI).
automated data-analysis digital-image-processing opencv python3 raspberry-pi-3 trained-models
Last synced: 04 Feb 2026
https://github.com/alejo1630/chicago_crimes
A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium
data-analysis data-visualization folium pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 13 Apr 2026
https://github.com/rani-sikdar/pwc-virtual-internship-powerbi
Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.
business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi
Last synced: 07 Jan 2026
https://github.com/ahmad-ali-rafique/weather-prediction-fcnn
This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.
ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information
Last synced: 28 Aug 2025
https://github.com/bcko/ud-da-stroopeffect
Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)
data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree
Last synced: 04 Jul 2025
https://github.com/umutsevdi/hr-management
HR Management, Analytics and Salary Determination System
analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow
Last synced: 10 Apr 2026
https://github.com/ivanildobarauna-dev/api-to-dataframe
Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 06 Mar 2025
https://github.com/mariam-badr-mb/gtc-ml-project1-hotel-bookings
The goal of this project is to build a robust data preprocessing pipeline for a hotel booking cancellation prediction model. The focus is not on training the final machine learning model but on ensuring that the dataset is clean, consistent, and ML-ready.
cleaning-data data-analysis exploratory-data-analysis
Last synced: 05 Sep 2025
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/roboto-ai/robologs-px4-actions
A collection of actions for working with PX4 data
data-analysis data-processing data-transformation drones px4 px4-logs px4-ulog robotics sensors
Last synced: 04 Feb 2026
https://github.com/samuelsoaress/python-study-datascience-ia
My data science and AI studies
data-analysis data-crawler data-mining data-science deep-learning machine-learning-algorithms
Last synced: 13 May 2026
https://github.com/sivas-2/food-demand
This project aims to predict food demand based on historical data, leveraging various statistical methods to achieve accurate forecasts.
data-analysis data-science dataanalysis food-demand-forecasting statistics
Last synced: 12 Aug 2025
https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel
This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.
automation data-analysis data-science dataset-generation excel-import signal-processing
Last synced: 18 Apr 2026
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 12 Apr 2026
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 21 Jan 2026
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 06 Oct 2025
https://github.com/karthikmprakash/911-call-dataanalysis
Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA
911-call-analysis data-analysis data-visualization python3 united-states-data
Last synced: 10 May 2026
https://github.com/komailmk/instagram-reach-forecasting
This repository provides a Python-based solution for forecasting Instagram reach using historical data and SARIMA modeling techniques.
data-analysis data-visualizations machine-learning
Last synced: 05 Oct 2025
https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection
The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.
banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier
Last synced: 06 Oct 2025
https://github.com/an1mch1k-theone/project_2_hh_analyze
Анализ вакансий из HeadHunter
data-analysis data-analysis-project postgresql python sql
Last synced: 14 Apr 2026
https://github.com/hordiales/redpanal-db-analysis
Analysis of the RedPanal.org music database
creative-commons data-analysis dataset etl machine-learning music music-information-retrieval statistical-analysis
Last synced: 10 Mar 2025
https://github.com/erickkhosasi/thelook-data_analysis
Final project for my SQL mini bootcamp. This project explores an e-commerce dataset to uncover key business insights. Data insights were queried in Google BigQuery and visualized with Google Sheets.
bigquery data-analysis e-commerce sql
Last synced: 05 Oct 2025
https://github.com/luabagg/worldwide-trends
Worldwide Google Trends visualization and classification
data-analysis data-visualization google-trends trends
Last synced: 03 Feb 2026
https://github.com/sunnybibyan/call_centre_power_bi_dashboard
Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.
call-centre-analysis dashboard data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/faisal-khann/diwali-sales-analysis
The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future
csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/noturlee/imdb-dataanalysis
A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.
data-analysis data-cleaning data-modeling data-science data-visualization
Last synced: 08 Apr 2025
https://github.com/tqhungdev0605/crawl_200_jd_dataanalyst
Automate job data scraping for 200 Data Analyst postings on https://vn.indeed.com using Python
data-analysis jupyter-notebook python3 scraping selenium
Last synced: 11 Apr 2026
https://github.com/renatomaynard/statistical-modeling-and-regression-analysis-life-expectancy
Statistical Modeling and Regression Analysis for Life Expentancy
data-analysis healthcare linear-regression machine-learning predictive-modeling r regression-analysis statistical-models statitics
Last synced: 23 Mar 2025
https://github.com/europanite/client_side_ml
Client Side Browser-Based CART Time-Series Prediction.
cart data-analysis data-science docker docker-compose expo expo-go machine-learning metro react-native ti time-series time-series-analysis time-series-forecasting time-series-prediction typescript vite
Last synced: 10 Apr 2026
https://github.com/antonijn/polyfit
Fits a polygon to a given data input
c data-analysis linear-algebra toy
Last synced: 16 Jul 2025
https://github.com/willie-conway/datavista
DataVista is a comprehensive, production-grade data analysis and machine learning platform that combines real-time data ingestion from live APIs, interactive visualizations, statistical analysis, hypothesis testing, and machine learning model training — all in a unified, professional-grade interface. Built with React and Recharts.
analytics-platform api-integration classification coingecko-api csv-import data-analysis data-cleaning-and-preprocessing data-pipeline data-science data-visualizations etl hypothesis-testing json-export machine-learning-models open-meteo react recharts regression statistics world-bank
Last synced: 30 May 2026
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 23 Mar 2025
https://github.com/siddharthbadal/sql-case-studies-data-analysis
Data Analysis case studies on various databases using SQL . Demonstrating proficiency in solving diverse business problems. Projects cover sales, orders, products, finance, healthcare and other sectors, and highlight my ability to analyze complex datasets through SQL queries, data manipulation, and visualization techniques.
data-analysis sql sql-query sql-server sqlserver
Last synced: 08 Jan 2026
https://github.com/zeinhasan/eksploration-and-data-visualization-course-material
Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials
data-analysis data-visualization statistics
Last synced: 24 Jun 2026
https://github.com/nirmalvatsyayan/data-analyst-nanodegree
Udacity data analyst nanodegree project submissions and learning
data-analysis numpy pandas python statistics udacity-data-analyst-nanodegree
Last synced: 12 Apr 2026
https://github.com/celineboutinon/bottleneck
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 5
data-analysis data-analytics data-visualisation dataframes market-intelligence marketing-analytics matplotlib-pyplot missingno numpy pandas python seaborn
Last synced: 07 Sep 2025
https://github.com/raccoon-hero/gender-equality-tracker
A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.
chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api
Last synced: 07 May 2026
https://github.com/tks18/pyquery
PyQuery is a local-first data operating system built on lazy execution that processes 100GB+ files while you doomscroll. No cap. 🧢
data-analysis data-science etl hdfs parquet pipeline polars python
Last synced: 14 Jan 2026
https://github.com/rara-ch/data-analysis-portfolio
This repository to store my data analytics projects, showcasing my skills in SQL and Python.
data-analysis mathematics matplotlib numpy pandas portfolio probability python seaborn sql statistics
Last synced: 12 Mar 2025
https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app
Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.
aws-ec2 data-analysis plotly preprocessing streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/aritrakar/statpy
A simple package containing some functions for analysing Gaussian and Binomial distributions. Created for the Udacity AWS MLE Foundations 2021 course.
data-analysis python statistics
Last synced: 24 Oct 2025
https://github.com/hamada-khairi/pfda-hamada
A comprehensive R-based data analysis project that examines housing rental patterns across multiple cities, utilizing statistical methods and visualization techniques to analyze 4,746 properties' data points including rent prices, locations, and amenities. The project employs various R libraries to clean, process, and visualize rental market trends
apu data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-processing-and-analysis data-science data-visualization-project ggplot2 house-rent-prediction r-programming-projects r-statistics r-studio real-estate-analytics
Last synced: 16 Mar 2025
https://github.com/ahammadshawki8/playing-with-pandas
🐼 Pandas is one of my favourite library in python. It is well-known for "Analyzing" data. Learn basics and beyond the basics of Pandas from this repository. 🤍🖤
beginner-friendly data-analysis favourite-library pandas python
Last synced: 17 Apr 2026
https://github.com/carolinedotxyz/dp_sgd_classification
A hands-on educational walkthrough of training a CelebA (Eyeglasses) image classifier with Differentially Private SGD using PyTorch and Opacus. The focus of this repo is on clarity and reproducibility through balanced subsets, deterministic preprocessing, and side-by-side baseline vs. DP training, while acknowledging real trade-offs.
celeba-dataset classification data-analysis dp-sgd machine-learning opacus python pytorch
Last synced: 16 May 2026
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/ultrasage-danz/weather-data-analysis
Weather Data Analysis notebook project. Created using Google collab
collaboration data-analysis data-science dataset google google-colab-notebook project
Last synced: 24 Mar 2025
https://github.com/sunnybibyan/exploratory-data-analysis-eda
Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.
data-analysis data-visualization jupyter-notebook python titanic-dataset
Last synced: 18 Jan 2026
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 13 Apr 2026
https://github.com/leosimoes/uerj-tcc-analisador-dados-texto
Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.
data-analysis data-science data-visualization python streamlit
Last synced: 24 Mar 2025
https://github.com/leosimoes/udacity-starbucks
Project 3 of the Udacity Machine Learning Engineer Nanodegree Program. Data analysis and machine learning application to Starbukcs data.
aws-iam aws-s3 aws-sagemaker data-analysis data-science machine-learning python
Last synced: 24 Mar 2025
https://github.com/targetta/ankaflow
YAML-based data pipeline framework that runs both locally and fully in-browser designed for data engineers, ML teams, and SaaS developers who need flexible, SQL-powered pipelines.
bigquery clickhouse data-analysis dataops deltalake duckdb elt-pipeline etl etl-automation motherduck parquet python sql
Last synced: 09 Oct 2025
https://github.com/guruakaashjn/te_project_microsoft_ai
AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.
artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi
Last synced: 27 Feb 2025
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 12 Apr 2026
https://github.com/jabhij/eda_experiments
In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.
ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis
Last synced: 14 Apr 2026