Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 06 Mar 2026
https://github.com/eve-ning/ppshift
Analyzes maps and scores from 2015
data-analysis data-mining osu osugame
Last synced: 13 Feb 2026
https://github.com/saroshfarhan/irish_hospital_data_anaysis
Irish hospital's patient discharge data for four counties analysis
data-analysis data-science data-visualization healthcare irish-data r-programming-language
Last synced: 18 Feb 2026
https://github.com/sebastianurdaneguibisalaya/colocaciones-de-credito-fondo-mivivienda-peru
Exploro las Colocaciones de Crédito del Fondo MIVIVIENDA S.A. entre 2018 y 2022, con un conjunto de datos descargado del Portal Nacional de Datos Abiertos del Perú. 🏠
data-analysis jupyter-notebook python
Last synced: 24 Feb 2025
https://github.com/ishitaagl20/nyc-taxi_trip_prediction
Taxi Trip Duration Prediction Using the NYC Dataset
data-analysis data-exploration data-visualisation decision-trees matplotlib nyc-taxi-dataset python3 random-forest seaborn xgboost
Last synced: 19 May 2026
https://github.com/jcm-ai/Personal-Data-Science-Projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 19 Aug 2025
https://github.com/jillie-wink/sql-portfolio
SQL Data Analysis Projects
data-analysis data-manipulation portfolio sql sqlite
Last synced: 02 Jan 2026
https://github.com/twistedfrost/best-of-ml-python
Explore the best machine learning libraries in Python. Stay updated with weekly rankings and contributions. Join the community! 🐙🌟
airport airport-simulation awesome breast-cancer-prediction data-analysis data-science data-visualization decision-tree-classifier deep-learning gpt jax nlp random-forest-classifier scikit-learn svm-classifier transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 26 Jun 2025
https://github.com/parthkumarmpatel/sql-exploratory-data-analysis
SQL EDA scripts for sales data warehouse — metrics, insights, and rankings from my data warehouse project.
data-analysis exploratory-data-analysis sql-server
Last synced: 26 Jun 2025
https://github.com/sevilaymuni/project-no.2-pandas-tableau-student-mobility
Pandas assisted Feature Engineering on Study Mobility: Tableau Dashboards on Students' Preferences
data-analysis data-extraction data-visualization feature-engineering pandas python tableau-dashboards tableau-desktop tableau-public
Last synced: 03 May 2026
https://github.com/adeebkhan25/dataset_suicide_susceptible
The "Student Suicide Risk Factors Dataset" is a comprehensive collection of data aimed at understanding and mitigating the factors contributing to student suicides.
data-analysis dataset machine-learning supervised-learning
Last synced: 24 Dec 2025
https://github.com/alimiheb/advwokcube-analysis
A comprehensive SSAS cube project based on AdventureWorksDW2019, featuring data cleaning, multidimensional modeling, and visualizations in Power BI and Excel.
adventureworks data-analysis excel powerbi sql-server ssas-multidimensional visualization
Last synced: 26 Jun 2025
https://github.com/revtpark/teamseas_scrapper
Scraping Team Seas for data analysis and visualization.
chartjs data-analysis python webscraping
Last synced: 28 Mar 2025
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/nivasharmaa/friskwatch
A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.
data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data
Last synced: 19 May 2026
https://github.com/gbikram/python-data-analysis
The Counted
data-analysis matplotlib python
Last synced: 10 Jul 2025
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/shellynagar27/marketing-content-performance-analysis
Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.
data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing
Last synced: 26 Jun 2025
https://github.com/roma-glushko/magechurn
churn-analytics data-analysis data-science
Last synced: 06 Apr 2025
https://github.com/kevin-rsj/sectores_economicos_covid-19
Análisis Exploratorio de Datos (EDA): Comportamiento de Sectores Económicos antes, durante y después de la Pandemia de COVID-19 (2019-2022)
data-analysis financial-analysis pandemic-analysis python stock-market time-series visualization yahoo-finance
Last synced: 20 May 2026
https://github.com/ryuzen6/kaggle-series
This is a series of Machine Learning/Deep Learning Models made for practice.
artificial-intelligence data-analysis data-science deep-learning machine-learning python3
Last synced: 20 May 2026
https://github.com/evamaerey/ma206distributions
data-analysis data-science ggplot2 statistics
Last synced: 22 Jul 2025
https://github.com/astrojarhead/irafscripts
IRAF cl scripts
astronomy data-analysis image-processing iraf scripts
Last synced: 12 Jan 2026
https://github.com/badranalyst/restaurant-reviews-sentiment-analysis-nlp-case-study
This project analyzes restaurant reviews using Natural Language Processing (NLP) for sentiment analysis. It covers data exploration, pre-processing (NLTK text cleaning), model building, prediction, and deployment. The goal is to predict sentiment from reviews using Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
data-analysis data-science eda exploratory-data-analysis matplotlib-pyplot model model-building numpy pandas pre-processing predictive-modeling python seaborn
Last synced: 13 Apr 2026
https://github.com/sharoonjoseph321/e-commerce-eda
Data Analysis on E-commerce ,using pandas, python, matplotlib.
data-analysis data-science data-science-projects data-visualization jupyter-notebook matplotlib pandas pandas-dataframe pandas-python python
Last synced: 06 Apr 2025
https://github.com/sharoonjoseph321/samsung_stock_prediction
Predicting future price of Samsung stock, using machine learning , scikit learn and pandas
algorithms data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction predictive-analytics predictive-modeling python stock-price-prediction supervised-learning
Last synced: 06 Apr 2025
https://github.com/deliprofesor/customerseg-customer-segmentation-and-shopping-analysis
This project performs data exploration, segmentation, and modeling of wholesale customer data using clustering algorithms, PCA, and decision trees to analyze purchasing behavior and predict customer channel preferences.
clustering customer-segmentation data-analysis data-visualization dbscan decision-tree gmm kmeans machine-learning pca
Last synced: 24 Jun 2025
https://github.com/techshot25/graduateadmissions
Looking at the probability of being accepted in a graduate program using a machine learning model
bayesian-regression correlation-matrices data-analysis data-science linear-regression machie-learning random-forest-regression regression ridge-regression
Last synced: 25 Feb 2025
https://github.com/balajimohan18/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook maachine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 20 May 2026
https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard
This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.
business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi
Last synced: 12 Jan 2026
https://github.com/emredemirbas/movie-ratings-analysis
A data analysis project investigating potential bias in movie ratings from 2015, comparing them with ratings from other platforms using Python, pandas, and visualization libraries.
data-analysis matplotlib pandas python seaborn
Last synced: 03 May 2026
https://github.com/vipulbunny/restaurant-insight-analysis
A comprehensive data analysis project exploring restaurant ratings, locations, and customer sentiments. This project includes data preprocessing, descriptive analysis, geospatial mapping, sentiment analysis, and price-rating correlations using Python and visualization tools.
data-analysis data-preprocessing data-visualization folium geospatial geospatial-analysis geospatial-visualization machine-learning nlp pandas python restaurant-insights seaborn sentiment-analysis
Last synced: 03 May 2026
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 03 May 2026
https://github.com/ankitgmishra/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/matteospanio/speed-analysis
A project to analyze the internet speed
Last synced: 03 May 2026
https://github.com/nathadriele/world-marathon-run-majors-analytics-challenge
This project presents a complete data engineering, analytics, machine learning, and Streamlit dashboard pipeline focused on the Abbott World Marathon Majors: Tokyo, Boston, London, Berlin, Chicago, and New York City. Covering the 2018 to 2025 seasons, it analyzes more than 628,000 runner records and 86 verified winner entries.
challenge data-analysis data-pipeline gradient-boosting lasso-regression linear-regression machine-learning models predictive-modeling python random-forest ridge-regression run-analytics world-marathon
Last synced: 09 Jun 2026
https://github.com/nickenshidqia/uber-new-york-data-analysis
Analyze Uber pickups on New York to get insight from this data
data-analysis data-analyst exploratory-data-analysis python
Last synced: 04 May 2026
https://github.com/arv-anshul/ipl-api
IPL API using Flask framework and ipl dataset.
api data-analysis fast-api flask flask-api ipl ipl-api python3
Last synced: 04 May 2026
https://github.com/ibrahimm7004/supermarket-sales-analysis
This project focuses on Data Mining techniques to gather inisights about customer behaviour regarding Supermarket Sales. Includes: Association Rule Mining, Temporal Patterns in customer behavior, Sequential Pattern Mining, Classification, Regression, and Outlier Detection.
apriori association-rules data-analysis data-mining data-science data-visualization fpgrowth python sales-analysis supermarket-sales
Last synced: 04 May 2026
https://github.com/hilalguleryuz/northwind_data_analysis_capstone_project
Northwind Capstone Project
capstone-project dashboard data-analysis data-visualization dax jupyter-notebook matplotlib northwind northwind-database pandas postgresql powerbi python seaborn sql
Last synced: 04 May 2026
https://github.com/halyusa16/e-commerce-analysis
This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.
data-analysis data-cleaning data-exploration data-visualization self-project
Last synced: 09 Jun 2026
https://github.com/josewebdev2000/us-violent-crime-data-analysis
Analyzing Violent Crime in the United States of America from 1960 to 2019
data-analysis data-science data-visualization interactive-visualizations jupyter-notebook pandas plotly python
Last synced: 04 May 2026
https://github.com/yokawaiik/data_science
Time series forecasting with future predict.
data-analysis keras lstm neural-network predict-future python python-3 rnn time-series-forecast visualization
Last synced: 05 May 2026
https://github.com/tasosfotiadis/time-series-analysis-and-forecasting-of-cryptocurrency-prices
Forecasted Cardano (ADA) cryptocurrency prices using time series analysis. The project involved data preprocessing, trend and seasonality analysis, and model building with ARIMA, SARIMA, and LSTM. Models were evaluated using metrics like MAE and MAPE, providing insights for financial decision-making.
applied-st classical-statistical-models data-analysis deep-learning lstm machine-learning neural-network python r time-series
Last synced: 05 May 2026
https://github.com/dhruvsrikanth/basic-data-science
A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!
analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn
Last synced: 05 May 2026
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/syarwinaaa09/exploring-nyc-public-school-test-result-scores
📊 analyzing NYC school test scores with python 🐍 to spot top performers 🏆 & trends 📈
data-analysis education pandas python visualization
Last synced: 06 May 2026
https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.
data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 06 May 2026
https://github.com/superpandas-ai/superpandas
Adding LLM integration to Pandas library
Last synced: 06 May 2026
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/fbarffmann/home_sales
Analyzed 25,000+ home sales using PySpark and SparkSQL. Identified pricing trends by year built, home features, and view rating. Optimized query run-time by 70% using caching.
aws big-data data-analysis home-sales parquet pyspark python spark spark-sql sql
Last synced: 06 May 2026
https://github.com/karlyndiary/coffee-shop-sales-analysis
Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 07 May 2026
https://github.com/mahmoudnamnam/fc-barcelona-reports
FC Barcelona Reports: An interactive web application to analyze and visualize FC Barcelona's match data. Built with Streamlit, it scrapes match data from WhoScored, stores it in MongoDB, and presents insights through interactive visualizations like pass networks, shot maps, and player statistics.
data-analysis data-visualization football-analytics mplsoccer pandas streamlit web-scraping
Last synced: 07 May 2026
https://github.com/lucalullo/global-emissions-and-temperature-1950-2024
Global climate analysis covering 75 years of CO₂, greenhouse gas emissions and mean surface temperatures across countries (1950–2024). Built with Pandas, Matplotlib, Seaborn and Plotly.
climate-change co2-emissions data-analysis greenhouse-gas temperature
Last synced: 10 Jun 2026
https://github.com/riborings/python_projects
Python projects and other programming experiences
data-analysis machine-learning project python regression-analysis
Last synced: 08 May 2026
https://github.com/bnvulpe/regression-and-time-series
This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.
colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting
Last synced: 08 May 2026
https://github.com/fahamidur/cuisine-analysis
This project analyzes recipes from AllRecipes.com to reveal global cooking patterns, nutritional trends, and cultural food differences, offering data-driven insights for food enthusiasts and researchers.
beautifulsoup data-analysis datavisualization pandas selenium tableau-public webscraping
Last synced: 08 May 2026
https://github.com/0290192029/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction apartments-for-rent api correios-api data-analysis feature-engineering feature-enginering linear-regression linear-regression-models mlops numpy prediction-model r seaborn
Last synced: 08 May 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 08 May 2026
https://github.com/ryan-wong1/analyzing-arrest-patterns-in-chicago-data-analysis
Chicago Police Department (CPD) arrest data on offenses, locations, and demographics
data-analysis data-cleaning data-visualization exploratory-data-analysis matplotlib pandas python seaborn
Last synced: 08 May 2026
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026
https://github.com/zxjahid/matplotlib
A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊
candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx
Last synced: 09 May 2026
https://github.com/vadniks/akabigdata
Technologies and tools for big data analysis
applied-mathematics association-rule-learning classification clustering data-analysis data-visualization ensemble-learning machine-learning-algorithms python3 statistics
Last synced: 23 Sep 2025
https://github.com/drill-n-bass/ovh-project
The goal of this task is to prepare statistical analysis of set of data from disks.
anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots
Last synced: 09 May 2026
https://github.com/datalopes1/manufacturing_defects
Projeto de EDA utilizando o Manufacturing Defects que pode ser encontrado no Kaggle
data-analysis data-visualization eda exploratory-data-analysis python
Last synced: 09 May 2026
https://github.com/happybravo/ss4202_project
Space Astronomy project
astro astronomy astrophysics classification data-analysis data-science data-visualization galaxies machine-learning python quasar stars
Last synced: 10 May 2026
https://github.com/macdon112/credit-card-fraud-detection
Comparing ML models (Random Forest, KNN, Decision Tree) for credit card fraud detection using SMOTE and stratified cross-validation.
classification data-analysis fraud-detection imbalanced-data machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/mozeel-v/spam-detection
ML-powered SMS Spam Classifier using NLP and Scikit-learn. Detects and filters spam messages with interactive Streamlit UI.
classification data-analysis mnb streamlit
Last synced: 10 May 2026
https://github.com/pipe199x/end-to-end-prediction-california
End-to-end prediction project using various technologies to predict housing prices in California.
california-housing data-analysis machine-learning python
Last synced: 11 May 2026
https://github.com/melissaantunes/ibm-data-analyst-professional
IBM Data Analyst Professional Certificate
analyze-data data-analysis data-analyst data-manipulation data-science data-visualization ibm-data-analyst-professional pandas python
Last synced: 11 May 2026
https://github.com/lucashomuniz/project-04
STATISTICAL ANALYSIS FOR DEMAND PLANNING IN POWERBI
bigquery data-analysis data-structures data-visualization database google-cloud-platform powerbi powerbi-visuals sql sql-query
Last synced: 11 May 2026
https://github.com/sferez/gradient_descent
Multiple Linear Regression, Gradient Descent with Python
data-analysis data-science gradient-descent linear-regression python
Last synced: 12 May 2026
https://github.com/OdessaZ/Portfolio-Projects
This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics
Last synced: 12 May 2026
https://github.com/ygalvao/bra_scraper_2022
A web scraper bot for the 2nd round of the 2022 Brazilian Federal Elections.
data-analysis data-analytics selenium web-scraper webscraper
Last synced: 12 May 2026
https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series
Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.
business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql
Last synced: 12 May 2026
https://github.com/parthds02/-daily-calorie-count-meal-plan-generator-
Welcome to the Daily Calorie Count Meal Plan Generator project! This Streamlit web application is designed to create personalized meal plans based on user inputs such as age, weight, gender, and calorie goals. It also allows users to download their customized meal plans as PDFs.
calories-tracker data-analysis data-science pdf-generation streamlit vscode
Last synced: 13 May 2026
https://github.com/nlink-jp/shell-agent-v2
macOS local-first chat & agent tool with interactive data analysis (Wails v2 + React)
data-analysis duckdb golang llm macos react wails
Last synced: 13 May 2026
https://github.com/bhaveshbhakta/emotions-analysis-using-gru
Emotions Analysis
data-analysis data-preprocessing data-visualization deep-learning emotion-analysis gru rnn
Last synced: 14 May 2026
https://github.com/yashsingh43/cdc-sleep-duration-health-analysis
Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.
beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python
Last synced: 11 Jun 2026
https://github.com/rachel-xmr/data-analysis-of-hand-drawn-symbols
Data analysis of hand draw symbols
cross-validation data-analysis decision-trees k-means-clustering machine-learning model-evaluation r
Last synced: 12 Jun 2026
https://github.com/saksham-jain177/cryptodataanalysis
A Python powered project that fetches live cryptocurrency data from the CoinMarketCap API, analyzes it, and updates a live Excel sheet every 5 minutes.
api-integration coinmarketcap cryptocurrency data-analysis excel live-data python
Last synced: 12 Jun 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/tawfikhammad/sql-leetcode-solutions
The solutions of SQL 50 LeetCode problems
data-analysis data-engineering database leetcode leetcode-solutions sql
Last synced: 15 Jun 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/llnl/cap
HPC workflow that automates the tedious actions of compiling, analyzing, and parsing with bincfg
data-analysis hpc python workflows
Last synced: 17 Jun 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/jayavarshini-jayakumaran/nba-exploratory-data-analysis
A data analytics project that explores NBA game and player data using Python and Power BI. Features data preprocessing, EDA, feature engineering, and an interactive dashboard for visualizing team and player performance trends.
data-analysis data-visualization exploratory-data-analysis powerbi python3
Last synced: 20 Jun 2026
https://github.com/haseebn19/urban-housing-demand
A full-stack web application for visualizing housing and labour market data
data-analysis data-visualization docker full-stack gradle statistics web webapp
Last synced: 22 Jun 2026
https://github.com/engusseus/warframe-market-set-profit-analyzer
Python tool that analyzes Warframe Market data to find profitable item sets to trade
api data-analysis python trading waframe
Last synced: 23 Jun 2026
https://github.com/imosudi/unsupervised-ml-kmeans-analysis
K-Means clustering analysis using synthetic datasets generated with scikit-learn, including meshgrid visualisation, silhouette score evaluation, and investigation of cluster count and random seed effects.
clustering data-analysis jupyter-notebook kmeans kmeans-clustering machine-learning matplotlib python3 scikit-learn silhouette-score unsupervised-learning
Last synced: 25 Jun 2026
https://github.com/souza-vitor/stock-market
codecademy data data-analysis data-mining data-science sql sqlite
Last synced: 26 Jun 2026
https://github.com/chdre/data-analyzer
A small package to analyze and preprocess data.
Last synced: 28 Jun 2026
https://github.com/manganite/vibespin
VibeSpin is a Python framework for simulating and analyzing 2D lattice spin systems (Ising, XY, and q-state Clock models) with Numba-accelerated Monte Carlo dynamics, correlation/structure diagnostics, and reproducible benchmarking workflows.
clock-model critical-phenomena data-analysis ising-model lattice-models monte-carlo-simulation phase-transitions physics-simulation python scientific-computing spin-models spin-systems statistical-mechanics xy-model
Last synced: 29 Jun 2026
https://github.com/asadiahmad/word-counter-spark
Word counter with spark
data-analysis nlp spark word-counter
Last synced: 02 May 2026
https://github.com/jcaperella29/jc_bioinformatics_hub
A personal hub to showcase my bioinformatics applications including RNA-Seq, ATAC-Seq, and miRNA-Seq analysis tools. Powered by simple HTML, CSS, and JavaScript with a biotech-themed design.
atac-seq bioinformatics biotech data-analysis github-pages portal rna-seq webapp
Last synced: 25 Feb 2026
https://github.com/tralahm/parliament-2017-dataset
Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition
csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek
Last synced: 31 Jan 2026