Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/scarblase/sales_insights
A data-driven analysis of 15,000 sales records using Python, Pandas, and visualizations to uncover trends, optimize strategies, and enhance business performance. 🚀📊
data-analysis data-visualization dataset matplotlib-pyplot pandas python3 sales-analysis seaborn
Last synced: 05 May 2026
https://github.com/flexmonster/svelte-flexmonster
Svelte wrapper for Flexmonster Pivot Table & Charts
data-analysis data-visualization frontend pivot-tables svelte sveltekit
Last synced: 27 Feb 2026
https://github.com/alemalvarez/data-analysis-web-project
Web-app providing a simple interface for data storage,
data-analysis data-science javascript react webapp
Last synced: 29 Apr 2026
https://github.com/dina-hosny/telco-customer-churn-analysis-using-power-bi
An interactive dashboard to represent some analysis of "Telco customer churn" data and the reasons that made customers churn using Microsoft Power BI.
business-intelligence data-analysis data-modeling data-visualization power-bi powerbi
Last synced: 19 Mar 2026
https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask
It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.
data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api
Last synced: 04 May 2026
https://github.com/akarshankapoor7/tensorflow_tutorial
This is an easy and fast tutorial for tensorflow. In data science, TensorFlow is an open-source machine learning framework by Google. It's used for building and training machine learning and deep learning models.
data-analysis data-science deep-learning machine-learning tensorflow
Last synced: 27 Apr 2026
https://github.com/sarthak-0-sach/drivermasterdata_database_table
This code enables data integration from multiple sources and ensures a single source for all driver-related attributes. Designed for scalability and pipeline compatibility, this project supports clean data transformations, validations, and storage-ready outputs. Ideal for quick analytics, created using python & airflow, automated using cronjob.
apache-airflow-etl-pipeline data-analysis data-visualization database-management python
Last synced: 27 Apr 2026
https://github.com/talha-1010/imdb-data-analysis
A data analysis project made with python using pandas
data-analysis data-visualization jupyter-notebook pandas pandas-dataframe
Last synced: 09 May 2026
https://github.com/sleeplessglory/big-data
Projects regarding big data analysis, presented within Jupyter Notebook
big-data data-analysis data-visualization jupyter python
Last synced: 16 Apr 2026
https://github.com/angelgardt/wlm-sdarp-old
World of Linear Models: Statistics & Data Analysis in R for Psychologists
data-analysis data-visualization gh-pages manim-animations quarto r rstudio statistics
Last synced: 04 May 2026
https://github.com/gowthamsundaresan/eigenscan
blockexplorer for eigenlayer
crypto data-analysis eigenlayer nextjs web3
Last synced: 04 May 2026
https://github.com/alicankaya192/world-happiness-report-2025
Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).
correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report
Last synced: 21 Jun 2026
https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2
Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.
data data-analysis deloitte excel forensic-analysis
Last synced: 06 Feb 2026
https://github.com/blagojeblagojevic/lol_data_analysis
classification data-analysis data-science jupyter-notebook kaggle python3
Last synced: 09 May 2026
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/sejalmankar1012/yuvaco_data_analysis_assessment
This assignment involves writing a Python script to calculate the cost of package deliveries based on provided data and a cost grid. The script takes package details such as weight, distance, and delivery type, applies the cost calculation rules, and saves the results in an output file. You can also run the script in Google Colab for convenience.
csv-file-handling data-analysis google-colab package-delivery python python-scripting
Last synced: 29 Apr 2026
https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries
🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈
ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics
Last synced: 21 Jun 2026
https://github.com/datadotworld/dw-jupyter-contents
Jupyter ContentsManager implementation for data.world
data data-analysis data-science dwstruct-t50-public-projects jupyter jupyter-notebook jupyterlab reference-implementation
Last synced: 22 Jun 2026
https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk
The aim of this analysis is to understand the smoking patterns among adults in the UK.
data data-analysis data-visualization python3
Last synced: 09 May 2026
https://github.com/paweljakubas/j-data-analysis
Data analysis using J
data-analysis j machine-learning statistics
Last synced: 23 Jun 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/kalfasyan/filoma
profiling files, directories, image data
data-analysis profiler validation
Last synced: 05 Apr 2026
https://github.com/ryanfranklin237/data-cleansing
A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells
data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python
Last synced: 23 Jun 2026
https://github.com/ismailtekin05/caloriedetectingai
🍎🔍 Smart AI system that identifies food items in photos and calculates their calorie content automatically. Built with TensorFlow, YOLOv8, CUDA and computer vision for accurate nutrition tracking.
ai aimodel calorie-calculator computer-vision cuda data-analysis data-science data-segmentation data-visualization dataset dataset-generation image-processing image-recognition python segmentation-models tensorflow ultralytics yaml yolo yolov8
Last synced: 29 Apr 2026
https://github.com/musaibnagani/fraud-detection
End-to-end fraud detection simulation using Python — Phase 1 (SQLite + Rules) and Phase 2 (MSSQL + Velocity/Behavioral Features) with synthetic banking data.
data-analysis fraud-detection fraudulent-transactions mssql mssql-database pandas python sqlite3 time-series
Last synced: 10 May 2026
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 22 Apr 2026
https://github.com/ruchit0807/heart_disease_prediction
An interactive ML-powered web app that predicts the risk of heart disease based on clinical inputs like age, chest pain, cholesterol, ECG, and more. Built using Python, Streamlit, and scikit-learn, it offers early risk assessment in a simple and accessible way—just enter your health metrics and get instant feedback.
data-analysis data-science knn-regression pandas streamlit
Last synced: 04 May 2026
https://github.com/tejas-130704/whatsapp-analyser
ChatMate is a web app that analyzes WhatsApp chats, providing insightful visualizations like word clouds, heatmaps, and activity timelines. It calculates total messages, words, media, links, and more, helping you understand chat patterns for groups or individuals with ease. Simply upload your chat file and get detailed reports instantly!
data-analysis data-visualization python streamlit web-application whatsapp-analysis
Last synced: 26 Apr 2026
https://github.com/christos99/scraping-project
This project is a Python-based tool for web scraping with a user-friendly GUI. Built with PyQt5 and Selenium, it allows users to scrape online listings by specifying keywords, price ranges, and exclusions. Results are displayed in a table and can be exported to an Excel file.
automation data-analysis excel gui openpyxl pandas pyqt5 python selenium web-scraping
Last synced: 10 May 2026
https://github.com/27ahmad/foreign-direct-investment-analytics
This repository contains an exploratory data analysis (EDA) and visualization project on a dataset of Foreign Direct Investment (FDI) by companies. The objective is to analyze FDI trends and present key insights through an interactive Tableau dashboard.
data-analysis eda matplotlib pandas python seaborn tableau
Last synced: 29 Apr 2026
https://github.com/mdaffailhami/data_science_speedrun_journey
This repository contains notebooks and projects related to my data science speedrun journey.
algebra artificial-intelligence data-analysis data-analyst data-science data-scientist jupyter-notebook machine-learning math mathematics numpy pandas postgresql probability python statistics
Last synced: 05 Apr 2026
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 26 Apr 2026
https://github.com/gabrielmpinho/cs50-sql
Solutions and notes from CS50’s Introduction to Databases with SQL. Covers CRUD operations, data modeling, normalization, joins, views, indexes, and connecting SQL with Python and Java. Begins with SQLite for portability and introduces PostgreSQL and MySQL for scalability.
data-analysis data-structures data-visualization database databases javascript python sql
Last synced: 10 May 2026
https://github.com/ubeydgur/titanic-survival-predictor
Predicting whether a person will survive the Titanic crash.
ai artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 07 Jun 2026
https://github.com/rogernet/desafio-profissional-produto-data-driven
Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.
data-analysis data-science data-visualization product
Last synced: 23 Jun 2026
https://github.com/edikedik/lxtractor
Library for analysing protein structures and sequences
bioinfomatics computational-biology data-analysis data-mining feature-extraction python structural-biology
Last synced: 14 Feb 2026
https://github.com/spaghettifunk/gvb
Analysis of GVB in Amsterdam
data-analysis public-transportation
Last synced: 28 Feb 2026
https://github.com/mafesan/2021-tfm-code
Revelio: Machine-Learning classifier to identify Bots integrable with GrimoireLab
bot-accounts data-analysis data-analytics data-science grimoirelab machine-learning metrics open-source open-source-community project-health python scikit-learn
Last synced: 15 Apr 2026
https://github.com/abhik1711/material-classification-and-energy-band-prediction---excavate-25
A Two-Stage Machine Learning Pipeline: A Binary Classifier to identify insulators with high accuracy and a Stacking Regressor to predict precise band gap values for insulators by leveraging advanced feature engineering techniques and ensemble learning methods
data-analysis machine-learning python
Last synced: 23 Jun 2026
https://github.com/cdeweyx/game-of-thrones-s7e1-eda
Exploratory data analysis of scraped tweets related to Game of Thrones S7E1
data-analysis data-visualization python twitter-api
Last synced: 26 Apr 2026
https://github.com/airdac/sim-ames_housing
Prediction of house prices with linear regression in R. Team project from UPC's Master's Degree in Data Science
data-analysis data-science linear-regression r statistical-models upc
Last synced: 07 Jun 2026
https://github.com/garcane/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 19 Mar 2026
https://github.com/gab-182/market-analysis-report-for-national-clothing-chain
Using custom M and DAX codes in Power BI, I conducte a thorough market analysis for a national clothing chain. The insights gathered from customer data and US Census Bureau statistics led to the formulation of a targeted marketing strategy, contributing to enhanced sales and customer satisfaction.
Last synced: 19 Mar 2026
https://github.com/mystique85/altseason-ethereum-analysis
Altcoin season analysis relative to Ethereum – price comparisons, technical indicators, and historical market trends
altcoins bitcoin blockchain crypto data-analysis ethereum investing
Last synced: 04 May 2026
https://github.com/ozlerhakan/eda
Exploratory Data Analysis Samples
data data-analysis data-virtualization eda exploratory-data-analysis matplotlib plotly python seaborn
Last synced: 16 Apr 2026
https://github.com/victor-antoniassi/junior_data_analyst_test_01
Solution developed for a technical assessment that analyzed video game sales data to support gaming partnership decisions.
asses assessment-project data-analysis data-analysis-project data-analyst duckdb etl prefect python
Last synced: 01 Jun 2026
https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes
Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.
analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3
Last synced: 02 May 2026
https://github.com/stastnypremysl/lsql-csv
lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in a shell. LSQL aims to be a more lapidary language than SQL.
csv data-analysis data-processing haskell language linux-shell lsql lsql-csv new-language query-language relational-database sql unix-command unix-philosophy unix-shell
Last synced: 25 Feb 2026
https://github.com/faezeh-gholamrezaie/visual-google-scholar-search
A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.
academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud
Last synced: 25 Apr 2026
https://github.com/arielle0222/data_analysis
📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.
autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin
Last synced: 30 Apr 2026
https://github.com/ayu-hack/ayu-hack
Enthusiastic learner passionate about building software and exploring the world of technology. Eager to contribute to open-source projects and collaborate with the developer community. Continuously developing my skills in Python,SQL,HTML,CSS,PowerBI, MacOS. Always open to feedback and excited to keep growing!
config css data-analysis github-config html powerbi-desktop python3 sql
Last synced: 30 Apr 2026
https://github.com/uchida16104/healthanalysis
It abstracts the health status of each device from its operational time calculated from RescueTime and analyzes the data.
data-analysis portfolio portfolio-website security security-tool
Last synced: 02 Feb 2026
https://github.com/pratik-khose/data-analysis-with-pandasai
PandasAI with Llama3 for Interactive Data Analysis
data-analysis llama3 llma pandasai streamlit visualization
Last synced: 11 May 2026
https://github.com/noodleslove/house-of-representative-analysis-i
This project uses public data about the stock trades made by members of the US House of Representatives.
data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading
Last synced: 18 Apr 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 19 Mar 2026
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026
https://github.com/busraozdemir0/data_analysis_apps
Data analysis and data visualization applications with python
data-analysis data-analysis-python data-visualization matplotlib matplotlib-figures matplotlib-pyplot numpy numpy-python pandas pandas-dataframe pandas-library
Last synced: 18 Apr 2026
https://github.com/cliffordnwanna/climate_data_analysis_and_visualization
The Climate Data Visualization and Analysis project showcases a comprehensive exploration of African climate data, with an emphasis on identifying patterns and trends in average temperatures across different countries and time periods. It serves as a practical demonstration of my data analysis, data visualization, and problem-solving skills.
data-analysis data-science mathplotlib pandas plotly visualization
Last synced: 04 May 2026
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/buchananja/dpyp
A convenience tool for small-scale data pipelines in Python
data data-analysis data-cleaning data-engineering data-pipeline data-preprocessing data-processing data-science pandas pipeline
Last synced: 18 Apr 2026
https://github.com/manishjanky/analyse-ford-gobike-dataset
Analyse ford go bike dataset
data-analysis ford-gobike udacity udacity-data-analyst-nanodegree udacity-nanodegree
Last synced: 15 Feb 2026
https://github.com/chiemekaifemegbulem/make.com
A curated portfolio of Make.com automation workflows engineered to streamline operations and ensure precision. Featuring solutions for e-commerce, data integration, marketing, and bespoke business processes, it exemplifies expertise in designing scalable, efficient, and dependable automated systems.
api automate automated automation business data-analysis data-science dataengineering integration integromat make scenario software-engineering upwork workflows
Last synced: 15 Feb 2026
https://github.com/jku-vds-lab/loops
Loops is a JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.
data-analysis data-science data-visualization jupyter jupyter-notebook notebook provenance
Last synced: 29 Jan 2026
https://github.com/avitai/datarax
A Differentiable Data Pipeline Framework for JAX
autograd data data-analysis data-science differentiable flax-nnx jax jit machine-learning xla
Last synced: 25 Apr 2026
https://github.com/vasishth/lecturesintrobayes
Please go to the website for these online lectures:
bayesian-inference brms data-analysis stan
Last synced: 06 Feb 2026
https://github.com/prashver/dashboard-gallery
These dashboards provide insights across diverse domains, including cryptocurrency sales, workforce challenges, disease impact analysis, and retail trends. Leveraging tools like Power BI and Excel, they offer actionable insights for decision-making.
cryptocurrency dashboards data-analysis data-profession data-visualization market-segmentation-analysis microsoft-excel monkey-pox powerbi product-analysis retail-trends
Last synced: 15 Feb 2026
https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis
The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.
codebasics data-analysis data-science data-visualization mathematics python python3 statistics
Last synced: 30 Apr 2026
https://github.com/chouaib-629/customersegmentation
Hadoop-based Customer Segmentation project using the Online Retail Dataset. Implements MapReduce for processing and Python for preprocessing to uncover customer purchasing patterns for targeted marketing.
big-data customer-segmentation data-analysis data-science distributed-computing hadoop hadoop-mapreduce java mapreduce marketing-analytics python
Last synced: 04 May 2026
https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation
This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/chandansoren/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 06 Jun 2026
https://github.com/szapp/magistdataanalysisvisualization
Case study: Data analysis and visualization to evaluate and recommend a business partnership. Team project for data-driven business with SQL and Tableau
dashboard data-analysis data-science data-visualization data-viz sql tableau-public
Last synced: 19 Mar 2026
https://github.com/pradeepchegur/seamantic_web_design
We designed a semantic web for Instagram in Wix platform.
data-analysis framework instagram semantic-web website-design wix
Last synced: 19 Mar 2026
https://github.com/karthikudyawar/passwordometer
To predict the strength of the password
cybersecurity data-analysis data-visualization dataset docker exploratory-data-analysis-eda fastapi jupyter-notebook mongodb password-security password-strength-meter
Last synced: 30 Apr 2026
https://github.com/asifdotexe/quickvu
Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.
automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit
Last synced: 06 Jun 2026
https://github.com/jofaval/mlai-portfolio
My portfolio about Data Analysis, Machine Learning and A.I.
computer-science data-analysis data-science machine-learning portfolio python
Last synced: 25 Apr 2026
https://github.com/alcestide/scianalytics
Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.
csv data-analysis data-science data-visualization pandas plotly python science-research statistics
Last synced: 30 Apr 2026
https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice
A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.
climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather
Last synced: 02 May 2026
https://github.com/phammings/sales-management-analysis
Sales management analysis and Power BI dashboard for sample business request and user stories
data-analysis excel powerbi sql
Last synced: 01 Feb 2026
https://github.com/nurfakhri/e-commerce-data-analyst
E-commerce data analysis supported by data wrangling, EDA, and web dashboard
dashboard data-analysis e-commerce flask-application python
Last synced: 10 Feb 2026
https://github.com/steno-aarhus/legliv
Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study
cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank
Last synced: 03 Mar 2026
https://github.com/kalebers/economic_analysis_data_science
Data Analysis Python project using economy data base to predict percentage of good and bad payers
data-analysis data-science machine-learning pandas python scipy sklearn-library
Last synced: 18 Apr 2026
https://github.com/mauriceling/sipy
Python-Based Statistical Graphical User Interface for Python
data-analysis julia julia-language jupyter jupyter-kernels pandas pandas-python python python3 r r-packages r-project r-stats scikit-learn scipy scipy-stats statistical-analysis statistical-tests statistics
Last synced: 15 Apr 2026
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/nafisalawalidris/hici-african-foods
HiCi African Foods: Excel dashboard & pivot table analysis of EU food rejection data to identify risks & recommend focus areas for market expansion.
data-analysis data-cleaning data-visualization eu-food-rejection excel-dashboard hici-african-foods market-expansion pivot-tables
Last synced: 19 Mar 2026
https://github.com/0xjeremy/me-18-final
Data collection and Analysis tools for IMUs
data-analysis imu raspberry-pi
Last synced: 03 May 2026
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/mkk-1817/hr-attrition
This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python
Last synced: 03 May 2026
https://github.com/averma205/national-power-outages-severity-analysis
DSC 80 final project at UCSD
data-analysis data-science geospatial-data pandas predictive-modeling python sklearn
Last synced: 09 Feb 2026
https://github.com/affec-ds/dashboard-ventas-vinilos
Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.
business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records
Last synced: 30 Apr 2026
https://github.com/edisedis777/duckdb-analyzer
A powerful tool for analyzing large CSV datasets using DuckDB.
csv data-analysis database duckdb
Last synced: 16 Apr 2026
https://github.com/shivshah19/movie-recommendation-system
This Movie Recommendation System is designed to provide personalized movie recommendations based on user preferences.
cosine-similarity data-analysis machine-learning pandas python streamlit
Last synced: 03 May 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026