Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-20 00:07:30 UTC
- JSON Representation
https://github.com/arjo129/image-sorter
Sort through folders of videos and images. Root out blurred and overexposed images.
computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps
Last synced: 25 Jul 2025
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 06 Nov 2025
https://github.com/thealphadollar/messiah
Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity
azure backend data data-analysis flask frontend materialize natural-disasters
Last synced: 19 Jan 2026
https://github.com/chaitanyac22/customer-segmentation-using-rfm-analysis-for-online-retail
This project uses RFM (Recency, Frequency, and Monetary) segmentation to analyze customer behavior and provide insights for targeted marketing campaigns. By classifying customers based on their purchasing patterns, strategies can be tailored to improve customer retention, drive growth, and maximize the lifetime value of each customer.
customer-segmentation data-analysis data-science data-visualization exploratory-data-analysis marketing numpy pandas python python3 rfm rfm-analysis rfm-segmentation
Last synced: 11 Jun 2025
https://github.com/aalkiyumi/senior-design-project
Web scraper for collecting product and review data from e-commerce sites using Scraping Bee, AWS, Selenium, and Pandas. Focuses on cost-effective solutions, user-friendly interfaces, and efficient data extraction and analysis.
aws cs5001 data-analysis data-extraction data-processing data-storage e-commerce-analytics e-commerce-data pandas product-reviews review-sentiment-analysis scraping-bee selenium senior-design-project uc uc2026 university-of-cincinnati web-crawlers web-scraping
Last synced: 18 Feb 2026
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 20 May 2026
https://github.com/emelyantsev/digital-twin-data-analysis
Notebooks for R&D tasks
data-analysis data-visualization
Last synced: 21 Mar 2025
https://github.com/CAIDA/submarine-cable-impact-analysis-public
This repository contains tools implemented for the PAM 2020 paper "Unintended consequences: Effects of submarine cable deployment on Internet routing" to collect and analyze data depicting the impact of the South-Atlantic Cable System (SACS) launch on Internet routing. This codebase can be extended to other use-cases of cable launches, failures, etc.
africa-americas africa-south-america bgp-data-analysis caida-ark-measurement-platform data-analysis historical-traceroutes impact internet-routing ripe-atlas-measurement-platform sacs-cable sail-cable submarine-cables
Last synced: 06 Apr 2025
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/leandronasx/agro-data
Projeto final da formação de analista de dados e dashboard da SoulCode Academy.
bigquery data-analysis gcp looker pandas powerbi python
Last synced: 18 Jul 2025
https://github.com/archived-blueprints/postgresql-blueprints
Simplified blueprints for building data pipelines with PostgreSQL.
cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql
Last synced: 29 Jul 2025
https://github.com/billy-enrizky/dgf-analysis
DGF AI Analysis, from Exploratory Data Analysis, Handling Missing Data, to Predicting DGF with various Machine Learning Model such as Logistic Regression, Support Vector Machine, Gradient Boosting, and Random Forest
data-analysis data-science exploratory-data-analysis machine-learning support-vector-machine
Last synced: 04 Aug 2025
https://github.com/mwoss/mlflow-stock-market-example
Stock market prediction - machine learning pipeline using MLFlow.
anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial
Last synced: 21 Jul 2025
https://github.com/prajwalchapke055/cognizant-artificial-intelligence-job-simulation-forage
Advise one of Cognizant’s clients on a supply chain issue by applying knowledge of machine learning models.
artificial-intelligence cognizant communication data-analysis data-modeling data-visualization development evaluation forage job-simulation machine-learning machine-learning-algorithms machine-learning-engineering model-interpretation presentation problem-statement python quality-assurance skills virtual-internship
Last synced: 17 May 2026
https://github.com/BigBangData/TimesheetAnalysis
R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.
bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management
Last synced: 29 Jul 2025
https://github.com/fbecerra/fbecerra.github.io
Source code for my website www.fernandobecerra.com
data-analysis data-science data-visualization dataviz interactive-visualizations
Last synced: 20 Mar 2025
https://github.com/zrkhadija/data-analysis-for-financial-time-series
In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.
autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization
Last synced: 09 Feb 2026
https://github.com/bkataru/physics-ia
Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.
astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting
Last synced: 07 Jul 2025
https://github.com/anonympins/data-primals-engine
Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.
api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api
Last synced: 07 Mar 2026
https://github.com/mljar/mercury-notebook-apps
Amazing apps build from Python notebooks with Mercury
data-analysis data-science data-visualization jupyter jupyter-notebook jupyterlab mljar python
Last synced: 21 May 2026
https://github.com/casualcomputer/sql.mechanic
Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).
data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server
Last synced: 30 Jul 2025
https://github.com/phollemans/cwutils
CoastWatch Utilities software for working with satellite data files from NOAA CoastWatch and elsewhere
cdat coastwatch-utilities data-analysis data-visualization install4j java noaa-coastwatch remote-sensing satellite-imagery
Last synced: 02 May 2025
https://github.com/thecoderpinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 03 Apr 2025
https://github.com/tesfamichael12/solar-farm-analysis
This repository contains code and analysis for exploring solar farm data from Benin, Sierra Leone, and Togo. It includes EDA, strategic recommendations for optimal solar farm locations, and an interactive Streamlit dashboard.
data-analysis eda ml solar-farm-analysis
Last synced: 07 Aug 2025
https://github.com/deep-diver/data-analysis-on-titanic
applying data analysis on titanic data sheet
Last synced: 30 Mar 2025
https://github.com/deep-diver/enron-data-analysis
Data Analysis and Machine Learning on Enron Data
data-analysis enron-data exploratory-data-analysis machine-learning
Last synced: 08 Jan 2026
https://github.com/prathmesh2507/india-superstore-powerbi-dashboard
Interactive India Superstore Sales Dashboard built using Power BI
business-intelligence dashboard data-analysis data-visualization powerbi
Last synced: 16 May 2026
https://github.com/reddyprasade/pandas-practice
Pandas
daat data-analysis data-science flexible labeling missing-data missing-values pandas pandas-profiling
Last synced: 18 May 2026
https://github.com/joanacmbarros/ardm-website
Website to support the R in Pharma 2023 workshop on the ARDM
analysis-results automation clinical-data data-analysis data-model r-in-pharma
Last synced: 03 Apr 2025
https://github.com/rafat-decodis/robust-asr-for-low-resource-languages
Exploring Benchmark Gaps and Real-World Speech Generalization for Language in Low Resource
artificial-intelligence automatic-speech-recognition data-analysis dataprocessing whisper
Last synced: 23 Jun 2025
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/javadtorabikh/businessintelligencesystem
This API provides advanced business analytics capabilities for company data, offering 10 core analytical functions to transform raw business data into actionable insights. The system is built with Python and Flask, designed for reliability, scalability, and performance.
analysis business-analytics business-intelligence data-analysis mashine-learning
Last synced: 22 Sep 2025
https://github.com/vvipjain/ecommerce-sales-analysis
Ecommerce Sales Analysis
data-analysis pandas pandas-dataframe python sql sqlalchemy
Last synced: 16 Apr 2026
https://github.com/thecoderpinar/worldpopulationanalysis2024
World Population Analysis 2024: An In-Depth Exploration of Urban and Rural Populations and Infrastructure Accessibility
data-analysis data-science economic-indicators machine-learning population-growth prophet-forecasting
Last synced: 03 Apr 2025
https://github.com/felixcharotte/ibm_datascience_capstone
In this project, we predicted if the SpaceX Falcon 9 first stage will land successfully by following the data science methodology. We also summarized the results for the business stakeholders.
analysis data-analysis data-science data-visualization databases folium jupyter-notebook machine-learning machine-learning-alrgorithms matplotlib pandas plotly plotly-dash python scikit-learn scipy seaborn sql
Last synced: 26 Jul 2025
https://github.com/marios-mamalis/mca-visualisation
A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)
3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation
Last synced: 02 Mar 2025
https://github.com/1994nikunj/nlp-toolkit-desktop-app
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator
Last synced: 18 Jul 2025
https://github.com/openpmd/openpmd-ccd
A Python Module & LabView Bindings for Storing CCD Images with openPMD
ccd data-analysis database hdf5 open-data open-science openpmd
Last synced: 16 May 2026
https://github.com/justsecret123/twitter-sentiment-analysis
A sentiment analysis model trained with Kaggle GPU on 1.6M examples, used to make inferences on 220k tweets about Messi and draw insights from their results.
classification data-analysis data-science deep-learning deep-neural-networks docker glove-embeddings kaggle lstm lstm-neural-networks machine-learning natural-language-processing nlp python rnn scikit-learn sentiment-analysis sentiment-classification tensorflow word-embeddings
Last synced: 03 Jan 2026
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/chaganti-reddy/case-study-market-segmentation
Market Segmentation Case Study Analysis using Clustering
case-study data-analysis machine-learning market-segmentation plotting
Last synced: 23 Jun 2025
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 18 May 2026
https://github.com/pangeo-data/foss4g-2022
Pangeo tutorial at FOSS4G 2022
data-analysis hvplot pangeo time-series xarray
Last synced: 12 Apr 2025
https://github.com/tameronline/tameronline
Showcasing Projects on Data Analysis, Programming, and AI — Developed Using Python and Modern Frameworks
data-analysis deep-learning flask machine-learning numpy pandas python3 sql web-development
Last synced: 11 Jun 2025
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/qytz/finchan
An event process framework with Python3.
data-analysis data-science dispatch-events event-driven python3
Last synced: 27 Mar 2026
https://github.com/juicedata/juicefs-deeplearning-tutorials
Deep Learning and Data Analytics Techniques with the help of JuiceFS.
data-analysis deep-learning filesystem juicefs machine-learning
Last synced: 07 Jul 2025
https://github.com/ocramz/record-encode
Generic encoding of record types
categorical-data categorical-features data-analysis data-mining data-science generic-programming machine-learning one-hot-encode preprocessing
Last synced: 14 Apr 2025
https://github.com/prithivsakthiur/data-board
Data Boards - Visualization of various plots ( Analysis )
data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces
Last synced: 25 Feb 2026
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 07 Mar 2026
https://github.com/wfamous/fiv_update-data
This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.
ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu
Last synced: 17 Feb 2026
https://github.com/mohd-faizy/07p_tumor-diagnosis-exploratory-data-analysis-on-breast-cancer-wisconsin-dataset
Tumor Diagnosis: Exploratory Data Analysis With Seaborn
data-analysis data-visualization eda exploratory-data-analysis knn-classification pca-analysis python random-forest random-forest-classifier statistics support-vector-machines tumor-detection visualization
Last synced: 17 May 2026
https://github.com/techytushar/india-odi-analysis
Analysis of ODI cricket matches of Indian Team
cricket data-analysis data-science pandas plotting python3
Last synced: 05 May 2026
https://github.com/sn2606/global-temperature-time-series
Time series analysis is performed on the Berkeley Earth Surface Temperature dataset.
arima arima-forecasting arima-model climate-change data-analysis data-visualization forecasting-model global-temperature series-analysis singular-spectrum-analysis time-series time-series-analysis time-series-forecasting
Last synced: 21 May 2026
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 18 Sep 2025
https://github.com/muzammil-13/data_analysis-inmakes
A data-driven project that leverages machine learning to predict Bitcoin price trends. Using historical Bitcoin data, this analysis provides 30-day price forecasts through advanced statistical modeling.
data-analysis data-science learning-by-doing machine-learning numpy pandas python python-library task
Last synced: 19 Feb 2026
https://github.com/cego669/datathonengopevi
Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 18 Feb 2026
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook oneness-consciousness open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 14 Apr 2025
https://github.com/chrdek/linqdatacalc
📈 🎲 Linq based data statistics set of extensions.
calculations calculator data-analysis data-analytics data-science data-statictics extension-methods extensions linq linq-extensions set-theory statistical-analysis statistics
Last synced: 27 Jun 2025
https://github.com/justin-pyne/dota-liquipedia-web-scraper
Scraping information off Liquipedia from DOTA leagues with BeautifulSoup/Pandas for statistical analysis/EDA.
bs4 csv data-analysis pandas python scraper
Last synced: 13 Jul 2025
https://github.com/himanshu231204/featurementor-ai
🧠 AI-powered Feature Engineering Mentor for ML students. Upload any CSV → get smart preprocessing recommendations with Google Gemini explanations. Learn WHY, not just HOW. Built with Streamlit + Python. ⭐ Star if useful!
data-analysis data-preprocessing data-science feature-engineering generative-ai pdf-report streamlit
Last synced: 04 Apr 2026
https://github.com/i4ds/ecallisto_ng
Ecallisto NG is a Python package tailored for interacting with Ecallisto data.
data-analysis data-visualization e-callisto ecallisto-international-network numpy pandas python spectrometer
Last synced: 13 Oct 2025
https://github.com/prathmesh2507/fifa-worldcup-powerbi-dashboard
Interactive FIFA World Cup Dashboard built using Power BI
business-intelligence dashboard data-analysis data-visualization fifa powerbi
Last synced: 16 May 2026
https://github.com/quantumudit/uk-student-accommodation-analysis
This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 27 Apr 2026
https://github.com/benweare/em_scripts
Electron microscopy scripts not associated with any specific publications.
contrast-transfer-function ctf data-analysis digital-micrograph electron-microscopy jupyter-notebook optics python tomography tomography-data-collection transmission-electron-microscopy zemlin-tableau zernike-polynomials
Last synced: 07 Apr 2026
https://github.com/zachlagden/spotify-listening-analyzer
A comprehensive Python tool for analyzing your Spotify listening history data.
analytics data-analysis pandas python spotify-web-api spotipy
Last synced: 31 Jul 2025
https://github.com/souvik09-tech/walmart_sales_dataanalysis
This end-to-end data analysis project leverages Python for processing and SQL for advanced querying to extract key business insights from Walmart sales data. It's designed for data analysts to enhance skills in data manipulation, querying, and pipeline creation.
data-analysis end-to-end etl-pipeline jupyter-notebook mysql mysql-database pandas python
Last synced: 17 Feb 2026
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/bebofekry/i_care_graduation_project
Graduation Project
ai artificial-intelligence chatbot cnn computer-vision data-analysis data-science deep-learning ecg graduation-project healthcare machine-learning medical medical-imaging natural-language-processing neural-networks nlp pattern-recognition predictive-modeling python
Last synced: 01 May 2026
https://github.com/kaguya163/ankara_coffee_sales_analysis
"Coffee shop sales analysis in Ankara. SQL, Tableau, Python, Data Analytics"
data-analysis mysql python sql tableau
Last synced: 17 Feb 2026
https://github.com/hongbo-wei/global-status-of-cc-security-certification
Data visualization of CC Security Certification using VUE, Django, and MySQL.
big-date common-criteria data-analysis data-visualisation data-visualization
Last synced: 07 Jul 2025
https://github.com/lachlanharrisdev/project-eidolon
// A modular OSINT pipeline framework that makes information gathering feel like cheating — because it almost is.
cybersecurity data-analysis docker enterprise infosec modular osint python
Last synced: 01 Mar 2026
https://github.com/beckversync/probability-and-statis_computer-parts-cpus-and-gpus-ics_
Probability and statistical analysis techniques are employed to explore data related to computer components, such as CPUs, GPUs, and Integrated Circuits (ICs). The objective is to uncover trends, identify patterns, and extract meaningful insights from real-world hardware data.
Last synced: 18 Feb 2026
https://github.com/quantumudit/demographic-data-analysis
This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.
data-analysis jupyter-notebook power-bi python
Last synced: 15 May 2026
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
data-analysis data-mining data-science data-visualization datascience
Last synced: 07 Apr 2025
https://github.com/stimulsoft/samples-dashboards.js-for-html
JavaScript samples for Dashboards.JS data visualization tool for HTML and native JavaScript applications
analytics automation components dashboard-application dashboard-designer dashboard-viewer data-analysis embedded html5 indicators javascript js json-database native-javascript onepage panels pivot-tables simple-dashboard transformation website
Last synced: 20 Oct 2025
https://github.com/arzan101/ev--car-data-analysis
This Power BI dashboard provides an interactive and data-driven overview of the electric vehicle (EV) landscape. It visualizes key insights across various dimensions including sales trends, model performance, manufacturer comparisons, and market growth. The purpose of the dashboard is to enable stakeholders to explore and analyze development
data-analysis data-science data-visualization database datacleaning excel powerbi
Last synced: 17 Jun 2025
https://github.com/milind220/hk-air-quality-analysis
My final project for a statistics and data analysis course. Whew that was a lot of graphs!
data-analysis jupyter-notebook numpy pandas python python3 scipy seaborn statistics
Last synced: 12 Apr 2026
https://github.com/gabysbrain/purescript-dataframe
A data structure for row-based data and queries
Last synced: 19 Feb 2026
https://github.com/leonism/customer-predictive-analysis
Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.
data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling
Last synced: 28 Mar 2025
https://github.com/patex1987/temperature-calibration
Notebook for sensor calibration evaluation
calibration data-analysis jupyter-notebook sensor
Last synced: 20 Jun 2025
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/avrtt/paysage
Pandas add-on library: find data quality issues and clean/improve dataframes in one line using scikit-learn transformer
data-analysis data-cleaning data-compression data-profiling data-quality data-quality-checks data-reporting pandas pandas-dataframe schema-validation scikit-learn scikit-learn-transformer
Last synced: 14 May 2026
https://github.com/shibam120302/black-friday-sales-data-analysis
This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms
data-analysis eda machine-learning python random-forest regression
Last synced: 20 May 2026
https://github.com/inphyt/inphyt.github.io
Special repository hosting the InPhyT website.
computational-epidemiology computational-modelling computational-neuroscience computational-social-science computational-socialscience computer-science data-analysis data-mining machine-learning mathematical-modelling mathematics modeling network-analysis physics scientific-computing scientific-machine-learning statistical-modeling statistical-physics
Last synced: 02 Feb 2026
https://github.com/depressioncenter/data-and-design-core
Code developed by the EFDC Data and Design Core team to support mental health research.
data-analysis data-science efdc inference r statistical-analysis umich
Last synced: 19 May 2026
https://github.com/aloth/power-bi-book-resources
Official resources for "Teach Yourself VISUALLY Power BI" by Alexander Loth (Wiley). Get all Power BI project files (.pbix) and datasets to follow along with the visual, step-by-step exercises in the book.
analytics bi business-analytics business-intelligence dashboards data-analysis data-cleaning data-modeling datavisualization dax etl microsoft microsoft-power-bi power-bi-desktop power-platform powerbi powerquery reporting sql visualization
Last synced: 19 Feb 2026
https://github.com/jethronap/asylumdataku_website
Mini website for reporting analysis of Asylum Data @ DIKU
Last synced: 13 Feb 2026
https://github.com/ezzz-lui/rsm-evaluationproject
Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst
Last synced: 13 Feb 2026
https://github.com/abeltavares/nps_performance_analysis
Analyzing and Monitoring Net Promoter Score (NPS) Performance for Healthcare Companies using SQL and Power BI
customer-satisfaction dashboard data-analysis data-visualization healthcare net-promoter-score nps-analysis performance-monitoring power-bi sql
Last synced: 19 Mar 2026
https://github.com/sivkri/imagecoloranalysis
ImageColorAnalysis is a repository with a Python script for color analysis in images using ImageMagick. It generates bash scripts for individual JPG images to analyze specific colors. It provides a flexible solution for extracting color information from images, applicable in various domains such as image classification and data analysis.
bash-scripts color-analysis computer-vision data-analysis image-classification image-processing imagemagick pavement pavement-images python-scripting stomata stomatal-index
Last synced: 13 Feb 2026
https://github.com/iraikov/chicken-dataframe
Tabular data structure for data analysis in Scheme
chicken-scheme chicken-scheme-eggs data-analysis dataframe linear-regression scheme scheme-programming-language
Last synced: 13 Feb 2026
https://github.com/w-edward/youtube-keyword-popularity-analyzer
An effort to discover the top trending keywords on Youtube.
data-analysis node-js numpy python webscraping youtube-api
Last synced: 15 Apr 2026
https://github.com/dcs-training/2024-11-18-cdcs-carpentry-social-sciences
This repo contains the material produced for a course run by the Centre in November 2024
data-analysis data-visualisation data-wrangling intro-to-programming r
Last synced: 14 Feb 2026
https://github.com/freepicheep/nu-salesforce
A nushell module to interact with Salesforce data through the Salesforce REST API.
data-analysis nu nushell salesforce salesforce-api scripting shell
Last synced: 03 Mar 2026
https://github.com/karlyndiary/global-electronics-retailer-sales-and-customer-insights
Developed an analysis using Python, SQL, and Excel to examine sales and customer demographics for a Global Electronics Retailer. The findings aim to enhance business strategies and improve overall performance.
dashboard data-analysis data-cleaning-and-preprocessing data-pipeline data-visualization etl microsoft-excel microsoft-sql-server python sql
Last synced: 14 Feb 2026
https://github.com/eerkela/bertrand
flexible type extensions for pandas
conversions data-analysis data-engineering data-science multiple-dispatch numpy pandas type-checking type-inference types
Last synced: 27 Mar 2026
https://github.com/mylethidiem/zero-to-hero
Project for learning, practicing code: Python, SQL, C/C++, Data science/Data Analysis, AI/Machine learning
ai cpp data-analysis data-science deep-learning machine-learning mlops python sql
Last synced: 02 Mar 2026
https://github.com/GeiserX/secciones-nacionalidades
Foreign Insight - WebApp providing insights about nationalities in Spain (Source: Instituto Nacional de Estadística)
census-data dashboard data-analysis data-visualization demographics geospatial government-data immigration ine nationalities open-data population r self-hosted shiny shinydashboard spain spanish statistics webapp
Last synced: 08 Apr 2026