Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/as16082023/motor-vehicle-thefts
Using SQL to analyze vehicle theft patterns across New Zealand, focusing on trends related to specific times and locations.
Last synced: 10 Apr 2025
https://github.com/xre22zax/airline-analysis
Travel agency and need to know the ins and outs of airline prices for your clients
data-analysis data-visualization python python3 visualization
Last synced: 13 Apr 2026
https://github.com/rainbowatcher/simple
Make data work easier, saving your working time
Last synced: 10 Apr 2025
https://github.com/scailfin/rob-client
Command line user interface for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 14 Jan 2026
https://github.com/mainak-97/weather-data-analysis-using-python
A comprehensive analysis of time-series weather data using Python and Pandas, focusing on data exploration, cleaning, and uncovering insights.
data-analysis jupyter-notebook pandas pandas-dataframe python python3 time-series-analysis
Last synced: 08 May 2026
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/masamallow/jupyterlab-my-local
Configuration to run my personal JupyterLab on my local.
data-analysis jupyter jupyter-notebook jupyterlab
Last synced: 26 Mar 2025
https://github.com/deliprofesor/behavioral-insights-and-data-exploration
This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.
acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis
Last synced: 10 Apr 2025
https://github.com/sabelomkhwanzi/data-alchemist-boot-camp
Built on Covalent's Unified API, Increment has the full historical data set for 40+ chains including every smart contract, event, transaction, address, etc. With access to all this data you can find:
covalent data-analysis increment
Last synced: 11 Mar 2026
https://github.com/seblehner/feldprakt
Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.
data-analysis data-visualization jupyter-notebook python
Last synced: 05 Oct 2025
https://github.com/harkishen/Agriculture-DS
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 11 Dec 2025
https://github.com/joyceannie/sql-data-with-danny-case-studies
Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
8-week-sql-challenge 8weeksqlchallenge case-study data-analysis data-analytics postgresql sql
Last synced: 05 Oct 2025
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/subhamghimire/dataanavis
Learning Data analysis and visualization
data-analysis data-science data-visualization dataset
Last synced: 06 Oct 2025
https://github.com/davifeliciano/modern_physics_experiments
Collection of data analysis and visualization scripts developed in Python around some modern physics experiments
data-analysis data-visualization modern-physics physics physics-experiments
Last synced: 18 Jan 2026
https://github.com/myles/notebooks
Some of my random Jupyter Notebooks.
data-analysis data-science jupyter-notebooks
Last synced: 18 Jan 2026
https://github.com/surbhi242singh/pizza_sales_project
Used SQL to analyze pizza sales data
data-analysis mysql pizza-sales sql
Last synced: 07 Oct 2025
https://github.com/thejvdev/ml-from-scratch
Repository for Implementing ML Models from Scratch in Python
classification data-analysis data-mining data-science deep-learning jupyter-notebook machine-learning matplotlib neural-networks numpy pandas prediction python regression seaborn sklearn visualization
Last synced: 02 Apr 2026
https://github.com/roydevashish/algo8.ai-data-manipulation-assignment
This assignment performs transaction-level sales data analysis and generates reports using Pandas / SQL / Spark inside a containerized environment. The dataset contains sales transaction records and is used to analyze SKUs, customers, and sales representative performance.
data-analysis duckdb python3 sql uv
Last synced: 15 May 2026
https://github.com/gabboraron/biostatisztika_es_alkalmazasai
"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"
biostatistics data-analysis data-visualization r statistics statistics-course
Last synced: 24 Oct 2025
https://github.com/rusiru-erandaka/pupil-dilation-signal-classification-pipeline-with-noise-filtering-feature-extraction
In this repository I have worked on Pupil Diameter Time series Dataset. here I have worked on data sampling, Blink detection and Noise Handling, Stimulus Onset Alignment & Ensemble Averaging, Baseline correction, Feature Extraction and finally create a Patient classification ML pipeliner
anomaly-detection classification-pipeline data-analysis data-preprocessing data-science time-series
Last synced: 08 Oct 2025
https://github.com/dcs-training/machinelearning
Introduction to Machine Learning with Python delivered by the centre in the year 2022-23. Go to the read me file
data-analysis data-wrangling machine-learning python statistics
Last synced: 08 Oct 2025
https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset
This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.
data-analysis data-visualization excel project
Last synced: 18 Jan 2026
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/alexquilis1/spanish-fuel-stations-analysis
Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons
data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse
Last synced: 08 Oct 2025
https://github.com/sorebit/pdrpy-pd-2
Data analysis of various stackechange.com archives.
data-analysis stackexchange time-travel university-project
Last synced: 08 Oct 2025
https://github.com/porimol/employee-turnover-prediction
Employee turover prediction using machine learning
data-analysis data-mining data-science data-visualization datascience machine-learning prediction predictive-modeling
Last synced: 23 Feb 2026
https://github.com/takshshah-16/pizza_sales_sql
SQL-powered pizza sales analytics project using MySQL Workbench to derive business insights through data exploration and queries.
business-intelligence data-analysis database-management mysql sql
Last synced: 09 Oct 2025
https://github.com/debopriya2320/classification-of-biological-databases
A project for categorizing and analyzing various biological databases based on their type and content.
bioinformatics biological-databases computational-biology data-analysis data-classification databases genomics proteomics research-tools systems-biology
Last synced: 09 Oct 2025
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 10 Oct 2025
https://github.com/brooks-code/toulouse-biblio-chronicle
Snapshot of Toulouse public library customer habits — cleaning raw, messy datasets of musical, cinematic, and literary checkouts; includes data-cleaning steps, analysis notebook revealing cultural tastes in the Pink City.
data-analysis data-cleaning data-cleaning-and-preprocessing data-quality exploratory-data-analysis jupyter-notebook library-data misaligned-data mojibake tutorial
Last synced: 10 Oct 2025
https://github.com/sharvesh1401/battsense
BattSense is a machine learning project focused on predicting the State of Health (SOH) of lithium-ion batteries using operational parameters such as voltage, current, temperature, and capacity. The model enables accurate, data-driven diagnostics for battery performance monitoring in electric vehicles and portable devices.
battery-diagnostics battery-health battery-health-prediction battery-soh data-analysis electric-vehicles energy-storage machine-learning predictive-maintenance python regression scikit-learn
Last synced: 07 May 2026
https://github.com/suhailsallam/tips_dashboard
Dashboard using Python & Streamlit
dashboard data-analysis data-analytics data-science data-scientist data-visualization python streamlit streamlit-dashboard streamlit-webapp
Last synced: 21 Jan 2026
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 27 Jan 2026
https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1
Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.
data-analysis data-visualization deloitte json tableau tableau-public
Last synced: 11 Oct 2025
https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python
Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.
data-analysis data-visualization numpy pandas python3 sns visualization
Last synced: 03 May 2026
https://github.com/mouadtaoussi/capmpingi-employee-reviews
Analysis of Capmpingi employee reviews using Python/Pandas and Power BI
data-analysis data-science kaggle pandas powerbi python python3
Last synced: 14 Apr 2026
https://github.com/dhruvil-26/tableau-projects
This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns
analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public
Last synced: 19 Jan 2026
https://github.com/dzakwanalifi/stadata-x
Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif
bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui
Last synced: 20 Jan 2026
https://github.com/NurFakhri/scraping-and-analysis-skincare
Scraping and data analysis of Indonesian skincare reviews.
beutifulsoup data-analysis data-scraping python requests review scraping-websites
Last synced: 12 Oct 2025
https://github.com/tzerk/esr
R package 'ESR' for plotting and analysing ESR spectra in dating applications
data-analysis data-visualization electron-spin-resonance geochronology r
Last synced: 13 Mar 2026
https://github.com/leosimoes/digitalinnovationone-analise-covid
Projeto prático "Criando modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil" da Digital Innovation One.
arima-models data-analysis data-science python time-series
Last synced: 09 May 2026
https://github.com/louisfernando1204/websocket-benchmark
A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.
benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws
Last synced: 09 Apr 2026
https://github.com/jsimell/sleepanalysis
A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.
data-analysis matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/angelalim88/jakarta-air-quality-index-classification
This project classifies Jakarta's Air Quality Index (AQI) from 2010 to 2023 using machine learning models (Random Forest, MLP, SVM) based on pollutant concentrations.
data-analysis data-visua machine-learning scikit-learn tensorflow
Last synced: 13 Oct 2025
https://github.com/gmalbert/rugby
Rugby Data Analysis and Sports Betting
data-analysis rugby sports-betting
Last synced: 31 May 2026
https://github.com/inddrsingh/restaurant_orders_mysql
Complex SQL queries on restaurant data for better and precise insights
Last synced: 28 Jan 2026
https://github.com/ayorick23/python-data-science-cheat-sheet
Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.
cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow
Last synced: 07 Apr 2026
https://github.com/asuquoaa/air_bnb_analysis_dashboard-tableau-
Interactive Tableau dashboards to analyze and visualize data, providing actionable insights for better decision-making
dashboard data-analysis interactive-visualization tableau
Last synced: 13 Mar 2026
https://github.com/supernyv/data_science_projects
Personal Data Science Projects
data-analysis data-science data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Oct 2025
https://github.com/saisurajmatta/healthcare-data-analytics
Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.
data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery
Last synced: 22 Jan 2026
https://github.com/rohanrony19/movie-recommendation-system
This is a python project where using Pandas library we will find correlation and give the best recommendation for movies.
data-analysis deep-learning knn-algorithm numpy pandas python recommendation-system
Last synced: 14 Apr 2026
https://github.com/pedrosfaria2/analisetitulosnetflix
Estudo de popularidade dos filmes da Netflix no IMDB.
analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 14 Apr 2026
https://github.com/zeynepcol/Data-Analysis-Visualization
Data Analysis
data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit
Last synced: 15 Oct 2025
https://github.com/hase3b/flask-dash-interactive-dashboard
An interactive data visualization dashboard created using Flask and Dash. This project includes comprehensive data preparation, exploratory data analysis (EDA), and dynamic visualizations with Seaborn and Plotly. Explore the multi-page Dash app with features like dropdowns and callbacks for updated plots.
callbacks dash dashboard data-analysis data-visualization dropdown eda flask interactive plotly seaborn web-app
Last synced: 19 May 2026
https://github.com/bhaveshbhakta/fish-weight-prediction-using-ml
Fish Weight Prediction
data-analysis data-visualization fish-weight-prediction gradient-boosting machine-learning
Last synced: 16 Oct 2025
https://github.com/supertetelman/coursera-exdata-09
This repo contains several R scripts that were used to analyze, plot, and clean data from various datasets. These projects were part of the Coursera course, Exploratory Data Analysis. The end results of the analysis are included.
big-data course coursera data-analysis r
Last synced: 16 Oct 2025
https://github.com/prakshi-23/itvedant-database-management-system-analysis-using-sql
Data Analysis using SQL
Last synced: 22 Jan 2026
https://github.com/casassg/ms_thesis
Social Media Analysis for Crisis Informatics in the Cloud
casassg-thesis data-analysis google-cloud kubernetes
Last synced: 19 Oct 2025
https://github.com/abishek0103/olist-ecommerce-sql-project
SQL Project using Olist Dataset – E-commerce analysis with MS SQL Server to extract business insights.
business-insights data-analysis sql-server
Last synced: 19 Oct 2025
https://github.com/lucashomuniz/Project-03
Data-Driven Decision Making: Selecting the Best Regression Model for E-commerce Sales
benchmark-framework data-analysis data-driven data-visualization e-commerce-project language-python lasso-regression linear-regression-models machine-learning python ridge-regression
Last synced: 20 Oct 2025
https://github.com/jimohola/zomato-restaurant-ratings-ml
Flask Deployment Machine Learning
css data-analysis flask html machine-learning python3
Last synced: 04 May 2026
https://github.com/zeshanfareed/graduation_admission_prediction_ml_django_project
predict file is Django Frameework code file
data-analysis data-visualization datasets-csv django-framework machine-learning machinelearning-python python
Last synced: 23 Jan 2026
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 15 Mar 2026
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/gunifiri/duckdb-ghw
🦆 Accelerate analytics with DuckDB's integration for GitHub workflows, enabling efficient data handling and processing directly within your repositories.
analytics analytics-engine big-data columnar-storage data-analysis data-science database duckdb in-memory-database open-source parquet python query-planner r sql
Last synced: 29 Apr 2026
https://github.com/albertobarrago/sentinel
A contribute for the research of Corrado Malanga and Filippo Biondi
Last synced: 24 Oct 2025
https://github.com/sumitkundu102022/air-quality-report
Air Quality Report using PowerBI
data data-analysis data-visualization powerbi
Last synced: 23 Jan 2026
https://github.com/brianlesko/r_data_science_stat5730
Written by Brian Lesko, the repository contains R Scripts demonstrating data science topics largely originating from study at Ohio State. Contents are written in R studio using the R markdown file. As of 1/21/23 Future projects concerning data science, statistics, and machine learning will be in python in my machine learning Repository
data data-analysis flight-data ggplot2 olympics-data r-markdown tidyverse
Last synced: 23 Jan 2026
https://github.com/alessandroryo/bike-rental-data-analysis
A data analysis project focused on understanding and predicting bike rental patterns. This project utilizes data processing, visualization, and predictive modeling techniques to gain insights into bike rental usage, fulfilling the final submission requirement for Dicoding Indonesia's Data Analysis course.
bike-rental data-analysis data-visualization jupyter-notebook machine-learning python streamlit
Last synced: 09 Apr 2026
https://github.com/janiavdv/data-spirits
Analysis of alcohol and sports betting data, including a correlation investigation.
correlation data-analysis data-science machine-learning
Last synced: 11 Nov 2025
https://github.com/sehgal-vishal/sql-nyc-collision-analysis
this analysis is based on the Collisions(Accidents) happend in New York City. I have used Sql Server For EDA(Exploratory Data Analysis
data-analysis database eda sql-server
Last synced: 06 Feb 2026
https://github.com/shrutiijoshi/apple_greenhouse_gas_emissions
A breakdown of Apple's greenhouse gas emissions from 2015 to 2022 as they aim to reach net zero emissions by 2030.
dashboard data-analysis data-visualization powerbi
Last synced: 06 Feb 2026
https://github.com/ljadhav25/linear_regression_data_science
Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.
data-analysis data-science linear-regression machine-learning
Last synced: 26 Oct 2025
https://github.com/srinibas-masanta/infosys-springboard-internship
An interactive Power BI dashboard developed during my Infosys Springboard Internship to visualize Indian election trends. It integrates historical and live API data to analyze vote shares, turnout patterns, and demographic insights across constituencies, helping news agencies report results in real time.
dashboard data-analysis data-cleaning data-collection data-visualization dax-functions powerbi
Last synced: 25 Feb 2026
https://github.com/campagnucci/exercitando_pandas
Exercícios práticos de pandas com dados abertos da educação de São Paulo
data-analysis data-science education-data exercises pandas-tutorial
Last synced: 28 Jan 2026
https://github.com/alunera-data/alunera-data
Hi, I’m Yvonne – building data solutions at the intersection of BI, SQL & Service Management
business-intelligence data-analysis data-engineering data-science github-profile portfolio rstats sql
Last synced: 28 Jan 2026
https://github.com/valentinoli/swiss-foodprint
Project in Applied Data Analysis, EPFL 2019
carbon-emissions data-analysis diet foodprint swiss switzerland
Last synced: 24 Jan 2026
https://github.com/aneeshmurali-n/global-superstore-sales-dashboard---power-bi-stunning-dark-theme
This Power BI dashboard provides a comprehensive view of sales data, enabling users to analyze sales trends, identify top-performing regions, and gain insights into customer behavior.
dark-theme dashboard data-analysis data-science data-visualization powerbi salesdashboard
Last synced: 28 Jan 2026
https://github.com/mysto-007/cyclistic-bike-share-analysis
Analyzed the dataset of Cyclistic Rental Service as the Capstone project for Google Data Analytics SpecializationAnalyzed the dataset of Cyclistic bike-share (Capstone project for Google Data Analytics Specialization)
bigquery data-analysis excel ms-sql-server sql tableau tableau-public
Last synced: 16 Mar 2026
https://github.com/annnieglez/fraud-detection-eda
Fraud Detection - Exploratory Data Analysis (EDA). Analyzing financial transactions to detect fraud patterns using Python and Tableau. Libraries: Pandas, Seaborn and Matplotlib. Key Focus: Data cleaning, fraud trends, high-risk transactions, time-based patterns
data-analysis data-science data-visualization eda fraud-detection fraud-prevention matplotlib seaborn
Last synced: 28 Jan 2026
https://github.com/anurag-ghosh-12/library_management_system_sql
This project showcases the development of a comprehensive Library Management System utilizing Structured Query Language (SQL). It demonstrates a practical application of relational database principles to efficiently manage library resources, member information, and borrowing/returning transactions.
data-analysis data-visualisation dbms-project sql
Last synced: 29 Jan 2026
https://github.com/smahala02/magnetism-lab
This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.
data-analysis inductance jupyter-notebook magnetism python toroids
Last synced: 29 Jan 2026
https://github.com/shrutiijoshi/marketing-campaign-report
The dataset includes information on campaign types, recipient segments, interactions (clicks, opens, bounces, etc.), and conversion metrics.
dashboard data-analysis data-visualization tableau-public
Last synced: 25 Feb 2026
https://github.com/surajwate/datalab
DataLab is a versatile toolkit designed to simplify data exploration, analysis, and visualization for data scientists.
data-analysis data-science python visualization
Last synced: 30 Jan 2026
https://github.com/aygp-dr/values-compass
Tools for exploring and analyzing Anthropic's Values-in-the-Wild dataset for AI ethics research
ai-ethics anthropic-claude data-analysis nlp values
Last synced: 25 Feb 2026
https://github.com/jcaperella29/jc_bioinformatics_hub
A personal hub to showcase my bioinformatics applications including RNA-Seq, ATAC-Seq, and miRNA-Seq analysis tools. Powered by simple HTML, CSS, and JavaScript with a biotech-themed design.
atac-seq bioinformatics biotech data-analysis github-pages portal rna-seq webapp
Last synced: 25 Feb 2026
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 31 Jan 2026
https://github.com/luminati-io/indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 07 Feb 2026
https://github.com/jofaval/titanic-disaster
Data Analysis of the famous Titanic Disaster in 1912 with Machine Learning
classification data-analysis data-science data-visualization google-colab kaggle machine-learning python scikit-learn
Last synced: 15 Apr 2026
https://github.com/amishidesai04/flipkart-mobile-sales-analysis
Flipkart Mobile Sales Analysis is a Tableau project that visualizes mobile sales data from Flipkart. It highlights trends in brand performance, pricing, ratings, and customer preferences. The interactive dashboard helps users explore key insights for data-driven decisions in e-commerce and retail.
dashboard data-analysis data-visualization storyboard tableau
Last synced: 31 Jan 2026
https://github.com/malthejorgensen/repx
Python regular expression file transformer
command-line-tool data-analysis text-processing
Last synced: 31 Jan 2026
https://github.com/ajmannust41288/data-analyst
Data Analyst ,Microsoft Professional expert,Desktop PowerBi ,Tablue and Dashboards with ChatGP4 AI uses
business-analytics data-analysis data-analyst data-analytics eda
Last synced: 01 Feb 2026
https://github.com/emediongfrancis/unified-data-lake-implementation-gcp-kafka-airflow-snowflake
This project demonstrates the integration of data from multiple sources into a unified data lake. The project showcases the use of Apache Airflow for ETL tasks, Google Cloud Storage as a data lake, Apache Kafka for data movement automation, Snowflake for data warehousing, and Google BigQuery for analysis.
airflow data-analysis data-warehousing etl etl-pipeline gcp-storage kafka snowflake value variety
Last synced: 07 Feb 2026
https://github.com/rohitdusane/healthcare-analytics
𝐏𝐨𝐰𝐞𝐫 𝐁𝐈 𝐃𝐚𝐬𝐡𝐛𝐨𝐚𝐫𝐝 is designed to provide valuable insights into patient waiting times across outpatient and inpatient healthcare services. It offers a comprehensive analysis of key factors influencing wait lists, including Age Profile, Specialty, Time Bands, and Patient Case Types.
data-analysis data-visualization dax dax-query healthcare-analysis powerbi-report
Last synced: 01 Feb 2026