Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/mtimma001/clinical-trial-data-tool-v2
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 14 Apr 2026
https://github.com/jimohola/zomato-restaurant-ratings-ml
Flask Deployment Machine Learning
css data-analysis flask html machine-learning python3
Last synced: 04 May 2026
https://github.com/tomkyle/binning
Determine optimal number of bins 𝒌 for histogram creation and optimal bin width 𝒉 using various statistical methods.
binning data-analysis distributions doanes-rule freedman-diaconis histogram histogram-binning math php-math rice-rule scotts-rule square-root statistics sturges-rule terrell-scotts-rule
Last synced: 21 Oct 2025
https://github.com/dcs-training/spatial_dynamics
Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file
data-analysis geographical-data gis qgis r statistics
Last synced: 23 Oct 2025
https://github.com/gunifiri/duckdb-ghw
🦆 Accelerate analytics with DuckDB's integration for GitHub workflows, enabling efficient data handling and processing directly within your repositories.
analytics analytics-engine big-data columnar-storage data-analysis data-science database duckdb in-memory-database open-source parquet python query-planner r sql
Last synced: 29 Apr 2026
https://github.com/browndwarf/contracosta
Wavelength dependent starspot contrast with Kepler/K2 and TESS
Last synced: 23 Jan 2026
https://github.com/avinashkr-ai/weather-analysis-backend
Weather analysis, visualization & Data science
data-analysis data-science data-visualisation django-rest-framework jyputer-notebook prediction python
Last synced: 24 Oct 2025
https://github.com/mohamed-khaled0/covid-data-exploration.sql
Covid-19 data
covid19-data data-analysis datacleaning microsoft-sql-server sql
Last synced: 06 Feb 2026
https://github.com/nishumehta/coffee-beans-sales-analysis
An in-depth analysis of coffee bean sales using an interactive Excel dashboard, which highlights trends and customer insights
dashboard data-analysis data-visualization excel
Last synced: 28 Jan 2026
https://github.com/shrutiijoshi/apple_greenhouse_gas_emissions
A breakdown of Apple's greenhouse gas emissions from 2015 to 2022 as they aim to reach net zero emissions by 2030.
dashboard data-analysis data-visualization powerbi
Last synced: 06 Feb 2026
https://github.com/aakk23/professional-survey-powerbi
This Power BI dashboard analyzes survey data from data professionals, highlighting salary trends, job roles, and career satisfaction. It provides insights into work-life balance, programming language preferences, and industry demographics.
data-analysis data-visualization dax excel powerbi powerquery
Last synced: 23 Feb 2026
https://github.com/limatix/limatix
Limatix datacollect and processtrak tools
data-analysis python scientific-workflows
Last synced: 23 Jan 2026
https://github.com/OneMoreDavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 28 Oct 2025
https://github.com/garcane/unicorn-companies-analysis
Tracking unicorn startups (valued at $1B+) provides valuable insights for investors and analysts to identify high-growth industries and emerging trends.
data-analysis exploratory-data-analysis financial-analysis investor postgresql sql
Last synced: 24 Jan 2026
https://github.com/rahulchouhan1/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.
data-analysis data-cleaning data-engineering data-science data-warehouse datascience etl etl-pipeline sql sql-query sql-server
Last synced: 24 Jan 2026
https://github.com/snigdho8869/numerical-data-analysis-projects
Exploring numerical data analysis with credit card churn, fraud detection, health predictions and more.
adaboost cnn data-analysis deep-learning dnn ensemble-learning exploratory-data-analysis gradient-boosting-classifier keras logistic-regression machine-learning ml numeric numerical-analysis pandas python3 random-forest scikit-learn support-vector-machines tensorflow
Last synced: 15 Apr 2026
https://github.com/yash1882/music-store-data-analysis
A project focuses on analyzing music store data using SQL ♬
begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project
Last synced: 28 Jan 2026
https://github.com/angchekar28/sales-report-power-bi
A Power BI sales report analyzing country-wise and product-wise sales trends. Includes dashboards, decomposition trees, and key influencers analysis for business insights.
dashboard data-analysis data-cleaning data-visualization powerbi sales-report
Last synced: 16 Mar 2026
https://github.com/wareflowx/excel-toolkit
A powerful command-line toolkit for Excel and CSV data manipulation, analysis, and transformation.
data-analysis data-wrangling excel pandas python uv
Last synced: 29 Jan 2026
https://github.com/mattdelaune/powerbi_healthcare_dashboard
Interactive Hospital Insights Dashboard built with Power BI, showcasing comprehensive analysis of patient demographics, treatment outcomes, and hospital performance.
data-analysis healthcare power-bi visualization
Last synced: 29 Jan 2026
https://github.com/isaqueiros/newspapersoldout-predictions-logistic_regression
This notebook is a study of the application of sklearn Logistic Regression model and analysis of metric quality with a focus on the impact of imbalanced data. The problem presented is the analysis of sales of newspapers of a local stand in order to classify the probability of the newspaper being Sold Out or Not, given a set of features.
data-analysis data-imbalance data-science logistic-regression machine-learning python sklearn-library sklearn-logistic-regression
Last synced: 18 Apr 2026
https://github.com/mfakhriazhar/us-companies-revenue-dashboard
This project is a data visualization dashboard built using Power BI that highlights lists of the largest companies in the United States by revenue. The goal is to provide an interactive overview of company performance across industries, focusing on revenue, employee metrics, and industry trends.
dashboard data-analysis data-visualization largest-companies-us powerbi revenue united-states
Last synced: 30 Jan 2026
https://github.com/gurpreet17/uc-davis-sql-for-data-science-specialization
Completed the SQL Basics for Data Science Specialization from the University of California, Davis, gaining proficiency in Data Analysis, SQL, Apache Spark, and Delta Lake.
apache-spark bigdata data-analysis data-science delta-lake sqlite
Last synced: 15 Apr 2026
https://github.com/luminati-io/indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 07 Feb 2026
https://github.com/jujulis18/olympicsmedalsdashboard
Olympic Dashboard – Paris 2024 est un tableau de bord interactif permettant d’explorer les performances des athlètes médaillés des Jeux Olympiques d’été de Paris 2024.
dashboard data-analysis data-visualization eda olympic python streamlit
Last synced: 31 Jan 2026
https://github.com/cca/panopto-session-data
analyzing Panopto session data for retention purposes
data-analysis ipython-notebook video
Last synced: 07 Feb 2026
https://github.com/allanotieno254/bank-loan-analysis-dashboard-power-bi
An interactive Power BI dashboard that analyzes bank loan data to provide insights into approval trends, default risks, and customer profiles. Designed to assist financial institutions in making data-driven lending decisions.
bank-loans business-intelligence dashboard data-analysis financial-analysis power-bi risk-assessment
Last synced: 31 Jan 2026
https://github.com/emediongfrancis/unified-data-lake-implementation-gcp-kafka-airflow-snowflake
This project demonstrates the integration of data from multiple sources into a unified data lake. The project showcases the use of Apache Airflow for ETL tasks, Google Cloud Storage as a data lake, Apache Kafka for data movement automation, Snowflake for data warehousing, and Google BigQuery for analysis.
airflow data-analysis data-warehousing etl etl-pipeline gcp-storage kafka snowflake value variety
Last synced: 07 Feb 2026
https://github.com/farzeen-2001/hr_analytics_dashboard_powerbi
HR data analytics using Power BI
data-analysis data-visualization datacleaning hr powerbi
Last synced: 25 Feb 2026
https://github.com/tr41z/machine-learning
machine learning models
ai artificial-intelligence data-analysis data-preprocessing google-colab jupyter-notebook machine-learning models python tensorflow
Last synced: 01 Feb 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/devbigboy/excel-power-query-get-transform
Power Query is a feature in Excel that allows you to quickly import data from multiple sources and easily clean, transform, and reshape it to suit your needs.
data-analysis data-science excel
Last synced: 08 Feb 2026
https://github.com/shibbir24/customer-sales-analysis-dashboard-using-tableau
Customer Sales Analysis Dashboard Using Tableau
dashboard data-analysis data-visualization sales-analysis tableau
Last synced: 08 Feb 2026
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Feb 2026
https://github.com/shubham200137/spotify-listening-habits-analytics
Spotify Listening Habits Analytics is a project aimed at analyzing personalized Spotify listening habits and music trends. It involves Exploratory Data Analysis (EDA) with Python Pandas, data processing using SQL Server, and creating visualizations with Power BI. The goal is to uncover insights into listening patterns, track popularity, and artist.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas power-bi-dashboard sqlserver
Last synced: 18 Mar 2026
https://github.com/naninsv/apple-retail-sales-warranty-analysis
An advanced SQL project analyzing over 1 million rows of Apple retail sales data to solve real-world business problems, optimize query performance, and extract actionable insights. The analysis includes sales trends, warranty claims, product performance, and year-over-year growth
business-intelligence data-analysis data-science etl insights retailanalytics sql sqladvance
Last synced: 26 Feb 2026
https://github.com/rajeev2806/netflix-data-analysis
In this project i have implemented ETL . I used netflix dataset to clean and analyze using postgresql and python
data-analysis data-cleaning postgresql python
Last synced: 15 Apr 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/dhruwsunita/car-sales-dashboard
Car sales dashboard using Tableau visualization tool.
car-sales data-analysis data-visualization excel kpis tableau
Last synced: 27 Feb 2026
https://github.com/thanaraklee/exploring-and-analyzing-data-in-oracle-database
This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.
data-analysis oracle-database sql sql-developer
Last synced: 12 Feb 2026
https://github.com/nabilshadman/power-bi-essential-training
Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning
dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard
Last synced: 12 Feb 2026
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 16 Apr 2026
https://github.com/ryan-wong1/nyc-job-postings-data-analysis
City of New York Current Job Postings 2024
data-analysis data-cleaning exploratory-data-analysis sql
Last synced: 13 Feb 2026
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 13 Feb 2026
https://github.com/mananabbasi/dashboard-power-bi
This repository showcases **Power BI projects** focused on data visualization and business intelligence. Each project transforms raw data into interactive dashboards and reports, providing actionable insights for decision-making. The repository includes Power BI files, datasets, and documentation for each project.
data-analysis data-science data-visualization powerbi
Last synced: 13 Feb 2026
https://github.com/muyangli76/covidsql
Global Covid Data analyzed in SQL and visualized in Tableau
data-analysis data-visualization sql tableau
Last synced: 14 Feb 2026
https://github.com/guermoud98/data-analysis-with-python-projects
data-analysis matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/prekshivyas/cis-595-big-data-analytics
Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.
data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping
Last synced: 16 Feb 2026
https://github.com/k-bloch/car-theft-analysis
A dashboard created to inform the public about car theft, providing insights extracted from real-world police stats.
data-analysis maven-analytics tableau
Last synced: 19 Mar 2026
https://github.com/arunesh-tiwari/sales-analysis
Tableau Data Analysis Project.
data-analysis data-visualization tableau
Last synced: 01 Mar 2026
https://github.com/rachit901109/simppl_task
Social Media Analytics Dashboard
dashboard-application data-analysis data-visualization network-graphs social-network-analysis
Last synced: 16 Apr 2026
https://github.com/abeltavares/hotel_performance_analysis
A Power BI project that analyzes the performance of a hotel, including revenue, expenses, customer data, hospitality metrics and financial ratios.
business-intelligence data-analysis expenses financial-analysis hospitality-industry power-bi revenue
Last synced: 02 Mar 2026
https://github.com/dmatking/dtlab
Date Time Lab
csv data-analysis data-quality datetime python timezone
Last synced: 02 Jun 2026
https://github.com/paladitya/cn_term_project
Code for testbed
automated-testing data-analysis reliability-score tcl testbed wrapper-library
Last synced: 02 Mar 2026
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 27 Mar 2026
https://github.com/anas436/student-performance-analysis
In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.
data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn
Last synced: 16 Apr 2026
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/lintangwisesa/ujian_analyticsvisualization_jcds07
Panduan Soal Ujian Data Analytics & Visualization Job Connector Data Science batch 7
data-analysis data-science data-visualisation exam
Last synced: 04 Mar 2026
https://github.com/dpb24/netflix-global-top-10-performance
Using Machine Learning to predict Netflix Global Top 10 viewership trends (Python & R)
data-analysis data-science data-visualization decision-tree-regression gradient-boosting-regressor machine-learning media netflix predictive-analytics predictive-modeling python r random-forest random-forest-regression regression-models sklearn streaming-video xgboost-regression
Last synced: 16 Apr 2026
https://github.com/santiago-giordano/ahora12project
Excel, SQL and Python processing from excel files
data-analysis excel jupyter-notebook microsoft-sql-server pandas sql sqlalchemy sqlserver
Last synced: 16 Apr 2026
https://github.com/akash-srm/user-engagement-analysis
Analyzed user engagement and feedback data to derive actionable insights for an online learning platform.
analytics-projects data-analysis data-cleaning eda jupyter-notebook pandas python seaborn student-engagement
Last synced: 16 Apr 2026
https://github.com/danpoynor/omdb-api-data-analysis
Gathers data for Oscar-winning movies using their IMDB ids, saves the information to a CSV file, and answers a few data analysis questions about the movies using JupyterLab.
analytics csv data-analysis jupyter-notebook matplotlib omdb-api pandas-dataframe python-dotenv python3 seaborn-plots
Last synced: 16 Apr 2026
https://github.com/yasumorishima/yasumorishima
Manufacturing Engineer & Data Analyst. 17 years exp in MFG. Python, VBA, Automation Specialist. (盛島康徳 / Yasunori Morishima)
automation data-analysis manufacturing portfolio python vba
Last synced: 05 Mar 2026
https://github.com/e1washere/weather-spark-pipeline
Scalable pipeline using Apache Spark to process and analyze weather data.
apache-spark batch-processing big-data data-analysis data-engineering data-pipeline data-processing etl python spark-sql weather-data
Last synced: 17 Apr 2026
https://github.com/vaishnavis03/finlatics_ml_program
This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.
correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn
Last synced: 17 Apr 2026
https://github.com/nathadriele/ifood-data-governance-pipeline
Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.
airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit
Last synced: 02 Apr 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/eliasdehondt/learn-r
Welcome to the Learn-R repository! This is your go-to resource for learning the R programming language, whether you're a beginner or looking to enhance your skills.
data-analysis data-visualization education machine-learning programming r statistics tutorials
Last synced: 03 Apr 2026
https://github.com/atlassandx90/cryptocurrency-volatility-prediction
Cryptocurrency volatility prediction ML pipeline
cryptocurrency data-analysis data-science data-visualization machine-learning
Last synced: 17 Apr 2026
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 04 Apr 2026
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 03 Jun 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/phanchenh/businessdashboard_powerbiproject_pizzadataset
Pizza Business Performance Analysis and Growth Strategies 2015
business-analytics business-intelligence dashboard data-analysis data-visualization insights powerbi python
Last synced: 17 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/pawlo77/airline-performance-data-analysis
Preprocessing of structured data - part of IAD study program, Faculty of Mathematics and Information Science, Warsaw University of Technology
data-analysis data-science visualization
Last synced: 10 May 2026
https://github.com/danpoynor/data-analysis-of-video-game-sales-2000-2015
This analysis reviews sales for the top 100 video games from the years 2000-2015 to gather insights. Within the notebook I use Python’s Pandas, Matplotlib, and Seaborn libraries to interact with the data and create graphs.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots video-game-sales
Last synced: 18 Apr 2026
https://github.com/satti-hari-krishna-reddy/data-whisperer
Data Whisperer is an AI-driven tool that automates exploratory data analysis (EDA), generates actionable insights, and enables natural language querying of datasets. it combines the power of AI (Google Gemini) with interactive visualizations and professional reporting.
ai data-analysis data-visualization llm python3 streamlit
Last synced: 18 Apr 2026
https://github.com/jordanconallluthaiswright/purchase-behaviour-data-analysis
This project analyzes Black Friday purchase behavior for Company XYZ, uncovering trends by gender, age, and location. Using data cleaning, statistical analysis, and visualization, it evaluates spending patterns, confidence intervals, and category preferences to provide actionable insights for optimizing marketing strategies and targeting.
business-analytics data-analysis jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/robinmillford/sales-metrics-dashboard-streamlit
This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.
dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard
Last synced: 19 Apr 2026
https://github.com/yuvrajsaraogi/unemployment-analysis-with-python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql
Last synced: 19 Apr 2026
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 19 Apr 2026
https://github.com/vyjayanthipolapragada/data_analytics_medical_appointments
Analyzing the data set which consists of medical appointments to draw insights about patient's no-show scenarios
data-analysis data-analytics data-cleaning data-visualization data-wrangling jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 19 Apr 2026
https://github.com/diegoglezsu/bulletin-fetcher
bulletin-py is python package to easily fetch bulletins and legal acts from a wide variety of sources of Eurpean Union.
data-analysis european-union legal-documents python sparql
Last synced: 19 Apr 2026
https://github.com/edwinrlambert/exploring-airbnb-market-trends
Dive into NYC's Airbnb market trends through detailed analysis of listings data, including prices, types, and review dates. This is a DataCamp project.
airbnb data-analysis jupyter-notebook market-trends python
Last synced: 19 Apr 2026
https://github.com/namratha2301/carprice_analysisandprediction
This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.
data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn
Last synced: 20 Apr 2026
https://github.com/misaghmomenib/soccer-match-analysis
This Project Predicts Football Match Outcomes (Home Win, Away Win, or Draw) Using Historical Match Data. It Involves Data Preprocessing, Exploratory Analysis, and Training a Random Forest Model to Predict Results Based on Features Like Shots, Possession, and Passes.
data-analysis git open-source python
Last synced: 20 Apr 2026
https://github.com/sarthakmishraa/bike_rental_predictor
Bike Sharing Dataset : This dataset contains the hourly and daily count of rental bikes between years 2011 and 2012 in Capital bikeshare system with the corresponding weather and seasonal information.
data-analysis machine-learning python xgboost
Last synced: 20 Apr 2026
https://github.com/ak-pydev/python_practice
Documenting my learning journey from python -> ML -> DL -> LLM/GenAI -> Agents exercises solved daily from Udemy/Kaggle/YouTube.
data-analysis data-science feature-engineering llms machine-learning mlflow mlops-workflow modeling python3 streamlit uvicorn
Last synced: 20 Apr 2026
https://github.com/salfaris/toy-data-analysis
Random toy data projects. For my portfolio data projects, see linked website
Last synced: 20 Apr 2026
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data
Last synced: 21 Apr 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/rahulpatel0615/sales-analysis-project
Sales Data Analysis Dashboard with Python, Pandas, and Matplotlib. Features 12+ visualizations and comprehensive insights.
data data-analysis data-visualization matplotlib pandas portfolio python
Last synced: 21 Apr 2026
https://github.com/maddieemihle/home_sales
A PySpark-powered analysis of real estate trends using home sales data. This project explores average prices by year, room configuration, and property features, while demonstrating SparkSQL, caching, and partitioning techniques in a scalable data pipeline—all within Google Colab
apache-spark caching data-analysis googlecolab parquet pyspark sparksql
Last synced: 21 Apr 2026