Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-27 00:07:21 UTC
- JSON Representation
https://github.com/rudra-g-23/power-bi-custom-visual
A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.
custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization
Last synced: 02 Jan 2026
https://github.com/tknishh/investing-platform
An investing platform application to help users get information and analyze various foreign currency assets. The investing platform uses an ETL pipeline to insert new batches of Forex data once a day.
data-analysis investing-platform pipeline
Last synced: 18 Mar 2025
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 06 Apr 2025
https://github.com/brevex/hotel-booking-demand-data-analysis
Data analysis in Python of demand for urban hotels and resorts showing their causes and relationships
data-analysis data-science hotel-booking-analysis kaggle python
Last synced: 08 May 2026
https://github.com/tushar2704/hiring-process-analytics
In this project, I am analyzing hiring process data to gain insights from about records of previous hires within a multinational company. By analyzing this data, I am aiming to uncover valuable trends and information about the company's hiring process, which can contribute to making informed decisions and improvements for the future.
data-analysis data-cleaning data-science data-wrangling excel tushar2704
Last synced: 25 Jan 2026
https://github.com/tushar2704/employee-distribution
This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.
data-analysis data-visualization excel postgresql powerbi sql tushar2704
Last synced: 04 Nov 2025
https://github.com/tushar2704/consumables_sales_dashboard
Welcome to the Consumable Sales Dashboard, a powerful and intuitive data visualization tool built using Power BI. This dashboard offers a comprehensive view of sales data for consumable products, allowing you to quickly and easily analyze performance and identify trends.
dashboard data-analysis data-analytics data-science excel postgresql powerbi streamlit-tushar2704 tushar2704
Last synced: 04 Nov 2025
https://github.com/lucalullo/italian-justice-workload
Multidimensional analysis of the Italian justice system workload (2003–2024). A study of civil and criminal proceedings using judicial pressure and litigation indicators.
data-analysis italy judicial-workload justice-system kaggle legal-analytics pandas python time-series
Last synced: 24 May 2026
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/maazie-khan/power-bi-projects
Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.
dashboard data-analysis data-science data-visualization database excel powerbi
Last synced: 02 Jan 2026
https://github.com/zborovskaanna/grosery_store_sales_analysis
Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau
data-analysis data-visualization matplotlib numpy pandas python seaborn tableau
Last synced: 08 Apr 2026
https://github.com/sramalhao/sleep_health_analysis
This repository contains a comprehensive project focused on analyzing various factors influencing sleep health, such as BMI, occupation, gender, age, physical activity, and stress levels.
analytics data-analysis eda matplotlib pandas python seaborn sklearn visualization
Last synced: 13 Apr 2026
https://github.com/nehul1149/olympic-data-analysis
This project is an interactive data visualization and analytics platform for exploring historical Olympic Games data. Built with Python and Streamlit, it offers an in-depth analysis of medal tallies, athlete statistics, and country-wise performance trends, providing users with powerful insights into the world's biggest sporting event.
analysis data-analysis data-science data-visualization matplotlib python streamlit
Last synced: 18 May 2026
https://github.com/thecoderpinar/globalwarmingforecast
🌍 Global Warming Forecast Tool An advanced tool for analyzing and forecasting climate trends using ARIMA and Prophet models, with interactive visualizations and scenario simulations.
arima climate-change data-analysis environmental-science forecasting global-warming machine-learning prophet streamlit time-series-analysis visualization
Last synced: 27 Mar 2025
https://github.com/mae776569/tmdb-data-analysis
Investigating tmdb movies dataset
conclusions data-analysis data-wrangling exploratory-data-analysis
Last synced: 03 Nov 2025
https://github.com/omari-kd/data-analytics
Welcome to my Data Analytics Portfolio, which includes structured projects in both Data Science and Data Analysis, implemented in R and Python.
data-analysis data-analytics data-science machine-learning
Last synced: 12 Aug 2025
https://github.com/jprmaulion/cholera-gedeo-ethiopia-spatial-analysis
Exploratory spatial analysis and visualization of cholera case clusters in Gedeo Zone, Ethiopia that integrates demographic and geographic data to identify environmental risk patterns and inform public health interventions. Includes geospatial mapping of cholera incidence relative to waterways and administrative boundaries.
cholera data-analysis data-analysis-python epidemiology ethiopia openstreetmap python spatial-analysis
Last synced: 12 Apr 2026
https://github.com/bjornmelin/minneanalytics
MinneAnalytics project work.
competitive-programming data-analysis data-visualization r
Last synced: 09 Jul 2025
https://github.com/sondosaabed/data-visualization-in-tableau
data-analysis data-visualization nanodegree plot tableau udacity
Last synced: 08 Sep 2025
https://github.com/darshan1924/house-price-pridiction
This repository contains a machine learning project for predicting house prices based on various features, including geographical coordinates. The project includes data preprocessing steps to handle# House Price Prediction Project
data-analysis data-preprocessing house-prices jupyter-notebook machine-learning prediction
Last synced: 27 Mar 2025
https://github.com/mosalem149/pythonutilities
A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.
data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting
Last synced: 15 May 2026
https://github.com/spshah1701/world-development-indicators
Analysis of World Development Indicators (WDI) using big data technologies, specifically Databricks, Apache Spark, and Scala.
apache-spark big-data data-analysis spark-sql
Last synced: 17 Mar 2025
https://github.com/borjamome/explorando-madrid
Exploring Madrid: A Data-driven Analysis with R 🐻🌳
data-analysis data-visualization madrid r
Last synced: 26 Mar 2025
https://github.com/omari-kd/transborder-freight-data-analysis
This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.
data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-science data-visualization powerbi
Last synced: 30 Mar 2025
https://github.com/shreshthvashisht/abc-call-volume-trend-analysis
Customer Experience Analysis
advanced-excel call-centre-analysis call-volume-trend data-analysis data-visualisation experience-analytics pivot-tables
Last synced: 01 Mar 2026
https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics
Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.
data-analysis data-visualization eda powerbi python sql
Last synced: 21 May 2026
https://github.com/aditiagrawal04/netflix-insights-mysql-
SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.
business-intelligence data-analysis data-exploration mysql netflix sql sql-project
Last synced: 28 Jun 2025
https://github.com/ljadhav25/knn-algorithm-data-science-
This repository contains a project demonstrating the implementation and application of the K-Nearest Neighbors (K-NN) algorithm in Data Science. The objective is to provide a comprehensive understanding of the K-NN algorithm, including data preprocessing, model training, evaluation, and visualization of results. This project is ideal for beginners
data-analysis data-science knn-classification machine-learning matplotlib-pyplot numpy pandas-library seaborn
Last synced: 16 Apr 2026
https://github.com/andryadsm/pizza-sales-report
🍕 Project Pizza Sales Report (MySQL, Tableau)
dashboards data-analysis data-visualization database-management mysql sales sql tableau
Last synced: 14 May 2025
https://github.com/waheed24-03/-ipl-stats-compare
Comparing Stats of Cricketers in IPL
cricket dashboard data-analysis data-visualization ipl python sports-analytics streamlit
Last synced: 28 Jun 2025
https://github.com/mindlessmuse666/iris-knn
Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.
algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn
Last synced: 17 Aug 2025
https://github.com/mindlessmuse666/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction data-analysis data-science linear-regression linear-regression-models machine-learning matplotlib python regression sklearn unit-testing
Last synced: 11 Apr 2026
https://github.com/emmarhoffmann/analysis-of-student-debt-among-first-generation-college-students
Explores the financial landscape of first-generation college students, analyzing patterns in student debt based on factors like median income, net price of attendance, and enrollment size.
data-analysis first-generation-college-students r statistical-models
Last synced: 17 Mar 2025
https://github.com/mmfava/analises-papers
Script base de alguns papers publicados entre 2019 e 2021.
Last synced: 22 May 2026
https://github.com/sunnyrao07/data-analysis-dashboard-in-excel
I implemented a comprehensive data analysis solution using Excel, developing multiple dashboards and tables to visualize and interpret the data. This involved a rigorous data cleaning and preprocessing pipeline followed by data visualization.
dashboard data-analysis excel visualization
Last synced: 03 Feb 2026
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/emmarhoffmann/analysis-of-california-real-estate-market-factors-influencing-home-prices
Investigates how home size, number of bedrooms, and bathrooms influence home prices, with comparisons across California, New York, New Jersey, and Pennsylvania.
data-analysis r real-estate statistical-models
Last synced: 17 Mar 2025
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/poglolopez/nesarc_research
Analyzing the relationship between Social Anxiety Disorder (SAD) and family history of behavioral problems using NESARC data. Includes statistical hypothesis testing (ANOVA, Chi-Square, Pearson Correlation, Moderation Analysis). Developed as part of the Data Analysis and Interpretation Specialization from Wesleyan University (Coursera).
anova chi-square coursera-assignment data-analysis hypothesis-testing mental-health moderation-analysis nesarc pandas pearson-correlation python social-anxiety statistical-analysis
Last synced: 14 Apr 2026
https://github.com/chiamakaukwuoma/portfolio
This repository contains various projects I've been privileged to work on outside of work.
aws-rds azure-fabric bigquery data-analysis docker-container elasticsearch excel grafana hadoop looker-studio mssql mysql postgresql powerbi python sql tableau
Last synced: 10 Apr 2026
https://github.com/yrohitha/titanic-data-analysis
Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.
data-analysis machine-learning matplotlib pandas scipy-stats statistical-models
Last synced: 13 Mar 2025
https://github.com/janashanaa/flightanalysis
This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/chingu-voyages/v47-tier3-team-30
An easily accessible tool for calculating electricity-related carbon emissions, along with insights for reducing environmental impact. | Voyage-47 | https://chingu.io/ | Twitter: https://twitter.com/ChinguCollabs
carbon-emissions carbon-footprint data-analysis data-engineering data-science
Last synced: 10 May 2026
https://github.com/fazej99/u.s-climate-and-temperature-analysis
This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.
data-analysis data-science data-visualization gis machine-learning streamlit
Last synced: 22 May 2026
https://github.com/vipulbunny/web-tech-scanner
A Python-based web scraping tool that detects technologies used on a website by analyzing its scripts, meta tags, and HTML content.
beautifulsoup beautifulsoup4 data-analysis data-science python requests technology-detection web-scraping
Last synced: 22 May 2026
https://github.com/satyacoder29/comparison-of-region-based-sales-tableau
The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.
data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions
Last synced: 02 Feb 2026
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/data-edd/california_population_projection
This project demonstrates a population projection analysis for the state of California using MySQL
Last synced: 30 Mar 2025
https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra
Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.
cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python
Last synced: 11 Apr 2026
https://github.com/nymarya/analise-correlacao-sifilis
Código da análise de correlação entre notificações de casos de sífilis e disponibilidade de testes e medicamentos
data-analysis healthcare pandas
Last synced: 03 Jan 2026
https://github.com/abhirajp595/python2
Capstone Project using python(Real-Estate)
data-analysis data-science data-visualization jupyter-notebook machine-learning numpy pandas python statistics
Last synced: 09 Apr 2026
https://github.com/sciencesar-labs/py485-final-project
ROOT-based muon data analysis using Python & Jupyter – final project for PY485E @ CERN
cern computational-physics data-analysis jupyter-notebook muons python root uproot
Last synced: 15 May 2026
https://github.com/merrill007/sql-data-warehouse-project
The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.
architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver
Last synced: 22 Mar 2025
https://github.com/kaushik-puttaswamy/airline-passenger-referral-prediction-using-machine-learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 22 Mar 2025
https://github.com/jonek/pv-city-mastr
Extract and analyze data about photovoltaic systems in Germany
data-analysis germany jupyter-notebook pandas photovolatic-power photovoltaic
Last synced: 11 May 2026
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 08 Apr 2026
https://github.com/cyberoctane29/noaa-lightning-analysis
This project explores lightning strike data from the National Oceanic and Atmospheric Administration (NOAA) to identify seasonal trends and analyze strike frequency across months. It demonstrates data manipulation, aggregation, and visualization using Python, providing insights into lightning activity patterns.
data-analysis data-science data-visualization eda python
Last synced: 20 Apr 2026
https://github.com/macnianios/salifort-motors_retention
Google Advanced Data Analytics Capstone: Analyzing customer retention at Salifort Motors.
data-analysis machine-learning pandas python seaborn sklearn
Last synced: 08 Apr 2026
https://github.com/lord3008/instances-of-data-analysis
This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.
Last synced: 03 Mar 2025
https://github.com/l337x911/simulations
data analysis via in silico simulations
data-analysis machine-learning python3
Last synced: 06 Apr 2025
https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression
This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.
data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/valna/mercado-play
Mercado Play (streaming service of @mercadolibre) redesign with Astro.
astro data-analysis data-manipulation data-science data-visualization husky javascript justd mercado-play mit-license prettier react rocketseat typescript
Last synced: 08 Apr 2026
https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure
Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure
data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny
Last synced: 15 May 2026
https://github.com/borjamome/top-goleadores
Mejores delanteros en Europa según los datos
data-analysis data-visualization football-analytics r
Last synced: 26 Mar 2025
https://github.com/jofaval/iris-flowers
Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936
classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/satyacoder29/crm-analytics
CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊
advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau
Last synced: 03 Mar 2025
https://github.com/oshinrathor/data-science-systems-and-analytics-projects
Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.
dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics
Last synced: 02 Mar 2025
https://github.com/jofaval/daily-california-births
Data Analysis of the Daily AFAB (Assigned Female At Birth) Births in California, 1959
california data-analysis data-science data-visualization deep-learning google-colab machine-learning python tensorflow timeseries timeseries-analysis
Last synced: 28 Jun 2025
https://github.com/abhinav-codealchemist/open-government-data-analysis
Data Analysis Using Pandas
data-analysis data-science jupyter-notebook python
Last synced: 18 May 2026
https://github.com/bilalhameed248/power-bi-learning-and-dev
Power BI Learning And Development
chats data-analysis data-preprocessing dataanalysis dax powerbi statistics visualization
Last synced: 06 Mar 2026
https://github.com/aalkiyumi/project-4-big-data-analysis-with-pyspark-on-weather-data
In this project, I analyzed weather data from the NCEI Global Surface Summary of Day dataset using PySpark in Jupyter Notebook. Tasks included data cleaning, statistical analysis, and forecasting for temperature, wind speed, precipitation, and extreme weather events. The project also predicts future weather patterns for Cincinnati and Florida.
big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis temperature-forecasting time-series-forecasting uc uc2026 university-of-cincinnati wind-speed-data
Last synced: 17 Mar 2025
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 04 Jun 2026
https://github.com/arun-data-analyst/finance-reporting-sql
End-to-end SQL project for project/portfolio finance: schema, seed data, validation, data-quality checks, business queries, and KPI views (Power BI–ready).
data-analysis data-modeling data-quality database finance kpi portfolio-management powerbi sql sql-server ssms
Last synced: 18 May 2026
https://github.com/dimits-ts/college_analysis
A statistical study about US college admissions, featuring a full report in LaTeX.
anova data-analysis exploratory-data-analysis linear-regression statistics
Last synced: 25 Jan 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 22 May 2026
https://github.com/cyberoctane29/deutsche-bank-customer-churn-prediction-end-to-end-analysis-and-modeling
In this project, I aim to predict customer churn for Deutsche Bank using supervised machine learning. It involves data exploration, feature engineering, and building Naive Bayes, Decision Tree, Random Forest, and XGBoost models. Models are tuned, evaluated, and compared to identify the best approach for churn prediction.
bank-customer-churn churn-analysis churn-prediction customer-churn-analytics data-analysis data-analytics data-visualization decision-tree eda gaussian-naive-bayes machine-learning random-forest supervised-learning xgboost
Last synced: 11 Oct 2025
https://github.com/onome-joseph/flexisaf
Generative AI & Data Science
data-analysis data-science machine-learning
Last synced: 16 Sep 2025
https://github.com/mikhaelmounay/salty-med
Salty Mediterranean - Grade 12 Data Analysis & Visualization Capstone Project
data-analysis data-visualization
Last synced: 02 Feb 2026
https://github.com/farhad-here/adventureworks_interactive_sales_dashboard_powerbi
An interactive Power BI dashboard for Adventure Works sales team to analyze performance, customers, products, and employees. Includes data cleaning, data modeling, DAX measures and advanced visualization features.
business-intelligence chart csv data-analysis data-cleaning data-cleaning-and-preprocessing data-visualization dax powerbi
Last synced: 13 Aug 2025
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/tushar2704/loan-limits-by-country
This project aims to leverage a diverse dataset encompassing economic indicators, demographic factors, and credit history to establish a predictive model. By establishing appropriate loan limits, financial institutions can enhance risk management, ensure responsible lending, and promote financial inclusivity.
artificial-intelligence data-analysis data-science loan project tushar2704
Last synced: 30 Oct 2025
https://github.com/kfrural/dashboard_agro
Dashboard Agro is a technological platform that integrates several components to support Brazilian agribusiness through data analysis, visualization and forecasts. This innovative solution was developed to serve three main groups: farmers, researchers and public managers.
big-data data-analysis predictive-analytics python
Last synced: 15 May 2026
https://github.com/lawwrites/uncovering_fintech_user_insights
Linear Regression and K-Means to obtain user behavior insights for a fintech company
data-analysis data-science kmeans-clustering linear-regression python unsupervised-machine-learning
Last synced: 22 May 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 18 Feb 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert
Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.
bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization
Last synced: 05 Apr 2025
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/dylanbk/exploring-data
A collection of programs that explore data engineering and analysis.
data-analysis data-engineering matplotlib pandas python
Last synced: 02 Mar 2025
https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends
Last synced: 27 Apr 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/beolawork-art/novabank-churn-analysis
NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.
data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql
Last synced: 08 Apr 2026
https://github.com/byte7/fifa17-analysis-and-prediction
⚽ FIFA 17 Analysis and Prediction⚽
data-analysis dreamteam fifa17 fifa17-analysis ultimate-team
Last synced: 26 Mar 2025