Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/sahilmaurya28/youtube-data-analysis
YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.
analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube
Last synced: 13 Apr 2026
https://github.com/nishumehta/british-airways-reviews-analysis
This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.
dashboard data-analysis data-visualization tableau tableau-public
Last synced: 12 Jan 2026
https://github.com/wassimhedfi/exploring-the-evolution-of-linux
Datacamp guided Project
data-analysis data-science ml python
Last synced: 15 May 2026
https://github.com/sreekar0101/-movie-recommendation-system-using-python
The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice
data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python
Last synced: 02 Jan 2026
https://github.com/sanafagal/wsp-msg-automation
An intuitive application for managing and analyzing customer and reseller data stored in Google Sheets, providing insights and streamlined data organization.
automation cloud-credentials data-analysis google-sheets-api python
Last synced: 16 Jun 2025
https://github.com/chahelgupta/hospital-readmission-prediction-and-analysis
The Hospital Readmission Prediction project uses clinical data to predict diabetic readmissions. SVM + SMOTE achieved 61.16% accuracy, with key predictors including hospital stay, lab tests, and medications.
data-analysis knn-classification logistic-regression machine-learning prediction prediction-model python random-forest-classifier smote svm-classifier
Last synced: 15 May 2026
https://github.com/amruthadevops/stock-market-analysis
To analyze market trends and predict future market behavior using machine learning techniques
data-analysis data-science jupyter-notebook machine-learning powerbi-desktop python stock-market
Last synced: 15 May 2026
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/mamtapanda088/dataanalaysis-warmup-
Tasks: Create a DataFrame: Convert the dictionary into a pandas DataFrame. Top and Bottom Rows: Display the top 3 bottom ,3 rows of the DataFrame. Summary Statistics: Generate summary statistics for the dataset. Gender Count: Count the occurrences of each gender. Marks Analysis: Calculate the average, maxi, and min marks. Tools Used: Python ,pandas
data-analysis data-science jupyter-notebook visualization
Last synced: 04 Apr 2025
https://github.com/lucashomuniz/project-01
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-05
[STATISTICAL ANALYSIS] Integrating Automation and Visualization for Optimal Data Analytics
automation data-analysis kolmogorov-smirnov language-r nonparametric-analysis parametric-analysis shapiro-wilk shiny-apps statistics t-test wilcoxon-test
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-10
Optimizing Sales Forecast Accuracy: Exploratory Analysis and Insights
data-analysis data-munging data-visualization dax-languague exploratory-data-analysis language-r power-bi sales-forecast statistics-modules
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-02
TRACKING USER ACCEPTANCE TESTING WITH POWERBI AND R
data-analysis data-visualization dax-languague powerbi-report powerbi-visuals powerquery r-language shiny-apps visuals
Last synced: 30 Mar 2025
https://github.com/palakjainanalyst/ecommerce-customer-spending-analysis
An end-to-end Ecommerce analytics project uncovering customer spending trends using Excel, Python, SQL, and Power BI. From raw data to interactive dashboards, this project delivers deep insights on spending patterns, high-value customer segments - showcasing a complete data-to-decisions workflow.
data-analysis data-visualization database ecommerce excel jupyter-notebook powerbi python spending sql
Last synced: 06 May 2026
https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility
A Data-Driven Approach to Food Security and Supermarket Accessibility
data-analysis matplotlib numpy pandas python3 seaborn
Last synced: 05 Apr 2025
https://github.com/achronus/data-exploration
A repository dedicated to interesting data exploration projects I've completed
data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn
Last synced: 02 Jan 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/lewismakau/portfolio-projects
This repository contains file data and SQL files for projects used for my Portfolio.
data-analysis data-cleaning data-structures data-visualization database google-analytics microsoft-sql-server mysql powerbi tableau
Last synced: 02 Apr 2026
https://github.com/mobutolakecondyle107/sql-server-ddw
🚀 Streamline data management with sql-server-ddw, a powerful tool for efficient queries and seamless integration in SQL Server environments.
backup-and-recovery data-analysis data-integration data-modeling database-management database-security performance-tuning query-optimization reporting-tools sql-scripting sql-server sql-server-express stored-procedures table-design transaction-management
Last synced: 12 Jun 2026
https://github.com/saymyname1337/bachelor-s-thesis
Bachelor's thesis of a student of the MPEI of Shevts G. V.
Last synced: 23 Jul 2025
https://github.com/souravxbera/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 15 May 2026
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 06 Mar 2026
https://github.com/sdley/logiciel-de-deliberation-uam-2022
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 08 May 2026
https://github.com/maazie-khan/austin-housing-insights-powerbi
Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.
dashboard data-analysis data-science data-visualization database powerbi
Last synced: 02 Jan 2026
https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny
🌎Datamart Analysis with Machine Learning
data-analysis data-science dataviz machine-learning ml r retail-analysis rstudio shiny
Last synced: 06 Apr 2025
https://github.com/admacpherson/admacpherson.github.io
This repository hosts my personal website & portfolio. You can find my work experience, endorsements, contact information, and more on it at andrewmacpherson.dev
data-analysis personal-site portfolio website
Last synced: 15 Sep 2025
https://github.com/rudra-g-23/power-bi-custom-visual
A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.
custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization
Last synced: 02 Jan 2026
https://github.com/sevilaymuni/project-no.2-pandas-tableau-student-mobility
Pandas assisted Feature Engineering on Study Mobility: Tableau Dashboards on Students' Preferences
data-analysis data-extraction data-visualization feature-engineering pandas python tableau-dashboards tableau-desktop tableau-public
Last synced: 03 May 2026
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 06 Apr 2025
https://github.com/revtpark/teamseas_scrapper
Scraping Team Seas for data analysis and visualization.
chartjs data-analysis python webscraping
Last synced: 28 Mar 2025
https://github.com/tushar2704/hiring-process-analytics
In this project, I am analyzing hiring process data to gain insights from about records of previous hires within a multinational company. By analyzing this data, I am aiming to uncover valuable trends and information about the company's hiring process, which can contribute to making informed decisions and improvements for the future.
data-analysis data-cleaning data-science data-wrangling excel tushar2704
Last synced: 25 Jan 2026
https://github.com/tushar2704/employee-distribution
This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.
data-analysis data-visualization excel postgresql powerbi sql tushar2704
Last synced: 04 Nov 2025
https://github.com/tushar2704/consumables_sales_dashboard
Welcome to the Consumable Sales Dashboard, a powerful and intuitive data visualization tool built using Power BI. This dashboard offers a comprehensive view of sales data for consumable products, allowing you to quickly and easily analyze performance and identify trends.
dashboard data-analysis data-analytics data-science excel postgresql powerbi streamlit-tushar2704 tushar2704
Last synced: 04 Nov 2025
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/maazie-khan/power-bi-projects
Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.
dashboard data-analysis data-science data-visualization database excel powerbi
Last synced: 02 Jan 2026
https://github.com/zborovskaanna/grosery_store_sales_analysis
Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau
data-analysis data-visualization matplotlib numpy pandas python seaborn tableau
Last synced: 08 Apr 2026
https://github.com/sramalhao/sleep_health_analysis
This repository contains a comprehensive project focused on analyzing various factors influencing sleep health, such as BMI, occupation, gender, age, physical activity, and stress levels.
analytics data-analysis eda matplotlib pandas python seaborn sklearn visualization
Last synced: 13 Apr 2026
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/deliprofesor/customerseg-customer-segmentation-and-shopping-analysis
This project performs data exploration, segmentation, and modeling of wholesale customer data using clustering algorithms, PCA, and decision trees to analyze purchasing behavior and predict customer channel preferences.
clustering customer-segmentation data-analysis data-visualization dbscan decision-tree gmm kmeans machine-learning pca
Last synced: 24 Jun 2025
https://github.com/mae776569/tmdb-data-analysis
Investigating tmdb movies dataset
conclusions data-analysis data-wrangling exploratory-data-analysis
Last synced: 03 Nov 2025
https://github.com/scailfin/rob-webapi-flask
Default RESTful Web API implementation for the Reproducible Open Benchmarks for Data Analysis Platform (ROB) using the Flask web framework.
benchmarks data-analysis reproducibility webapi
Last synced: 17 Mar 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-extensive
A cookiecutter template for data analysis projects using Python.
cookiecutter data-analysis project-template python
Last synced: 09 Apr 2025
https://github.com/omari-kd/transborder-freight-data-analysis
This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.
data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-science data-visualization powerbi
Last synced: 30 Mar 2025
https://github.com/shreshthvashisht/abc-call-volume-trend-analysis
Customer Experience Analysis
advanced-excel call-centre-analysis call-volume-trend data-analysis data-visualisation experience-analytics pivot-tables
Last synced: 01 Mar 2026
https://github.com/vetrivel07/flight-price-prediction
Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 15 Jun 2025
https://github.com/aditiagrawal04/netflix-insights-mysql-
SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.
business-intelligence data-analysis data-exploration mysql netflix sql sql-project
Last synced: 28 Jun 2025
https://github.com/andryadsm/pizza-sales-report
🍕 Project Pizza Sales Report (MySQL, Tableau)
dashboards data-analysis data-visualization database-management mysql sales sql tableau
Last synced: 14 May 2025
https://github.com/waheed24-03/-ipl-stats-compare
Comparing Stats of Cricketers in IPL
cricket dashboard data-analysis data-visualization ipl python sports-analytics streamlit
Last synced: 28 Jun 2025
https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera
introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera
data-analysis matplotlib numpy pandas
Last synced: 03 May 2026
https://github.com/cassiofb-dev/fide-rating-analysis
The plot speaks for itself
chess data-analysis fide hans rating
Last synced: 15 Jun 2025
https://github.com/kineticloom/plydb-fun-nfl-analyst
Analyze NFL data with your AI agent
data-analysis football-analytics nfl
Last synced: 15 May 2026
https://github.com/mmfava/analises-papers
Script base de alguns papers publicados entre 2019 e 2021.
Last synced: 22 May 2026
https://github.com/sunnyrao07/data-analysis-dashboard-in-excel
I implemented a comprehensive data analysis solution using Excel, developing multiple dashboards and tables to visualize and interpret the data. This involved a rigorous data cleaning and preprocessing pipeline followed by data visualization.
dashboard data-analysis excel visualization
Last synced: 03 Feb 2026
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/ccoolbaugh/individualized_cooling_data_analysis
Matlab code to analyze data collected during a brown adipose tissue individualized cooling protocol.
brown-adipose-tissue cold-exposure data-analysis ibutton matlab skin-temperature thermoregulation
Last synced: 18 Aug 2025
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/poglolopez/nesarc_research
Analyzing the relationship between Social Anxiety Disorder (SAD) and family history of behavioral problems using NESARC data. Includes statistical hypothesis testing (ANOVA, Chi-Square, Pearson Correlation, Moderation Analysis). Developed as part of the Data Analysis and Interpretation Specialization from Wesleyan University (Coursera).
anova chi-square coursera-assignment data-analysis hypothesis-testing mental-health moderation-analysis nesarc pandas pearson-correlation python social-anxiety statistical-analysis
Last synced: 14 Apr 2026
https://github.com/yrohitha/titanic-data-analysis
Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.
data-analysis machine-learning matplotlib pandas scipy-stats statistical-models
Last synced: 13 Mar 2025
https://github.com/srummanf/elnino-anomaly-study
Study on El Niño’s impact on Chennai groundwater sustainability
data-analysis machine-learning python satellite-imagery-analysis
Last synced: 15 May 2026
https://github.com/fazej99/u.s-climate-and-temperature-analysis
This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.
data-analysis data-science data-visualization gis machine-learning streamlit
Last synced: 22 May 2026
https://github.com/vipulbunny/web-tech-scanner
A Python-based web scraping tool that detects technologies used on a website by analyzing its scripts, meta tags, and HTML content.
beautifulsoup beautifulsoup4 data-analysis data-science python requests technology-detection web-scraping
Last synced: 22 May 2026
https://github.com/yassin522/health-insurance-cross-sell-prediction
Prediction of Vehicles Health Insurance
data data-analysis data-science machine-learning plotly python
Last synced: 15 May 2026
https://github.com/data-edd/california_population_projection
This project demonstrates a population projection analysis for the state of California using MySQL
Last synced: 30 Mar 2025
https://github.com/nymarya/analise-correlacao-sifilis
Código da análise de correlação entre notificações de casos de sífilis e disponibilidade de testes e medicamentos
data-analysis healthcare pandas
Last synced: 03 Jan 2026
https://github.com/cyberoctane29/noaa-lightning-analysis
This project explores lightning strike data from the National Oceanic and Atmospheric Administration (NOAA) to identify seasonal trends and analyze strike frequency across months. It demonstrates data manipulation, aggregation, and visualization using Python, providing insights into lightning activity patterns.
data-analysis data-science data-visualization eda python
Last synced: 20 Apr 2026
https://github.com/merrill007/sql-data-warehouse-project
The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.
architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver
Last synced: 22 Mar 2025
https://github.com/kaushik-puttaswamy/airline-passenger-referral-prediction-using-machine-learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 22 Mar 2025
https://github.com/12danielll/neurogenomics_project
This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.
bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis
Last synced: 06 Apr 2026
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 08 Apr 2026
https://github.com/macnianios/salifort-motors_retention
Google Advanced Data Analytics Capstone: Analyzing customer retention at Salifort Motors.
data-analysis machine-learning pandas python seaborn sklearn
Last synced: 08 Apr 2026
https://github.com/jofaval/teams-assistance-graph
Visualize in a Graph your Teams Attendance Report
charts data-analysis data-visualization data-viz github-copilot highcharts highcharts-js javascript microsoft-teams
Last synced: 08 Sep 2025
https://github.com/l337x911/simulations
data analysis via in silico simulations
data-analysis machine-learning python3
Last synced: 06 Apr 2025
https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression
This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.
data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/valna/mercado-play
Mercado Play (streaming service of @mercadolibre) redesign with Astro.
astro data-analysis data-manipulation data-science data-visualization husky javascript justd mercado-play mit-license prettier react rocketseat typescript
Last synced: 08 Apr 2026
https://github.com/pentalpha/eu-car-emissions-analysis-2015
Analysis of CO² Emissions on Passenger Cars at the E.U. Contries, Year 2015.
data-analysis data-science dataset jupyter-notebook python python3
Last synced: 15 May 2026
https://github.com/pentalpha/bti-performance-study
A series of analysis on a large amount of data about the grades of students in the Technology Information course at UFRN
analysis big-data clustering data-analysis data-science data-visualization ipynb ipython jupyter-notebook performance-analysis plot python python3
Last synced: 15 May 2026
https://github.com/jofaval/daily-california-births
Data Analysis of the Daily AFAB (Assigned Female At Birth) Births in California, 1959
california data-analysis data-science data-visualization deep-learning google-colab machine-learning python tensorflow timeseries timeseries-analysis
Last synced: 28 Jun 2025
https://github.com/harmanveer-2546/movie-industry
Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.
business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python
Last synced: 10 Apr 2026
https://github.com/mtholahan/advanced-mysqlquery-tuning-mini-project
Analyzed EuroCup 2016 data with advanced SQL queries. Imported CSV datasets into MySQL, designed schema with match, player, and referee details, and implemented queries covering match outcomes, penalty shootouts, player stats, bookings, substitutions, and referee activity to explore tournament dynamics.
bootcamp data-analysis data-engineering data-modeling database eurocup football mysql queries soccer sports springboard sql
Last synced: 15 May 2026
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 04 Jun 2026
https://github.com/dimits-ts/college_analysis
A statistical study about US college admissions, featuring a full report in LaTeX.
anova data-analysis exploratory-data-analysis linear-regression statistics
Last synced: 25 Jan 2026
https://github.com/onome-joseph/flexisaf
Generative AI & Data Science
data-analysis data-science machine-learning
Last synced: 16 Sep 2025
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/tushar2704/loan-limits-by-country
This project aims to leverage a diverse dataset encompassing economic indicators, demographic factors, and credit history to establish a predictive model. By establishing appropriate loan limits, financial institutions can enhance risk management, ensure responsible lending, and promote financial inclusivity.
artificial-intelligence data-analysis data-science loan project tushar2704
Last synced: 30 Oct 2025
https://github.com/lawwrites/uncovering_fintech_user_insights
Linear Regression and K-Means to obtain user behavior insights for a fintech company
data-analysis data-science kmeans-clustering linear-regression python unsupervised-machine-learning
Last synced: 22 May 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 18 Feb 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert
Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.
bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization
Last synced: 05 Apr 2025
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/eesunmoon/genai_cor-recom
[Project] Outfit Coordination Recommender System using KoAlpaca
data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium
Last synced: 06 Apr 2026
https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends
Last synced: 27 Apr 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/beolawork-art/novabank-churn-analysis
NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.
data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql
Last synced: 08 Apr 2026