Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/ansh-info/literaturesurvey
Literature Survey Engine, leverages the powerful Semantic Scholar's Recommendation API to provide you with highly relevant research article recommendations based on your curated lists of articles.
api api-integration automation data-analysis data-visualization docker docker-compose literature-survey machine-learning mysql paper-recommendations python recommendation-system research-tools semantic-scholar streamlit zotero
Last synced: 10 Apr 2026
https://github.com/rohithay/titanic-data-analysis
Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.
data-analysis machine-learning matplotlib pandas scipy-stats statistical-models
Last synced: 15 May 2026
https://github.com/surajsanap/employee-resigning-analysis-powerbi-dashboard-data-analytics
Effortlessly analyze employee resignations with our concise Power BI dashboard. Download the XML file, open the dashboard, and gain quick insights into resignation trends and reasons for departure. Streamlined and effective
dashboard data-analysis data-analytics powerbi python xml-dataset
Last synced: 08 May 2025
https://github.com/davydantoniuk/stackoverflow-graph-analyse-r
data-analysis graph r stackoverflow
Last synced: 13 Mar 2025
https://github.com/mostafa-bashir/investigating_weather_data
data-analysis ipython jupyter-notebook nump pandas python
Last synced: 07 Apr 2026
https://github.com/dionixius7/titanic-disaster-ml-model
This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances
data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction
Last synced: 07 Feb 2026
https://github.com/diliprk/smartcityvisualization
Data Wrangling and Data Visualization Works done for Smart City Project at HBK Saar
bokeh data-analysis data-visualization python3
Last synced: 15 May 2026
https://github.com/cadedupont/mlb-data-analysis
Performing analysis on dataset of active MLB players in R
baseball-analytics data-analysis data-science mlb-stats-api r
Last synced: 23 Jun 2026
https://github.com/djccnt15/mathematics
data-analysis data-science linear-algebra python statistics
Last synced: 24 Jun 2025
https://github.com/anandanraju/sql_data_analysis_projects
About This Two projects involves analyzing Pizza Data & Walmart Sales data using SQL to identify insights and trends. The aim is to do data-driven approaches to understand sales performance, identify key factors influencing sales, and provide actionable recommendations for business improvement.
csv data-analysis data-management mysql pizza sql sql-schema walmart
Last synced: 24 Jun 2025
https://github.com/monish-nallagondalla/cement_strength_prediction
The Cement Strength Prediction project uses machine learning to predict the compressive strength of cement based on its components, such as Cement, Fly Ash, Water, Superplasticizer, Coarse Aggregate, Fine Aggregate, and Age. The goal is to forecast compressive strength (MPa) for optimized cement production and quality control.
cement-strength-prediction construction-industry data-analysis data-preprocessing data-science data-visualization feature-engineering machine-learning predictive-modeling python regression-analysis scikit-learn
Last synced: 11 May 2026
https://github.com/nitins17/tableauvisualizations
Visualizations I created while learning to work with Tableau
data-analysis data-science data-visualization tableau visualization
Last synced: 01 Mar 2026
https://github.com/edumoraes1/journey_active_users
Segmentação de base via SQL para jornada de vendedores ativos
bq data-analysis salesforce sql
Last synced: 02 Feb 2026
https://github.com/brownred/python-and-sql
Python and SQL (postgreSQL & mySQL) for data analysis.
data-analysis databases python3 sql
Last synced: 11 May 2026
https://github.com/collins-kimotho/communicate-data-findings
Data Analysis Project: Investigating Factors Contributing to No-Show Appointments in Medical Records
data-analysis data-science data-visualization dataset pandas python
Last synced: 17 May 2026
https://github.com/advestis/adadjust
Package allowing to fit any mathematical function to (for now 1-D only) data.
Last synced: 17 May 2026
https://github.com/gemaquejr/restaurant-orders
Projeto com o objetivo de aplicar os conceitos de POO e trabalhar com Set, Hashmap e Dict. Este projeto foi criado para avaliação final na seção 06 do módulo de ciência da computação do Curso de Desenvolvimento Web na Trybe.
data-analysis dict hashmap poo python set
Last synced: 30 Oct 2025
https://github.com/oubiche-ishak19/stock_evaluation_python
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
backtesting-frameworks classification csv-processing data-analysis expert-system finance financial-analysis-tools python rule-based-classifier stock stock-market streamlit tkinter-gui yahoo-finance
Last synced: 15 May 2026
https://github.com/georgehanymilad/end-to-end-shopping-trends-data-analysis
SQL+ Python + Power BI Project for Data Analysis
data-analysis data-visualization datacleaning mssql powerbi python sql
Last synced: 17 May 2026
https://github.com/saob007/tablero_subsidios_servicio_agua
Se construye un dashboard para el análisis de la distribución y asignación de subsidios para agua potable y alcantarillado otorgados por la Secretaría de Planeación de la Alcaldía de Sincelejo en 2020, con el objetivo de identificar patrones en cobertura, consumo, facturación y subsidios, facilitando la toma de decisiones en políticas públicas
dashboard data-analysis data-visualization looker-studio
Last synced: 31 Jan 2026
https://github.com/imnotamr/ai
A collection of machine learning and AI projects implemented in Jupyter notebooks, covering regression, classification, and neural networks
ai classification colab-notebook data-analysis data-preprocessing data-preprocessing-and-cleaning data-visualization deep-learning deep-neural-networks jupyter-notebook machine-learning model-evaluation predictive-modeling project-based-learning python supervised-learning supervised-learning-algorithms supervised-learning-classifiers unsupervised-learning unsupervised-learning-algorithms
Last synced: 17 May 2026
https://github.com/satvikpraveen/pcc-vizforge
🎨 Personal data visualization toolkit generating synthetic datasets across multiple domains (random walks, dice simulations, weather patterns, earthquakes, GitHub analytics) with beautiful Matplotlib & Plotly visualizations. Includes Jupyter notebooks, interactive dashboards & statistical analysis. Perfect for learning data science! 🚀📊
analytics dashboard data-analysis data-generation data-science data-visualization github-analytics interactive-visualization jupyter-notebook matplotlib plotly probability python random-walk scientific-computing seismology statistical-analysis synthetic-data time-series weather-data
Last synced: 17 May 2026
https://github.com/silianpan/python-data-analysis-course
python data analysis course of drotion-lega
data-analysis jupyter-notebook panda
Last synced: 11 Apr 2025
https://github.com/eslamdyab21/a-b-test-to-an-e-commerce-website
A/B test to an e-commerce website
csv data-analysis data-science hypothesis-testing pandas python udacity-data-analyst-nanodegree
Last synced: 17 May 2026
https://github.com/gabrielczar/machine-learning
Repositorio de Analise de Dados and Machine Learning
data-analysis data-science jupyter-kernels jupyter-notebook learning-exercise machine-learning
Last synced: 14 Jul 2025
https://github.com/tinaland101/python-api-challenge
This project involves analyzing weather data from cities around the world using the OpenWeatherMap API and creating visualizations to explore the relationship between weather variables and latitude.
api-integration-and-data-retrieval data-analysis data-collection-and-geospatial-analysis problem-solving-and-decision-making statistical-analysis
Last synced: 03 Mar 2025
https://github.com/brunomontezano/sleep-cognition-and-functioning
💤 Data analysis of a brief communication published in Psychiatry Research Communications journal by Montezano et al (2023).
bipolar-disorder cognition data-analysis data-visualization data-viz depression ggplot2 pelotasrs psychiatry psychology published-article r sleep ucpel
Last synced: 13 Jun 2026
https://github.com/nishumehta/house-sales-analysis
House Sales Analysis Dashboard for King County, Washington, built with Tableau. Features interactive charts and maps to explore sales patterns, price distributions, and property conditions.
dashboard data-analysis data-visualization tableau tableau-dashboards tableau-public
Last synced: 11 Jan 2026
https://github.com/rdrahul123/my_python-codes
Python Programming codes and Notebooks
anaconda data-analysis data-science jupyter-notebook python python3 visual-studio
Last synced: 17 May 2026
https://github.com/arv-anshul/pw-api
Perform data analysis on PW Skills APIs. Made a web app using streamlit. See any course syllabus, analytics, quizzes and assignments.
api course data-analysis ineuron-ai physics-wallah project pw-skills python3 streamlit
Last synced: 18 Apr 2026
https://github.com/lucaspadoni/9-11-hijackers-social-network-analysis
Social Network Analysis focused on the events of 9/11/2001. By examining publicly available data through SNA techniques, we gain insights into the organizational structure of the terrorist network, offering valuable perspectives on key relationships and connections.
9-11 data-analysis data-analytics graph-theory hijacking network-analysis sna social-network-analysis terrorism terrorist-attacks
Last synced: 26 Jun 2026
https://github.com/edjoukou/altip-sales-analysis
It is about Sales data analysis
data-analysis mysql-database sql tableau visualization
Last synced: 20 Jul 2025
https://github.com/marcomadera/test-for-random-numbers
Test for random number between 0 and 1
Last synced: 09 Jul 2025
https://github.com/lauratrigo/codigo_roti
Análise de ROTI é uma ferramenta em MATLAB para processar e visualizar dados ionosféricos (ROTI) de múltiplas estações GNSS. Desenvolvido para pesquisas em geofísica espacial, o script gera gráficos temporais comparativos com filtros de qualidade e tratamento de dados faltantes. 📡
data-analysis geophysics image-processing matlab roti scientific-initiation
Last synced: 24 Jun 2025
https://github.com/sadratehranian/prediction-of-covid-19-diagnosis
Build an algorithm in MATLAB using ML techniques to predict if a person is having COVID-19 or not depending on the existing medical conditions. Further research has been conducted on identifying the most suitable machine learning techniques and increase their prediction accuracy.
covid-19 data-analysis data-science data-visualization machine-learning matlab prediction visualization
Last synced: 11 Sep 2025
https://github.com/thyripian/ibm_data_science_capstone
data-analysis data-science data-visualization python python3
Last synced: 12 May 2026
https://github.com/mahdikh03/custumers_clustering_rmf
A data analysis project to implement RFM (Recency, Frequency, Monetary) analysis for customer segmentation and behavior analysis using the K-Means algorithm.
customer-segmentation data-analysis k-means-clustering unsupervised-learning
Last synced: 09 May 2025
https://github.com/gonzalofuentes28/dpeek
Interactive terminal data viewer for CSV, TSV, JSON, and JSONL files
bubbletea cli csv csv-viewer data-analysis data-viewer golang json json-viewer sqlite terminal tui
Last synced: 06 Apr 2026
https://github.com/niaid/categorical-data-analysis
bcbb-training data-analysis data-science r statistics
Last synced: 24 Jun 2025
https://github.com/riciokzz/covid-19-analysis
Covid-19 Analysis In South Korea
covid-19 data-analysis data-cleaning data-engineering exploratory-data-analysis machine-learning south-korea
Last synced: 20 Jul 2025
https://github.com/niaid/genetic-linkage-analysis
Materials for ACE course on Genetic Linkage Analysis.
ace ace-uganda2020 analysis bcbb-training clinical data-analysis genetics ngs ngs-analysis
Last synced: 24 Jun 2025
https://github.com/faith99/water_pollution_dashboard
A data visualization project exploring water access, contamination and health outcomes
data-analysis data-visualization powerbi public-health publichealth
Last synced: 02 Feb 2026
https://github.com/dzakwanalifi/reglins
regLins is an R package designed for performing linear regression analysis using various optimization methods. It also provides an interactive Shiny application for a more dynamic analysis experience.
data-analysis linear-regression optimization r shiny-app
Last synced: 09 Jul 2025
https://github.com/muneeb1030/webscrapper_altnews
The project utilizes a combination of Python, Scrapy, and Selenium to navigate through the dynamic content of AltNews.in and collect valuable information for analysis and verification.
data-analysis data-collection python3 scrapy scrapy-spider selenium selenium-python
Last synced: 17 May 2026
https://github.com/macorisd/instagram-fake-account-analysis
A project in R focused on detecting fake Instagram accounts. It includes exploratory data analysis, data visualization, and analysis using three techniques: association rules, formal concept analysis, and regression. The results are presented in an interactive Quarto book.
data-analysis data-science data-visualization r
Last synced: 10 Jun 2025
https://github.com/eslamdyab21/apara-data-gui
Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.
csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui
Last synced: 17 May 2026
https://github.com/bhiogade/customer-purchase-analysis
Comprehensive Customer Purchase Analysis Across Multiple Dimensions
data-analysis data-visualization tableau tableau-desktop
Last synced: 02 Feb 2026
https://github.com/imdadmiran17/data_analysis_exercise
data-analysis numpy numpy-arrays numpy-exercises python3
Last synced: 17 May 2026
https://github.com/shaikh-raj/data-science-portfolio
Data Science Portfolio of Raj Shaikh including Case Studies and Articles that I have completed that solve various business problems.
articles case-study data-analysis deep-learning machine-learning nlp statistics
Last synced: 20 Jul 2025
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/arction/lcjs-example-0507-dashboardfiberanalysis
A demo application showcasing using LightningChart JS to visualize fiber analysis data.
area-plot area-series chart charts dashboard data-analysis demo heatmap javascript lcjs lightningchart-js performance visualization webgl
Last synced: 12 Mar 2025
https://github.com/ryan-wong1/72-years-of-shark-incidents-in-california-data-analysis
Shark Incidents in California 1950 - 2022
data-analysis data-cleaning data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/anastasius21/creditcardfrauddetection
This repository contains a Jupyter Notebook for Credit Card Fraud Detection Model and a csv dataset on which it is being trained
credit-card-fraud data-analysis data-science data-visualization fraud-detection logistic-regression machine-learning
Last synced: 16 Jun 2025
https://github.com/soumasish2005/ai-chatbot-using-snowflake
This project is a Streamlit application that allows users to upload a CSV file and ask questions about their data in natural language.
cloud data-analysis data-science data-visualization python snowflake streamlit
Last synced: 17 May 2026
https://github.com/shreeparab1890/india-gdp-rate-1960-to-2021-data-analysis
This ipython notebook is the Exploratory data analysis (EDA) of the India GDP Rate 1960 to 2021.
analysis data-analysis eda exploratory-data-analysis ipython-notebook jyputer-notebook matplotlib matplotlib-pyplot pandas python
Last synced: 06 Mar 2026
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/balajimohan18/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 05 Apr 2026
https://github.com/srinibas-masanta/deloitte-forage-virtual-internship
This repository contains my work from the Deloitte Forage Virtual Internship, where I analyzed factory telemetry data in Tableau to identify machine breakdown patterns and assessed gender pay equality using Excel. From interactive dashboards to insightful classifications, this project showcases hands-on data analysis and visualization skills. 🚀📊
data-analysis data-visualization deloitte excel forage tableau
Last synced: 15 Jan 2026
https://github.com/farzeennimran/apriori-algorithm
Apriori Algorithm for Association Rule Mining
algorithm apriori apriori-algorithm apyori association-rule-mining association-rules data-analysis data-mining data-science numpy pandas python
Last synced: 06 Apr 2026
https://github.com/rohitdusane/interactive-ibd-analysis-dashboard-with-dash-plotly
This repository showcases a project that combines data analysis and visualization through Dash and Plotly. The goal of this project is to offer an efficient and user-friendly way to integrate robust data analysis with an interactive web-based interface.
clinical-research data-analysis exploratory-data-analysis pyhton statistical-reports
Last synced: 24 Jun 2025
https://github.com/sotirismos/pattern-recognition-labs
Lab exercises and quizzes for Pattern Recognition course, Auth winter semester 20-21
classification clustering data-analysis machine-learning pattern-recognition
Last synced: 17 Jun 2025
https://github.com/rachelresende/regressaolinear
Este repositório é destinado as aulas de regressão linear que realizei em um curso da Udemy sobre o assunto em 2025. Sendo um curso de reciclagem, pois estudei esse tratamento também em 2020 em um curso de estatística da Alura.
data-analysis data-science linear-regression
Last synced: 11 Sep 2025
https://github.com/haroontrailblazer/user_behavioral_analysis
Social Media User Engagement Analysis Using Power BI
data-analysis data-science data-visualization database powerbi
Last synced: 29 Mar 2025
https://github.com/mainak-97/pizza-sales-analysis-project
Pizza Sales Analysis Project: This project optimizes a pizza restaurant's operations by analyzing demand patterns, revenue, and efficiency, providing insights to enhance profitability, streamline production, and improve customer satisfaction.
business-analytics business-intelligence dashboards data-analysis operations-optimization peak-hours power-bi restaurant-analysis revenue-analysis
Last synced: 06 Jan 2026
https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 02 Mar 2025
https://github.com/wassimhedfi/exploring-the-evolution-of-linux
Datacamp guided Project
data-analysis data-science ml python
Last synced: 15 May 2026
https://github.com/parth-jatav/ipl-data-analysis-mentorness
This project uses Power BI to analyze IPL cricket data, featuring dashboards with insights on batting averages, strike rates, and player roles. It identifies the top 11 players and includes navigable pages focused on specific roles like Anchors, Finishers, and All-Rounders.
dashboard data-analysis ipl ipl-dashboard powerbi
Last synced: 07 Mar 2026
https://github.com/sanafagal/wsp-msg-automation
An intuitive application for managing and analyzing customer and reseller data stored in Google Sheets, providing insights and streamlined data organization.
automation cloud-credentials data-analysis google-sheets-api python
Last synced: 16 Jun 2025
https://github.com/chahelgupta/hospital-readmission-prediction-and-analysis
The Hospital Readmission Prediction project uses clinical data to predict diabetic readmissions. SVM + SMOTE achieved 61.16% accuracy, with key predictors including hospital stay, lab tests, and medications.
data-analysis knn-classification logistic-regression machine-learning prediction prediction-model python random-forest-classifier smote svm-classifier
Last synced: 15 May 2026
https://github.com/amruthadevops/stock-market-analysis
To analyze market trends and predict future market behavior using machine learning techniques
data-analysis data-science jupyter-notebook machine-learning powerbi-desktop python stock-market
Last synced: 15 May 2026
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/alansteinbarth/eksploracyjna-analiza-danych-o-pasazerach-statku-titanic
🔍 Titanic EDA: odkrywanie wzorców przeżywalności przez analizę danych. Profesjonalny projekt z wizualizacjami i insights
analytics csv data-analysis data-science data-visualization dataset eda exploratory-data-analysis jupyter-notebook kaggle machine-learning matplotlib numpy pandas portfolio python seaborn statistics titanic visualization
Last synced: 11 Apr 2026
https://github.com/syarwinaaa09/analyzing-students-mental-health
data-driven exploration into student mental health trends using survey data
csv-dataset data-analysis education jupyter-notebook mental-health-awareness pandas psychology student-mental-health visualization
Last synced: 11 Sep 2025
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste. Built using React, Node.js, Express.js, MySQL and Bootstrap.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 07 Apr 2026
https://github.com/pylena/movies-prediction
This project focuses on clustering movies based on their genres using machine learning techniques. By analyzing genre data, the model groups similar movies together, facilitating recommendations and insights into genre-based patterns.
data-analysis machine-learning render streamlit unsupervised-learning
Last synced: 18 May 2026
https://github.com/judyway2/de-data
A brief analysis on schools ARR data
data-analysis jupyter-notebook
Last synced: 11 May 2025
https://github.com/natanel567/university_machine_learning_project
Machine Learning final project Tel Aviv University
data-analysis jupyter-notebook machine-learning
Last synced: 11 May 2025
https://github.com/victoorv/detection_malwares
L'objectif de ce projet est de développer un classifieur capable de différencier les logiciels malwares des goodwares.
classification data-analysis data-science machine-learning machine-learning-algorithms malware-analysis malware-detection oversampling-algorithms python scikit-learn supervised-learning undersampling-algorithms
Last synced: 28 Apr 2026
https://github.com/prakhar-code/british_airways_review_analysis
Analysis of the British Airways Reviews by Customers, filtered by several different factors such as food, entertainment, services, etc.
data-analysis data-cleaning excel tableau-dashboards tableau-public tableau-visualization
Last synced: 15 Jan 2026
https://github.com/ziaeemehr/neuro_toolbox
Single Header File C++ library for analysis of neurophysiological and simulated data.
data-analysis data-science signal-processing synchronization
Last synced: 21 Jul 2025
https://github.com/rafinha0rafinha/web-analyzer-backend
(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.
azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer
Last synced: 10 Apr 2026
https://github.com/mfakhriazhar/stock-price-prediction
Stock prices are highly volatile and influenced by various factors, making accurate prediction a major challenge in investment decisions.
data-analysis data-science deep-learning python recurrent-neural-networks
Last synced: 18 May 2026
https://github.com/lewismakau/portfolio-projects
This repository contains file data and SQL files for projects used for my Portfolio.
data-analysis data-cleaning data-structures data-visualization database google-analytics microsoft-sql-server mysql powerbi tableau
Last synced: 02 Apr 2026
https://github.com/spring-0/netflix-media-data-analysis
Exploring and analyzing Netflix data to uncover trends through data visualization and statistical analysis.
Last synced: 27 Mar 2025
https://github.com/jasonsu131/cps188-term-project
A data analysis program developed in C to extract information about diabetic patients across Canada from a governmental spreadsheet available online. The program showcases summaries and averages based on the extracted data.
c data-analysis data-statictics file-reading
Last synced: 28 Mar 2025
https://github.com/mobutolakecondyle107/sql-server-ddw
🚀 Streamline data management with sql-server-ddw, a powerful tool for efficient queries and seamless integration in SQL Server environments.
backup-and-recovery data-analysis data-integration data-modeling database-management database-security performance-tuning query-optimization reporting-tools sql-scripting sql-server sql-server-express stored-procedures table-design transaction-management
Last synced: 12 Jun 2026
https://github.com/velut/thesis-sw
Software and datasets used in the "Cost-effective and Scalable Activity Matching using Crowdsourcing" thesis
bpmn cost crowdflower crowdsourcing data-analysis dataset performance-analysis plotting-algorithms r thesis
Last synced: 19 Jun 2025
https://github.com/mae776569/weratedogs-wrangling
Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations
data-analysis data-science data-visualization tweets twitter-api
Last synced: 25 Jan 2026
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/souravxbera/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 15 May 2026
https://github.com/mfakhriazhar/ecom-qtt-prediction
In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.
data-analysis data-science data-visualization e-commerce-project eda machine-learning python
Last synced: 19 May 2026
https://github.com/kenwuqianghao/scotiabank-datathon-2023
Code and data analysis done for 2023 Scotiabank Datathon
data-analysis fraud-detection jupyter-notebook python
Last synced: 18 May 2026
https://github.com/yash22222/british-airways-data-science-internship
All 2 Task Assigned By British Airways Data Science Virtual Internship Programme
csv data-analysis data-science data-visualization google-colaboratory jyputer-notebook machine-learning microsoft-excel microsoft-powerpoint python
Last synced: 16 May 2026
https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data
Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.
data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping
Last synced: 30 May 2026
https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset
This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations
business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis
Last synced: 07 Apr 2026
https://github.com/annaanastasy/classification-project-student-grades
A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.
catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling
Last synced: 29 Mar 2025
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 06 Mar 2026
https://github.com/manuelgil/vscode-data-pack
This extension pack includes the essential extensions for data analysts.
data-analysis data-science data-structures data-visualization vscode-extension
Last synced: 07 Apr 2026