Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/shridhar1504/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning
Last synced: 08 May 2026
https://github.com/guglielmo/datalab-notebooks
Data analysis at openpolis
data-analysis data-science jupyter-notebooks pandas python3
Last synced: 08 May 2026
https://github.com/nickchristopherson/duluth-tourism-analysis
End-to-End Data Pipeline for Tourism Industry Analysis
data-analysis data-visualization duluth economic-analysis jupyter pandas pdf-extraction python tourism
Last synced: 08 May 2026
https://github.com/mgimond/meteo_waterville
Waterville (Maine) meteorological data
data-analysis data-science exploratory-data-analysis meteorology r
Last synced: 24 Jan 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/allanotieno254/us-largest-companies-by-revenue-web-scraping
A Python project for web scraping and analyzing the largest companies in the United States by revenue from Wikipedia
automation beautifulsoup csv data-analysis data-cleaning data-execution data-extraction pandas python web-scraping
Last synced: 08 May 2026
https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries
🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈
ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics
Last synced: 21 Jun 2026
https://github.com/rogernet/desafio-profissional-produto-data-driven
Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.
data-analysis data-science data-visualization product
Last synced: 23 Jun 2026
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/leosimoes/udacity-starbucks
Project 3 of the Udacity Machine Learning Engineer Nanodegree Program. Data analysis and machine learning application to Starbukcs data.
aws-iam aws-s3 aws-sagemaker data-analysis data-science machine-learning python
Last synced: 24 Mar 2025
https://github.com/sustentarea/gs-data-analysis-report-3
📓 Exploring potential associations between childhood undernutrition and the Standardized Precipitation Evapotranspiration Index (SPEI) in Brazilian municipalities (2008–2019)
brazil climate-change data-analysis data-science food-systems global-syndemic ibge malnutrition nutrition obesity r rstats sisvan spei sustainable-eating wasting worldclim
Last synced: 27 Oct 2025
https://github.com/shrawans007/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments hotel-service ms-excel ms-excel-data-analytics pivot-tables sentiment-analysis tableau tableau-public text-reviews treemap
Last synced: 22 Mar 2025
https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2
Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.
dashboard data data-analysis dax-measures excel powerbi powerbidashboard
Last synced: 23 Jan 2026
https://github.com/rubinlake/rl-academy-data-analytics
Educational data analysis project demonstrating BMW sales data analysis with AI-powered code assistance using Cursor IDE and Jupyter notebooks
cursor-ide data-analysis educational-project jupyter langchain matplotlib numpy pandas python scipy seaborn
Last synced: 09 May 2026
https://github.com/jabhij/fbi_nics-firearm-background-checks
This project is a try to showcase the use of guns across the US.
data-analysis data-analytics data-science data-visualization tableau
Last synced: 23 Feb 2026
https://github.com/dogan-the-analyst/developer_survey_analysis
Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.
data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python
Last synced: 09 May 2026
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 12 Apr 2026
https://github.com/viper373/163-buff
爬取网易BUFF平台CS:GO武器皮肤交易数据
163 arima crawler-python csgo data-analysis prediction python
Last synced: 24 Oct 2025
https://github.com/hordiales/redpanal-db-analysis
Analysis of the RedPanal.org music database
creative-commons data-analysis dataset etl machine-learning music music-information-retrieval statistical-analysis
Last synced: 10 Mar 2025
https://github.com/5ekastanx/data-analysis
Extracting data from parsing, for example, like hacking using Python using all sorts of function methods
Last synced: 14 Mar 2025
https://github.com/ayaanjawaid/google_playstore_data_analysis
This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.
data-analysis eda html numpy pandas-dataframe plotly python vizualisation
Last synced: 24 Feb 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/datasciencelovers/ai-financial-market-data-analysis
Analyse Financial Market Data of AI companies with Python
ai artificial-intelligence big-data-analytics chatgpt data-analysis data-analytics data-science data-visualization financial-analysis gemini google llama machine-learning market-data-analysis matplotlib-python meta openai pandas-python python
Last synced: 05 May 2026
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/gallillio/unsupervised_clustering_music_recommendation_system
Music Recommendation System using Unsupervised Machine Learning Clustering Methods using K-Means, Fuzzy C Mean DBSCAN, Gaussian Mixture Model, BIRCH and Agglomerative Clustering
affinity-propagation agglomerative-clustering birch-clustering data-analysis data-visualization dbscan-clustering fuzzy-cmeans-clustering gaussian-mixture-models k-means-clustering pca unsupervised-machine-learning
Last synced: 19 Oct 2025
https://github.com/christos99/scraping-project
This project is a Python-based tool for web scraping with a user-friendly GUI. Built with PyQt5 and Selenium, it allows users to scrape online listings by specifying keywords, price ranges, and exclusions. Results are displayed in a table and can be exported to an Excel file.
automation data-analysis excel gui openpyxl pandas pyqt5 python selenium web-scraping
Last synced: 10 May 2026
https://github.com/gabrielmpinho/cs50-sql
Solutions and notes from CS50’s Introduction to Databases with SQL. Covers CRUD operations, data modeling, normalization, joins, views, indexes, and connecting SQL with Python and Java. Begins with SQLite for portability and introduces PostgreSQL and MySQL for scalability.
data-analysis data-structures data-visualization database databases javascript python sql
Last synced: 10 May 2026
https://github.com/phillbertnevinemmanuel/automotivesalesdataanalysis
This marks my inaugural venture into personal data analysis, employing SQL and Python for Correlation Analysis. I've sourced the dataset from Kaggle, specifically focusing on automotive sales. You can find the dataset linked on my website below. I'm excited to share that I've independently managed the majority of tasks involved in this project.
data-analysis dataset microsoft-sql-server python python-lambda sql ssms tsql
Last synced: 14 Mar 2026
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 14 Apr 2026
https://github.com/guruakaashjn/te_project_microsoft_ai
AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.
artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi
Last synced: 27 Feb 2025
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 12 Apr 2026
https://github.com/kyleprotho/analysistoolbox
Analysis Tool Box (i.e. "analysistoolbox") is a collection of tools in Python for data collection and processing, statisitics, analytics, and intelligence analysis.
analytics data-analysis open-source-intelligence python3 r research snippets statistics
Last synced: 22 Aug 2025
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 21 Jan 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026
https://github.com/aaryan-agr/canadian-energy
This project analyzes Canada's energy trade, focusing on imports, exports, and market trends in the energy sector.
data-analysis data-cleaning data-manipulation data-processing data-science data-vizualisation energy-sector time-series-analysis
Last synced: 10 Jun 2025
https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-
# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻
analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits
Last synced: 01 Jul 2025
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 28 Jan 2026
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/rkolehov/retail-sales-analysis-project
End-to-end e-commerce analysis showcasing SQL and data visualization skills. Tracks sales, customer behavior, product performance, and delivery efficiency. Interactive dashboards provide actionable insights for business decision-making
analytics dashboard data-analysis ecommerce jupyter-notebook postgresql python sql tableau vscode
Last synced: 19 Apr 2026
https://github.com/raad07/sql_project-world_layoffs_dataset
This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.
data-analysis database mysql sql
Last synced: 27 Jan 2026
https://github.com/cr-mao/machine-learning
机器学习笔记
data-analysis data-handling machine-learning math numpy pandas
Last synced: 12 Nov 2025
https://github.com/an1mch1k-theone/project_2_hh_analyze
Анализ вакансий из HeadHunter
data-analysis data-analysis-project postgresql python sql
Last synced: 14 Apr 2026
https://github.com/thatsinewave/radiosonde-data-analyzer
A web-based tool for visualizing and analyzing radiosonde flight data from log files generated by the Radiosonde Decoder by 9A4AM
data-analysis data-analytics data-visualization good-first-contribution good-first-issue good-first-issues good-first-pr good-first-pr-first-contribution good-first-project good-first-prs good-practices html-css-javascript html-css-js radiosonde radiosonde-hunting radiosondes rtl-sdr sdr sdr-tool thatsinewave
Last synced: 15 Oct 2025
https://github.com/listiangr/ecommerce_sales_data_analysis
Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.
dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel
Last synced: 19 Jan 2026
https://github.com/virajbhutada/google-stock-price-forecasting-lstm
Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.
data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting
Last synced: 27 Feb 2025
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 27 Feb 2025
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/renatomaynard/statistical-modeling-and-regression-analysis-life-expectancy
Statistical Modeling and Regression Analysis for Life Expentancy
data-analysis healthcare linear-regression machine-learning predictive-modeling r regression-analysis statistical-models statitics
Last synced: 23 Mar 2025
https://github.com/cworld1/novel-analysis
A simple project for analyzing Chinese novels
Last synced: 17 Mar 2025
https://github.com/luabagg/worldwide-trends
Worldwide Google Trends visualization and classification
data-analysis data-visualization google-trends trends
Last synced: 03 Feb 2026
https://github.com/cdeweyx/game-of-thrones-s7e1-eda
Exploratory data analysis of scraped tweets related to Game of Thrones S7E1
data-analysis data-visualization python twitter-api
Last synced: 26 Apr 2026
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 26 Apr 2026
https://github.com/vansh-py04/data-extraction-and-text-analysis
The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables that are explained
data-analysis data-extraction data-science nlp nlp-machine-learning python textanalysis webscraping
Last synced: 24 Apr 2026
https://github.com/webuccinoco/mysql-pivot-tables
Build complex MySQL pivot tables without touching a single line of code. This free PHP tool lets you visually connect your database and map out your data sources with a few simple clicks.
business-analytics business-intelligence crosstab data-analysis data-analytics data-visualization mysql mysql-database mysql-pivot-table mysql-reports mysql-virtualization php php-pivot-table php-reports pivot-tables reporting-tools
Last synced: 04 Feb 2026
https://github.com/leosimoes/uerj-tcc-analisador-dados
Trabalho de conclusão de curso (TCC) em Engenharia de Computação. Aplicativo Web para preparação e análise de dados, criação de gráficos e modelos de regressão linear e logistica.
computer-engineer data-analysis data-science data-visualization linear-logistic linear-regression python streamlit
Last synced: 24 Apr 2026
https://github.com/sivkri/perseus-ms-proteomics-venn
Mass spectrometry Perseus Data analysis
data-analysis mass-spectrometry perseus proteomics proteomics-data proteomics-data-analysis proteomics-data-integration
Last synced: 14 Apr 2026
https://github.com/akarshankapoor7/tensorflow_tutorial
This is an easy and fast tutorial for tensorflow. In data science, TensorFlow is an open-source machine learning framework by Google. It's used for building and training machine learning and deep learning models.
data-analysis data-science deep-learning machine-learning tensorflow
Last synced: 27 Apr 2026
https://github.com/mr-chang95/loan_data_visualization
Data Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook loans python udacity-data-analyst-nanodegree
Last synced: 24 Apr 2026
https://github.com/adrija-debnath/ideas-isi-data-science-internship
Topic of the Project - Predictive Maintenance Analysis, Data Science Internship at IDEAS - Institute of Data Engineering, Analytics and Science Foundation Technology Innovation Hub at Indian Statistical Institute, Kolkata.
data-analysis data-science predictive-analytics predictive-maintenance streamlit
Last synced: 27 Apr 2026
https://github.com/datavil/framex
A light-weight, dataset obtaining library for fast prototyping, tutorial creation, and experimenting.
data-analysis data-fetching data-science dataframe datasets visualization
Last synced: 06 Jun 2026
https://github.com/luminati-io/airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 04 Jan 2026
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 18 Mar 2025
https://github.com/tawfikhammad/data-analysis-projects
Data visualization and analysis
data-analysis data-science data-visualization matplotlib plotly seaborn
Last synced: 14 May 2026
https://github.com/sathyasris27/time-series-and-spectral-analysis-
The aim of this project involves the analyses the data, removing trends and seasonal effects, identifying the underlying process, understanding the dominant frequencies, and using the residuals to make predictions.
data-analysis data-visualization forecasting r spectral-analysis time-series-analysis
Last synced: 07 Jun 2026
https://github.com/jongan69/potion-leaderboard
Start of Entry for potion leaderboard contest
data-analysis leaderboard potion trading
Last synced: 11 Jun 2026
https://github.com/jpotter80/notebook-examples
This repository demonstrates a systematic approach to cleaning and standardizing e-commerce product data using DuckDB. The notebook serves as a detailed walkthrough of our data cleaning methodology, showcasing how we handle common data quality challenges in e-commerce datasets.
data-analysis data-cleaning jupyter-notebook
Last synced: 12 Jun 2025
https://github.com/alxrm/scent-of-literature
Russian literature sentiment analysis in terms of very small dataset
classification data-analysis sentiment-analysis sklearn tf-idf
Last synced: 28 Apr 2026
https://github.com/roboto-ai/robologs-px4-actions
A collection of actions for working with PX4 data
data-analysis data-processing data-transformation drones px4 px4-logs px4-ulog robotics sensors
Last synced: 04 Feb 2026
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/vatshayan/hospital-discharge-analysis
Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc
data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn
Last synced: 04 Mar 2025
https://github.com/sivas-2/food-demand
This project aims to predict food demand based on historical data, leveraging various statistical methods to achieve accurate forecasts.
data-analysis data-science dataanalysis food-demand-forecasting statistics
Last synced: 12 Aug 2025
https://github.com/gauranshgoel123/predictive-demand-analysis
Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend
chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render
Last synced: 13 Apr 2026
https://github.com/billy-enrizky/yelpfusion
Finding All restaurants in the Maryland area using YelpFusion API
data-analysis pandas yelp-api yelpfusion
Last synced: 28 Apr 2026
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 06 Oct 2025
https://github.com/sanam2405/chatinfo
Analysing the WhatsApp Chat with my crush over a 6M period
data-analysis data-visualization python
Last synced: 27 Apr 2026
https://github.com/freebirdscrew/dataanlaysis_and_datasets
Data Analysis on the Datasets that are Provided by the Govt., Kaggle and Other Data Source Providers.
data-analysis data-science datanalysis datasets deep-learning govt kaggle kaggle-competition kaggle-dataset kaggledatasets machine-learning machine-learning-algorithms neural-networks
Last synced: 18 Apr 2026
https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection
The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.
banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier
Last synced: 06 Oct 2025
https://github.com/adnanrahin/apache-spark-complete-reference
This repository reflects on all the necessary steps to take before jump in into Big Data.
big-data data-analysis data-science kaggle-dataset machine-learning rdd scala spark
Last synced: 29 Apr 2026
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 20 Jul 2025
https://github.com/shsiddhant/memory.fm
A Python library, CLI tool, and web-based dashboard for exploring music listening history from Last.fm and Spotify.
analytics data-analysis data-visualization memories music
Last synced: 04 Apr 2026
https://github.com/pradipece/weather_forecast_data_analysis
Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"
data-analysis data-science data-visualization git github python python3
Last synced: 19 Apr 2026
https://github.com/kentlouisetonino/sw-project-data-analysis
My project for AMA MATH 6200 course.
data-analysis python school-project
Last synced: 28 Feb 2025
https://github.com/vijayjoshi16/credit-card-fraud-detection-using-ml-in-python
Credit Card Fraud Detection Using ML in Python
data-analysis jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python regression seaborn
Last synced: 17 Apr 2026
https://github.com/tynoee/record_company-database
A record company database with multiple query commands using SQL
Last synced: 31 Jan 2026
https://github.com/oliverfanderson/quarto-portfolio
My data science portfolio website. Made with Quarto in R.
data-analysis data-engineering data-science data-visualization database html portfolio-website r yaml
Last synced: 05 Mar 2026
https://github.com/prajakta1321/kaggle-ai-report-2023
A Report describing the trends in emergence of AI over the years !
data-analysis data-visualization python3
Last synced: 28 Jun 2025
https://github.com/paezha/isdas
Companion package for An Introduction to Spatial Data Analysis and Statistics with R
data-analysis gis rstats spatial-analysis spatial-statistics
Last synced: 04 Jan 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/aravindnathan02/sales-and-customer-analytics
This is a repository for sales and customer performance Tableau dashboard.
customer-dashboard dashboard data-analysis data-visualization sales-analysis sales-dashboard tableau
Last synced: 08 Jan 2026
https://github.com/luochang212/weibo-analysis
Data analysis based on sina weibo.
Last synced: 03 Apr 2026
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 27 Feb 2025
https://github.com/airdac/sim-telco_customer_churn
Prediction of customer churn with logistic regression in R. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science logistic-regression r statistical-models upc
Last synced: 28 May 2026
https://github.com/arielle0222/data_analysis
📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.
autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin
Last synced: 30 Apr 2026