Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/betkh/datascieneinpython
Jupiter Notebook files
data-analysis data-visualization
Last synced: 16 Jun 2025
https://github.com/clemence-g/heat-dome-analysis
atmospheric-science data-analysis geopotential heat-wave jupyter
Last synced: 18 Mar 2025
https://github.com/joe-stifler/llm-sig-playground
This repository is a collaborative space for MSc Earth Science students at Imperial College London to experiment with and apply Large Language Models (LLMs) to real-world Earth Science problems. Follows below the persona playground link.
data-analysis earth-science llms machine-learning research-automation
Last synced: 29 Mar 2025
https://github.com/mansiikumarii/mysql
A curated collection of MySQL scripts covering DDL, DML, and DRL operations. Ideal for beginners to practice and understand core SQL concepts.
backend data-analysis data-modeling database database-integration database-management database-performance database-schema mysql mysql-admin mysql-database orm php-mysql query-optimization rdbms sql sql-query sql-script stored-procedure
Last synced: 19 May 2026
https://github.com/qorah/vic-edu-housing-insights
Analysis of education outcomes and housing affordability in Victoria, Australia.
data-analysis jupyter-notebook
Last synced: 18 Mar 2025
https://github.com/the-pinbo/dimensionalityredux-pca-vs-autoencoders
Comparative study of PCA and Autoencoders for effective dimensionality reduction, assessed through PSNR and SSIM metrics.
autoencoder-mnist autoencoders data-analysis dimensionality-reduction image-compression mnist neural-networks pca psnr ssim
Last synced: 13 May 2025
https://github.com/julie-fliorko/rockbuster-insights-sql-project
Data analysis using PostgreSQL to help Rockbuster Stealth LLC identify revenue trends, customer insights, and rental behavior patterns.
Last synced: 22 Jul 2025
https://github.com/harshindcoder/online_retail_data_clustering_project
This marketing analytics project uses RFM (Recency, Frequency, Monetary) features for customer classification, inspired by the online retail mining paper. The RFM model helps segment customers, identify high-value ones, and optimize marketing strategies.
customer-segmentation data-analysis data-visualization market-analytics
Last synced: 17 Aug 2025
https://github.com/ebrizzzz/data-visualization-project-using-tableau
A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.
analytics data data-analysis data-science data-visualization python regression tableau
Last synced: 16 Jun 2025
https://github.com/vigneshrocky262/powersub-demo-1434
🔧 Streamline your workflow with powersub-demo-1434, a simple tool for managing and automating tasks efficiently.
api automation coding-sandbox collaborative-tools data-analysis demo dynamic-programming machine-learning neural-networks performance-testing powersub project-management python software-development visualization
Last synced: 05 May 2026
https://github.com/jhrcook/protein-language-models
Experimenting with protein language model predictions
data-analysis protein-language-model variant-effect-prediction
Last synced: 28 May 2026
https://github.com/amishidesai04/interactive-data-visualisation-tool
A Java-based application leveraging JavaFX to create dynamic and interactive charts, including pie charts, bar charts, and line graphs. Ideal for visualizing various datasets, this tool offers customizable features and a user-friendly interface. Easily input and manage data, customize chart styles, and observe trends and patterns effectively.
charts data-analysis data-visualisation data-visualization-project gui java javafx visualization-tools
Last synced: 17 Apr 2026
https://github.com/iamsainikhil/us-births-analysis
Analysis of US-Births during 1994-2003 based on CDC-NCHS data set.
Last synced: 16 May 2026
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/tknishh/investing-platform
An investing platform application to help users get information and analyze various foreign currency assets. The investing platform uses an ETL pipeline to insert new batches of Forex data once a day.
data-analysis investing-platform pipeline
Last synced: 18 Mar 2025
https://github.com/andrewzgheib/football-database-analysis
Football database utilizing PostgreSQL and Pandas for data management, with PowerBI for intuitive KPI visualization
data-analysis data-visualization database pandas pgsql postgr powerbi sql
Last synced: 04 Apr 2025
https://github.com/nerooc/device-downtime-detection
Repozytorium dotyczące projektu z przedmiotu "Sztuczne Sieci Neuronowe"
data-analysis detection-model recurrent-neural-networks
Last synced: 22 Mar 2025
https://github.com/timkong21/siemens-mobility-operations-industrial-engineer-simulation
Operations Industrial Engineer job simulation with Siemens Mobility. Includes time study analysis to identify assembly bottlenecks (Task 1) and a proposed layout redesign to improve efficiency without automation (Task 2).
data-analysis forage industrial-engineering job-simulation manufacturing process-improvement production-engineering python siemens time-analysis
Last synced: 19 May 2026
https://github.com/lopez86/datascienceexamples
Examples of various data science & data analysis topics using various sources of data.
data-analysis data-science pandas scikit-learn tutorial visualization
Last synced: 13 Apr 2026
https://github.com/sharduljunagade/human-activity-recognition
This repository contains the code for the Assignment-1 of the course ES 335: Machine Learning 2024 at IIT Gandhinagar taught by Prof. Nipun Batra.
data-analysis data-collection decision-trees groq-api human-activity-recognition jupyter langchain-python machine-learning pandas prompt-engineering python sklearn tsfel
Last synced: 08 Apr 2026
https://github.com/drisskhattabi6/exploratory-data-analysis-projects
This Repo contains My Exploratory Data Analysis Projects for many datasets
data-analysis data-preprocessing data-visualization datasets diabetes-prediction eda exploratory-data-analysis iris-dataset
Last synced: 26 Jun 2025
https://github.com/brevex/hotel-booking-demand-data-analysis
Data analysis in Python of demand for urban hotels and resorts showing their causes and relationships
data-analysis data-science hotel-booking-analysis kaggle python
Last synced: 08 May 2026
https://github.com/nagar2nd/zomato-bangalore-analysis-tableau
Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.
data-analysis data-visualization tableau
Last synced: 05 Mar 2026
https://github.com/shubhamgoyal575/credit-card-fraud-detection
📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.
adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost
Last synced: 08 Feb 2026
https://github.com/nehul1149/olympic-data-analysis
This project is an interactive data visualization and analytics platform for exploring historical Olympic Games data. Built with Python and Streamlit, it offers an in-depth analysis of medal tallies, athlete statistics, and country-wise performance trends, providing users with powerful insights into the world's biggest sporting event.
analysis data-analysis data-science data-visualization matplotlib python streamlit
Last synced: 18 May 2026
https://github.com/swatisinghit/e-commerce-trend-analysis-for-target
An exploratory and in-depth study of the E-Commerce sales data for a Brazilian store using SQL.
bigquery data-analysis mysql sql
Last synced: 19 May 2026
https://github.com/amarlearning/exploring-the-evolution-of-linux
Data Analysis about the development of the Linux operating system by exploring its Git repository history.
cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux
Last synced: 12 May 2026
https://github.com/imnotamr/datasets-used
A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics
ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models
Last synced: 19 May 2026
https://github.com/mulukensholaye/spark_kafka_streaming_csv
Real-time streaming data analysis pipeline with integrating apache spark's streaming library to read records from kafka topic
apache-kafka apache-spark data-analysis python3 realtime-messaging
Last synced: 19 May 2026
https://github.com/syed-amjad-ali/airbnb-listing-analysis
Analyzing AirBnB listings in Paris to determine the impact of recent regulations
business-intelligence data-analysis jupyter-notebook maven-analytics python
Last synced: 19 May 2026
https://github.com/hawmex/aut_data_and_information_analysis_project
This repository contains the files of my project for the "Data & Information Analysis" course at AUT (Tehran Polytechnic).
data-analysis data-science k-means outlier-detection python
Last synced: 19 May 2026
https://github.com/devexpress-examples/wpf-pivotgrid-how-to-display-underlying-data
This example demonstrates how to obtain the records from the control's underlying data source for a selected cell or multiple selected cells.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 19 May 2026
https://github.com/samir-atra/share-lm_dataset_analysis
Analysis, studies and optimizations on the ShareLM extension dataset
data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas
Last synced: 19 May 2026
https://github.com/thecoderpinar/globalwarmingforecast
🌍 Global Warming Forecast Tool An advanced tool for analyzing and forecasting climate trends using ARIMA and Prophet models, with interactive visualizations and scenario simulations.
arima climate-change data-analysis environmental-science forecasting global-warming machine-learning prophet streamlit time-series-analysis visualization
Last synced: 27 Mar 2025
https://github.com/prakshal0809/sql-data-analysis-project
This project involves analyzing pizza sales data using SQL to address various data analysis questions, providing essential foundational to advanced SQL knowledge.
Last synced: 26 Jun 2025
https://github.com/tusharpandey003/data-science
Data science include Data Analysis, Machine learning , EDA,PCA and Data Structure and Algorithms
algorithms algorithms-and-data-structures data-analysis data-analytics data-cleaning data-science data-structures data-visualization dsa kmeans-clustering machine-learning outlier-detection pca pca-analysis
Last synced: 13 Mar 2025
https://github.com/borjamome/radiografia-madrid
Análisis de Población, Economía y Sociedad de Madrid con R.
data-analysis data-visualization madrid r
Last synced: 17 Jun 2025
https://github.com/bjornmelin/minneanalytics
MinneAnalytics project work.
competitive-programming data-analysis data-visualization r
Last synced: 09 Jul 2025
https://github.com/sondosaabed/data-visualization-in-tableau
data-analysis data-visualization nanodegree plot tableau udacity
Last synced: 08 Sep 2025
https://github.com/sukhitashvili/pca_tutorial
PCA algorithm from scrach, using only matrix-vector multiplications
data-analysis data-science data-visualization machine-learning-algorithms pca
Last synced: 29 Mar 2025
https://github.com/samukiszhsd/alteryx-analytics
Você está trabalhando com dados de transações bancárias do Itaú e precisa fazer algumas análises para ajudar o time de auditoria a detectar padrões incomuns e possíveis transações suspeitas.
alteryx data-analysis data-structures data-visualization etl workflow
Last synced: 18 Feb 2026
https://github.com/prady2309/stock-analysis
Analysis on the stock prices of Apple, Google, Microsoft and Amazon
data-analysis data-science data-visualization python stock-market
Last synced: 19 May 2026
https://github.com/darshan1924/house-price-pridiction
This repository contains a machine learning project for predicting house prices based on various features, including geographical coordinates. The project includes data preprocessing steps to handle# House Price Prediction Project
data-analysis data-preprocessing house-prices jupyter-notebook machine-learning prediction
Last synced: 27 Mar 2025
https://github.com/eve-ning/ppshift
Analyzes maps and scores from 2015
data-analysis data-mining osu osugame
Last synced: 13 Feb 2026
https://github.com/saroshfarhan/irish_hospital_data_anaysis
Irish hospital's patient discharge data for four counties analysis
data-analysis data-science data-visualization healthcare irish-data r-programming-language
Last synced: 18 Feb 2026
https://github.com/mosalem149/pythonutilities
A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.
data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting
Last synced: 15 May 2026
https://github.com/sebastianurdaneguibisalaya/colocaciones-de-credito-fondo-mivivienda-peru
Exploro las Colocaciones de Crédito del Fondo MIVIVIENDA S.A. entre 2018 y 2022, con un conjunto de datos descargado del Portal Nacional de Datos Abiertos del Perú. 🏠
data-analysis jupyter-notebook python
Last synced: 24 Feb 2025
https://github.com/ishitaagl20/nyc-taxi_trip_prediction
Taxi Trip Duration Prediction Using the NYC Dataset
data-analysis data-exploration data-visualisation decision-trees matplotlib nyc-taxi-dataset python3 random-forest seaborn xgboost
Last synced: 19 May 2026
https://github.com/spshah1701/world-development-indicators
Analysis of World Development Indicators (WDI) using big data technologies, specifically Databricks, Apache Spark, and Scala.
apache-spark big-data data-analysis spark-sql
Last synced: 17 Mar 2025
https://github.com/jillie-wink/sql-portfolio
SQL Data Analysis Projects
data-analysis data-manipulation portfolio sql sqlite
Last synced: 02 Jan 2026
https://github.com/twistedfrost/best-of-ml-python
Explore the best machine learning libraries in Python. Stay updated with weekly rankings and contributions. Join the community! 🐙🌟
airport airport-simulation awesome breast-cancer-prediction data-analysis data-science data-visualization decision-tree-classifier deep-learning gpt jax nlp random-forest-classifier scikit-learn svm-classifier transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 26 Jun 2025
https://github.com/parthkumarmpatel/sql-exploratory-data-analysis
SQL EDA scripts for sales data warehouse — metrics, insights, and rankings from my data warehouse project.
data-analysis exploratory-data-analysis sql-server
Last synced: 26 Jun 2025
https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics
Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.
data-analysis data-visualization eda powerbi python sql
Last synced: 21 May 2026
https://github.com/adeebkhan25/dataset_suicide_susceptible
The "Student Suicide Risk Factors Dataset" is a comprehensive collection of data aimed at understanding and mitigating the factors contributing to student suicides.
data-analysis dataset machine-learning supervised-learning
Last synced: 24 Dec 2025
https://github.com/lulloooo/bizdata-nexus
Collection of my Business & Data Analysis projects, from professional/academic endeavors to passion-driven explorations 📊
business-analysis data-analysis economics etl excel finance mysql python r risk-analysis
Last synced: 05 Apr 2026
https://github.com/alimiheb/advwokcube-analysis
A comprehensive SSAS cube project based on AdventureWorksDW2019, featuring data cleaning, multidimensional modeling, and visualizations in Power BI and Excel.
adventureworks data-analysis excel powerbi sql-server ssas-multidimensional visualization
Last synced: 26 Jun 2025
https://github.com/ljadhav25/knn-algorithm-data-science-
This repository contains a project demonstrating the implementation and application of the K-Nearest Neighbors (K-NN) algorithm in Data Science. The objective is to provide a comprehensive understanding of the K-NN algorithm, including data preprocessing, model training, evaluation, and visualization of results. This project is ideal for beginners
data-analysis data-science knn-classification machine-learning matplotlib-pyplot numpy pandas-library seaborn
Last synced: 16 Apr 2026
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/nivasharmaa/friskwatch
A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.
data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data
Last synced: 19 May 2026
https://github.com/gbikram/python-data-analysis
The Counted
data-analysis matplotlib python
Last synced: 10 Jul 2025
https://github.com/mindlessmuse666/iris-knn
Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.
algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn
Last synced: 17 Aug 2025
https://github.com/shellynagar27/marketing-content-performance-analysis
Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.
data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing
Last synced: 26 Jun 2025
https://github.com/roma-glushko/magechurn
churn-analytics data-analysis data-science
Last synced: 06 Apr 2025
https://github.com/kevin-rsj/sectores_economicos_covid-19
Análisis Exploratorio de Datos (EDA): Comportamiento de Sectores Económicos antes, durante y después de la Pandemia de COVID-19 (2019-2022)
data-analysis financial-analysis pandemic-analysis python stock-market time-series visualization yahoo-finance
Last synced: 20 May 2026
https://github.com/ryuzen6/kaggle-series
This is a series of Machine Learning/Deep Learning Models made for practice.
artificial-intelligence data-analysis data-science deep-learning machine-learning python3
Last synced: 20 May 2026
https://github.com/evamaerey/ma206distributions
data-analysis data-science ggplot2 statistics
Last synced: 22 Jul 2025
https://github.com/astrojarhead/irafscripts
IRAF cl scripts
astronomy data-analysis image-processing iraf scripts
Last synced: 12 Jan 2026
https://github.com/badranalyst/restaurant-reviews-sentiment-analysis-nlp-case-study
This project analyzes restaurant reviews using Natural Language Processing (NLP) for sentiment analysis. It covers data exploration, pre-processing (NLTK text cleaning), model building, prediction, and deployment. The goal is to predict sentiment from reviews using Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
data-analysis data-science eda exploratory-data-analysis matplotlib-pyplot model model-building numpy pandas pre-processing predictive-modeling python seaborn
Last synced: 13 Apr 2026
https://github.com/clchinkc/zombie
Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker
algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation
Last synced: 05 Apr 2026
https://github.com/sharoonjoseph321/e-commerce-eda
Data Analysis on E-commerce ,using pandas, python, matplotlib.
data-analysis data-science data-science-projects data-visualization jupyter-notebook matplotlib pandas pandas-dataframe pandas-python python
Last synced: 06 Apr 2025
https://github.com/sharoonjoseph321/samsung_stock_prediction
Predicting future price of Samsung stock, using machine learning , scikit learn and pandas
algorithms data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction predictive-analytics predictive-modeling python stock-price-prediction supervised-learning
Last synced: 06 Apr 2025
https://github.com/techshot25/graduateadmissions
Looking at the probability of being accepted in a graduate program using a machine learning model
bayesian-regression correlation-matrices data-analysis data-science linear-regression machie-learning random-forest-regression regression ridge-regression
Last synced: 25 Feb 2025
https://github.com/balajimohan18/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook maachine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 20 May 2026
https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard
This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.
business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi
Last synced: 12 Jan 2026
https://github.com/mindlessmuse666/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction data-analysis data-science linear-regression linear-regression-models machine-learning matplotlib python regression sklearn unit-testing
Last synced: 11 Apr 2026
https://github.com/yuvrajsaraogi/uber-data-analysis-using-machine-learning
This repository contains Uber Data Analysis using various Machine learning Algorithms
data-analysis data-science exploratory-data-analysis linear-regression logistic-regression machine-learning random-forest uber-data-analysis
Last synced: 24 Aug 2025
https://github.com/kiran-kumar-k3/sales-performance-dashboard
The Sales Performance Dashboard is an interactive Python-based web application that visualizes and analyzes sales data, providing actionable insights through dynamic charts and metrics.
data-analysis python streamlit
Last synced: 20 May 2026
https://github.com/emmarhoffmann/analysis-of-student-debt-among-first-generation-college-students
Explores the financial landscape of first-generation college students, analyzing patterns in student debt based on factors like median income, net price of attendance, and enrollment size.
data-analysis first-generation-college-students r statistical-models
Last synced: 17 Mar 2025
https://github.com/archanakokate/bank_term_deposit_prediction
Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.
data-analysis data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Sep 2025
https://github.com/emmarhoffmann/analysis-of-california-real-estate-market-factors-influencing-home-prices
Investigates how home size, number of bedrooms, and bathrooms influence home prices, with comparisons across California, New York, New Jersey, and Pennsylvania.
data-analysis r real-estate statistical-models
Last synced: 17 Mar 2025
https://github.com/janashanaa/flightanalysis
This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/karanch10/fraudshield
FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.
data-analysis data-science data-visualization machine-learning python3
Last synced: 20 May 2026
https://github.com/jiteshshelke/codsoft
A repository showcasing three machine learning projects—Titanic Survival Prediction, Movie Rating Prediction, and Iris Flower Classification—completed during CodSoft's Data Science Internship. 🚀
codsoft codsoftinternship data-analysis data-science linear-regression logistic-regression machine-learning machine-learning-algorithms python
Last synced: 20 May 2026
https://github.com/tabibyte/azerbaijani-rapper-lyrics-data-analysis
Lyrics Data Analysis of Azerbaijani Rappers
azerbaijan data-analysis rappers
Last synced: 22 Jul 2025
https://github.com/chingu-voyages/v47-tier3-team-30
An easily accessible tool for calculating electricity-related carbon emissions, along with insights for reducing environmental impact. | Voyage-47 | https://chingu.io/ | Twitter: https://twitter.com/ChinguCollabs
carbon-emissions carbon-footprint data-analysis data-engineering data-science
Last synced: 10 May 2026
https://github.com/patricksferraz/aqw-madrid-data-analysis
Interactive analysis and visualization of Madrid's air quality and weather data (2001-2016) using Python, Dash, and Jupyter. Features interactive maps, statistical analysis, and data visualization tools.
air-quality dash data-analysis data-engineering data-science data-visualization data-wrangling environmental-data environmental-science interactive-dashboard jupyter jupyter-notebook madrid open-data pandas plotly python statistical-analysis time-series weather-data
Last synced: 30 Jan 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/satyacoder29/comparison-of-region-based-sales-tableau
The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.
data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions
Last synced: 02 Feb 2026
https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra
Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.
cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python
Last synced: 11 Apr 2026
https://github.com/iwasakiyuuki/data-analysis-platform-airflow-dag
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 13 May 2025
https://github.com/steviecurran/prediction-plot
Code to performs machine learning (k-nearest neighbours regression) and plot the predicted versus measured values
astrophysics c data-analysis high-redshift machine-learning pgplot python statistics tensorflow visualization
Last synced: 20 May 2026
https://github.com/sciencesar-labs/py485-final-project
ROOT-based muon data analysis using Python & Jupyter – final project for PY485E @ CERN
cern computational-physics data-analysis jupyter-notebook muons python root uproot
Last synced: 15 May 2026
https://github.com/nikitalpopov/news
v semester project
data-analysis data-science python scikit-learn
Last synced: 20 May 2026
https://github.com/jonek/pv-city-mastr
Extract and analyze data about photovoltaic systems in Germany
data-analysis germany jupyter-notebook pandas photovolatic-power photovoltaic
Last synced: 11 May 2026