Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/priyanshubiswas-tech/e-commerce_data_analysis
Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.
data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python
Last synced: 28 Apr 2026
https://github.com/szapp/candyanalysis
Case study: Analyze the candy power ranking to identify and recommend popular candy characteristics
data-analysis data-visualization feature-selection interaction-terms
Last synced: 28 Apr 2026
https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach
Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.
data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost
Last synced: 28 Apr 2026
https://github.com/bala-1409/titanic-survived-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning-algorithms matplotlib predictive-modeling python3 seaborn
Last synced: 28 Apr 2026
https://github.com/i-am-uchenna/sql-data-warehouse-project
The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.
architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver
Last synced: 15 May 2026
https://github.com/emircanakyuzz/veri_gorsellestirilmesi_ve_analizi-analysis_and_visualization_of_dataset
Bu çalışmada numpy, pandas, seaborn ve matplotlib gibi veri biliminde çokca bilinen modülleri kullanarak analiz ve görselleştirme işlemleri gerçekleştirdim.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/kawshik-khan/fake-news-analysis
A fake news detection ML model. It utilizes the Bag of Words model for text vectorization and a Multinomial Naive Bayes classifier to predict whether news articles are real or fake. The project covers data preprocessing, model training, and performance evaluation with accuracy metrics and a confusion matrix.
data-analysis data-science machine-learning ml python3
Last synced: 08 Jun 2026
https://github.com/jbalooshie/movies-etl
Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.
data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy
Last synced: 11 Apr 2026
https://github.com/satyacoder29/crowdfunding-in-sql
Crowdfunding is a method of raising funds for projects or causes by collecting small contributions from a large group of people, usually through online platforms. It enables individuals, startups, and nonprofits to secure funding, offering rewards or recognition in exchange, and helps bring ideas to life without traditional financing.
data-analysis data-cleaning database-management mysql-database quries sql sql-functions sql-server views
Last synced: 29 Apr 2026
https://github.com/anilyigitsel/istanbul-rental-apartments-analysis
This project analyzes the Istanbul Rental Apartments Dataset (2025), which includes rental apartment listings from Istanbul, Turkey.
data-analysis data-visualization jupyter-notebook matplotlib pandas python rental-housing
Last synced: 29 Apr 2026
https://github.com/eco786786/restaurant_orders
This analysis seeks to uncover patterns in customer behaviour by examining restaurant order data.
data-analysis git postgresql tableau
Last synced: 29 Apr 2026
https://github.com/saroshfarhan/kaggle-playground-s4e11
Kaggle old competirion just for practice
data-analysis data-science data-visualization jupiter-notebook python3
Last synced: 29 Apr 2026
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/farhad-here/textprepx
A Multilingual Text Preprocessing Tool for English and Persian.
cleantext contractions data-analysis deep-learning emoji nlp nltk opp parsivar regex streamlit text-preprocessing textblob
Last synced: 29 Apr 2026
https://github.com/srinibas-masanta/yelp-business-reviews-analysis
This project analyzes Yelp business reviews using Python, Snowflake, and SQL, focusing on efficient data ingestion, transformation, and analysis. We preprocess JSON data, optimize ingestion via Amazon S3, classify sentiments with Python UDFs, and extract insights using SQL queries—showcasing a streamlined end-to-end workflow.
amazon-s3 data-analysis json python snowflake sql
Last synced: 29 Apr 2026
https://github.com/varshan1123/sql-tableau-project
We analyze key indicators for our pizza sales data to gain insights into our business performance - A Data Analysis Project performed on Tableau & SQL.
analysis data-analysis data-science data-visualization excel mysql powerbi sql sql-server tableau tableau-dashboards
Last synced: 29 Apr 2026
https://github.com/shimaa83/eda-repo
Exploratory data analysis for Police and retail dataset in kaggle
Last synced: 29 Apr 2026
https://github.com/prithviraj-2003/cognifyz-data-science-internship
🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF
data-analysis data-science data-visualization matplotlib numpy pandas python3
Last synced: 29 Apr 2026
https://github.com/nishumehta/supermart-grocery-sales-retails-analytics
Tableau Dashboard Link :
data-analysis data-cleaning data-visualization jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn
Last synced: 29 Apr 2026
https://github.com/theoplayz2/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
analysis battery-life cqrs csharp data-analysis eeg-analysis exploratorydataanalysis json-visualization matplotlib messaging profile-report python verilog visualization
Last synced: 29 Apr 2026
https://github.com/farhad-here/student_performance_analyzer
Student Performance Analyzer with python, it is on of my data analysis course project. I teach you about filter(),lambda,map() in python
data-analysis data-visualization filter kaggle kaggle-dataset lambda map pandas python python-tutorial streamlit
Last synced: 29 Apr 2026
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 11 Apr 2026
https://github.com/angchekar28/air-quality-index-analysis
This project analyzes Air Quality Index (AQI) data to identify pollution trends, seasonal variations, and the impact of different pollutants. It includes data visualization, correlation analysis, and insights into air quality variations over time.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python
Last synced: 30 Apr 2026
https://github.com/shruti-h/sales_data_analysis
Sales Data Analysis | Pandas & Matplotlib
data-analysis data-science data-vi matplotlib pandas-library python
Last synced: 30 Apr 2026
https://github.com/prady2309/email-spam-detection-with-machine-learning
Implemented using Naive Bayes Algorithm
data-analysis data-science machine-learning python
Last synced: 30 Apr 2026
https://github.com/bachtiarashidiqy/ecommercedashboard
An interactive e-commerce analytics dashboard built with Streamlit, providing visualizations for sales performance, product analysis, geographic insights, and delivery status. Includes date filtering, company branding, and comprehensive documentation.
analytics dashboard data-analysis data-visualization e-commerce matplotlib pandas python seaborn streamlit
Last synced: 30 Apr 2026
https://github.com/srinibas-masanta/ibm-applied-data-science-capstone
This repository contains the work completed for the Applied Data Science Capstone Project offered by IBM on Coursera. The capstone project is the final course in the IBM Data Science Professional Certificate series and serves as an opportunity to apply the skills and knowledge gained throughout the series to a real-world data science problem.
capstone-project data-analysis data-science data-visualization machine-learning python web-scraping
Last synced: 30 Apr 2026
https://github.com/samuelpillai/machine-learning-classification-regression-nlp
A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.
classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining
Last synced: 30 Apr 2026
https://github.com/mfakhriazhar/nlp-movie-recommender-system
This project is a content-based movie recommender system built using Natural Language Processing (NLP) techniques. By extracting and combining important text features from movie metadata, this system suggests movies that are similar to a user's selected title.
data-analysis data-science deep-learning machine-learning natural-language-processing python recommender-system
Last synced: 30 Apr 2026
https://github.com/mitchellharrison/mitchellharrison.github.io
Welcome to my slice of the internet, where I share the knowledge that Duke gave me, so you don't have to spend the mortgage-sized amount to access it. Built with R, Python, Quarto, and love.
ai algorithms-and-data-structures blog data-analysis data-science data-visualization educational machine-learning portfolio portfolio-website quarto r r-language statistics tutorials
Last synced: 30 Apr 2026
https://github.com/abhi227070/ipl-2024-sold-player-data-analysis
This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.
data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3
Last synced: 30 Apr 2026
https://github.com/devag2004/electricity-analysis-using-spark
electricity analysis project made using spark
data-analysis spark spark-mllib
Last synced: 01 May 2026
https://github.com/cdeweyx/bryce-harper-2016-analysis
Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.
data-analysis data-visualization python
Last synced: 01 May 2026
https://github.com/ariyaarka/result-analysis
A simple analysis of result based on different factors shown in figures
data-analysis jupyter-notebook matplotlib numpy-library pandas-dataframe python seaborn
Last synced: 01 May 2026
https://github.com/filip-kustura/data-warehouse-olympics
This project, part of the elective Advanced Database Systems course, involved building a data warehouse based on the already existing database in PostgreSQL. It focuses on analyzing Olympic Games data across time, covering athletes' performance by discipline, location, and other dimensions. Implemented in Spring 2022.
data-analysis data-warehouse database extract-transform-load olympic-games postgresql sql star-schema university-project
Last synced: 01 May 2026
https://github.com/monish-nallagondalla/sensor_fault_detection
This repo contains sensor data for analysis, focusing on sensor readings, their attributes, and classification (Good/Bad). It includes 500+ sensors with features for predictive modeling, anomaly detection, and sensor failure prediction.
anomaly-detection classification data-analysis data-science machine-learning predictive-modeling python sensor-data
Last synced: 01 May 2026
https://github.com/shibbir24/amazon-product-sales-data-analysis-trends-and-insights
Amazon Product Sales Data Analysis: Trends and Insights
amazon-dataset data-analysis matplotlib numpy pandas seaborn
Last synced: 01 May 2026
https://github.com/rafath0ssain/predihome
Data analysis using economic factors affecting living conditions across Canadian provinces.
data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr
Last synced: 01 May 2026
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 01 May 2026
https://github.com/ujjwalll/get-that-flair
It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/
data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling
Last synced: 01 May 2026
https://github.com/more-joao/color-distance-luminance
Data analysis project that aims to establish a relation between the Canberra distance between white and any given color in the RGB colorspace and its luminance.
canberra-distance data-analysis luminance python r rgb
Last synced: 02 May 2026
https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends
Last synced: 02 May 2026
https://github.com/suma-aljudaia/my-portfolio
Suma Aljudaia | Portfolio – AI & Data Analysis Enthusiast
ai css data-analysis html machine-learning portfolio
Last synced: 02 May 2026
https://github.com/ronitjariwala/prodigy_ds_04
Prodigy InfoTech Data Science Internship Task-4
data-analysis data-science data-visualization python
Last synced: 02 May 2026
https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor
AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.
data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics
Last synced: 02 May 2026
https://github.com/isaqueiros/motorpremium-predictions-mlpclassifier
This Jupyter Notebooks is an initial study of the application of sklearn neural network MLP Classifier model. The model is applied to dataset MotorPremiums, which is supplied separately in .csv format.
data-analysis data-science machine-learning neural-network python sklearn-library
Last synced: 02 May 2026
https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python
We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.
data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system
Last synced: 02 May 2026
https://github.com/m0saan/python-for-data-analysis
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,
data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python
Last synced: 02 May 2026
https://github.com/dissorial/prx21_erikz
Analysis of self-tracked data: interactive visualizations & predictive algorithms
analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization
Last synced: 02 May 2026
https://github.com/mehanix/dhrw
🎢 IaaS visual editor to create & deploy data processing pipelines - python, rmq, react, meteorjs
computational-graph computational-graphs data-analysis data-engineering data-pipeline data-pipelines data-processing data-processing-and-analysis data-processing-pipelines data-processing-system data-science data-visualization docker-compose good-first-issue help-wanted meteorjs-application rabbitmq react-flow
Last synced: 02 May 2026
https://github.com/faiyaz-zaman/used-car-market-trends-on-bikroy.com
Used Car Market Trends on Bikroy.com
data-analysis python scraping-websites selenium tableau
Last synced: 02 May 2026
https://github.com/jofaval/red-wine-quality
Data Analysis of the Red Portuguese's Wine's Quality in 2009
classification data-analysis data-science data-visualization google-colab kaggle logistic-regression machine-learning python scikit-learn wine-quality xgboost
Last synced: 03 May 2026
https://github.com/mehtadigisha/iris-flower-classification
Iris Flower Classification
accuracy-score classification-report data-analysis data-visualization eda iris-classification machine-learning matplotlib pandas prediction python scikit-learn seaborn svc-model svm-model visualization
Last synced: 03 May 2026
https://github.com/ahmedhosssam/lesser_pandas
Pandas-like Data Analysis library in C++
cpp data-analysis data-science pandas
Last synced: 03 May 2026
https://github.com/monteirooscar98/tarifas-publicas-sp-dieese
Extração de dados através de WebScraping no site do Dieese e Analise em relação as Tarifas Públicas do Município de São Paulo.
data-analysis data-visualization python webscraping
Last synced: 03 May 2026
https://github.com/emredemirbas/movie-ratings-analysis
A data analysis project investigating potential bias in movie ratings from 2015, comparing them with ratings from other platforms using Python, pandas, and visualization libraries.
data-analysis matplotlib pandas python seaborn
Last synced: 03 May 2026
https://github.com/vipulbunny/restaurant-insight-analysis
A comprehensive data analysis project exploring restaurant ratings, locations, and customer sentiments. This project includes data preprocessing, descriptive analysis, geospatial mapping, sentiment analysis, and price-rating correlations using Python and visualization tools.
data-analysis data-preprocessing data-visualization folium geospatial geospatial-analysis geospatial-visualization machine-learning nlp pandas python restaurant-insights seaborn sentiment-analysis
Last synced: 03 May 2026
https://github.com/devlucho/modelos-predictivos
Modelos predictivos utilizando los algoritmos de Regresión Lineal, Regresión Logística y Árboles de Decisión.
data-analysis jupyter-notebook python3
Last synced: 03 May 2026
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 03 May 2026
https://github.com/nurulashraf/logistic-regression-loan-prediction
Loan approval prediction using logistic regression based on applicant data, including income, credit history, and property details, after data preparation and feature engineering.
data-analysis data-science loan-prediction logistic-regression machine-learning predictive-modeling python sklearn
Last synced: 03 May 2026
https://github.com/matteospanio/speed-analysis
A project to analyze the internet speed
Last synced: 03 May 2026
https://github.com/samruddhi3012/screen-time-analysis
Hi! This repo demonstrates a python project on Screen Time Analysis.
data-analysis data-visualization python
Last synced: 04 May 2026
https://github.com/xiaohan2012/myunisport
Visualize your Unisport annual training records
data-analysis data-visualization pandas pygal sports-stats tikzposter
Last synced: 04 May 2026
https://github.com/sanchittechnogeek/rental-data-visualization_python
Statistics and visualization of rental data with python
data-analysis data-science data-visualization statistics
Last synced: 04 May 2026
https://github.com/aaaa-source/us-stock-market-analysis-and-prediction
US Stock Market Analysis and Prediction
artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks classification clustering data-analysis finance financial-analysis python
Last synced: 09 Jun 2026
https://github.com/sweta-kaundilya/python_for_data_analysis
Learning Python and all the relevant libraries in python for Data field.
cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn
Last synced: 04 May 2026
https://github.com/hilalguleryuz/northwind_data_analysis_capstone_project
Northwind Capstone Project
capstone-project dashboard data-analysis data-visualization dax jupyter-notebook matplotlib northwind northwind-database pandas postgresql powerbi python seaborn sql
Last synced: 04 May 2026
https://github.com/abhinav330/911-emergency-calls-analysis
This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.
data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python
Last synced: 09 Jun 2026
https://github.com/josewebdev2000/us-violent-crime-data-analysis
Analyzing Violent Crime in the United States of America from 1960 to 2019
data-analysis data-science data-visualization interactive-visualizations jupyter-notebook pandas plotly python
Last synced: 04 May 2026
https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies
Web scraping the top 250 movies in IMDB site.
data-analysis numpy pandas python
Last synced: 04 May 2026
https://github.com/georgehanymilad/plantycare-app
Graduation Project - Fayoum Center
ai backend cnn-classification colab-notebook data-analysis deep-learning diagrams front-end java kaggle machine-learning native ui-design
Last synced: 04 May 2026
https://github.com/yokawaiik/data_science
Time series forecasting with future predict.
data-analysis keras lstm neural-network predict-future python python-3 rnn time-series-forecast visualization
Last synced: 05 May 2026
https://github.com/tasosfotiadis/time-series-analysis-and-forecasting-of-cryptocurrency-prices
Forecasted Cardano (ADA) cryptocurrency prices using time series analysis. The project involved data preprocessing, trend and seasonality analysis, and model building with ARIMA, SARIMA, and LSTM. Models were evaluated using metrics like MAE and MAPE, providing insights for financial decision-making.
applied-st classical-statistical-models data-analysis deep-learning lstm machine-learning neural-network python r time-series
Last synced: 05 May 2026
https://github.com/matt-ags/jornada-python
Repositório com os projetos realizados durante a semana "Jornada Python" - 01/2025
artificial-intelligence automation data-analysis jupyter-notebook machine-learning python
Last synced: 05 May 2026
https://github.com/beaprogrammer02345/python_data_analysis
Sales Analysis using Python
data-analysis data-visualization python
Last synced: 05 May 2026
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/kuranez/eu-energy-map
Dashboard visualizing renewable energy trends in the European Union.
dashboard dashboards data-analysis data-visualization energy-data european-union geopandas green-energy interactive-map map pandas plotly python renewable-energy renewables web-app
Last synced: 05 May 2026
https://github.com/mohitsai/boston-housing-data-analysis
Data Analysis Project for the City of Boston Government for insights into effect of property rennovations and remodelling on housing availability in the city
data-analysis data-science matplotlib numpy pandas python
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/ayaatmohammed/amazon-sales-analysis-pyspark
In-depth analysis of the Olist E-commerce dataset from Kaggle using PySpark for customer segmentation (RFM) and market basket analysis.
big-data big-data-analytics customer-segmentation data-analysis data-science ecommerce jupyter-notebook kaggle pyspark python rfm-analysis
Last synced: 05 May 2026
https://github.com/caesaredia/ymusic-project
Exploratory data analysis (EDA) of music streaming behavior in two fictional cities using Python, Pandas, and Jupyter Notebook. It explores user behavior, genre preferences, and listening patterns throughout the week.
data-analysis eda pandas python
Last synced: 05 May 2026
https://github.com/hms75/movie_rating_analysis
A movie rating analysis which identifies trends amongst a dataset of 5000 movies.
data-analysis data-visualization matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/iamrajmani/sentimental-analysis
Sentimental Analysis - Final Year College Project
data-analysis data-visualization machine-learning python pytorch
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/yashpaneliya/bank-loan-default-analysis
Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.
data-analysis loan-default-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/ankitwalimbe/sentiment-analysis
Sentiment analysis of Amazon Fashion reviews using VADER and a baseline ML model (TF-IDF + SGDClassifier). Includes visualizations, reproducible notebook, and recruiter-ready documentation.
data-analysis machine-learning matplotlib nlp pandas python seaborn sentiment-analysis sklearn
Last synced: 06 May 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/syarwinaaa09/visualizing-the-history-of-nobel-prize-winners
analysis and visualization of Nobel Prize winners
data-analysis data-visualization jupyter-notebook machine-learning matplotlib nobel-prize pandas python
Last synced: 06 May 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 06 May 2026
https://github.com/vimlesh-gupta/blinkit_data_analytics_project
End-to-end Blinkit data analytics project using Python, SQL Server & Power BI
blinkit data-analysis eda pandas powerbi python sql-server
Last synced: 06 May 2026
https://github.com/karlyndiary/coffee-shop-sales-analysis
Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 07 May 2026
https://github.com/badranalyst/exploratory-data-analysis-on-salaries-dataset
Performing EDA on a dataset related to salaries, exploring relationships between factors like job titles, industries, and locations. Insights are visualized with plots to identify trends and disparities in salary data.
data-analysis dataset eda exploratory-data-analysis pandas python
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting
This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.
data-analysis data-visualization mechine-learing plotly python streamlit
Last synced: 07 May 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026