Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/cassiofb-dev/projetos-intensivao-python
Projetos do evento intensivão de Python da Hashtag treinamentos.
automation data-analysis data-science data-visualization jupyter-notebook machine-learning python webscraping
Last synced: 03 May 2026
https://github.com/baggiponte/pyconpt-polars
@pola-rs talk @pyconpt
apache-arrow data-analysis data-science etl polars python
Last synced: 03 May 2026
https://github.com/vipulbunny/restaurant-insight-analysis
A comprehensive data analysis project exploring restaurant ratings, locations, and customer sentiments. This project includes data preprocessing, descriptive analysis, geospatial mapping, sentiment analysis, and price-rating correlations using Python and visualization tools.
data-analysis data-preprocessing data-visualization folium geospatial geospatial-analysis geospatial-visualization machine-learning nlp pandas python restaurant-insights seaborn sentiment-analysis
Last synced: 03 May 2026
https://github.com/mohnish88/e-commerce-data-analysis
I analyzed sales data to identify trends and patterns, which significantly enhanced decision-making processes. Additionally, I created interactive visualizations to present these insights clearly and effectively, facilitating better understanding and communication of the data's implications.
data-analysis data-cleaning jupyter-notebook pandas plotly python python-library sales sales-analysis visulaization
Last synced: 03 May 2026
https://github.com/devlucho/modelos-predictivos
Modelos predictivos utilizando los algoritmos de Regresión Lineal, Regresión Logística y Árboles de Decisión.
data-analysis jupyter-notebook python3
Last synced: 03 May 2026
https://github.com/codeslash21/tmdb_data_analysis
We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.
data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie
Last synced: 03 May 2026
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 03 May 2026
https://github.com/joelfaldin/data-analysis
A collection of data-analysis projects I've built over time! ✨⛏️
Last synced: 03 May 2026
https://github.com/ljadhav25/swiggy-restaurant-analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots
Last synced: 03 May 2026
https://github.com/iguptashubham/ev-market-exploration
So, market size analysis is a crucial aspect of market research that determines the potential sales volume within a given market
data-analysis data-analysis-projects data-science-project forecast projects python
Last synced: 03 May 2026
https://github.com/nathadriele/world-marathon-run-majors-analytics-challenge
This project presents a complete data engineering, analytics, machine learning, and Streamlit dashboard pipeline focused on the Abbott World Marathon Majors: Tokyo, Boston, London, Berlin, Chicago, and New York City. Covering the 2018 to 2025 seasons, it analyzes more than 628,000 runner records and 86 verified winner entries.
challenge data-analysis data-pipeline gradient-boosting lasso-regression linear-regression machine-learning models predictive-modeling python random-forest ridge-regression run-analytics world-marathon
Last synced: 09 Jun 2026
https://github.com/bpkaur/whats-in-a-name
Exploring dataset of first names of babies born in the US in order to uncover interesting stories
data-analysis datacamp numpy pandas python3
Last synced: 04 May 2026
https://github.com/mindlessmuse666/titanic-data-visualization
Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.
data-analysis data-visualization matplotlib pandas plotly python seaborn titanic
Last synced: 04 May 2026
https://github.com/nickenshidqia/uber-new-york-data-analysis
Analyze Uber pickups on New York to get insight from this data
data-analysis data-analyst exploratory-data-analysis python
Last synced: 04 May 2026
https://github.com/xiaohan2012/myunisport
Visualize your Unisport annual training records
data-analysis data-visualization pandas pygal sports-stats tikzposter
Last synced: 04 May 2026
https://github.com/fatihilhan42/the-office-eda
Data analysis study of my favorite sitcom, The Office (US).
data-analysis data-science data-visualization fatihilhan office python sitcom
Last synced: 04 May 2026
https://github.com/soham7998/data-analysis-projects
My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.
data data-analysis data-science machine-learning nlp python soham visualization
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/ibrahimm7004/supermarket-sales-analysis
This project focuses on Data Mining techniques to gather inisights about customer behaviour regarding Supermarket Sales. Includes: Association Rule Mining, Temporal Patterns in customer behavior, Sequential Pattern Mining, Classification, Regression, and Outlier Detection.
apriori association-rules data-analysis data-mining data-science data-visualization fpgrowth python sales-analysis supermarket-sales
Last synced: 04 May 2026
https://github.com/mr-chang95/sf_data_visualization
In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.
business data-analysis data-visualization jupyter-notebook pandas python san-francisco
Last synced: 04 May 2026
https://github.com/abhijais4896/belarus-car-price-prediction
Belarus-car-price-prediction
data-analysis datacleaning macine-learning numpy pandas python
Last synced: 04 May 2026
https://github.com/sweta-kaundilya/python_for_data_analysis
Learning Python and all the relevant libraries in python for Data field.
cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn
Last synced: 04 May 2026
https://github.com/fatihilhan42/book-recommendation-system-with-python
In this project, we are making a book recommendation system that recommends similar books according to the genres or ratings that the user enters, using a large book dataset. The link of the dataset is given below. Happy reading...
books data-analysis data-science data-visualization kaggle python recommendation-engine recommendation-system
Last synced: 04 May 2026
https://github.com/hyperplasma/olympic-visualization-analysis
Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.
data-analysis data-visualization matplotlib numpy pandas python wordcloud
Last synced: 04 May 2026
https://github.com/halyusa16/e-commerce-analysis
This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.
data-analysis data-cleaning data-exploration data-visualization self-project
Last synced: 09 Jun 2026
https://github.com/rtgrt5645/numpy-lab
🧮 Explore, manipulate, and visualize data with NumPy to enhance your Python skills in scientific computing and data analysis.
array-operations data-analysis data-science jupyter-notebook machine-learning numerical-computing numpy numpy-arrays numpy-library numpy-python python python3 scientific-computing
Last synced: 04 May 2026
https://github.com/jatin-mehra119/flight-price-prediction
This study aims to analyze flight booking data from "Ease My Trip" website, using statistical tests and linear regression to extract insights. By understanding this data, valuable information can be gained to benefit passengers using the platform.
data-analysis datacleaning datavisualization machine-learning preprocessing-data python sklearn-pipeline sklearn-regression-algorithm streamlit-webapp
Last synced: 04 May 2026
https://github.com/josewebdev2000/us-violent-crime-data-analysis
Analyzing Violent Crime in the United States of America from 1960 to 2019
data-analysis data-science data-visualization interactive-visualizations jupyter-notebook pandas plotly python
Last synced: 04 May 2026
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 04 May 2026
https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies
Web scraping the top 250 movies in IMDB site.
data-analysis numpy pandas python
Last synced: 04 May 2026
https://github.com/drod75/nyc-arrests-analysis
This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.
data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn
Last synced: 04 May 2026
https://github.com/surayasumona/test_bowlers_analysis
Data Analysis with Python
data-analysis data-manipulation data-preprocessing numpy pandas
Last synced: 04 May 2026
https://github.com/yokawaiik/data_science
Time series forecasting with future predict.
data-analysis keras lstm neural-network predict-future python python-3 rnn time-series-forecast visualization
Last synced: 05 May 2026
https://github.com/zobayerakib/credit-card-fraud-analysis__data-analysis-project
credit-card data-analysis decision-trees fraud-detection gradient-descent knn-classification logistic-regression machine-learning machine-learning-algorithms naive-bayes-classifier random-forest-classifier
Last synced: 05 May 2026
https://github.com/tasosfotiadis/time-series-analysis-and-forecasting-of-cryptocurrency-prices
Forecasted Cardano (ADA) cryptocurrency prices using time series analysis. The project involved data preprocessing, trend and seasonality analysis, and model building with ARIMA, SARIMA, and LSTM. Models were evaluated using metrics like MAE and MAPE, providing insights for financial decision-making.
applied-st classical-statistical-models data-analysis deep-learning lstm machine-learning neural-network python r time-series
Last synced: 05 May 2026
https://github.com/jcm-ai/personal-data-science-projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 05 May 2026
https://github.com/dhruvsrikanth/basic-data-science
A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!
analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn
Last synced: 05 May 2026
https://github.com/demon-2-angel/product-customer-acquisition-analysis-using-behaviour
The database encompasses eight tables with varied attributes and rows. Key analyses include product restocking needs, top VIP customers' contributions, and an average customer profit of $39,039.59. Recommendations emphasize strategic marketing to new customers and incentives for existing VIP clients based on acquisition costs and profit insights.
customer-products customer-segmentation data-analysis database sqlite
Last synced: 05 May 2026
https://github.com/jacktheprogrammer/time-series-forecasting-and-analysis
My personal project consisting of my personally created notebooks to work with time series forecasting and analysis. In these projects, I've used deep learning using tensorflow, xgboost, statsmodels and scipy libraries of python. The series were of weather, energy consumption and that of stocks.
data-analysis data-science deep-neural-networks energy-consumption machine-learning portfolio prophet-facebook prophet-model python python3 scipy statsmodels stocks tensorflow time-series time-series-analysis timeseries-forecasting weather xgboost
Last synced: 05 May 2026
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/ayaatmohammed/amazon-sales-analysis-pyspark
In-depth analysis of the Olist E-commerce dataset from Kaggle using PySpark for customer segmentation (RFM) and market basket analysis.
big-data big-data-analytics customer-segmentation data-analysis data-science ecommerce jupyter-notebook kaggle pyspark python rfm-analysis
Last synced: 05 May 2026
https://github.com/kammarah/data-sample
I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.
data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp
Last synced: 05 May 2026
https://github.com/nkamilla/titanic-eda
Exploratory Data Analysis of the Titanic dataset using Python (Pandas, NumPy, Matplotlib). Includes data cleaning, visualizations, correlations, and key business insights.
data-analysis eda jupyter-notebook matplotlib numpy pandas python titanic-dataset
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/caesaredia/ymusic-project
Exploratory data analysis (EDA) of music streaming behavior in two fictional cities using Python, Pandas, and Jupyter Notebook. It explores user behavior, genre preferences, and listening patterns throughout the week.
data-analysis eda pandas python
Last synced: 05 May 2026
https://github.com/donmaruko/python-eda-toolkit
CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.
data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud
Last synced: 06 May 2026
https://github.com/benjaminrose/data-analysis-book
A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis
book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course
Last synced: 06 May 2026
https://github.com/iamrajmani/sentimental-analysis
Sentimental Analysis - Final Year College Project
data-analysis data-visualization machine-learning python pytorch
Last synced: 06 May 2026
https://github.com/ibrahimceyisakar/hotel-finder
Hotel finder system with Python includes data gathering, analyzing, and visualization.
data-analysis data-gathering data-visualization pandas plotly python selenium streamlit
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/yashpaneliya/bank-loan-default-analysis
Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.
data-analysis loan-default-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/ankitwalimbe/sentiment-analysis
Sentiment analysis of Amazon Fashion reviews using VADER and a baseline ML model (TF-IDF + SGDClassifier). Includes visualizations, reproducible notebook, and recruiter-ready documentation.
data-analysis machine-learning matplotlib nlp pandas python seaborn sentiment-analysis sklearn
Last synced: 06 May 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/friendotjava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
data-analysis explanatory-data-analysis exploratory-data-analysis jupyter-notebook pytho streamlit
Last synced: 06 May 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026
https://github.com/superpandas-ai/superpandas
Adding LLM integration to Pandas library
Last synced: 06 May 2026
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 06 May 2026
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 06 May 2026
https://github.com/edanur-y/variable-analysis-of-banks-ratio-data
Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R
data-analysis log-transformation missing-values-analysis multicollinearity normality-test r spss
Last synced: 10 Jun 2026
https://github.com/fbarffmann/home_sales
Analyzed 25,000+ home sales using PySpark and SparkSQL. Identified pricing trends by year built, home features, and view rating. Optimized query run-time by 70% using caching.
aws big-data data-analysis home-sales parquet pyspark python spark spark-sql sql
Last synced: 06 May 2026
https://github.com/rlalpha49/anisearch-model
AniSearchModel leverages Sentence-BERT (SBERT) models to generate embeddings for synopses, enabling the calculation of semantic similarities between descriptions. This allows users to find the most similar anime or manga based on a given description.
anime api data-analysis data-merging embeddings flask hugging-face-datasets kaggle-datasets machine-learning manga natural-language-processing nlp python sentence-bert similarity-search
Last synced: 06 May 2026
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/karlyndiary/coffee-shop-sales-analysis
Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 07 May 2026
https://github.com/suhas-005/jovian-data-analysis-course-assignment
These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai
data-analysis data-analytics numpy pandas python
Last synced: 07 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/rohansoni45/whatsapp-chat-analysis
This project involves analyzing WhatsApp chat data to extract valuable insights. Using Python and various libraries like Pandas and Matplotlib, the project processes and visualizes chat statistics such as message frequency, most active participants, and sentiment analysis.
chat-analysis data-analysis data-science matplotlib pandas python sentiment-analysis streamlit visualization web-app word-cloud
Last synced: 07 May 2026
https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-
This project analyzes airplane crash data (1908 - 2008)✈️📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.
data-analysis data-visualization exploratory-data-analysis
Last synced: 10 Jun 2026
https://github.com/devexpress-examples/winforms-pivot-change-the-field-value-header-appearance-backcolor
This example handles the CustomDrawFieldValue event to fill the header's color.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/mahmoudnamnam/fc-barcelona-reports
FC Barcelona Reports: An interactive web application to analyze and visualize FC Barcelona's match data. Built with Streamlit, it scrapes match data from WhoScored, stores it in MongoDB, and presents insights through interactive visualizations like pass networks, shot maps, and player statistics.
data-analysis data-visualization football-analytics mplsoccer pandas streamlit web-scraping
Last synced: 07 May 2026
https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021
Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.
advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python
Last synced: 07 May 2026
https://github.com/pedrosfaria2/fugascomhelicoptero
Meu primeiro uso do Jupyter Notebook em um projeto
analise-de-dados data-analysis jupyter-notebook matplotlib pandas python
Last synced: 07 May 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026
https://github.com/lucalullo/global-emissions-and-temperature-1950-2024
Global climate analysis covering 75 years of CO₂, greenhouse gas emissions and mean surface temperatures across countries (1950–2024). Built with Pandas, Matplotlib, Seaborn and Plotly.
climate-change co2-emissions data-analysis greenhouse-gas temperature
Last synced: 10 Jun 2026
https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation
GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim
content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream
Last synced: 07 May 2026
https://github.com/satyam4229/identify-employee-attrition
This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 08 May 2026
https://github.com/otonomee/against-the-clock-transcript-analysis
This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.
against-the-clock ai-analysis audio-processing creative-ai creative-process data-analysis fact-magazine machine-learning music-production natural-language-processing nlp text-mining yt-dlp
Last synced: 08 May 2026
https://github.com/devexpress-examples/wpf-pivot-grid-group-date-time-values
This example shows how to group date-time values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 08 May 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/ayushsiloiya619/online-food-orders-analysis
Data Analytics with Python
data-analysis data-visualization matplotlib pandas-dataframe python3 seaborn-python
Last synced: 08 May 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/l1ght14/customer-churn-prediction
Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.
churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom
Last synced: 09 May 2026
https://github.com/mrunmayee3108/financial-chatbot
A Python chatbot for analyzing financial data of companies with revenue, income, assets, cash flow, and debt ratio queries
chatbot data-analysis jupyter-notebook pandas python python3
Last synced: 09 May 2026
https://github.com/drod75/burger_king_analysis
A simple analysis on a burger king dataset.
data-analysis data-visualization jupyter-notebook pandas python seaborn
Last synced: 09 May 2026
https://github.com/suhas-005/eda-indian-startup-funding
Exploratory Data Analysis on Indian Startup Funding (2015-2020)
data-analysis data-analytics data-science data-visualization exploratory-data-analysis matplotlib pandas python seaborn startup-funding
Last synced: 09 May 2026
https://github.com/emanoelcampos/python-onemonth
This repository contains educational materials and projects developed during a Python course offered by OneMonth. It covers Python basics, intermediate concepts, web development with Flask, and data analysis with pandas. The course is structured into weeks, each focusing on a different aspect of Python programming and its applications.
data-analysis flask jupyter-notebook onemonth python python3
Last synced: 09 May 2026
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026