Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/marcinz20/anomaly-detection-in-credo-dataset
University project, which goal is to build a system, that detects anomalies in CREDO dataset
credo data-analysis data-science encoder-decoder-model jupiter-notebook pca-analysis python3
Last synced: 29 Apr 2026
https://github.com/nivasharmaa/spiderverse
A comprehensive Java program for analyzing and managing events and data points within a fictional spiderverse. Features event handling, anomaly detection, cluster management, and robust file I/O operations.
advanced-algorithms anomaly-detection clustering data-analysis file-io object-oriented-programming
Last synced: 29 Apr 2026
https://github.com/vanshuchaudhary/zomato
This Jupyter Notebook contains an exploratory data analysis (EDA) of Zomato restaurant data. It includes data cleaning, visualization, and insights into restaurant ratings, pricing, cuisine distribution, and location-based trends.
business-analytics data-analysis data-mining data-science data-visualization datascience matplotlib pandas-dataframe pandas-python python python-3 python-library
Last synced: 29 Apr 2026
https://github.com/kasraskari/learn-r-codes
A learning repository for R programming, covering data manipulation, visualization, and statistical analysis. (Work in progress!) 🚧
data-analysis data-analysis-r data-visualization r r-examples r-graphics r-statistics statistics
Last synced: 08 Jun 2026
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 29 Apr 2026
https://github.com/eco786786/restaurant_orders
This analysis seeks to uncover patterns in customer behaviour by examining restaurant order data.
data-analysis git postgresql tableau
Last synced: 29 Apr 2026
https://github.com/saroshfarhan/kaggle-playground-s4e11
Kaggle old competirion just for practice
data-analysis data-science data-visualization jupiter-notebook python3
Last synced: 29 Apr 2026
https://github.com/i7t5/sentimentnlp
Sentiment analysis for COMP 435 Introduction to Machine Learning, Spring 2025
data-analysis jupyter-notebook machine-learning nlp python sentiment-analysis
Last synced: 29 Apr 2026
https://github.com/prince-pastakiya/human-resources-tableau-project
👥 Interactive Tableau dashboard for HR analytics — includes workforce overview, demographics, income analysis, and detailed employee records with full filtering.
chatgpt data-analysis data-visualization human-resources numpy python python-faker tableau-dashboards tableau-public
Last synced: 18 Apr 2026
https://github.com/findmyway/dataframe-in-julia
A quick introduction of DataFrame in Julia for users from Python
data-analysis dataframe julia jupyter-notebook
Last synced: 29 Apr 2026
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/carlos-edulira/mbabigdata-projeto
Entrega do projeto MBA Unipe Big Data BI
data-analysis delta minio python spark
Last synced: 29 Apr 2026
https://github.com/mr-dhan/eda-sales-customer-transactions
Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.
dashboard data data-analysis data-analysis-python data-science data-visualization eda python
Last synced: 29 Apr 2026
https://github.com/mfakhriazhar/python-data-analyst-tutorial
A collection of My Python learning files for Data Analyst purposes. Covers fundamental to advanced topics such as data exploration, visualization, statistical analysis, and the use of popular libraries like Pandas, NumPy, Matplotlib, and Seaborn. Suitable for personal documentation or shared learning references.
data-analysis data-science data-visualization exploratory-data-analysis portfolio python
Last synced: 29 Apr 2026
https://github.com/farhad-here/textprepx
A Multilingual Text Preprocessing Tool for English and Persian.
cleantext contractions data-analysis deep-learning emoji nlp nltk opp parsivar regex streamlit text-preprocessing textblob
Last synced: 29 Apr 2026
https://github.com/jofaval/melbourne-temperature-timeseries
Timeseries Data Analysis and Forecasting of the daily min temperature in Melbourne from 1981 to 1990
data-analysis data-science data-visualization deep-learning google-colab melbourne python temperature tensorflow timeseries timeseries-analysis
Last synced: 29 Apr 2026
https://github.com/chandantech2023/sales-trend-analysis
This repository features the Superstore Sales Analysis project, demonstrating data cleaning and analysis using Python and SQL, along with interactive visualization in Power BI. .
data-analysis data-science dax kaggle powerbi-desktop python3 sql
Last synced: 29 Apr 2026
https://github.com/sdley/cas_pratique-del_annuel
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 29 Apr 2026
https://github.com/alunera-data/sql-use-cases
Practical SQL use cases for Business Intelligence and IT Service Management (BI & ITSM)
business-intelligence dashboards data-analysis data-quality eda itsm kpis postgresql process-monitoring query reporting sql sqlserver
Last synced: 29 Apr 2026
https://github.com/meinhere/ta-pendat
Proyek Akhir Mata Kuliah Penambangan Data - Klasifikasi Trauma Pasien Menggunakan Metode Naive Bayes
data-analysis data-mining naive-bayes-classifier python trauma
Last synced: 29 Apr 2026
https://github.com/varshan1123/sql-tableau-project
We analyze key indicators for our pizza sales data to gain insights into our business performance - A Data Analysis Project performed on Tableau & SQL.
analysis data-analysis data-science data-visualization excel mysql powerbi sql sql-server tableau tableau-dashboards
Last synced: 29 Apr 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/shimaa83/eda-repo
Exploratory data analysis for Police and retail dataset in kaggle
Last synced: 29 Apr 2026
https://github.com/dindagustiayu/data-processing
The digital text book to interpreting characterisation results.
characterisation data-analysis gitbook latex-package myst qualitative-analysis quantitative-analysis
Last synced: 08 Jun 2026
https://github.com/sakan811/stress-pattern-occurrence-in-english-words
This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress
data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals
Last synced: 20 Jun 2026
https://github.com/al-ghaly/e-commerce-a-b-testing
A Statistical Analysis project in which I Performed an A/B test to analyze the effect of changing the user interface for an E-Commerce company's Website.
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 29 Apr 2026
https://github.com/yimethan/basics-of-data-analysis
2023-2 Basics of Data Analysis
data-analysis numpy pandas python
Last synced: 29 Apr 2026
https://github.com/nishumehta/supermart-grocery-sales-retails-analytics
Tableau Dashboard Link :
data-analysis data-cleaning data-visualization jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn
Last synced: 29 Apr 2026
https://github.com/akorablov/project_data_jobs
Looking into the data job market across Czechia, Slovakia, and the US.
data-analysis data-science git job-market jupyter learn-to-code matplotlib pandas python seaborn skills visual-studio-code visualisation
Last synced: 29 Apr 2026
https://github.com/ahshah322/world-happiness-report-2025
Data analysis and visualization of the World Happiness Report 2025 using Python (pandas, seaborn, matplotlib). Explores how GDP, health, freedom, generosity, and corruption perception influence global happiness.
data-analysis data-science matplotlib numpy pandas python seaborn worldhappiness
Last synced: 29 Apr 2026
https://github.com/fanisgl/video-games-sales-data
Data Analysis of Sales Dataset using Python.
data-analysis data-science data-visualization dataset jupyter-notebooks matplotlib numpy pandas poisson-distribution python python3 sales statistics
Last synced: 29 Apr 2026
https://github.com/farhad-here/student_performance_analyzer
Student Performance Analyzer with python, it is on of my data analysis course project. I teach you about filter(),lambda,map() in python
data-analysis data-visualization filter kaggle kaggle-dataset lambda map pandas python python-tutorial streamlit
Last synced: 29 Apr 2026
https://github.com/alam025/invoice-generator
Processed 500+ invoices with automated payment reminders and multi-currency PDF generation
api data-analysis finance fintech nextjs pdfkit prisma python stripe
Last synced: 08 Jun 2026
https://github.com/dcs-training/intro-to-statistics
Intro to Statistics workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training. Go to the readme file
data-analysis data-visualisation data-wrangling r statistics
Last synced: 20 Jun 2026
https://github.com/jelhamm/model-ensembles-bagging-in-machine-learning
"This repository contains implementations of Boosting method, aimed at improving predictive performance by combining multiple models. by using titanic database."
bagging bagging-ensemble data-analysis database-analysis datamining datamining-algorithms ensemble-machine-learning ensemble-model jupyter-notebook machine-learning-models machine-learning-projects matplotlib module numpy-library pandas-library python sklearn-library
Last synced: 30 Apr 2026
https://github.com/marknature/machine-learning-intern
Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset
data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn
Last synced: 10 Apr 2026
https://github.com/angchekar28/air-quality-index-analysis
This project analyzes Air Quality Index (AQI) data to identify pollution trends, seasonal variations, and the impact of different pollutants. It includes data visualization, correlation analysis, and insights into air quality variations over time.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python
Last synced: 30 Apr 2026
https://github.com/ykbintang/ecommerce-dashboard
data-analysis python streamlt-cloud-deploy
Last synced: 30 Apr 2026
https://github.com/aidan-zamfir/advt-analysis
Web scrapping project. Will eventually use character/episode data for NLP & networking/ data analysis .
data-analysis nlp python selen webscraping
Last synced: 23 Aug 2025
https://github.com/prady2309/email-spam-detection-with-machine-learning
Implemented using Naive Bayes Algorithm
data-analysis data-science machine-learning python
Last synced: 30 Apr 2026
https://github.com/avazasgarov/soccer-hypothesis-testing
Statistical analysis comparing goal-scoring patterns in Men’s vs. Women’s FIFA World Cups using hypothesis testing.
data-analysis eda hypothesis-testing matplotlib-pyplot pandas pingouin python scipy
Last synced: 30 Apr 2026
https://github.com/aishwaryagade02/loan-funnel-optimization-analysis
Tracks how loan applications move through each stage, helps spot where people drop off, and gives clear insights to improve approval strategies and overall performance.
ab-testing data-analysis data-creation hypothesis-testing python reporting sql statistical-methods streamlit
Last synced: 30 Apr 2026
https://github.com/praveendecode/analytics-for-hospitals-health-care-data
Analytics for Hospitals' Health-Care Data
covid-19 data-analysis data-visualization exploratory-data-analysis ibm-cognos-analytics ibm-watson medical-domain python
Last synced: 30 Apr 2026
https://github.com/srinibas-masanta/ibm-applied-data-science-capstone
This repository contains the work completed for the Applied Data Science Capstone Project offered by IBM on Coursera. The capstone project is the final course in the IBM Data Science Professional Certificate series and serves as an opportunity to apply the skills and knowledge gained throughout the series to a real-world data science problem.
capstone-project data-analysis data-science data-visualization machine-learning python web-scraping
Last synced: 30 Apr 2026
https://github.com/samuelpillai/machine-learning-classification-regression-nlp
A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.
classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining
Last synced: 30 Apr 2026
https://github.com/celineboutinon/laplace-immo
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 3
data-analysis data-analytics data-structures database-design database-schema databases mysql-connector-python mysql-workbench python sql
Last synced: 30 Apr 2026
https://github.com/shishirshekhar/diabetes-prediction
This is a early diabetes prediction web app
data-analysis data-visualization decision-tree-classifier machine-learning streamlit streamlit-application streamlit-dashboard streamlit-web streamlit-webapp visualization
Last synced: 30 Apr 2026
https://github.com/mfakhriazhar/nlp-movie-recommender-system
This project is a content-based movie recommender system built using Natural Language Processing (NLP) techniques. By extracting and combining important text features from movie metadata, this system suggests movies that are similar to a user's selected title.
data-analysis data-science deep-learning machine-learning natural-language-processing python recommender-system
Last synced: 30 Apr 2026
https://github.com/celineboutinon/little-lemon
Meta Database Engineer Professional Certificate - Capstone Project
data-analysis data-analytics data-structures data-visualisation database-design database-schema databases mysql-connector-python mysql-workbench python sql tableau-dashboards
Last synced: 30 Apr 2026
https://github.com/aniketmondal/dataanalysis
Contains cleaning, transformation, and exploratory analysis of various data sets using Python Pandas, NumPy, re, random, etc.
analysis data-analysis data-science pandas python
Last synced: 30 Apr 2026
https://github.com/busra-deveci/kaggle-iris_data_analysis
Exploratory data analysis and visualization of the Iris dataset using Python.
data-analysis iris-dataset kaggle pandas python seaborn visualization
Last synced: 30 Apr 2026
https://github.com/fbarffmann/credit-risk-classification
Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/devexpress-examples/wpf-pivot-grid-provide-custom-summary-values
This example demonstrates how to determine the value type when you calculate custom summary values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 01 May 2026
https://github.com/fazatholomew/marlboroplan
In order to contribute to a more inclusive sustainable energy program in Massachusetts, this project is part of my work for a nonprofit organization called All In Energy and undergraduate thesis for my degree.
data-analysis data-visualization energy jupyter-notebook massachusetts python
Last synced: 01 May 2026
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/devag2004/electricity-analysis-using-spark
electricity analysis project made using spark
data-analysis spark spark-mllib
Last synced: 01 May 2026
https://github.com/mmfava/lonomia-host-plants-2024
This project investigates the relationship between Lonomia achelous and Lonomia obliqua caterpillars and their host plants. The project uses Docker for a consistent environment and R for statistical analysis, with detailed processes documented in Jupyter notebooks.
data-analysis host-plants lonomia lonomism r
Last synced: 01 May 2026
https://github.com/aonurakman/data-analysis-and-ml-algorithms
An exploration of data analysis techniques and standard ML algorithms on QSAR oral toxicity dataset. - 2021 - Yıldız Technical University
classification clustering data-analysis data-mining isolation-forest python regression
Last synced: 20 Jun 2026
https://github.com/virajbhutada/music-store-data-analysis-sql
Hands-on SQL data analysis project for music store. Enhance proficiency with database queries. Ideal for practitioners seeking real-world analytics experience. Gain insights into customer behavior, revenue trends, and genre preferences, empowering strategic decision-making in the music industry. Explore the project for a rich learning experience.
data-analysis data-insights data-science database genre-prediction music-industry music-store postgresql postgresql-database query-optimization revenue-trends sql sql-queries
Last synced: 01 May 2026
https://github.com/bpkaur/a-network-analysis-of-game-of-thrones
A Network analysis of Game of Thrones: To analyze the co-occurrence network of the characters in the Game of Thrones books
data-analysis data-science machine-learning networkx python3
Last synced: 01 May 2026
https://github.com/filip-kustura/data-warehouse-olympics
This project, part of the elective Advanced Database Systems course, involved building a data warehouse based on the already existing database in PostgreSQL. It focuses on analyzing Olympic Games data across time, covering athletes' performance by discipline, location, and other dimensions. Implemented in Spring 2022.
data-analysis data-warehouse database extract-transform-load olympic-games postgresql sql star-schema university-project
Last synced: 01 May 2026
https://github.com/sairupeshl/leo-orbital-congestion-analysis
Geospatial data analysis of the UCS Satellite Database using Python to map active LEO space assets, validate orbital parameters, and isolate mega-constellation traffic bottlenecks.
aerospace-engineering data-analysis geospatial-analysis orbital-mechanics pandas python satellite-data seaborn
Last synced: 08 Jun 2026
https://github.com/manjit-baishya-datascience/flipkart-laptop-listing-eda
This project analyzes laptop price data from Flipkart using AutoScraper for web scraping. It includes data loading, EDA, cleaning, statistical analysis, and visualization. The goal is to derive insights for pricing strategies and market positioning. Explore the repository for detailed documentation and code.
data-analysis ecommerce-platform flipkart laptop python
Last synced: 08 Jun 2026
https://github.com/pablo1785/receipt-rs
Receipt processing backend built with Shuttle.rs, Axum and Azure Form Recognizer API
api-rest axum azure backend cognitive-services computer-vision data-analysis rust shuttle-rs sqlx
Last synced: 01 May 2026
https://github.com/nel-zi/city_logistics
Built an automated, scalable Azure cloud data infrastructure for City Logistics, integrating market trends to optimize operations and enhance decision-making.
azure azure-cloud-services data-analysis data-automation data-cleaning data-engineering data-transformation
Last synced: 01 May 2026
https://github.com/pratanup/solar-power-generation-prediction
A solar power generation company wants to optimize solar power production and needs the prediction model to predict ‘Clearsky DHI’, ‘Clearsky DNI’, ‘Clearsky GHI’.
anaconda data-analysis data-science google-colab jupiter-notebook machine-learning machine-learning-algorithms machinelearning-python prediction prediction-model python
Last synced: 01 May 2026
https://github.com/fbarffmann/project1
Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.
box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn
Last synced: 01 May 2026
https://github.com/codesaadumair/data-science-monorepo
Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.
data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python
Last synced: 01 May 2026
https://github.com/kavicastelo/soil-fertilizer-analysis-colab
This repository includes a data analysis and model training practical Jupyter notebooks using a soil fertilizer dataset. (use 4th edition)
data-analysis jupyter-notebook python
Last synced: 01 May 2026
https://github.com/abdoomohamedd/python-data-analysis-projects
A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 01 May 2026
https://github.com/rafath0ssain/predihome
Data analysis using economic factors affecting living conditions across Canadian provinces.
data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr
Last synced: 01 May 2026
https://github.com/leftcoastnerdgirl/introduction_to_pandas
This project introduces the use of Python in a JupyterNotebook.
analytics budget-analysis budget-planner-tool budget-planning data-analysis dataframes jupyter-notebook pandas pandas-python python
Last synced: 01 May 2026
https://github.com/rodrigojunqueiradev/100-days-of-code-bootcamp
100 Days of Code: The Complete Python Pro Bootcamp
data-analysis data-science python python-3 python-library python-script python3
Last synced: 20 Jun 2026
https://github.com/celineboutinon/client-segmentation
CentraleSupélec/OpenClassrooms Data Scientist 2024-2025 - Projet 5
aws client-segmentation cloud-architecture data-analysis data-science data-visualization database e-commerce marketing marketing-analytics marketplace-solution
Last synced: 01 May 2026
https://github.com/vedantshi/stock-price-prediction-for-maang-companies
This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.
data-analysis data-visualization lstm machine-learning python stock-price-prediction
Last synced: 01 May 2026
https://github.com/guptakushal03/whatsapp-chat-analyser
The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.
chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud
Last synced: 01 May 2026
https://github.com/bhoyarapurva23399/mini-erp-inventory-billing
Lightweight ERP inventory and billing web app built using Python Flask and SQLite — featuring product, customer, and dashboard management.
backend data-analysis erp flask inventory-billing mini-project python sqlite
Last synced: 01 May 2026
https://github.com/ujjwalll/get-that-flair
It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/
data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling
Last synced: 01 May 2026
https://github.com/sanikamal/machine-learning-atoz
Beginner-friendly machine learning tutorials and mini-projects.
collaborative-filtering data-analysis data-visualization decision-trees kmeans-clustering knn machine-learning machine-learning-algorithms recommender-system regression svm
Last synced: 08 Jun 2026
https://github.com/bheemisme/icc-t20-world-cup-dashboard
2024 icc t20 world cup dashboard
dashboard data-analysis data-analytics data-science data-visualization matplotlib pandas seaborn
Last synced: 02 May 2026
https://github.com/haseebn19/urban-housing-demand
A full-stack web application for visualizing housing and labour market data
data-analysis data-visualization docker full-stack gradle statistics web webapp
Last synced: 22 Jun 2026
https://github.com/maxwelllzh/linearizer
Linearizing parameters for linear regression
data-analysis machine-learning scikit-learn
Last synced: 02 May 2026
https://github.com/harshindcoder/salifort_motors_project
This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.
data-analysis data-science data-visualization hr-analytics machine-learning tree-models
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends
Last synced: 02 May 2026
https://github.com/teja-1403/ignosis-tech-ml-assignment
Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.
customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python
Last synced: 02 May 2026
https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.
analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 02 May 2026
https://github.com/asadiahmad/word-counter-spark
Word counter with spark
data-analysis nlp spark word-counter
Last synced: 02 May 2026
https://github.com/suma-aljudaia/my-portfolio
Suma Aljudaia | Portfolio – AI & Data Analysis Enthusiast
ai css data-analysis html machine-learning portfolio
Last synced: 02 May 2026
https://github.com/hamzazafar10/movie-recommendation-system
Content based movie recommendation system using cosine similarity.
cosine-similarity data-analysis data-preprocessing data-science data-structures data-visualization jupyter-notebook machine-learning movie-recommendation python
Last synced: 02 May 2026
https://github.com/amishidesai04/emergency-calls-data-analysis-project
Welcome to the Emergency Calls Data Analysis project repository. This project is dedicated to extracting, processing, and visualizing data from the "Emergency – 911 Calls, Montgomery County" dataset, sourced from Kaggle. The main objective is to analyze trends in emergency calls in Montgomery County, Pennsylvania, spanning multiple years.
analysis data-analysis data-extraction data-processing data-science data-visualization numpy pandas python seaborn
Last synced: 02 May 2026
https://github.com/rorrell/employmentdata
A Jupyter Notebook where I use group by to analyze the average unemployment rate by year
data-analysis data-visualization jupyter-notebook python3
Last synced: 02 May 2026
https://github.com/isaqueiros/motorpremium-predictions-mlpclassifier
This Jupyter Notebooks is an initial study of the application of sklearn neural network MLP Classifier model. The model is applied to dataset MotorPremiums, which is supplied separately in .csv format.
data-analysis data-science machine-learning neural-network python sklearn-library
Last synced: 02 May 2026
https://github.com/chuxinh/our-data-manual
All in one place for our data science learning journey by Chuxin and Melody
data-analysis data-science machine-learning python
Last synced: 09 Jun 2026
https://github.com/neuro-mechatronics-interfaces/matlab_analyses
Tools for analysis, statistics, and/or simulation in Matlab.
data-analysis data-visualization matlab matlab-codes matlab-functions matlab-gui matlab-scripts neuroscience weber-lab
Last synced: 09 Jun 2026
https://github.com/leticiamilan/santander-tech-data-science
Este repositório contém os projetos desenvolvidos durante o curso de Data Science, uma parceria entre a Ada Tech e o Santander Open Academy. O curso é dividido em vários módulos, cada um focado em um aspecto fundamental da ciência de dados.
ada-tech calculus data-analysis data-science letscode python santander
Last synced: 09 Jun 2026
https://github.com/oyebamiji-micheal/data-analysis-with-python-zero-to-pandas
This repository contains all assignments and project completed when I took a course, "Data Analysis with Python: Zero to Pandas", on Jovian
data-analysis numpy pandas python
Last synced: 10 Apr 2026
https://github.com/bhaveshbhakta/gold-price-prediction-using-ml
Gold Price Prediction
data-analysis data-visualization gold-price-prediction machine-learning python
Last synced: 02 May 2026