Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/theveryhim/frequent-item-sets-and-lsh
A practice on finding frequent item sets and similar items in pysaprk framework
big-data data-analysis frequent-itemset-mining locality-sensitive-hashing pyspark text-processing
Last synced: 03 Jul 2025
https://github.com/fbarffmann/mycitibike
Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.
citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping
Last synced: 07 Jul 2025
https://github.com/dulajkavinda/pandas-exploring-data-ml
🐼 Exploring data with pandas library.
data-analysis machine-learning pandas python
Last synced: 09 May 2026
https://github.com/ernanej/data-science-dca0131
Files, developed throughout the 2024.1 semester of the Data Science discipline taught at the Federal University of Rio Grande do Norte by the Department of Computer Engineering and Automation (DCA). 📚
big-data data-analysis data-science ia
Last synced: 30 Mar 2025
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/masamallow/jupyterlab-my-local
Configuration to run my personal JupyterLab on my local.
data-analysis jupyter jupyter-notebook jupyterlab
Last synced: 26 Mar 2025
https://github.com/deliprofesor/behavioral-insights-and-data-exploration
This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.
acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis
Last synced: 10 Apr 2025
https://github.com/matt-ags/jornada-python
Repositório com os projetos realizados durante a semana "Jornada Python" - 01/2025
artificial-intelligence automation data-analysis jupyter-notebook machine-learning python
Last synced: 05 May 2026
https://github.com/noodleslove/house-of-representatives-analysis-ii
In this project, we want to estimate if a transaction will have capital gains exceeding $200 using the provided dataset.
coursework data-analysis data-science eda feature-engineering pandas python3
Last synced: 12 Apr 2026
https://github.com/prajjwol09/data-cleaning-project
This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.
cleaning-data columns data-analysis database duplicates mysql rows standard
Last synced: 20 Apr 2026
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/jnnjenga/video-game-analysis
Performs exploratory data analysis on a video game dataset, examining titles, release dates, teams, ratings, genres, and user engagement metrics to uncover trends, popular genres, and developer insights.
data-analysis data-science dataset eda exploratory-data-analysis game-analysis games genres jupyter-notebook pandas python trends user-engagement video-game visualization
Last synced: 13 Apr 2026
https://github.com/codesaadumair/pandas_exercises_personal
Personalized enhancements to pandas exercises with comprehensive solutions and practical insights for mastering data analysis in Python.
data-analysis data-science pandas python
Last synced: 09 May 2026
https://github.com/paphada1103/data-analysis-with-python
📊 Analyze data efficiently using Python’s top libraries. Learn to explore, clean, and visualize data for meaningful insights in your projects.
carpentries data-analysis data-carpentry data-visualisation dataframe-api dataset english hacktoberfest ibm jovian lsl machine-learning matplotlib programming python realtime social-sciences spark
Last synced: 09 May 2026
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/ankitwalimbe/ecommerce-funnel-analysis
SQL-based analysis of the Olist e-commerce dataset — building an order funnel (purchase → approval → delivery) with breakdowns by payment type, product category, region, and monthly trend. Includes insights, CSV exports, and Tableau dashboard.
bigquery business-intelligence data-analysis ecommerce funnel-analysis sql tableau-public
Last synced: 05 Oct 2025
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/jimartskenya/ai-code-context
🤖 Automate code documentation with AI to enhance understanding and streamline your workflow, saving time on unfamiliar codebases and projects.
ai claude-code codebase-analysis context-management data-analysis dependency-analysis gemini intellij-plugin jupyterlab-extension llm-integration machine-learning mcp-server open-source pandas prompt-engineering streamlit-component token-reduction vibe-coding
Last synced: 08 May 2026
https://github.com/madhurragarwal/advertising-data-set---eda-and-ml
Logistic Regression and EDA done on Advertising Data set
data-analysis machine-learning
Last synced: 13 May 2026
https://github.com/manganite/vibespin
VibeSpin is a Python framework for simulating and analyzing 2D lattice spin systems (Ising, XY, and q-state Clock models) with Numba-accelerated Monte Carlo dynamics, correlation/structure diagnostics, and reproducible benchmarking workflows.
clock-model critical-phenomena data-analysis ising-model lattice-models monte-carlo-simulation phase-transitions physics-simulation python scientific-computing spin-models spin-systems statistical-mechanics xy-model
Last synced: 29 Jun 2026
https://github.com/satvikpraveen/rsvp_case_study
A comprehensive IMDB dataset analysis using SQL. Includes database setup, advanced queries, and actionable insights. Organized with files for database creation, queries, and solutions. Features an Entity-Relationship Diagram (ERD), executive summary, and SQL scripts. Perfect for SQL workflows and business intelligence in the film industry.
aggregate-functions business-intelligence common-table-expressions data-analysis data-driven-decisions data-querying database-design entity-relationship-diagram imdb-dataset relational-database sql subqueries-and-joins
Last synced: 11 Jan 2026
https://github.com/davifeliciano/modern_physics_experiments
Collection of data analysis and visualization scripts developed in Python around some modern physics experiments
data-analysis data-visualization modern-physics physics physics-experiments
Last synced: 18 Jan 2026
https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit
Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.
analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics
Last synced: 08 May 2026
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/tomijuarez/lemmatisation
Lemmatisation fully implemented in Java.
algorithms data-analysis data-science java-8 lemmatization oop
Last synced: 08 Apr 2025
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/saiteja-talluri/data-analytics-assignement
Report on World Happiness Data (Data Analysis and Visualisation of the data)
data-analysis data-visualization ipynb-jupyter-notebook
Last synced: 20 Jan 2026
https://github.com/shourya1997/boston_housing
In this project, you will apply basic machine learning concepts on data collected for housing prices in the Boston, Massachusetts area to predict the selling price of a new home.
boston-housing-dataset data-analysis jupyter-notebook machine-learning python unsupervised-machine-learning
Last synced: 18 May 2026
https://github.com/celineboutinon/bookworms
OpenClassrooms Data Analyst 2022-2023 - Projet 6
apriori-algorithm data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot mlxtend numpy pandas python scikit-learn scikit-posthocs scikitlearn seaborn statsmodels
Last synced: 05 May 2026
https://github.com/omarsolieman/socialgiveawaydataanalysis
This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis
data-analysis data-science data-visualization instagram scraping threejs
Last synced: 14 May 2026
https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset
This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.
data-analysis data-visualization excel project
Last synced: 18 Jan 2026
https://github.com/jacktheprogrammer/time-series-forecasting-and-analysis
My personal project consisting of my personally created notebooks to work with time series forecasting and analysis. In these projects, I've used deep learning using tensorflow, xgboost, statsmodels and scipy libraries of python. The series were of weather, energy consumption and that of stocks.
data-analysis data-science deep-neural-networks energy-consumption machine-learning portfolio prophet-facebook prophet-model python python3 scipy statsmodels stocks tensorflow time-series time-series-analysis timeseries-forecasting weather xgboost
Last synced: 05 May 2026
https://github.com/seif-elkateb/dataset-analysis-r
cu-boulder data data-analysis datamodeling datascience ms-ds msds434 r
Last synced: 01 Apr 2025
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/ashwin331133/hospital_allpatients_waitinglist_data
This Power BI project analyzes patient waiting lists across various medical specialties and case types (Day Case, Inpatient, Outpatient). The goal is to gain insights to improve healthcare management and resource allocation.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/aniruddha-biswas/jpmorgan-chase-excel-internship
JPMorgan Chase & Co.'s Excel Skills on Forage Virtual Internship
conditional-formatting data-analysis data-cleaning data-visualization excel excel-dashboard macos pivot-tables power-query shortcuts storytelling vba-excel
Last synced: 01 Apr 2025
https://github.com/alexquilis1/spanish-fuel-stations-analysis
Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons
data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse
Last synced: 08 Oct 2025
https://github.com/inddrsingh/e-commerce_orders
ETL project, with Python for Data cleaning and MySQL for Data analysis
data-analysis etl-pipeline mysql python
Last synced: 18 Apr 2026
https://github.com/hari7261/data-visualization
Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.
data-analysis data-visualization export hari7261 import python realtime-visualization
Last synced: 17 Jun 2025
https://github.com/porimol/employee-turnover-prediction
Employee turover prediction using machine learning
data-analysis data-mining data-science data-visualization datascience machine-learning prediction predictive-modeling
Last synced: 23 Feb 2026
https://github.com/jlee9503/telecommunication-churn
Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.
data-analysis data-visualization matplotlib pandas python scikit-learn
Last synced: 18 Jan 2026
https://github.com/faisal-khann/ipl-analysis
The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.
data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn
Last synced: 08 May 2026
https://github.com/wardenkenny/data-analyst-portfolio
A repository I have created to show and explore data analytics.
data-analysis excel r spreadsheets sql tableau
Last synced: 02 Apr 2025
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/marianamartiyns/api-logisticregression
Data analysis, modeling, and deployment of a logistic regression model for churn prediction, integrating a FastAPI backend and a Streamlit frontend.
data-analysis data-science fastapi logistic-regression pyhton streamlit
Last synced: 29 Apr 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/hilalguleryuz/excel_supermarket_data_analysis_project
Supermarket Data Analysis with Excel
dashboard data-analysis data-visualization excel microsoft-excel supermarket supermarket-data-analysis supermarket-dataset
Last synced: 06 Jan 2026
https://github.com/alokthedataguy/financial-friend-web-app
Financial Friend is a privacy-first web app that takes a user’s payment statement (PhonePe, GPay, bank CSV/PDF), cleans and understands it, and then talks back like a friend—giving simple, human answers (plus a few tiny visuals) to questions people actually care about.
data-analysis data-science data-visualization fastapi finance-management financial-analysis financial-data insights personal-finance-and-data-anlaysis python react
Last synced: 14 Apr 2026
https://github.com/kathkoeh/pimaindian-kk
Logistic regression analysis of diabetes risk using the Pima Indians dataset. Includes prevalence analysis, modeling, ROC/AUC evaluation, and patient testing in Python.
data-analysis diabetes epidemiology logistic-regression machine-learning public-health python
Last synced: 28 Apr 2026
https://github.com/atiqisrak/py
This repository houses the code and resources for the **100 Days of Python Challenge** – an intensive learning journey designed to propel you from beginner to a a confident Python programmer in just 100 days.
data-analysis data-science machine-learning python3
Last synced: 10 Oct 2025
https://github.com/sabdikay/analysis-of-biodiversity
This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.
data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 14 Apr 2026
https://github.com/jatin-s16/digital-marketing
This repository contains raw data for Marketing analysis along with key business questions. I performed data cleaning using Python and its libraries and extracted meaningful insights. The results were then visualised using Tableau to enhance business understanding.
data-analysis data-science python3 tableau
Last synced: 16 Mar 2025
https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 14 May 2026
https://github.com/suhailsallam/tips_dashboard
Dashboard using Python & Streamlit
dashboard data-analysis data-analytics data-science data-scientist data-visualization python streamlit streamlit-dashboard streamlit-webapp
Last synced: 21 Jan 2026
https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python seaborn
Last synced: 04 May 2026
https://github.com/3rd-son/movie-streaming-service-analysis
Exploratory Data Analysis of the Streaming Services like Neflix, Hulu, Disney+ etc
data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python seaborn
Last synced: 18 Apr 2026
https://github.com/scarlet-enlight/ml_project
Comparison of different classifiers (KNN, Naive Bayes, Decision Tree) on Sleep Health and Lifestyle Dataset
data-analysis machine-learning
Last synced: 13 Mar 2026
https://github.com/shrunga92/restaurant_order_analysis_sql
This project is a structured SQL-based analysis of restaurant orders, aimed at deriving key insights from transactional data.
Last synced: 03 Jul 2025
https://github.com/jiwookseo/natural_language_analysis
api sample for google natural language and ECOS(한국은행 경제통제시스템)
data-analysis google-natural-language-api text-analysis
Last synced: 11 Oct 2025
https://github.com/arthurosipyan/jamascript
JamaScript, your personal data assistant
blueprint data-analysis data-mining data-science exceldatareader jama jama-api python python-3 python-script python3
Last synced: 03 Sep 2025
https://github.com/npodlozhniy/podlozhnyy-module
One place for the most useful methods for work
data-analysis data-science pypi
Last synced: 21 Jan 2026
https://github.com/donmaruko/python-eda-toolkit
CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.
data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud
Last synced: 06 May 2026
https://github.com/zenithclown/finfolio
A Personal Finance Management Tool for the Developers, by the Developer
data-analysis data-science finance finance-application finance-management good-habits personal-finance portfolio
Last synced: 04 Feb 2026
https://github.com/ak-abhilash/super-market-sales-data-analysis-and-forecasting-using-power-bi
Power BI project to visualize sales data of a supermarket.
dashboard data-analysis powerbi salesdata visualization
Last synced: 05 Feb 2026
https://github.com/ndomah1/learning-probability-and-statistics
This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.
correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics
Last synced: 18 Jan 2026
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/eubrunoo/beer-consumption-predictor
An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.
data-analysis data-science data-visualization machine-learning r statistical-analysis statistics
Last synced: 02 Apr 2025
https://github.com/comsavvy/water-analysis-project
Project on water analysis
data-analysis data-visualization predictive-analytics python water-analysis
Last synced: 21 May 2026
https://github.com/sorebit/pdrpy-pd-2
Data analysis of various stackechange.com archives.
data-analysis stackexchange time-travel university-project
Last synced: 08 Oct 2025
https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report
This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.
dashboard data-analysis data-visualization financial-analysis powerbi statistics
Last synced: 21 Jan 2026
https://github.com/sarvesh2304/stellarator_simulation
A comprehensive Julia package for stellarator fusion reactor physics analysis featuring 3D magnetic field calculations, neoclassical transport modelling, quasi-isodynamic optimisation algorithms, and interactive 3D visualisations. Includes tokamak comparison framework and high-resolution plotting capabilities for fusion research.
3d-visualisation data-analysis field-line-tracing fusion-physics fusion-research interactive-3d julia magnetic-confinement magnetic-field-calculations magnetic-surfaces matplotlib neoclassical-transport numerical-methods optimisations physics-simulation plasma-physics plotly quasi-isodynamic stellarator stellarator-optimization
Last synced: 09 Oct 2025
https://github.com/ibromeat/market-orders-analysis
Data analysis of CRM market orders dataset
data-analysis jupyter-notebook machine-learning pandas python visualization
Last synced: 01 May 2026
https://github.com/amish5ingh/cricket-data-analytics-ipl
Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.
data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn
Last synced: 09 May 2026
https://github.com/ryuzen6/bangalore-real-estate-price-prediction
This is a Data Science Project which predicts the cost of Real Estate in Bangalore. Requirements: Jupyter Notebook (for Data Cleaning and creating the Linear Regression using various python libraries) , Pycharm (python IDE for creating Python Flask Server), Visual Studio Code (to create the UI with HTML, CSS and Javascript).
css3 data-analysis data-science html5 javascript jupyter-notebook machine-learning python3
Last synced: 06 May 2026
https://github.com/jhaayush2004/churncast
Fusion of deep Data Science, Machine Learning and MLOps...
aws data-analysis data-science data-visualization deep-neural-networks docker machine-learning mlops-workflow
Last synced: 09 Oct 2025
https://github.com/syarwinaaa09/exploring-nyc-public-school-test-result-scores
📊 analyzing NYC school test scores with python 🐍 to spot top performers 🏆 & trends 📈
data-analysis education pandas python visualization
Last synced: 06 May 2026
https://github.com/priyanshubiswas-tech/priyanshubiswas-tech
SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB
apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql
Last synced: 21 Jan 2026
https://github.com/mateib20/proiect-achizi-ia-i-prelucrarea-datelor
Procesarea semnalului, analiza datelor și analiza spectrală pentru semnal sonor
c c-language c-programming c-programming-language data-analysis data-engineering data-science data-visualization datascience python python-lambda python-library signal-analysis signal-processing
Last synced: 11 Jun 2025
https://github.com/leosimoes/digitalinnovationone-analise-datasets
Projeto prático "Análise de dados com Python e Pandas" do Bootcamp "Banco Carrefour Data Engineer" da Digital Innovation One.
data-analysis data-science python
Last synced: 24 Mar 2025
https://github.com/anandu-jpg/coffee-shop-sales-analysis
This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.
business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton
Last synced: 18 May 2026
https://github.com/yashpaneliya/bank-loan-default-analysis
Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.
data-analysis loan-default-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/ankitwalimbe/sentiment-analysis
Sentiment analysis of Amazon Fashion reviews using VADER and a baseline ML model (TF-IDF + SGDClassifier). Includes visualizations, reproducible notebook, and recruiter-ready documentation.
data-analysis machine-learning matplotlib nlp pandas python seaborn sentiment-analysis sklearn
Last synced: 06 May 2026
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 27 Jan 2026
https://github.com/2013xile/sheethub
Organize, import, export, concatenate sheet files on web application.
data-analysis data-wrangler excel sheets
Last synced: 08 Apr 2025
https://github.com/andersoncrs/prediccion-del-precio-de-vehiculos-un-enfoque-con-regresion-lineal-y-regularizacion
Este proyecto tiene como objetivo predecir el precio de vehículos usados utilizando técnicas de regresión lineal y regularización Lasso. A través del análisis y procesamiento de datos, se construye un modelo predictivo preciso e interpretable basado en las características más relevantes de cada vehículo.
data-analysis data-exploration lasso-regression machine-learning polinomial-regression regularization-methods
Last synced: 03 Jul 2025
https://github.com/jrdnbradford/the-office-us
Data concerning NBC's mockumentary series The Office (U.S. version)
csv data-analysis json the-office xml
Last synced: 19 Jan 2026
https://github.com/mituskillologies/aiml-pcp-jul25
Programs conducted at AI-ML Training Program at Pimpri Chinchwad Polytechnic, Pune in Jul 2025
artificial-intelligence classification clustering data-analysis data-visualization machine-learning matplotlib pandas regression scikit-learn supervised-learning unsupervised-learning
Last synced: 03 May 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python
Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.
data-analysis data-visualization numpy pandas python3 sns visualization
Last synced: 03 May 2026
https://github.com/vineet416/eda-hr-analytics
EDA on HR-Analytics by PW Skills Data Analytics course
data-analysis data-analysis-python data-analytics data-preprocessing data-processing data-visualization exploratory-data-analysis jupyter-notebook matplotlib-pyplot numpy pandas python seaborn statistical-analysis
Last synced: 14 Apr 2026
https://github.com/dhruvil-26/tableau-projects
This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns
analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public
Last synced: 19 Jan 2026
https://github.com/1401dev/iowa-liquor-retail-sales-analysis
This repository contains the analysis of Iowa liquor retail sales data, aimed at uncovering sales trends and forecasting future sales patterns. The project involves data cleaning, preparation, and advanced time series analysis using Microsoft SQL Server and Google Colab.
customer-behavior data-analysis data-cleaning data-science data-visualization exploratory-data-analysis forecasting google-colab machine-learning microsoft-sql-server pandas prophet python retail-analytics retail-sales sales-forecasting sales-performance sql statsmodels time-series-analysis
Last synced: 16 Feb 2026
https://github.com/codesaadumair/exploratory-data-analysis
A centralized repository showcasing various Exploratory Data Analysis (EDA) projects using Jupyter notebooks, visualizations, and accompanying documentation.
data-analysis data-science data-visualization eda jupyter-notebook jupyterlab python
Last synced: 24 Mar 2025
https://github.com/silvermete0r/sdu_hackathon_uss_db_analysis
Smart Data Ukimet Hackathon - "Data Modeling" case Solution - Topic: Store Analysis based on Unified Star Schema
data-analysis data-modeling postgresql python sql unified-star-schema
Last synced: 14 Apr 2026
https://github.com/navp7/pizzasales_powerbi
This project involves creating a comprehensive sales performance dashboard using Power BI to visualize and analyze the sales data of an Italian pizza company.
data-analysis ms-sql-server ms-word powerbi visualization
Last synced: 13 Mar 2026
https://github.com/analysisbyvivek/road-accident
Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.
data-analysis data-visualization eda jupyter-notebook kaggle tableau-public
Last synced: 19 Jun 2026
https://github.com/dzakwanalifi/stadata-x
Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif
bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui
Last synced: 20 Jan 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026