Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/celineboutinon/bookworms
OpenClassrooms Data Analyst 2022-2023 - Projet 6
apriori-algorithm data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot mlxtend numpy pandas python scikit-learn scikit-posthocs scikitlearn seaborn statsmodels
Last synced: 05 May 2026
https://github.com/beaprogrammer02345/python_data_analysis
Sales Analysis using Python
data-analysis data-visualization python
Last synced: 05 May 2026
https://github.com/13anush/python-libraries-
A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.
data-analysis matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/sajjad425/edaipl
The dataset covers the Indian Premier League (IPL) with details on matches (date, teams, venue, results), player stats (runs, wickets), team stats (wins, losses), season summaries, and umpire info. The EDA reveals patterns and insights, highlighting dominant teams, star players, and trends across seasons.
data-analysis eda exploratory-data-analysis ipl python
Last synced: 05 May 2026
https://github.com/pcanadas/weather_scraper
Este proyecto automatiza la recopilación y el procesamiento de datos meteorológicos históricos y previsionales. Utiliza Selenium para extraer información de sitios web de clima, procesa los datos con Pandas y los almacena en archivos CSV limpios. Es ideal para análisis climáticos, visualización de datos o integración en otros sistemas.
beautifulsoup data-analysis pandas python selenium
Last synced: 05 May 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/akash-47-tank/personalized-e-commerce-review-summarizer
Personalized E-commerce Product Review Summarizer: A Streamlit app that summarizes product reviews (e.g., from a CSV) using T5-small and tailors summaries to user preferences (price, durability, etc.) with NLP and lightweight ML.
data-analysis e-commerce machine-learning nlp personalization portfolio python scikit-learn sentiment-analysis streamlit t5 transformers web-app
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/ayaatmohammed/amazon-sales-analysis-pyspark
In-depth analysis of the Olist E-commerce dataset from Kaggle using PySpark for customer segmentation (RFM) and market basket analysis.
big-data big-data-analytics customer-segmentation data-analysis data-science ecommerce jupyter-notebook kaggle pyspark python rfm-analysis
Last synced: 05 May 2026
https://github.com/kammarah/data-sample
I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.
data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/meinhere/dicoding-analisis-data
Submission Analisis Data dengan tema E-Commerce Streamlit App
data-analysis data-mining e-commerce python streamlit
Last synced: 05 May 2026
https://github.com/hms75/movie_rating_analysis
A movie rating analysis which identifies trends amongst a dataset of 5000 movies.
data-analysis data-visualization matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/iamrajmani/sentimental-analysis
Sentimental Analysis - Final Year College Project
data-analysis data-visualization machine-learning python pytorch
Last synced: 06 May 2026
https://github.com/ryuzen6/bangalore-real-estate-price-prediction
This is a Data Science Project which predicts the cost of Real Estate in Bangalore. Requirements: Jupyter Notebook (for Data Cleaning and creating the Linear Regression using various python libraries) , Pycharm (python IDE for creating Python Flask Server), Visual Studio Code (to create the UI with HTML, CSS and Javascript).
css3 data-analysis data-science html5 javascript jupyter-notebook machine-learning python3
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/syarwinaaa09/exploring-nyc-public-school-test-result-scores
📊 analyzing NYC school test scores with python 🐍 to spot top performers 🏆 & trends 📈
data-analysis education pandas python visualization
Last synced: 06 May 2026
https://github.com/yashpaneliya/bank-loan-default-analysis
Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.
data-analysis loan-default-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda
Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.
data-analysis data-science data-visualization eda matplolib numpy pandas python
Last synced: 06 May 2026
https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.
data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 06 May 2026
https://github.com/ankitwalimbe/sentiment-analysis
Sentiment analysis of Amazon Fashion reviews using VADER and a baseline ML model (TF-IDF + SGDClassifier). Includes visualizations, reproducible notebook, and recruiter-ready documentation.
data-analysis machine-learning matplotlib nlp pandas python seaborn sentiment-analysis sklearn
Last synced: 06 May 2026
https://github.com/abhinav330/customer-behavior-analysis-linear-regression
This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.
data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis
Last synced: 06 May 2026
https://github.com/mikma03/datascience_python_datacamp
DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.
data-analysis data-science datacamp datascience numpy pandas python
Last synced: 06 May 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/friendotjava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
data-analysis explanatory-data-analysis exploratory-data-analysis jupyter-notebook pytho streamlit
Last synced: 06 May 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026
https://github.com/superpandas-ai/superpandas
Adding LLM integration to Pandas library
Last synced: 06 May 2026
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 06 May 2026
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026
https://github.com/edanur-y/variable-analysis-of-banks-ratio-data
Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R
data-analysis log-transformation missing-values-analysis multicollinearity normality-test r spss
Last synced: 10 Jun 2026
https://github.com/korniichuk/pydatan-homework
Python Data Analysis course homework
course data-analysis data-analysis-python python python3
Last synced: 06 May 2026
https://github.com/dhruwsunita/iphones-eda-analysis
EDA analysis on apple products.
data-analysis data-visualization eda matplotlib numpy pandas plotly python seaborn
Last synced: 06 May 2026
https://github.com/fbarffmann/home_sales
Analyzed 25,000+ home sales using PySpark and SparkSQL. Identified pricing trends by year built, home features, and view rating. Optimized query run-time by 70% using caching.
aws big-data data-analysis home-sales parquet pyspark python spark spark-sql sql
Last synced: 06 May 2026
https://github.com/rlalpha49/anisearch-model
AniSearchModel leverages Sentence-BERT (SBERT) models to generate embeddings for synopses, enabling the calculation of semantic similarities between descriptions. This allows users to find the most similar anime or manga based on a given description.
anime api data-analysis data-merging embeddings flask hugging-face-datasets kaggle-datasets machine-learning manga natural-language-processing nlp python sentence-bert similarity-search
Last synced: 06 May 2026
https://github.com/urbanekda/upwork_dashboard
A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.
data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit
Last synced: 07 May 2026
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026
https://github.com/badranalyst/exploratory-data-analysis-on-salaries-dataset
Performing EDA on a dataset related to salaries, exploring relationships between factors like job titles, industries, and locations. Insights are visualized with plots to identify trends and disparities in salary data.
data-analysis dataset eda exploratory-data-analysis pandas python
Last synced: 07 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/rohansoni45/whatsapp-chat-analysis
This project involves analyzing WhatsApp chat data to extract valuable insights. Using Python and various libraries like Pandas and Matplotlib, the project processes and visualizes chat statistics such as message frequency, most active participants, and sentiment analysis.
chat-analysis data-analysis data-science matplotlib pandas python sentiment-analysis streamlit visualization web-app word-cloud
Last synced: 07 May 2026
https://github.com/jjkay03/discord-call-extractor
Collect HTML data from Discord group/DM to create database of calls
data-analysis database discord discord-tool
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/syarwinaaa09/modeling-car-insurance-claim-outcomes
a data analysis project on car insurance trends using Python and Jupyter Notebook
car-insurance classic-cars data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python seaborn visualization
Last synced: 07 May 2026
https://github.com/joseph-pabian/life-expectancy-
Statistical analysis of life expectancy in developed vs developing countries using SQL and Python
data-analysis duckdb public-health python sql statistics
Last synced: 07 May 2026
https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021
Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.
advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python
Last synced: 07 May 2026
https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting
This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.
data-analysis data-visualization mechine-learing plotly python streamlit
Last synced: 07 May 2026
https://github.com/pedrosfaria2/fugascomhelicoptero
Meu primeiro uso do Jupyter Notebook em um projeto
analise-de-dados data-analysis jupyter-notebook matplotlib pandas python
Last synced: 07 May 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026
https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation
GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim
content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream
Last synced: 07 May 2026
https://github.com/anshulkansal121/music_store_analysis_sql
SQL project to analyze online music store data
beginner-friendly contributions-welcome data-analysis database mysql sql
Last synced: 07 May 2026
https://github.com/satyam4229/identify-employee-attrition
This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 08 May 2026
https://github.com/bnvulpe/regression-and-time-series
This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.
colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting
Last synced: 08 May 2026
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/blladerunner/customer-churn-dashboard
Customer Churn Dashboard — SQL + Python analytics project exploring customer retention patterns, churn rate by demographics and services, and key insights for telecom business strategy.
business-intelligence churn-analysis customer-retention dashboard data-analysis data-analytics data-science pandas powerbi python sql sqlite telecom
Last synced: 08 May 2026
https://github.com/devexpress-examples/wpf-pivot-grid-group-date-time-values
This example shows how to group date-time values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 08 May 2026
https://github.com/prakashjha1/stock-investment-analysis
Stock Investment Analysis Project can help investor to select the better performing stocks.
data-analysis data-science numpy pandas pandas-datareader parallel-programming python
Last synced: 08 May 2026
https://github.com/phanchenh/adventureworkdataset-rfm-analysis-sqlproject
RFM Analysis Using SQL on the AdventureWorks Dataset (2011-2014)
business-analytics business-intelligence data-analysis mssql rfm-analysis sql
Last synced: 10 Jun 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/0290192029/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction apartments-for-rent api correios-api data-analysis feature-engineering feature-enginering linear-regression linear-regression-models mlops numpy prediction-model r seaborn
Last synced: 08 May 2026
https://github.com/sumit-sinha9/sales-analysis
Analyzing 12 months worth fo Sales data
data-analysis pandas python visualization
Last synced: 08 May 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/ayushsiloiya619/online-food-orders-analysis
Data Analytics with Python
data-analysis data-visualization matplotlib pandas-dataframe python3 seaborn-python
Last synced: 08 May 2026
https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices
This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.
data-analysis data-science machine-learning python scikit-learn udacity
Last synced: 08 May 2026
https://github.com/cnoret/ibm-data-analyst-professional
Final project & Courses Notebooks
analyzing-data data-analysis data-analyst data-manipulation data-science data-visualization ibm ibm-certificate ibm-data-analyst-professional ibm-datascience-certification pandas python
Last synced: 09 May 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/l1ght14/customer-churn-prediction
Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.
churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom
Last synced: 09 May 2026
https://github.com/ranagaballah/true-fake-news
True Fake News Detector NLP model
data-analysis data-science data-visualization deployment machine-learning matplotlib nlp numpy pandas python
Last synced: 09 May 2026
https://github.com/drod75/burger_king_analysis
A simple analysis on a burger king dataset.
data-analysis data-visualization jupyter-notebook pandas python seaborn
Last synced: 09 May 2026
https://github.com/rizkipragustono/data_analysis_spark
Exploration: Data Analysis using Spark
apache-spark data-analysis pyspark python spark-sql sql
Last synced: 09 May 2026
https://github.com/emanoelcampos/python-onemonth
This repository contains educational materials and projects developed during a Python course offered by OneMonth. It covers Python basics, intermediate concepts, web development with Flask, and data analysis with pandas. The course is structured into weeks, each focusing on a different aspect of Python programming and its applications.
data-analysis flask jupyter-notebook onemonth python python3
Last synced: 09 May 2026
https://github.com/itsharshparmar/uber_data_analysis
Data analysis on Uber ride data using Python and visualization libraries.
analytics-projects business-analysis colab-notebook data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis matplotlib pandas python python-project real-world-data seaborn time-series-analysis transportation uber uber-data-analysis
Last synced: 09 May 2026
https://github.com/urmesthamondal/data_analysis_projects
Portfolio Data analysis projects built using Excel, Python, SQL and for visualization used Power bi .
data-analysis pivot-tables powerbi python sql sql-server visualisation
Last synced: 09 May 2026
https://github.com/guptaachin/airline-sentiment-analysis-from-twitter-feeds
Analyses of the airline service providers' sentiment from twitter feeds
classification data-analysis data-science jupyter-notebook machine-learning natural-language-processing pandas pca python sklearn-library tf-idf visualization
Last synced: 09 May 2026
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 May 2026
https://github.com/lotfiferaga/energeiahub
data-analysis data-visualization energy-consumption python streamlit
Last synced: 09 May 2026
https://github.com/magnus0969/black-friday-sales-analysis
An in-depth analysis of Black Friday sales data to uncover trends, customer behavior, and product insights. Utilizing Python, data visualization, and machine learning techniques, this project provides key business intelligence to optimize sales strategies.
analysis data-analysis data-science python sales-analysis
Last synced: 09 May 2026
https://github.com/zxjahid/matplotlib
A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊
candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx
Last synced: 09 May 2026
https://github.com/noeldevelops/stem-degrees-analysis-cpp
C++ Data Analysis, I/O - takes an external data file for processing, performs some statistical analysis, and displays the results in the console
Last synced: 29 May 2026
https://github.com/rahmamohammad/retail_project
Retail & Data analytics: KPIs, sales trends, Excel planning pack, forecasting & inventory tracking.
data-analysis data-visualization ecommerce excel jupyter-notebook matplotlib python retail-analytics storytelling
Last synced: 17 May 2026
https://github.com/j-wu1/analyse_ventes_jeuxvideo_python
Analyse Exploratoire de Données (EDA) sur les ventes de jeux vidéo avec Python, Pandas, Matplotlib et Seaborn dans un Jupyter Notebook.
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 19 Aug 2025
https://github.com/jailsonsb2/kit-analise-de-dados
🚀 Um kit de ferramentas Python para acelerar a análise de dados. Carregue arquivos de forma inteligente (CSV, Excel, etc.) e converta notebooks Jupyter para scripts de produção sem esforço.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados automation automations dados data-analysis data-cleaning etl etl-automation jupyter-notebook pandas powerquery python toolkit
Last synced: 29 Apr 2026
https://github.com/apostolis-bloutsos-data/employee-data-eda
Mini EDA project on synthetic employee records using Python, pandas, and matplotlib
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/rugwiroparfait/alx_sql
This repo is where I save my queries and learning materials in Data Science program from ALX
anaconda data data-analysis jupyter-notebook sql
Last synced: 19 Aug 2025
https://github.com/alikhalajii/text-classification-life-sciences
Text classification of Life Science apps
data-analysis data-science datasets feature-importance jupyter-notebook pandas sbert scikit-learn word2vec
Last synced: 05 May 2026
https://github.com/mjshubham21/ny_yellow_taxi_python_da_project
A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/cyberoctane29/epa-air-quality-aqi-analysis
This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.
data-analysis numpy pandas python statistics
Last synced: 10 Apr 2026
https://github.com/jedrzej-wydra/competition-cooperation
Competition, cooperation, and parental effects in larval aggregations formed on carrion by communally breeding beetles Necrodes littoralis (Staphylinidae: Silphinae)
data-analysis non-linear-regression r
Last synced: 20 Aug 2025
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 08 May 2026
https://github.com/puspacempaka/superstore-analysis-with-sql
This repository showcases various data analyses on the popular Superstore dataset using SQL queries. The analyses cover a range of business insights, including sales performance, customer segmentation, and product profitability. Each analysis is documented with the SQL queries used and explanations of the steps involved.
business-intelligence data-analysis sales-analysis sql superstore-dataset
Last synced: 09 Mar 2026
https://github.com/rubyyy1118/share-price-analysis
The assignment in my MSc Business Analytics course
data-analysis data-preprocessing data-science data-visualization matplotlib numpy pandas python seaborn
Last synced: 10 Apr 2026
https://github.com/shriansh8619/sql_eda
Explored relational databases using SQL to perform comprehensive Exploratory Data Analysis (EDA), covering database exploration, segmentation, trend analysis, and performance ranking. Developed reusable SQL scripts to analyze dimensions, measures, and time-based metrics, helping uncover key business insights.
data-analysis exploratory-data-analysis mysql
Last synced: 20 Aug 2025
https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application
The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.
data-analysis data-science data-visualization python python3 streamlit
Last synced: 18 Apr 2026
https://github.com/myriamba/neuraview
AI-Powered Data Insights and Visualization Generator
data-analysis data-engineering data-insights data-visualization generative-ai llm
Last synced: 21 Aug 2025