Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/cagandemirmr/google-play-yorum-analizi
Türkiyede 2024 yılında en çok beğenilen My Supermarket Simulator 3D oyununa ait yorumların duygu durumu,yorumların beğeni sayısını,Firmanın geri dönüşleri ve kullanıcı nicknameleri gibi değişkenleri analiz ederek içgörü topladım.
bert data-analysis data-science nlp
Last synced: 10 Jun 2026
https://github.com/md-emon-hasan/data_analytics_project
Data analytics tasks and solutions, featuring hands-on exercises for data cleaning, visualization, and analysis using Python libraries.
cars-dataset census-data covid19-data data-analysis london-house-price police-data weather-data
Last synced: 08 May 2026
https://github.com/lostvikx/fintech-pg
Files related to my FinTech course
analytics data-analysis data-science data-visualization finance fundamental-analysis numpy pandas python technical-analysis
Last synced: 14 Apr 2026
https://github.com/maugus0/sats-flight-data-fetcher
A simple Python tool to fetch and analyze flight data for 15+ major airlines using the AirLabs API.
airline-data cli-tool data-analysis flight-data python3
Last synced: 17 Mar 2026
https://github.com/phammings/sales-management-analysis
Sales management analysis and Power BI dashboard for sample business request and user stories
data-analysis excel powerbi sql
Last synced: 01 Feb 2026
https://github.com/akshat0427/python_youtube_history
a bunch of data science operations performed on youtube history data
data-analysis data-science extracting-features
Last synced: 10 Jun 2026
https://github.com/rubinlake/rl-academy-data-analytics
Educational data analysis project demonstrating BMW sales data analysis with AI-powered code assistance using Cursor IDE and Jupyter notebooks
cursor-ide data-analysis educational-project jupyter langchain matplotlib numpy pandas python scipy seaborn
Last synced: 09 May 2026
https://github.com/victor-antoniassi/junior_data_analyst_test_01
Solution developed for a technical assessment that analyzed video game sales data to support gaming partnership decisions.
asses assessment-project data-analysis data-analysis-project data-analyst duckdb etl prefect python
Last synced: 01 Jun 2026
https://github.com/dogan-the-analyst/developer_survey_analysis
Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.
data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python
Last synced: 09 May 2026
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 12 Apr 2026
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 23 Mar 2025
https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping
Last synced: 15 Mar 2025
https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard
The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.
business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports
Last synced: 09 Apr 2025
https://github.com/elkronos/stat_py
Statistics functions for python
assumption-check data-analysis data-visualization python regression statistical-analysis statistical-inference statistical-models statistical-tests statistics
Last synced: 10 Oct 2025
https://github.com/vhawk19/ambaan
just wants the average analyst to be happi
data-analysis duckdb-wasm sql vue
Last synced: 01 Mar 2026
https://github.com/kentlouisetonino/sw-project-data-analysis
My project for AMA MATH 6200 course.
data-analysis python school-project
Last synced: 28 Feb 2025
https://github.com/jpotter80/notebook-examples
This repository demonstrates a systematic approach to cleaning and standardizing e-commerce product data using DuckDB. The notebook serves as a detailed walkthrough of our data cleaning methodology, showcasing how we handle common data quality challenges in e-commerce datasets.
data-analysis data-cleaning jupyter-notebook
Last synced: 12 Jun 2025
https://github.com/yonatanadam/film-success-prediction
Analyzing Hollywood movie success based on genre, target audience, and runtime using machine learning
data-analysis ipynb machine-learning
Last synced: 25 Jan 2026
https://github.com/cosmoduende/r-twitter
Explore your Twitter activity with R: Sentiment Analysis and Data Visualization. How to analyze your Twitter account (or any account), discover your habits and sentiments with the "rtweet" package and NLP.
data-analysis data-visualization lemmatization nlp nlp-library nlp-resources nltk nltk-library r-package r-programming r-studio rtweet stemming twitter twitter-api twitter-data twitter-data-analysis twitter-data-extraction twitter-sentiment-analysis udpipe
Last synced: 10 Oct 2025
https://github.com/ivanildobarauna-dev/currency-quote
Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 27 Oct 2025
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026
https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2
Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.
dashboard data data-analysis dax-measures excel powerbi powerbidashboard
Last synced: 23 Jan 2026
https://github.com/jabhij/fbi_nics-firearm-background-checks
This project is a try to showcase the use of guns across the US.
data-analysis data-analytics data-science data-visualization tableau
Last synced: 23 Feb 2026
https://github.com/selcuk05/forbes_top_100_celebrities_data_analysis
Forbes Top 100 Celebrities since 2005 Data Analysis and Visualization
Last synced: 11 Oct 2025
https://github.com/as16082023/atliq-hospitality-analysis
This project presents an overview of AtliQ Grands' performance in the hospitality industry using Power BI.
atliqgrand codebasicsresumeprojectchallenge data-analysis data-visualization powerbi revenueinsights
Last synced: 23 Jan 2026
https://github.com/rupav/fifa17-detailed-analysis
⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽
data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart
Last synced: 16 May 2026
https://github.com/beallio/wherewolf
Wherewolf is a production-grade, local SQL workbench designed for data engineers and analysts to query local files (CSV, Parquet, JSON) with ease. Built with Streamlit, it provides a unified interface to execute SQL against either DuckDB or PySpark engines without requiring complex setup.
big-data data-analysis data-engineering etl parquet performance pyspark python spark-sql sql uv
Last synced: 28 Apr 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/eshaagarwa/sales_insight_project
Sales insights project using Powerbi and SQL
data-analysis data-visualization databse datacleaning datamodeling microsoft-power-bi mysql-database powerbi sales-insights sql
Last synced: 08 Aug 2025
https://github.com/shreeparab1890/flipkart-laptops-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.
data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 14 Apr 2026
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/mariam-badr-mb/gtc-ml-project1-hotel-bookings
The goal of this project is to build a robust data preprocessing pipeline for a hotel booking cancellation prediction model. The focus is not on training the final machine learning model but on ensuring that the dataset is clean, consistent, and ML-ready.
cleaning-data data-analysis exploratory-data-analysis
Last synced: 05 Sep 2025
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 21 Jan 2026
https://github.com/albamerdani/iot_air_quality_ml
IoT Project for Air Quality and Data Analysis with Machine Learning
air-quality aqi data-analysis data-science decision-tree iot machine-learning-algorithms prediction random-forest raspberry-pi-3 sensors
Last synced: 27 Jan 2026
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 04 Sep 2025
https://github.com/sandergi/designbuildfly
Useful tools made for Design Build Fly at UW, hosted on Glitch so teammates can easily access. Check out our optimization tools here: https://github.com/JPaonaskar/DBF-Optimization
data-analysis inav-blackbox webapp
Last synced: 01 Apr 2025
https://github.com/akankshaaa013/30-day-machine-learning-deep-learning
To practically Learn, Explore, and Share my Insights on the Libraries and Tools that power Machine Learning.
data-analysis machine-learning python
Last synced: 15 Mar 2025
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 14 Apr 2026
https://github.com/silvano315/med-physics
This would be a repository about medical physics. It will based on 4 paths: medical data to analyse, SOTA programs for medical purposes, computer vision and eXplainability.
computer-vision data-analysis data-science explainable-ai medical-imaging medical-physics medical-tool
Last synced: 24 Mar 2025
https://github.com/alejo1630/chicago_crimes
A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium
data-analysis data-visualization folium pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/vruddhi18/e-commerce_data_analysis_powerbi_dashboard
The E-Commerce Data Analysis project leverages Power BI to analyze sales and customer insights from Blinkit, Zepto, Myntra, and Flipkart, providing interactive dashboards to enhance e-commerce strategies.
Last synced: 27 Feb 2026
https://github.com/akash1070/freecodecamp-data-analysis-with-python-
contains study notes and assignments from freecodecamp of Data Analysis With Python
data-analysis demographic-analysis mean-variance-standard-calculator medical-data-visualisation numpy-library pandas-library python3 sea-level-predictor time-series-analysis
Last synced: 01 May 2026
https://github.com/mindgamesnl/yanderestats
https://mindgamesnl.github.io/YandereStats/
data-analysis reporting-pipeline yandere yandere-sim
Last synced: 18 Jun 2026
https://github.com/lebrancconvas/how-much-love-in-thai-song
How much Love song among the Thai Songs?
data-analysis side-project web-scraping
Last synced: 19 Jun 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/dcs-training/intronetworkanalysis
This is a repository for the Introduction to Network Analysis course provided by Brian Wong for the CDCS. Within the repository there are files with sample datasets and a guide to building datasets. It will be updated before each section. Go to the Readme file
data-analysis data-visualisation gephi network-analysis text-analysis
Last synced: 27 Jan 2026
https://github.com/robcyberlab/linear-regression-application
🔢Linear Regression Application💻
artificial-intelligence data-analysis data-science data-visualization linear-regression machine-learning python python-programming regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/guildofcalamity/balanceact
A simple expense item tracking application with statistics.
banking csharp data-analysis data-visualization dependency-injection dotnet money-management mvvm visual-studio windows-app-sdk windows-sdk winui winui3 winuicommunity xaml xaml-ui
Last synced: 05 Feb 2026
https://github.com/manvendra747/customer-segmentation
Customer segmentation using Python and PowerBI
customer-segmentation dashboard data-analysis data-science data-visualization powerbi python rfm-analysis
Last synced: 28 Apr 2025
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 12 Apr 2026
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/vyjayanthipolapragada/early_sepsis_detection_ml
An end-to-end project leveraging clinical datasets (PhysioNet, MIMIC-IV, MIMIC-IV-ED) to develop and compare ML and LSTM-based models for early sepsis prediction.
data-analysis data-visualization deep-learning healthcare jupyter-notebook keras-tensorflow lstm-neural-networks machine-learning neural-network python
Last synced: 28 Apr 2026
https://github.com/arnikz/piqmie
Proteomics Identifications & Quantitations Data Management & Integration Service
data-analysis data-management data-visualisation mass-spectrometry peptide-identification protein-inference protein-quantification proteomics silac web-application
Last synced: 03 Feb 2026
https://github.com/alxrm/scent-of-literature
Russian literature sentiment analysis in terms of very small dataset
classification data-analysis sentiment-analysis sklearn tf-idf
Last synced: 28 Apr 2026
https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-
# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻
analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits
Last synced: 01 Jul 2025
https://github.com/abhijais4896/belarus-car-price-prediction
Belarus-car-price-prediction
data-analysis datacleaning macine-learning numpy pandas python
Last synced: 04 May 2026
https://github.com/fatihilhan42/book-recommendation-system-with-python
In this project, we are making a book recommendation system that recommends similar books according to the genres or ratings that the user enters, using a large book dataset. The link of the dataset is given below. Happy reading...
books data-analysis data-science data-visualization kaggle python recommendation-engine recommendation-system
Last synced: 04 May 2026
https://github.com/hyperplasma/olympic-visualization-analysis
Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.
data-analysis data-visualization matplotlib numpy pandas python wordcloud
Last synced: 04 May 2026
https://github.com/ljadhav25/logistic-regression-data-science-
Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.
data-analysis data-science data-visualization logestic-regression machine-learning
Last synced: 04 May 2026
https://github.com/bishopce16/pyber_analysis
The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.
data-analysis data-visualization jupyter-notebook matplotlib pandas python
Last synced: 04 May 2026
https://github.com/surayasumona/test_bowlers_analysis
Data Analysis with Python
data-analysis data-manipulation data-preprocessing numpy pandas
Last synced: 04 May 2026
https://github.com/flytomarsz/bike-sharing-system-analysis
This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.
data-analysis data-visualization jupyter-notebook python streamlit
Last synced: 04 May 2026
https://github.com/tasosfotiadis/time-series-analysis-and-forecasting-of-cryptocurrency-prices
Forecasted Cardano (ADA) cryptocurrency prices using time series analysis. The project involved data preprocessing, trend and seasonality analysis, and model building with ARIMA, SARIMA, and LSTM. Models were evaluated using metrics like MAE and MAPE, providing insights for financial decision-making.
applied-st classical-statistical-models data-analysis deep-learning lstm machine-learning neural-network python r time-series
Last synced: 05 May 2026
https://github.com/dhruvsrikanth/basic-data-science
A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!
analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn
Last synced: 05 May 2026
https://github.com/sajjad425/edaipl
The dataset covers the Indian Premier League (IPL) with details on matches (date, teams, venue, results), player stats (runs, wickets), team stats (wins, losses), season summaries, and umpire info. The EDA reveals patterns and insights, highlighting dominant teams, star players, and trends across seasons.
data-analysis eda exploratory-data-analysis ipl python
Last synced: 05 May 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/anushkundu/crime-pattern-analysis
Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)
data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/meinhere/dicoding-analisis-data
Submission Analisis Data dengan tema E-Commerce Streamlit App
data-analysis data-mining e-commerce python streamlit
Last synced: 05 May 2026
https://github.com/panoschatzi/healthcare_and_bioinformatics_analyses
Healthcare and Bioinformatics data analysis projects with Python and SQL.
data-analysis data-cleaning data-visualisation jupyter matplotlib mysql pandas plotly python seaborn sql
Last synced: 06 May 2026
https://github.com/abhinav330/customer-behavior-analysis-linear-regression
This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.
data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis
Last synced: 06 May 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/harryrlk/data_analysis_showcase
This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.
data-analysis data-science data-visualization excel portfolio python r tableau
Last synced: 06 May 2026
https://github.com/abdelmajidlh/cours
Cours Data engineering et data analyse.
apache-spark big-data data-analysis data-engineering docker jupyter-notebook pyspark
Last synced: 06 May 2026
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 06 May 2026
https://github.com/vimlesh-gupta/blinkit_data_analytics_project
End-to-end Blinkit data analytics project using Python, SQL Server & Power BI
blinkit data-analysis eda pandas powerbi python sql-server
Last synced: 06 May 2026
https://github.com/dhruwsunita/iphones-eda-analysis
EDA analysis on apple products.
data-analysis data-visualization eda matplotlib numpy pandas plotly python seaborn
Last synced: 06 May 2026
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/jjkay03/discord-call-extractor
Collect HTML data from Discord group/DM to create database of calls
data-analysis database discord discord-tool
Last synced: 07 May 2026
https://github.com/satyam4229/identify-employee-attrition
This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 08 May 2026
https://github.com/otonomee/against-the-clock-transcript-analysis
This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.
against-the-clock ai-analysis audio-processing creative-ai creative-process data-analysis fact-magazine machine-learning music-production natural-language-processing nlp text-mining yt-dlp
Last synced: 08 May 2026
https://github.com/sumit-sinha9/sales-analysis
Analyzing 12 months worth fo Sales data
data-analysis pandas python visualization
Last synced: 08 May 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/ranagaballah/true-fake-news
True Fake News Detector NLP model
data-analysis data-science data-visualization deployment machine-learning matplotlib nlp numpy pandas python
Last synced: 09 May 2026
https://github.com/emanoelcampos/python-onemonth
This repository contains educational materials and projects developed during a Python course offered by OneMonth. It covers Python basics, intermediate concepts, web development with Flask, and data analysis with pandas. The course is structured into weeks, each focusing on a different aspect of Python programming and its applications.
data-analysis flask jupyter-notebook onemonth python python3
Last synced: 09 May 2026
https://github.com/zxjahid/matplotlib
A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊
candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx
Last synced: 09 May 2026
https://github.com/tyriek-cloud/nyc-mobility-survey-analysis
An end-to-end data engineering project in which five NYC DOT datasets were modified in an ETL process and analyzed for insights.
aws aws-athena aws-glue aws-glue-crawler aws-quicksight aws-s3 data-analysis data-engineering etl-pipeline json python
Last synced: 09 May 2026
https://github.com/yeopster/datascience_notebook
Compilation of my Notebook based on Kaggle Dataset
data-analysis data-science kaggle notebook python
Last synced: 10 May 2026
https://github.com/shridhar1504/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 10 May 2026
https://github.com/devexpress-examples/winforms-pivot-create-user-folders-within-the-customization-form
This example demonstrates how to organize the Customization Form fields in folders.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 10 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/shridhar1504/titanic-survivor-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier predictive-modeling python3 scikit-learn seaborn supervised-learning
Last synced: 11 May 2026
https://github.com/szuzick/hr-analytics-pipeline
End-to-end HR analytics solution using PostgreSQL, dbt, and Power BI
data-analysis data-visualization database-maintenance dbt hr-analytics insights postgresql powerbi sql
Last synced: 10 Jun 2026