Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/satyacoder29/e-commerce-sales-analysis
Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.
data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups
Last synced: 05 Mar 2026
https://github.com/pizofreude/divvybikes-share-success
Developing data-driven marketing campaign for Divvy to convert casual riders into annual members. Divvy is a bike-share program of the Chicago Department of Transportation (CDOT).
airflow bi-analytics data-analysis data-engineering data-visualization database dbt docker etl jupyterlab python r redshift s3
Last synced: 17 Apr 2026
https://github.com/dina-hosny/analyze-and-model-airline-system
Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions
data-analysis data-modeling data-warehouse datawarehousing dwh plsql sql
Last synced: 05 Mar 2026
https://github.com/shashwat9kumar/us-accidents-data-analysis
Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle
accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python
Last synced: 17 Apr 2026
https://github.com/vaishnavis03/finlatics_ml_program
This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.
correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn
Last synced: 17 Apr 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/jabercrombia/video-game-data
This project integrates FastAPI as the backend and Next.js as the frontend to create a full-stack web application. It processes and displays vides game sales data, enabling seamless API communication while maintaining a scalable and efficient architecture.
data-analysis nextjs nintendo playstation python typescript video-game
Last synced: 02 Apr 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026
https://github.com/eliasdehondt/learn-r
Welcome to the Learn-R repository! This is your go-to resource for learning the R programming language, whether you're a beginner or looking to enhance your skills.
data-analysis data-visualization education machine-learning programming r statistics tutorials
Last synced: 03 Apr 2026
https://github.com/jhrcook/checkplease
Analysis of an immune checkpoint-blockade screen.
bayesian-statistics data-analysis pymc3 python python3 r
Last synced: 17 Apr 2026
https://github.com/coder36459/fcc-projects
freeCodeCamp projects
bash bootstrap c-sharp css d3 data-analysis html javascript matplotlib numpy pandas postgresql programming python react seaborn sql topojson xml xslt
Last synced: 03 Apr 2026
https://github.com/shimazadeh/ft_linear_regression
Implementing a modular linear regression from scratch to predict the price of cars using a gradient descent algorithm.
data-analysis data-science hyperparameter-tuning linear-regression predictive-modeling
Last synced: 03 Jun 2026
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 04 Apr 2026
https://github.com/ridemountainpig/education-level-data-analysis
An analysis of the relationship between education levels, unemployment rates, and credit card spending in Taiwan's six major cities.
data-analysis matplotlib pandas-python
Last synced: 17 Apr 2026
https://github.com/nathaliacosim/migration-patrim
Automação para extração, conversão e migração de dados patrimoniais para o sistema patrimônio cloud da betha sistemas. O projeto garante um fluxo estruturado e seguro de transferência de informações, utilizando C# (.NET Framework), PostgreSQL e integração via API.
conversion-tool data-analysis data-conversion data-transformation dotnet dotnet-code dotnet-console-app migration-tool
Last synced: 17 Apr 2026
https://github.com/kgotsosm/fcc-data-analysis
Notebooks created for the Data Analysis Course on freeCodeCamp
data-analysis data-visualization matplotlib pandas seaborn
Last synced: 17 Apr 2026
https://github.com/macnianios/retail_sales_analysis
final data science project on techpro academy data science stream
anova clustering colab-notebook data-analysis data-science data-science-projects linear-regression numpy pandas python
Last synced: 17 Apr 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/royungar/automotive_sales_insights_dashboard
Data visualization project analyzing automotive sales, recalls, and customer sentiment using IBM Cognos Analytics. Features KPIs, treemaps, heatmaps, and advanced visual storytelling techniques.
automotive-industry business-intelligence cognos-analytics csv customer-sentiment dashboard data-analysis data-engineering data-visualization eda excel heatmap ibm kpi recall-analysis sales-data treemap
Last synced: 04 Jun 2026
https://github.com/vitornegromonte/eda_stroke
Exploratory data analysis in the stroke prediction dataset
data-analysis data-science exploratory-data-analysis kaggle-dataset visualization
Last synced: 17 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/santos-k/fashion-recommender-dashboard
The project is a neural network-based fashion recommendation system built using Python. The model used for this system is Resnet50, which is a deep learning model used for image recognition. The data used for training the model is scraped from Flipkart, with a total of 65,000 images.
ann cnn dash dashboard data-analysis data-science deep-learning eda gcp heroku kera machine-learning nueral-networks plolty python tensorflow
Last synced: 04 Apr 2026
https://github.com/q-viper/blog-notebooks
This is the repo to store most of my blogs in dataqoil.com and q-viper.github.io.
data-analysis data-science machine-learning-algorithms timeseries
Last synced: 04 Apr 2026
https://github.com/sanam2405/ahs
This contains the analysis of result of AHS Madhyamik Examination 2022
data-analysis data-visualization jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/nicovandenhooff/kaggle-competitions
A repository that contains my Kaggle projects.
data-analysis data-visualization deep-learning exploratory-data-analysis kaggle machine-learning matplotlib modeling neural-network numpy pandas seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/rajeev2806/retail-order-data-analysis
Dataset downloaded from kaggle api and then data cleaning and analysis is performed
data-analysis data-cleaning postgresql
Last synced: 18 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/danpoynor/data-analysis-of-video-game-sales-2000-2015
This analysis reviews sales for the top 100 video games from the years 2000-2015 to gather insights. Within the notebook I use Python’s Pandas, Matplotlib, and Seaborn libraries to interact with the data and create graphs.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots video-game-sales
Last synced: 18 Apr 2026
https://github.com/wang-q/tva
tva: Tab-separated Values Assistant
cli command-line-tool csv data-analysis data-processing etl high-performance rust streaming tabular-data tsv unix-philosophy
Last synced: 05 Apr 2026
https://github.com/prakhar-ff13/finding-donors-for-charityml
Udacity Machine Learning Engineer Nanodegree project 2
data-analysis data-science machine-learning supervised-learning udacity udacity-machine-learning-nanodegree udacity-nanodegree
Last synced: 05 Apr 2026
https://github.com/mtimma001/clinical-trial-data-tool
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 05 Apr 2026
https://github.com/satti-hari-krishna-reddy/data-whisperer
Data Whisperer is an AI-driven tool that automates exploratory data analysis (EDA), generates actionable insights, and enables natural language querying of datasets. it combines the power of AI (Google Gemini) with interactive visualizations and professional reporting.
ai data-analysis data-visualization llm python3 streamlit
Last synced: 18 Apr 2026
https://github.com/kathisnehith/analyst_snehith_portfolio
Hello! This is My Portfolio Website
azure big-data data-analysis data-mining matplotlib mysql-database outlier-detection pandas-python powerbi python sql tableau validation
Last synced: 18 Apr 2026
https://github.com/jordanconallluthaiswright/purchase-behaviour-data-analysis
This project analyzes Black Friday purchase behavior for Company XYZ, uncovering trends by gender, age, and location. Using data cleaning, statistical analysis, and visualization, it evaluates spending patterns, confidence intervals, and category preferences to provide actionable insights for optimizing marketing strategies and targeting.
business-analytics data-analysis jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/vl1507/data_science_pro_course
Курс "Аналитик данных PRO (PRO DA-6)"
da data-analysis data-science ds jupyter-notebook machine-learning ml pro-da python
Last synced: 18 Apr 2026
https://github.com/mi7773/advanced_sql_data_analytics_project
A hands-on SQL project simulating data analysis using fact and dimension tables, covering trends over time, cumulative metrics, performance breakdowns, segmentation, and reporting via SQL.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics database query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 Apr 2026
https://github.com/robinmillford/sales-metrics-dashboard-streamlit
This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.
dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard
Last synced: 19 Apr 2026
https://github.com/shyamkumarnagilla/ai-powered-forecasting-for-agricultural-productivity
AI Powered Forecasting for Agricultural Productivity is a project that utilizes machine learning to predict crop yields and optimize farming practices. By harnessing historical and real-time data, this model empowers farmers with data-driven insights to enhance productivity and sustainability in agriculture.
data-analysis data-visualization deep-learning flask neural-network
Last synced: 19 Apr 2026
https://github.com/scanf-s/basic_dataanalysis
data-analysis jupyter-notebook matplotlib pandas python
Last synced: 19 Apr 2026
https://github.com/yuvrajsaraogi/unemployment-analysis-with-python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql
Last synced: 19 Apr 2026
https://github.com/tsffarias/my-books
Exploratory analysis of my Dataset 'All_the_Books_I_read' which contains all the books I've read
books data-analysis python tableau
Last synced: 19 Apr 2026
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 19 Apr 2026
https://github.com/decepticon-ts/cap-ai-studio
Description: A modern, powerful web application for advanced image analysis and batch processing, featuring real-time AI-powered image captioning, comprehensive reporting, and an intuitive user interface. Built with Streamlit and Google's Gemini API.
artificial-intelligence batch-processing computer-vision data-analysis gemini-api image-processing image-processing-python python streamlit streamlit-webapp threading
Last synced: 19 Apr 2026
https://github.com/mugambi645/eda-projects
A list of EDA projects
data-analysis eda matplotlib numpy pandas plotly seaborn webscraping
Last synced: 19 Apr 2026
https://github.com/diegoglezsu/bulletin-fetcher
bulletin-py is python package to easily fetch bulletins and legal acts from a wide variety of sources of Eurpean Union.
data-analysis european-union legal-documents python sparql
Last synced: 19 Apr 2026
https://github.com/edwinrlambert/exploring-airbnb-market-trends
Dive into NYC's Airbnb market trends through detailed analysis of listings data, including prices, types, and review dates. This is a DataCamp project.
airbnb data-analysis jupyter-notebook market-trends python
Last synced: 19 Apr 2026
https://github.com/samwhaaa/superfoodsmax
A customer demographic & spending trend analysis on the fictional SuperFoodsMax grocery chain
data-analysis data-analytics data-visualization jupyter jupyter-notebook python
Last synced: 20 Apr 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/nikolaos-mavromatis/etf-data-analysis-dashboard
Insights into SPY ETF performance with an interactive Streamlit dashboard powered by Alpha Vantage data.
api data-analysis data-visualization financial-analysis pandas plotly python streamlit
Last synced: 20 Apr 2026
https://github.com/dthung1602/goodread-bestbook-prediction
Data analysis - trying to predict the result of Goodreads Choice Adward
data-analysis goodreads pca python r xgboost
Last synced: 20 Apr 2026
https://github.com/natnaelhhaile/text-similarity-analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 20 Apr 2026
https://github.com/jerinpious/movie-recommendation-system
A content-based movie recommendation system built using Python. The system processes movie data, extracts relevant features, and provides recommendations based on user preferences
content-based-recommendation data-analysis jupyter-notebook machine-learning pandas python streamlit
Last synced: 20 Apr 2026
https://github.com/jbalooshie/school_district_analysis
Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.
data-analysis data-science dataframes jupyter-notebook numpy pandas python
Last synced: 20 Apr 2026
https://github.com/hugo-hattori/customer_profile_analysis
Data Analysis Project.
data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python plotly plotly-express plotly-io python
Last synced: 20 Apr 2026
https://github.com/sarthakmishraa/bike_rental_predictor
Bike Sharing Dataset : This dataset contains the hourly and daily count of rental bikes between years 2011 and 2012 in Capital bikeshare system with the corresponding weather and seasonal information.
data-analysis machine-learning python xgboost
Last synced: 20 Apr 2026
https://github.com/abinashsahoo007/project-bankruptcy-prevention
The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.
data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit
Last synced: 20 Apr 2026
https://github.com/salfaris/toy-data-analysis
Random toy data projects. For my portfolio data projects, see linked website
Last synced: 20 Apr 2026
https://github.com/robinmillford/hr-analytics-employee-performance-analysis
HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.
data-analysis data-visualization jupyter-notebook powerbi python3 sql
Last synced: 20 Apr 2026
https://github.com/william-franco/fuzzy-logic
data-analysis data-science rust rust-application rust-lang terminal-app
Last synced: 04 Jun 2026
https://github.com/profasem/logistics-performance-analysis
Power BI dashboard analyzing logistics performance, delivery delays, carrier efficiency, and regional risk.
business-intelligence dashboard data-analysis logistics powerbi python supply-chain
Last synced: 21 Apr 2026
https://github.com/mahmoudwal27/e-commerce-data-analysis
A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.
analytics data-analysis data-analysis-python data-set google-cloud python
Last synced: 21 Apr 2026
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data
Last synced: 21 Apr 2026
https://github.com/rachel-xmr/data-analysis-in-health-set-csc3062
CSC3062 Data Analysis and visualization
classification-algorithm data-analysis data-visualization model-evaluation nmf pca python svm t-sne visualization
Last synced: 05 Jun 2026
https://github.com/martinkalema/power-distribution-modelling
Power Distribution Modelling for cea and cel algorithms
data-analysis python synthetic-dataset
Last synced: 21 Apr 2026
https://github.com/rahulpatel0615/sales-analysis-project
Sales Data Analysis Dashboard with Python, Pandas, and Matplotlib. Features 12+ visualizations and comprehensive insights.
data data-analysis data-visualization matplotlib pandas portfolio python
Last synced: 21 Apr 2026
https://github.com/meerantajalli/networksecuritydefense
This Network Security defense systems acts as an indicator against SMP Floods, UDP Floods, ICMP Floods. This model is trained using packets from wireshark and can easily differentiate between normal network traffic and traffic that has been targetted on the machine by an attacker using the rate of packets transfer and using the source IP.
anomaly-detection classification cyber-security data-analysis ddos-detection icmp-flood intrusion-detection machine-learning network-security packet-analysis python random-forest security smp-flood udp-flood wireshark
Last synced: 21 Apr 2026
https://github.com/danpoynor/data-analysis-spotify-songs-2010-2019
Spotify data analysis for songs between 2010 and 2019 using Jupyter Notebooks including pandas and Seaborn plots.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots spotify
Last synced: 22 Apr 2026
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 22 Apr 2026
https://github.com/robinmillford/optimizing-treatment-plans-through-data-analysis
The primary focus was on understanding customer health, treatment, and associated charges over multiple years.
data-analysis data-visualization healthcare mysql powerbi sql
Last synced: 22 Apr 2026
https://github.com/prgermux/yield-reporter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing production data from various machines. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data.
automation data-analysis python reporting
Last synced: 22 Apr 2026
https://github.com/thinogueiras/jornada-python
Jornada Python - Hashtag Programação.
data-analysis data-science inteligencia-artificial python rpa
Last synced: 22 Apr 2026
https://github.com/kgotsosm/epl-analysis
Preparing data for machine learning algorithms to predict English Premier League match winners.
data-analysis data-cleaning data-modeling
Last synced: 22 Apr 2026
https://github.com/leabrodyheine/california-schools-data-visualization
This front-end project provides interactive visualizations of learning models adopted by California schools during the pandemic. Using D3.js and Mapbox, it dynamically presents data through bar charts, bubble charts, heatmaps, and geographic maps, allowing users to explore trends across school types, sizes, and districts.
d3-visualization d3js data-analysis data-visualization mapbox openai plotly
Last synced: 22 Apr 2026
https://github.com/ayushi-gajendra/restaurant-order-analysis-sql
End-to-end SQL analysis of 12,266 restaurant transactions to identify high-performing menu items, revenue concentration, bulk ordering behavior, and strategic growth opportunities.
analytics-portfolio business-intelligence case-study customer-segmentation data-analysis data-analytics database-analysis menu-engineering mysql revenue-analysis sql sql-project
Last synced: 05 Jun 2026
https://github.com/ayushi-gajendra/buenos-aires-subway-statistics
A comprehensive data analysis of the Buenos Aires subway system ridership using Python and Pandas. This project identifies peak-hour congestion patterns, explores hourly passenger distributions, and utilizes the 95th percentile to isolate extreme traffic conditions for urban mobility insights.
95th-percentile buenos-aires data-analysis data-science-portfolio data-visualization matplotlib pandas python statistical-analysis subway-ridership transit-data urban-mobility
Last synced: 05 Jun 2026
https://github.com/floffah/my-listening
Various ways to analyse your Spotify extended streaming history data
convex data-analysis listening-history spotify
Last synced: 23 Apr 2026
https://github.com/tranngoca5039/bigquery-a5y
📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.
analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql
Last synced: 05 Jun 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/shudhanshurp/adidas-us-data-analysis
This Power BI project analyzes Adidas sales data across different regions, retailers, and product categories in the U.S. The dashboards provide insights into sales performance, operational metrics, and future forecasts to support data-driven decision-making.
data-analysis data-transformation data-visualization forecasting powerbi python retail-analytics
Last synced: 24 Apr 2026
https://github.com/ihnokim/datk
Data Analysis Toolkit (DATK)
data-analysis data-engineering data-science deep-learning image-processing pandas signal-processing
Last synced: 24 Apr 2026
https://github.com/strixion/demoversion_ai
The demoversion of StrixionAI
ai csv data-analysis data-analytics json python txt
Last synced: 24 Apr 2026
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 24 Apr 2026
https://github.com/voidnire/redditviralmysteryposts
Análise de posts de subreddits de mistério. O que define um post viral neste tipo de sub?
data-analysis data-visualization mysteries mystery nlms python-3 reddit
Last synced: 24 Apr 2026
https://github.com/muthukumar0908/youtube-data-harvesting-and-warehousing-using-sql-mongodb-and-streamlit
Create a simple and intuitive user interface using Streamlit, From the youtube getting and extracting the data by using API key. That data stored in database.
data-analysis mongodb-atlas python sqldatabase streamlit-webapp youtube-api
Last synced: 24 Apr 2026
https://github.com/manisharora96/data-analysis-of-smartwatch
The project is structured with sample data, step-by-step Jupyter notebooks, and modular Python scripts for automated analysis
data-analysis data-visualization jupyter-notebook python smartwatch-analysis
Last synced: 24 Apr 2026
https://github.com/cyberoctane29/python-for-data-analysis
A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.
data data-analysis data-analytics data-science python
Last synced: 24 Apr 2026
https://github.com/amlanmohanty1/zepto-sql-data-analysis-project
Complete Data Analysis on Zepto Inventory data using SQL
data-analysis database inventory-management postgresql sql zepto
Last synced: 24 Apr 2026
https://github.com/gnodux/adb-link
An MCP server that connects to multiple databases. Supports access control and dynamic SQL query tool registration and invocation.
agent ai-tools data-analysis database-gateway go mcp mcp-server
Last synced: 06 Jun 2026
https://github.com/mariann95/sql_data_warehouse_and_analytics_project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
data-analysis data-analytics data-cleaning data-engineering data-lakehouse data-science data-science-portfolio data-warehouse data-warehousing datalake datawarehouse datawarehousing etl etl-job etl-pipeline medallion-architecture sql sql-query sql-server sqlserver
Last synced: 06 Jun 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/rubix982/product-quality-classification
This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.
data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp
Last synced: 25 Apr 2026
https://github.com/tmoulik/bikeshare-python
Analysis of Bikeshare data from three major cities
data-analysis data-visualization python udacity-nanodegree
Last synced: 25 Apr 2026
https://github.com/xjwllmsx/hacker-news-engagement
Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.
data data-analysis jupyter python
Last synced: 25 Apr 2026
https://github.com/m-biriulova/python-job-market-analysis
Web scraping, data analysis, and visualization of Python developer vacancies in Czech Republic.
automation beautifulsoup data-analysis data-visualization portfolio-project python selenium web-scraping
Last synced: 25 Apr 2026
https://github.com/viniciusds2020/streamlit_app_adult
Protótipo APP - Machine learning - Streamlit
app data-analysis data-science front-end joblib machine-learning python streamlit
Last synced: 25 Apr 2026