Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/sumit0ubey/internship
This repository showcases the tasks and projects I completed during various internships. It includes work across diverse domains such as: Data Analysis: Exploratory data analysis, data visualization, and insights generation using Python and libraries like Pandas, Matplotlib, and Seaborn. Backend Development: Designing and implementing RESTful API
backend-development data-analysis python-developer
Last synced: 05 Sep 2025
https://github.com/yixin0829/web-data-scraping-project
Python web scraping script and MySQL database design :bug:
beautifulsoup4 data-analysis data-visualization database gender-api matplotlib python website
Last synced: 19 Apr 2026
https://github.com/mradovic38/cnn-dominant-color-recognition
Detecting a dominant color of Tulip images using various convolutional neural networks.
artificial-intelligence cnn computer-vision data-analysis deep-learning dominant-color kmeans-clustering machine-learning neural-network pytorch regression resnet resnet-18 squeeze-and-excitation
Last synced: 01 Jul 2026
https://github.com/tszon/data-science-projects
Included are all the worth-noting Data Science projects in my learning journey with DataCamp.
data-analysis data-science exploratory-data-analysis feature-engineering machine-learning modelling preprocessing-data scikit-learn supervised-learning
Last synced: 15 Mar 2025
https://github.com/itskshitija/lego-set-explorer
As a part of the Maven Analytics Lego challenge, I developed an interactive Power BI dashboard exploring the evolution of LEGO sets from 1970 to 2022.
data-analysis data-science data-visualization dataanalysis dataset powerbi powerbi-desktop powerbi-report
Last synced: 12 Jun 2025
https://github.com/fbarffmann/nosql-challenge
Analyzed 28,000+ UK restaurant records using MongoDB and PyMongo. Queried hygiene scores, location data, and customer ratings.
data-analysis data-cleaning database-analysis json mongodb nosql pymongo python restaurant-data
Last synced: 13 Apr 2026
https://github.com/hugo-hattori/watercraft_values_ai_prediction
Data Science Project.
ai-model artificial-intelligence artificial-intelligence-algorithms data-analysis data-analytics data-science jupyter jupyter-notebook machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas pandas-dataframe pandas-python python seaborn sklearn sklearn-library sklearn-metrics
Last synced: 23 Aug 2025
https://github.com/junpenglao/jaefa
Just Another Eye-movement Filtering Algorithm
data-analysis eye-movement-data eye-tracking
Last synced: 12 Jan 2026
https://github.com/sasanthns/sql_data_warehouse_project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver
Last synced: 24 Mar 2025
https://github.com/avratanubiswas/fluorpenplugin
A matlab user interface for analysing OJIP curve datasets from FluorPen instrument. That is, serving as an additional plug in for "quick categorical analysis".
data-analysis fluorpen ojip-curve
Last synced: 18 Mar 2026
https://github.com/ginalamp/covid_dashboard_twitternews
Corona Dashboard & report based on Twitter media outlet news.
dashboard data-analysis data-visualization twitter
Last synced: 28 Jan 2026
https://github.com/syed-amjad-ali/-bank-churn-ml
Predicting bank customer churn using machine learning. This project includes exploratory data analysis (EDA), feature engineering, classification models (Logistic Regression, Random Forest), and customer segmentation using K-Means clustering.
classification data-analysis data-science eda jupyter-notebook k-means-clustering machine-learning ml python segmentation
Last synced: 09 Mar 2025
https://github.com/robertochiosa/automatic-powerpoint-report-rmd
Automatically generate good looking powerpoint presentations from a csv dataset
data-analysis data-science medium medium-article python r
Last synced: 19 Apr 2026
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 19 Apr 2026
https://github.com/akashvarma26/data-analysis-on-olympics-csv-dataset
Data Analysis on Olympics dataset of csv format using re and Pandas in Jupyter notebook.
data-analysis jupyter-notebook pandas regex
Last synced: 02 May 2026
https://github.com/anniefib/otherprojects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
airflow-etl-orchestration aws cloud-native-data-solutions data-analysis data-visualization database datamodelling datawarehousing eda end-to-end-data-pipelines machine-learning-models pgadmin4 spark-analytics sql
Last synced: 07 May 2026
https://github.com/decepticon-ts/cap-ai-studio
Description: A modern, powerful web application for advanced image analysis and batch processing, featuring real-time AI-powered image captioning, comprehensive reporting, and an intuitive user interface. Built with Streamlit and Google's Gemini API.
artificial-intelligence batch-processing computer-vision data-analysis gemini-api image-processing image-processing-python python streamlit streamlit-webapp threading
Last synced: 19 Apr 2026
https://github.com/reddyprasade/r-program
R is a programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis.
data-analysis data-science r-programming
Last synced: 11 Apr 2026
https://github.com/alphatwirl/qtwirl
qtwirl (quick-twirl), one-function interface to AlphaTwirl
alphatwirl data-analysis data-frame pandas r root-cern
Last synced: 11 Apr 2026