Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/stkisengese/numpy-data-fundamentals
A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.
data data-analysis numpy pre-processing
Last synced: 16 May 2026
https://github.com/aakk23/netflix_sql_project
This SQL project provides an analytical overview of Netflix's movies and TV shows dataset, uncovering key insights related to content types, ratings, release trends, and geographic distribution. It helps explore patterns in content availability, audience targeting, and regional preferences to support data-driven decisions.
data-analysis netflix-data-analysis postgresql sql
Last synced: 10 Apr 2025
https://github.com/carlosvinimsouza/jupyter-notebook-basic
Armazenado todos os trabalhos referentes a Ciência de Dados.
data-analysis data-science programas-jupyter-notebook python
Last synced: 11 May 2026
https://github.com/mboula/mboula.github.io
GitHub portfolio + interactive resume | Showcasing data projects in civil rights (housing), cannabis, and analytics
cannabis case-study civil-rights compliance dashboards data-analysis data-cleaning data-vizualization excel google-data-analytics housing open-data pattern-analysis portfolio pro-se public-data r sql tableau
Last synced: 10 Jul 2025
https://github.com/adriens/declaration-solennelle-louis-mapou-2024-06-08-data
Dataviz DECLARATION SOLENNELLE DU 2024-06-08 - LOUIS MAPOU
data-analysis datavisualization dataviz dataviz-tools gephi gephi-visualizations new-caledonia nouvelle-caledonie storytelling
Last synced: 18 Feb 2026
https://github.com/s-narasimman/zepto_inventory_sql_data_analysis
This project focuses on data cleaning, exploration, and analysis of product information from the Zepto dataset using SQL. It provides actionable insights into pricing, stock availability, discounts, and category-level performance.
aggregation categorization csv data-analysis data-cleaning kaggle postgresql sql zepto
Last synced: 16 May 2026
https://github.com/deborangueira/campeonado_kaggle_2025
Desenvolvimento de um modelo de machine learning para prever o sucesso de startups. O objetivo é identificar quais empresas têm maior probabilidade de se tornarem casos de sucesso no mercado.
computacao data-analysis desafio kaggle modulo3 ponderada
Last synced: 16 May 2026
https://github.com/pabi1234810/data_analysis_zepto
A comprehensive SQL-based business intelligence solution for analyzing grocery store product data, inventory management, and pricing strategies. This project demonstrates end-to-end data analysis workflow from raw data exploration to actionable business insights.
analytics csv data-analysis data-science database excel kaggle kaggle-dataset mathematics pgadmin4 sql utf-8 zepto
Last synced: 01 Nov 2025
https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm
Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.
artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression
Last synced: 01 Jan 2026
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/katarinatmb/serbia-protest-analysis
This project analyzes the frequency, regional distribution, and group characteristics of protests that emerged across Serbia following the fatal collapse of the Novi Sad train station roof in November 2024. The analysis explores how different communities responded in the aftermath of the disaster, using data visualization in RStudio
data-analysis data-visualization r r-mark rstudio
Last synced: 10 Jul 2025
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 16 May 2025
https://github.com/edanur-y/abalone-age-prediction-with-regression-models
Comparing the performances of simple linear, multiple linear, multi-layer perceptron and k-nearest neighbors regressions on abalone data to predict the age.
data-analysis hyperparameter-tuning missing-values-analysis outlier-analysis python recursive-feature-elimination
Last synced: 20 May 2026
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 07 May 2026
https://github.com/colindean/allegheny_voter_reg_analysis
Allegheny County Voter Registration Analysis Tools
data-analysis data-science elections pandas polars python voting
Last synced: 16 May 2026
https://github.com/nuraj250/datainsighthub
A Node.js backend application that processes and analyzes personal user data to generate personalized insights and recommendations. It features secure user authentication, data upload and storage, custom algorithms for data analysis, and optional real-time notifications and third-party API integrations. Perfect for showcasing backend development
api-development backend-development bcrypt data-analysis data-analytics data-insights dotenv express jwt-authentication mongodb nodejs passport secure-api user-authentication
Last synced: 09 Apr 2026
https://github.com/gaurav-van/data-analysis-projects
Collections of Projects that involves Data Analysis and Informed Decision Making
data-analysis database powerbi sql
Last synced: 06 Sep 2025
https://github.com/ManuMoolimani/Data-Analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 10 Jul 2025
https://github.com/swouf/ntds_imdb_team4
data-analysis data-visualization datascience graph-theory
Last synced: 13 May 2025
https://github.com/aicorsair/python-case-study-365-data-science-customer-segmentation-in-marketing
This repository contains a detailed case study on the segmentation of 365 Data Science customers using real-world data from an onboarding survey.
customer-segmentation data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-selection hierarchical-clustering k-means-clustering machine-learning marketing marketing-analytics matplotlib pandas python scikit-learn social-media social-media-analysis
Last synced: 13 Apr 2026
https://github.com/lotfiferaga/sig_explore
3d-graphics api data-analysis data-visualization openstreetmap python
Last synced: 06 Mar 2026
https://github.com/adrianlardies/multi-asset-financial-analysis
Comparative analysis of bitcoin, gold and S&P 500 in relation to macroeconomic indicators (VIX, interest rate, CPI). We explore the evolution of a $100 monthly investment in these assets, presenting visualizations to evaluate their performance and potential as financial diversification tools.
data-analysis data-science matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/hemant-kumar786/heart-disease-prediction
Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.
correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis
Last synced: 02 Nov 2025
https://github.com/engraulleite/local-data-warehousing-with-docker
Creating a DW from 0 to hero. Starting with logical and physical modeling to valuable reports.
airbyte data-analysis datawarehouse docker etl-pipeline metabase pgadmin4 postgresql
Last synced: 01 May 2026
https://github.com/datalopes1/fifa21_datacleaning
Neste projeto será feito o processo de limpeza e manipulação a partir do dataset FIFA 21 messy, raw dataset for cleaning/ exploring, que pode ser encontrado no Kaggle, com licensa CC0: Public Domain e enviado por Rachit Toshniwal.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/kunalkumar2001/data-analytics-python-project
Data Analyst Python Project for Portfolio
data-analysis data-anaytics matplotlib numpy pandas python seaborn
Last synced: 19 Apr 2026
https://github.com/martachesnova/sql
Performing data modeling (ERD) and data engineering. Then, writing series of SQL queries to analyze Employee Database of a company.
data-analysis data-engineering data-modeling erd postgresql sql
Last synced: 16 May 2026
https://github.com/hassanislam463/british-airways-data-science
Analyze Skytrax reviews to uncover customer sentiments and key themes while predicting booking behavior using machine learning. This repository includes data collection, analysis, and modeling scripts alongside concise, visualized insights to improve customer experience and operational efficiency.
data-analysis data-science data-visualization
Last synced: 28 Mar 2025
https://github.com/waynejz/heart-disease-analysis
COMP9321 19T1 Assignment 3
data-analysis machine-learning web-application
Last synced: 04 Apr 2025
https://github.com/erseco/ugr_tratamiento_inteligente_datos
Repositorio de trabajo de la asignatura Tratamiento Inteligente de Datos del Máster en Ingeniería Informática de la Universidad de Granada (UGR)
Last synced: 26 Apr 2026
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 25 Jan 2026
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/habiburrahman-mu/data-wrangling
Data Wrangling is the process of converting data from the initial format to a format that may be better for analysis.
data-analysis data-mining data-science
Last synced: 21 May 2026
https://github.com/nurulashraf/linear-regression-spotify
Data Science - Spotify Linear Regression Analysis
data-analysis data-preprocessing data-visualization dataset-exploration feature-selection linear-regression machine-learning matplotlib mean-squared-error model-evaluation multiple-regression music-analytics numpy predictive-modeling python regression-analysis root-mean-squared-error scikit-learn seaborn spotify-data
Last synced: 01 May 2026
https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal
Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.
data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn
Last synced: 16 May 2026
https://github.com/hassanislam463/sentiment_analysis_of_financial_news_headlines_and_affect_on_stock_price_prediction
This project analyzes financial news sentiment using a fine-tuned RoBERTa model and integrates it with stock data to predict price movements using LSTM and GRU. It highlights the role of sentiment in enhancing stock market forecasting.
data-analysis data-science data-visualization deep-learning lstm-neural-networks nlp-machine-learning
Last synced: 28 Mar 2025
https://github.com/dcs-training/regressionandmixedeffectsmodelling
This course will introduce you to regression and linear mixed-effects models (LMMs). It will help to develop your theoretical understanding and practical skills for running such models in R. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 25 Feb 2025
https://github.com/dcs-training/introtodatabases
This repository host the material connected to a training developed by Dave Elsmore (Edina) for CDCS. Go to the readme file
data-analysis data-wrangling databases sql
Last synced: 10 Jun 2026
https://github.com/dongdong7048/newtaipei-housing-trend
新北市房價趨勢分析專案
data-analysis housing new-taipei python real-estate
Last synced: 28 Mar 2025
https://github.com/chrisrobertsjr/chrisrobertsjr
Welcome to my Github Profile!
data data-analysis java r sql statistics
Last synced: 03 May 2026
https://github.com/srikarveluvali/dataanalysis
The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.
css data-analysis html javascript matplotlib numpy pandas python seaborn tableau
Last synced: 09 Apr 2026
https://github.com/jihoonerd/national_health_insurance_sharing_service_project
국민건강보험 데이터를 활용한 EDA
data-analysis exploratory-data-analysis health insurance
Last synced: 18 Jul 2025
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/daniel-jcvv/daniel-jcvv
👨💻 Data Engineer | 3+ years enterprise experience with Telcel & Citi Banamex Develop ETL pipelines, data governance, and cloud solutions. Building scalable data architectures and automated workflows for Fortune 500 clients. Tech Stack: Python, SQL Server, Oracle, Apache Airflow, PySpark
agentic-ai apache-airflow apache-kafka apache-spark automation business-intelligence citi-bank-apis data-analysis data-engineering data-lake data-warehouse etl-pipeline medallion-architecture mlops n8n-workflow python rag sql-server
Last synced: 15 Apr 2026
https://github.com/satyacoder29/smartfinance-dynamic-financial-dashboard
SmartFinance: Dynamic Financial Dashboard is an interactive tool designed to visualize key financial metrics like revenue, expenses, and profit. It features real-time data updates, charts, slicers, and navigation for easy analysis. This dashboard helps businesses make data-driven decisions and optimize financial performance.
data-analysis data-cleaning data-modeling data-visualization powerbi powerbi-desktop powerbi-visuals powerquerym
Last synced: 13 Feb 2026
https://github.com/lucalullo/italian-justice-workload
Multidimensional analysis of the Italian justice system workload (2003–2024). A study of civil and criminal proceedings using judicial pressure and litigation indicators.
data-analysis italy judicial-workload justice-system kaggle legal-analytics pandas python time-series
Last synced: 24 May 2026
https://github.com/mvharsh/blinkit-sales-dashboard
An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.
blinkitdashboard data-analysis data-visualization powerbi
Last synced: 25 Jan 2026
https://github.com/shafaq-aslam/data-gathering
A hands on collection of notebooks exploring multiple techniques of data gathering, from reading CSV, Excel, JSON, and SQL files to exporting data in various formats and fetching real time data through APIs. This repository documents my complete learning journey of data ingestion, preparation, and extraction for data analysis workflows.
api data-analysis data-export data-gathering data-import data-science jupyter-notebook machine-learning pandas python python3
Last synced: 21 May 2026
https://github.com/nick-peter-marcus/chocolate-bar-analysis
Analyzing Chocolate Bar Features and Ratings - Data Visualization, Decision Trees, Random Forest
data-analysis data-visualization decision-trees python random-forest seaborn sklearn
Last synced: 10 May 2026
https://github.com/as16082023/global-electronics-retailer
Analyzed Maven Electronics' performance data to identify factors driving revenue decline since 2020.
advanced-excel data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/anonymo2239/big-data-churn-analyzer
Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.
big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark
Last synced: 21 May 2026
https://github.com/karishmagupta05/e-commerce-sales-dashboard
This project is an interactive E-Commerce Sales Dashboard built using Power BI. It provides key insights into sales, profit, and customer behavior through visually engaging charts and graphs.
data-analysis data-visualization powerbi
Last synced: 09 Feb 2026
https://github.com/fealt/databricks_incremental_data_project
Databricks project showcasing incremental data ingestion with industry best practices.
data-analysis data-engineering data-ingestion databricks delta-lake etl lakehouse medallion-architecture python spark spark-sql sql streaming-data
Last synced: 08 May 2026
https://github.com/dcs-training/much-ado-about-nothing-missing-data-in-research
Repo for the Much ado about nothing workshop. Go to the Readme file
data-analysis data-cleaning data-wrangling r
Last synced: 15 Jun 2025
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/abhipatel35/moviematcher-movie-recommender-system
A robust movie recommendation system using the MovieLens dataset, employing Collaborative Filtering, Matrix Factorization, and Hybrid Models to enhance recommendation accuracy and diversity.
collaborative-filtering content-based-filtering data-analysis eda hybrid-models machine-learning matrix-factorization movie-recommendations movielens-dataset python recommender-system surprise-library
Last synced: 21 May 2026
https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data
This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.
data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost
Last synced: 14 May 2026
https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights
The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.
chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization
Last synced: 17 Feb 2026
https://github.com/mahmoudwal27/brazilian_ecommerce
This project explores and cleans the Olist Brazilian E-Commerce dataset using Python (Pandas) to prepare it for Power BI visualization. The process includes loading data, performing exploratory analysis, handling missing values and duplicates, formatting key columns, and exporting clean datasets.
analytics data-analysis data-analysis-python google-cloud python
Last synced: 16 May 2026
https://github.com/natnaelhhaile/Text-Similarity-Analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 11 Apr 2025
https://github.com/tapas-gope/pizza-sales
This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.
business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis
Last synced: 02 Jan 2026
https://github.com/kaushik-puttaswamy/food-delivery-time-prediction-using-machine-learning
The Food Delivery Time Prediction Model estimates delivery times using regression algorithms, with XGBoost as the best performer, and is deployed as a real-time application via Streamlit.
data-analysis data-science delivery food-delivery geolocation machine-learning modeldeployment predictive-modeling python realtimeproject regression-models streamlit xgboost
Last synced: 16 Apr 2026
https://github.com/netcodez/analysing-unicorn-companies---sql
Analysing Unicorn Companies using SQL
data-analysis data-structures database postresql sql
Last synced: 16 May 2026
https://github.com/shivani8136/bellabeat-smart-device-data-analysis
This project analyzes smart device fitness data to uncover insights into user behavior, engagement, and wellness patterns. Conducted for Bellabeat, a high-tech company specializing in health-focused smart products for women, this analysis supports strategic decisions around product development and feature prioritization.
data-analysis data-visualization r-programming-language
Last synced: 08 Feb 2026
https://github.com/tathithienthanh/majorproject_womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 20 Mar 2025
https://github.com/rajesh9943/visualizing-global-development-trends-an-animated-analysis-of-life-expectancy-and-fertility-rates
To clean and analyze data to find trends in global population, fertility, and life expectancy from 1960 to 2016. This idea was inspired by hans rosling . To analyze the data, I used a scatter bubble chart, which clearly shows how's the population increased and the fertility rate decreased from 1960 to 2016.
data-analysis data-cleaning-and-preprocessing data-exploration expolatory-data-analysis identify-patterns reporting vizualisation
Last synced: 08 Oct 2025
https://github.com/grindelfp/two-data-manipulative-tasks
Two simple tasks on data analysis and processing.
Last synced: 17 Feb 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/touradbaba/multi-page_dash_application
This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.
dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere
Last synced: 27 Jul 2025
https://github.com/balajimohan18/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides
Last synced: 08 Mar 2026
https://github.com/michael-angelo-mootoo/quanta-app
Quanta is an open source statistical package app / toolkit for neuroscience and general computational descriptive and inferential statistics.
computational-statistics customtkinter data-analysis descriptive-statistics gui-application inferential-statistics neuroscience python r statistical-analysis statistics tkinter-python
Last synced: 16 May 2026
https://github.com/kheriberto/logistic_regression_project
A project that analyses dummie data from an advertising company using logistic regression
data-analysis logistic-regression pandas python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/gutow/langmuir_trough
Code to run homebuilt Langmuir Trough using Jupyter and Python. Link below for API docs:
data-acquisition data-analysis jupyter langmuir-trough plotting
Last synced: 11 Aug 2025
https://github.com/pdiegel/currencytracker
A Python application that fetches real-time currency exchange rates from an API, securely stores the data in an SQLite database, and includes error handling, logging, and good programming practices for reliable and periodic data capturing.
analysis api currency data-analysis data-capture logging python python3 sqlite3 tracker
Last synced: 09 Sep 2025
https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn
Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation
data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis
Last synced: 08 Apr 2026
https://github.com/ggarciajavier/udacity-dalf-project3-test-perceptual-phenomenom
Work performed for the 3rd project of Udacity Data Analyst Nanodegree: statistical testing of a perceptual phenomenom (Stroop task).
data-analysis python statistical-inference udacity-data-analyst-nanodegree
Last synced: 18 May 2026
https://github.com/l0rd-inquisit0r/data-analytics
A repository of data analytics implementations in Python
ai data-analysis data-analysis-python data-analytics
Last synced: 18 Jun 2025
https://github.com/rajkumargara/bike_rental_data_analysis
Chicago bike rental data analysis for business insights using R programming
data-analysis data-visualization data-wrangling large-dataset machine-learning-algorithms
Last synced: 11 Aug 2025
https://github.com/ejw-data/tableau-drug-study
Brief analysis of drug treatments that were also analyzed with pandas
Last synced: 02 Jan 2026
https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard
This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.
dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit
Last synced: 18 Apr 2026
https://github.com/gkn-tech/brisecheck_website
Web Crawler, Visualizations and Game
choropleth-map contact-form data-analysis data-visualization game-development pygame python-flask scatter-plot web-crawler web-scraping
Last synced: 25 Feb 2025
https://github.com/smehra1208/certifications
data-analysis data-visualization excel postgres powerbi python sql
Last synced: 14 May 2026
https://github.com/bpkaur/exploring-67-years-of-lego
Exploring 67 years of LEGO
data-analysis datacamp pandas python3
Last synced: 10 May 2026
https://github.com/faizantkhan/automated-eda
This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.
automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz
Last synced: 18 Apr 2026
https://github.com/anamakarevich/suicide_rates_factors
Female suicide rates analysis for Udacity Hacathon
data-analysis data-cleaning linear-regression suicide
Last synced: 21 May 2026
https://github.com/javorraca/unsupervised-ml
A short exercise using R to perform unsupervised machine learning (clustering) on a sample data set.
ade4 clustering clustering-algorithm clustering-analysis data-analysis data-analytics data-science dplyr jupyter k-means-clustering machine-learning machinelearning ml r r-programming sse unsupervised-machine-learning
Last synced: 05 Apr 2025
https://github.com/yasir-arafah/nyc-trip-fare-prediction-using-tcn
"NYC Trip Fare Prediction Using Temporal Convolutional Networks (TCN)" is a Data Analytics Project where the trip and fare data of NYC taxi are combined and then analyzed using Pyspark and visualized using Matplotlib library. The project predicts the fare by using Temporal Convolutional Neural Network.
colab data-analysis matplotlib nyc-taxi-dataset pyspark python
Last synced: 29 Apr 2026
https://github.com/puspacempaka/hackerrank-sql-challenges-intermediate
This repository features solutions to various intermediate-level SQL challenges from HackerRank. It includes efficient SQL queries, problem-solving techniques, and well-documented scripts. Explore these solutions to understand different SQL problems and enhance your skills.
challenges data-analysis database hackerrank-solutions queries sql sql-intermediate-level
Last synced: 02 Jan 2026
https://github.com/simranshaikh20/credit-card-dashboard
A Data Visualization Project using Microsoft Power bi
data-analysis data-visualization powerbi
Last synced: 02 Jan 2026
https://github.com/mmzong/gee_lifestyleeffectsonhypertension
Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.
aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots
Last synced: 29 Jul 2025
https://github.com/erayagdogan/simplecharts
Simple Charts is a chart maker compose app with material 3 design. Charts are created using the lets-plot-compose library.
android android-app charts data-analysis data-visualization jetpack-compose lets-plot-kotlin material-3 viewmodel
Last synced: 29 Jun 2026
https://github.com/iliyasalve/cyclistic_case_study
Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"
bike-sharing data data-analysis data-visualisation r
Last synced: 06 Apr 2025
https://github.com/josericodata/josericodata
Adding a cool README file
big-data data-analysis data-science dublin hadoop hadoop-mapreduce hadoop-spark ireland jobsearch jobseeker portfolio portfolio-data-science portfolio-website python sql
Last synced: 26 Aug 2025
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/ginga1402/demand-supply-analysis
Demand- Supply Analysis using Python
data-analysis data-science demand-supply-management driver-rider-relationship
Last synced: 30 Mar 2025
https://github.com/ginga1402/car_price_prediction
Predict the price of a car using MS Excel.
college-project data-analysis excel linear-regression
Last synced: 30 Mar 2025
https://github.com/abhishekyadav915/diwali_sales_analysis
This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python
Last synced: 05 Apr 2025