Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/matt-ags/jornada-python
Repositório com os projetos realizados durante a semana "Jornada Python" - 01/2025
artificial-intelligence automation data-analysis jupyter-notebook machine-learning python
Last synced: 05 May 2026
https://github.com/pedrosfaria2/analisetitulosnetflix
Estudo de popularidade dos filmes da Netflix no IMDB.
analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 14 Apr 2026
https://github.com/sanjayankur31/20181206-neurofedora
Slides for my NeuroFedora seminar at the UH Biocomputaiton group's weekly seminar
computational-neuroscience data-analysis neurofedora neuroimaging neuroscience open-science
Last synced: 19 Feb 2026
https://github.com/virajbhutada/hollywood-insights-tableau
Strategic cinematic insights through Hollywood's data landscape. Tableau-driven analytics for genre, studio profitability, and audience dynamics. Uncover trends, assess audience reception, and navigate through years of film data, elevating your understanding of the cinematic world.
analystics business-intelligence dashboard data-analysis data-visualization entertainment hollywood storytelling tableau tableau-desktop visualization
Last synced: 05 Feb 2026
https://github.com/kunalpisolkar24/winequalityprediction
Predicting wine quality using machine learning with matplotlib, numpy, pandas, and seaborn for insightful data analysis. 🍇🤖📊
data-analysis data-science data-visualization machine-learning prediction-model
Last synced: 16 Oct 2025
https://github.com/ashithapallath/r-lab
This repository offers a collection of exercises, assignments, and projects designed for the R Programming course. It focuses on utilizing R for data analysis, statistical modeling, and visualization tasks.
data-analysis exploratory-data-analysis machine-learning r-language visualization
Last synced: 16 Oct 2025
https://github.com/sngr0x0/ranklytics-kr
OP.GG Scraping
data-analysis league-of-legends matplotlib opgg playwright-python scraping visualization
Last synced: 16 Oct 2025
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 14 May 2025
https://github.com/aishwaryahastak/ipl_analysis
Analysis of IPL dataset using PySpark
Last synced: 16 Oct 2025
https://github.com/bhaveshbhakta/fish-weight-prediction-using-ml
Fish Weight Prediction
data-analysis data-visualization fish-weight-prediction gradient-boosting machine-learning
Last synced: 16 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/supertetelman/coursera-exdata-09
This repo contains several R scripts that were used to analyze, plot, and clean data from various datasets. These projects were part of the Coursera course, Exploratory Data Analysis. The end results of the analysis are included.
big-data course coursera data-analysis r
Last synced: 16 Oct 2025
https://github.com/luminati-io/target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/dhruvsrikanth/basic-data-science
A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!
analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn
Last synced: 05 May 2026
https://github.com/mattdelaune/excel_sales_dashboard
Interactive Excel Dashboard for Coffee Sales Analysis: This project leverages Excel to analyze sales data, uncover seasonal trends, regional preferences, and customer behaviors, providing actionable insights for optimizing inventory and marketing strategies.
data-analysis excel pivot-tables sales-dashboard sales-data
Last synced: 27 Jan 2026
https://github.com/kathisnehith/austin-crime-report-analysis
Data analysis and visualization of crime trends in Austin
crime-reporting data-analysis data-visual database reporting sql tableau
Last synced: 25 Feb 2026
https://github.com/carlosvinimsouza/full-tutorial-python
My tutorial Python completed
data-analysis data-science data-structures django django-framework fastapi fastapi-framework flask flask-web frameworks learn-to-code learning python python3 roadmap tutorial tutorial-code
Last synced: 10 Apr 2026
https://github.com/bhaveshbhakta/diamond-price-prediction-using-xgboost
Diamond Price Prediction
data-analysis data-visualization diamond-prices-predictions ensemble-learning machine-learning xgboost
Last synced: 27 Oct 2025
https://github.com/codeslash21/communicate_data_findings
Analyze and visualize Bay Wheel system data which contains 2.5M individual trips data. And communicate the data findings from the dataset in the form notebook slide.
bay-wheel data-analysis data-visualization explanatory-data-visualization exploratory-data-analysis
Last synced: 22 Jan 2026
https://github.com/shrikantnaidu/greyatom-projects
GreyAtom Projects.
data-analysis data-science greyatom machine-learning portfolio
Last synced: 24 Jul 2025
https://github.com/singhs05/global-youtube-trends
Understand the impact of Likes, comments, dislikes on the video consumption for the videos that were trending.
data-analysis mssqlserver query sql
Last synced: 18 Mar 2026
https://github.com/soypete/example-go-dataframes-parser
example of https://godoc.org/github.com/kniren/gota/dataframe
data-analysis data-science datastructures golang-examples ml
Last synced: 12 Sep 2025
https://github.com/eesunmoon/machine_learning
[Spring 2021] Machine Learning
data-analysis kaggle machine-learning ml python scikit-learn sklearn
Last synced: 14 Apr 2026
https://github.com/lucashomuniz/Project-03
Data-Driven Decision Making: Selecting the Best Regression Model for E-commerce Sales
benchmark-framework data-analysis data-driven data-visualization e-commerce-project language-python lasso-regression linear-regression-models machine-learning python ridge-regression
Last synced: 20 Oct 2025
https://github.com/lucashomuniz/Project-02
Data Analysis and Machine Learning Techniques for Liver Disease Prediction
classification-model data-analysis decision-tree healthcare-application knn-algorithm liver-disease-prediction logistic-regression machine-learning python-language python-script random-forest supervised-learning svm-model
Last synced: 20 Oct 2025
https://github.com/ashwin331133/sql-pizza-outlet-sales-analysis
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
Last synced: 24 Feb 2026
https://github.com/tomkyle/binning
Determine optimal number of bins 𝒌 for histogram creation and optimal bin width 𝒉 using various statistical methods.
binning data-analysis distributions doanes-rule freedman-diaconis histogram histogram-binning math php-math rice-rule scotts-rule square-root statistics sturges-rule terrell-scotts-rule
Last synced: 21 Oct 2025
https://github.com/saisurajmatta/nashville-housing-data-cleaning-project
Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.
azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio
Last synced: 23 Jan 2026
https://github.com/devanshsahu47/talentscape-glassdoor-analysis
TalentScape is an end-to-end Python project that cleans and analyzes a comprehensive Glassdoor Jobs dataset. It features robust data wrangling and 20 insightful visualizations to uncover trends in job titles, salary ranges, company ratings, and more—providing actionable recommendations to optimize recruitment and compensation strategies.
business-intelligence data-analysis data-vizualisation jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/nischay002/us-honey-production-analysis
Analysis of US honey production (1995–2021) using Python & data visualization. Identifies trends in honey yield, pricing, and colony distribution across states.
data-analysis data-visualization exploratory-data-analysis honey-production matplotlib pandas python seaborn us-agriculture
Last synced: 26 Feb 2025
https://github.com/poglolopez/prueba_tecnica_inlaze
Este repositorio muestra mis habilidades en análisis de datos a través de una prueba técnica para Inlaze. Incluye flujos de trabajo con Python, SQLite y Power BI para analizar el comportamiento de jugadores, depósitos y rendimiento de fuentes de tráfico, destacando eficiencia operativa e información estratégica.
data-analysis data-v etl jupyter powerbi python sqlite
Last synced: 26 Feb 2025
https://github.com/shubhammittal-data/hr_dashboard_tableau
An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.
chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public
Last synced: 17 May 2026
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 23 Jan 2026
https://github.com/zeshanfareed/graduation_admission_prediction_ml_django_project
predict file is Django Frameework code file
data-analysis data-visualization datasets-csv django-framework machine-learning machinelearning-python python
Last synced: 23 Jan 2026
https://github.com/scbirlab/hts-tools
🏮 Parsing and analysing platereader absorbance and fluorescence data.
assay-analysis data-analysis fluorescence high-throughput high-throughput-screening platereader
Last synced: 23 Jan 2026
https://github.com/rtlich/sap-sustainable-management
Project for the ERP & BI course at Esprit School of Engineering. It optimizes resource and operations management in an agri-food company using SAP MM & PM, focusing on sustainability, CO₂ reduction, and predictive maintenance.
angular business-intelligence data-analysis flask machine-learning ocr powerbi python sql-server talend
Last synced: 05 May 2026
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 15 Mar 2026
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/gunifiri/duckdb-ghw
🦆 Accelerate analytics with DuckDB's integration for GitHub workflows, enabling efficient data handling and processing directly within your repositories.
analytics analytics-engine big-data columnar-storage data-analysis data-science database duckdb in-memory-database open-source parquet python query-planner r sql
Last synced: 29 Apr 2026
https://github.com/browndwarf/contracosta
Wavelength dependent starspot contrast with Kepler/K2 and TESS
Last synced: 23 Jan 2026
https://github.com/albertobarrago/sentinel
A contribute for the research of Corrado Malanga and Filippo Biondi
Last synced: 24 Oct 2025
https://github.com/sugumarsrinivasan/sql-datawarehouse-project
Building Mordern datawarehouse with SQL Server, including ETL Processes, data modeling, and data analytics.
data-analysis data-analytics data-engineering data-lake data-science data-warehouse datawarehousing etl etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 19 Jun 2026
https://github.com/luminati-io/walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/jofaval/sonar
Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987
data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci
Last synced: 09 Apr 2026
https://github.com/mohamed-khaled0/covid-data-exploration.sql
Covid-19 data
covid19-data data-analysis datacleaning microsoft-sql-server sql
Last synced: 06 Feb 2026
https://github.com/ragedunicorn/mantisx-notebook
A repository for Jupyter notebooks analysing mantisx data
data-analysis data-visualization mantis mantisx shooting training
Last synced: 24 Jul 2025
https://github.com/janiavdv/data-spirits
Analysis of alcohol and sports betting data, including a correlation investigation.
correlation data-analysis data-science machine-learning
Last synced: 11 Nov 2025
https://github.com/a26nine/kortext-usage-dashboard
An interactive data visualisation dashboard built using Tableau software to understand the value of digital resources issued on Kortext platform at Middlesex University, London.
data-analysis data-science data-visualization knime tableau
Last synced: 01 Feb 2026
https://github.com/sehgal-vishal/sql-nyc-collision-analysis
this analysis is based on the Collisions(Accidents) happend in New York City. I have used Sql Server For EDA(Exploratory Data Analysis
data-analysis database eda sql-server
Last synced: 06 Feb 2026
https://github.com/gjjvdburg/veld
Easy command line analytics
cli command-line-tool data-analysis data-science data-visualization statistics
Last synced: 26 Oct 2025
https://github.com/ljadhav25/linear_regression_data_science
Linear regression analysis is used to predict the value of a variable based on the value of another variable. The variable you want to predict is called the dependent variable. The variable you are using to predict the other variable's value is called the independent variable.
data-analysis data-science linear-regression machine-learning
Last synced: 26 Oct 2025
https://github.com/alaminsframe/weather-dengue-trends-in-dhaka
Analyzing weather-Dengue correlation in Dhaka (2020–2025)
beautifulsoup4 data-analysis data-scraping pandas public-health selenium tableau time-series-analysis
Last synced: 26 Oct 2025
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/campagnucci/exercitando_pandas
Exercícios práticos de pandas com dados abertos da educação de São Paulo
data-analysis data-science education-data exercises pandas-tutorial
Last synced: 28 Jan 2026
https://github.com/limatix/limatix
Limatix datacollect and processtrak tools
data-analysis python scientific-workflows
Last synced: 23 Jan 2026
https://github.com/badranalyst/residential-unit-prices-data-analysis-application
Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.
data-analysis data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/farzeen-2001/blinkit-sales-analysis-using-powerbi
The project provides an overview about the BlinkIt Sales performances
data-analysis data-visualization datacleaning excel powerbi
Last synced: 24 Jan 2026
https://github.com/code-jl/nfl-kicker-predictor
A sophisticated Python application that provides real-time NFL kicker statistics and performance analysis with an intuitive graphical interface.
beautifulsoup data-analysis data-visualization espn football gui nfl prediction python real-time-analytics real-time-data sport-analytics sports-data statistics tkinter web-scraping
Last synced: 01 Jun 2026
https://github.com/alunera-data/alunera-data
Hi, I’m Yvonne – building data solutions at the intersection of BI, SQL & Service Management
business-intelligence data-analysis data-engineering data-science github-profile portfolio rstats sql
Last synced: 28 Jan 2026
https://github.com/georgehanymilad/amazon-prime-content-analysis-dashboard-using-power-bi
Power BI Project for Data Analysis
dashboard data-analysis data-visualization dataanalyst movies-rate powerbi
Last synced: 06 Feb 2026
https://github.com/diegopino/publibdata_codexhackathon
Public Library Data processing/analysis codex hackathon attempt
data-analysis data-visualization libraries public
Last synced: 24 Jan 2026
https://github.com/isabelleysseric/data-analysis
Data analysis with R
data-analysis data-processing data-science-projects graph graph-algorithms r
Last synced: 28 Jan 2026
https://github.com/obirikan/u.s.-county-commute-data-analysis
This project extracts and analyzes U.S. county-level commuting data from the 2020 American Community Survey (ACS 5-Year Estimates) via the U.S. Census Bureau API.
Last synced: 28 Jun 2025
https://github.com/hdgiacon/power_bi_projects
Repositório contendo cursos, dashboards e projeto relacionados à análise de dados e Power BI.
data-analysis data-engineering data-visualization microsoft-power-bi
Last synced: 24 Jan 2026
https://github.com/leftcoastnerdgirl/excel_crowdfunding_analysis
This project demonstrates the use of MS Excel for data cleansing & formatting to prepare for data analysis and visualization.
bar-charts conditional-formatting data-analysis data-analytics data-analytics-excel data-preparation data-preprocessing data-visualization excel line-graph
Last synced: 06 Feb 2026
https://github.com/snigdho8869/numerical-data-analysis-projects
Exploring numerical data analysis with credit card churn, fraud detection, health predictions and more.
adaboost cnn data-analysis deep-learning dnn ensemble-learning exploratory-data-analysis gradient-boosting-classifier keras logistic-regression machine-learning ml numeric numerical-analysis pandas python3 random-forest scikit-learn support-vector-machines tensorflow
Last synced: 15 Apr 2026
https://github.com/arjunraj77/analysis-hub
International Fraud Group Hackathon
data-analysis data-visualization hackathon-project kibana-cluster kibana-dashboard
Last synced: 30 Mar 2025
https://github.com/annnieglez/fraud-detection-eda
Fraud Detection - Exploratory Data Analysis (EDA). Analyzing financial transactions to detect fraud patterns using Python and Tableau. Libraries: Pandas, Seaborn and Matplotlib. Key Focus: Data cleaning, fraud trends, high-risk transactions, time-based patterns
data-analysis data-science data-visualization eda fraud-detection fraud-prevention matplotlib seaborn
Last synced: 28 Jan 2026
https://github.com/wassimhd/pwc-switzerland-power-bi-in-data-analytics-virtual-case-experience
The Project helps to build a foundation in data analysis and Power BI software which is provided by PWC virtual internship
data-analysis data-visualization datastorytelling powerbi
Last synced: 28 Jan 2026
https://github.com/hess125/data-visualizations
A repository of data visualization projects
data data-analysis data-science data-visualization powerbi projects sql sqlite tableau
Last synced: 31 Aug 2025
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/srimantapal205/dataengineerwireframedesigns
Data Engineer Wireframe Designs are essential for planning and visualizing data pipelines, architecture, and workflows before implementation.
data-analysis data-engineering dataflow dataflow-programming datapipeline dataprocessing development visualization
Last synced: 29 Jan 2026
https://github.com/karlyndiary/adidas-sales-analysis
Analyzed Adidas' product sales performance, top retailers, monthly trends, yearly growth, regional distribution, and pricing insights. Performed ETL from Python (Pandas) to SQL Server, extracted data with SQL, and visualized key insights in Excel.
adidas-sales-analysis adidas-sales-dashboard dashboard data-analysis data-cleaning data-pipeline data-visualization etl excel-dashboard microsoft-excel microsoft-sql-server python
Last synced: 10 Feb 2026
https://github.com/andreicirciumaru/best-of-breed
CSV fundamentals screener: schema validation + market-cap weights
csv data-analysis finance pandas python screener
Last synced: 15 Apr 2026
https://github.com/anmolian/data_analysis_facebook_api_ads
Big Data Analytics
data-analysis data-visualization pyspark sql
Last synced: 24 Feb 2026
https://github.com/wareflowx/excel-toolkit
A powerful command-line toolkit for Excel and CSV data manipulation, analysis, and transformation.
data-analysis data-wrangling excel pandas python uv
Last synced: 29 Jan 2026
https://github.com/mattdelaune/powerbi_healthcare_dashboard
Interactive Hospital Insights Dashboard built with Power BI, showcasing comprehensive analysis of patient demographics, treatment outcomes, and hospital performance.
data-analysis healthcare power-bi visualization
Last synced: 29 Jan 2026
https://github.com/abhi227070/medical-insurance-predictor
This project implements a machine learning regression model to predict medical insurance charges based on user-provided details such as smoking status, number of children, gender, and age. The user-friendly interface allows individuals to estimate their average insurance price before purchasing medical insurance.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression-models
Last synced: 04 May 2026
https://github.com/isaqueiros/newspapersoldout-predictions-logistic_regression
This notebook is a study of the application of sklearn Logistic Regression model and analysis of metric quality with a focus on the impact of imbalanced data. The problem presented is the analysis of sales of newspapers of a local stand in order to classify the probability of the newspaper being Sold Out or Not, given a set of features.
data-analysis data-imbalance data-science logistic-regression machine-learning python sklearn-library sklearn-logistic-regression
Last synced: 18 Apr 2026
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/shrutiijoshi/marketing-campaign-report
The dataset includes information on campaign types, recipient segments, interactions (clicks, opens, bounces, etc.), and conversion metrics.
dashboard data-analysis data-visualization tableau-public
Last synced: 25 Feb 2026
https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models
This repo contains a model for determining the rainfall patterns and another for medical expense prediction model
data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning
Last synced: 30 Aug 2025
https://github.com/jacktheprogrammer/time-series-forecasting-and-analysis
My personal project consisting of my personally created notebooks to work with time series forecasting and analysis. In these projects, I've used deep learning using tensorflow, xgboost, statsmodels and scipy libraries of python. The series were of weather, energy consumption and that of stocks.
data-analysis data-science deep-neural-networks energy-consumption machine-learning portfolio prophet-facebook prophet-model python python3 scipy statsmodels stocks tensorflow time-series time-series-analysis timeseries-forecasting weather xgboost
Last synced: 05 May 2026
https://github.com/mfakhriazhar/us-companies-revenue-dashboard
This project is a data visualization dashboard built using Power BI that highlights lists of the largest companies in the United States by revenue. The goal is to provide an interactive overview of company performance across industries, focusing on revenue, employee metrics, and industry trends.
dashboard data-analysis data-visualization largest-companies-us powerbi revenue united-states
Last synced: 30 Jan 2026
https://github.com/touchesir/twitter_physicalactivity
Companion Data / Analysis for "Monitoring Physical Activity Levels using Social Media Data"
Last synced: 30 Jan 2026
https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-
This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms
data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier
Last synced: 15 Apr 2026
https://github.com/aygp-dr/values-compass
Tools for exploring and analyzing Anthropic's Values-in-the-Wild dataset for AI ethics research
ai-ethics anthropic-claude data-analysis nlp values
Last synced: 25 Feb 2026
https://github.com/ahnaf19/rokomari_price_analysis
This was a job hiring assignment given my rokomari.com. The data was small, obviously a generated one for test purpose. I tried to describe myself while diving deep as much as possible.
data-analysis data-cleaning data-visualization etl
Last synced: 30 Aug 2025
https://github.com/gurpreet17/uc-davis-sql-for-data-science-specialization
Completed the SQL Basics for Data Science Specialization from the University of California, Davis, gaining proficiency in Data Analysis, SQL, Apache Spark, and Delta Lake.
apache-spark bigdata data-analysis data-science delta-lake sqlite
Last synced: 15 Apr 2026
https://github.com/auliannee/new-york-uber-pickups-analysis
This repository contains the projects related to data collecting, quality check, manipulation, analyzing, and visualizations.
data-analysis data-science ipython-notebook jupyter-notebook python
Last synced: 07 Feb 2026
https://github.com/sajjad425/edaipl
The dataset covers the Indian Premier League (IPL) with details on matches (date, teams, venue, results), player stats (runs, wickets), team stats (wins, losses), season summaries, and umpire info. The EDA reveals patterns and insights, highlighting dominant teams, star players, and trends across seasons.
data-analysis eda exploratory-data-analysis ipl python
Last synced: 05 May 2026
https://github.com/jujulis18/olympicsmedalsdashboard
Olympic Dashboard – Paris 2024 est un tableau de bord interactif permettant d’explorer les performances des athlètes médaillés des Jeux Olympiques d’été de Paris 2024.
dashboard data-analysis data-visualization eda olympic python streamlit
Last synced: 31 Jan 2026
https://github.com/amishidesai04/flipkart-mobile-sales-analysis
Flipkart Mobile Sales Analysis is a Tableau project that visualizes mobile sales data from Flipkart. It highlights trends in brand performance, pricing, ratings, and customer preferences. The interactive dashboard helps users explore key insights for data-driven decisions in e-commerce and retail.
dashboard data-analysis data-visualization storyboard tableau
Last synced: 31 Jan 2026
https://github.com/traore-07/fedex-sales-analysis
Analysis of the FedEx Sales Transaction
data-analysis data-visualization sales-analysis tabeau
Last synced: 31 Jan 2026
https://github.com/jaseel342/hr_analytics_dashboard
This project showcases the use of Power Query and DAX Query to analyze employee details, add new measures and columns, and create a dashboard using Power BI.
data-analysis dax-query power-query powerbi
Last synced: 03 Jan 2026
https://github.com/cca/panopto-session-data
analyzing Panopto session data for retention purposes
data-analysis ipython-notebook video
Last synced: 07 Feb 2026
https://github.com/allanotieno254/bank-loan-analysis-dashboard-power-bi
An interactive Power BI dashboard that analyzes bank loan data to provide insights into approval trends, default risks, and customer profiles. Designed to assist financial institutions in making data-driven lending decisions.
bank-loans business-intelligence dashboard data-analysis financial-analysis power-bi risk-assessment
Last synced: 31 Jan 2026