Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/dcs-training/r-visualisation-and-stats
This repository contains material from a 8 classes course on Data Visualisation and statistics with R
data-analysis data-visualisation data-wrangling intro-to-programming r statistics
Last synced: 20 Jun 2026
https://github.com/evanmathew/northwind-traders
SQL-powered analysis of sales, employee performance, and customer behavior using PostgreSQL window functions. This project uncovers key business insights to optimize decision-making.
case-study data-analysis jupyter-notebook northwind-traders postgresql python-postgresql sql
Last synced: 20 Jun 2026
https://github.com/haseebn19/urban-housing-demand
A full-stack web application for visualizing housing and labour market data
data-analysis data-visualization docker full-stack gradle statistics web webapp
Last synced: 22 Jun 2026
https://github.com/katiebuntic/research_methods
Data Science Research Methods
analysis data-analysis data-science python research-project
Last synced: 23 Jun 2026
https://github.com/engusseus/warframe-market-set-profit-analyzer
Python tool that analyzes Warframe Market data to find profitable item sets to trade
api data-analysis python trading waframe
Last synced: 23 Jun 2026
https://github.com/manditacaos/hypefemme-analise-vendas
Projeto de análise de dados e visualização no Power BI da loja fictícia Hype Femme.
data-analysis jupyter-notebook portfolio powerbi python
Last synced: 10 Apr 2025
https://github.com/emaleckova/emaleckova.github.io
My personal website created with Quarto
biology data-analysis data-viz quarto r
Last synced: 23 Jun 2026
https://github.com/nimomach/cafe-sales
This analysis focuses on evaluating the sales performance of a cafe by examining key metrics such as total revenue, sales by product category, peak sales times, and many more.
cafe data-analysis data-visualization sales
Last synced: 12 Mar 2026
https://github.com/subratamondal1/heart-attack-prediction
Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.
data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/aalekhpatel07/statcan
StatCAN dataset fetcher and cleaner.
census data-analysis data-science statcan
Last synced: 02 Apr 2025
https://github.com/masamallow/jupyterlab-my-local
Configuration to run my personal JupyterLab on my local.
data-analysis jupyter jupyter-notebook jupyterlab
Last synced: 26 Mar 2025
https://github.com/deliprofesor/behavioral-insights-and-data-exploration
This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.
acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis
Last synced: 10 Apr 2025
https://github.com/deliprofesor/k-means-clustering-for-retail-data-analysis
This project uses K-Means clustering to segment wholesale customers based on their spending habits. The data is preprocessed, scaled, and clustered into four groups. The Elbow and Silhouette methods determine the optimal number of clusters, and results are visualized using boxplots and scatter plots to uncover spending patterns.
clustering-visualisation data-analysis elbow-method k-means k-means-clustering r silhouette-score
Last synced: 10 Apr 2025
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/georgehanymilad/sales-and-profit-analysis-using-excel
Excel Project for Data Analysis
dashboard data-analysis data-visualization excel excel-dashboard interactivedashboard pivot-tables pivotcharts profit sales-analysis visuzalization
Last synced: 05 Feb 2026
https://github.com/sabelomkhwanzi/data-alchemist-boot-camp
Built on Covalent's Unified API, Increment has the full historical data set for 40+ chains including every smart contract, event, transaction, address, etc. With access to all this data you can find:
covalent data-analysis increment
Last synced: 11 Mar 2026
https://github.com/edanur-y/airline-customer-satisfaction-prediction-with-multiple-logistic-regression
Performing multiple logistic regression analysis on airline and customer data to predict the satisfaction. 🔵R
data-analysis missing-values-analysis multiple-logistic-regression optimal-cut-off-points r
Last synced: 09 Jun 2026
https://github.com/vdoninav/real_estate_analysis
real estate analysis
data data-analysis data-analysis-python data-science pandas pandas-dataframe pandas-python plotly plotly-express scipy seaborn streamlit streamlit-application streamlit-dashboard streamlit-webapp
Last synced: 12 Apr 2026
https://github.com/luciocolonna/cyclistic-bikesharing-2023
Case study on public data from Chicago's Divvy bikeshare, using R
bikesharing capstone-project cyclistic cyclistic-bikshare data-analysis data-visualization geojson ggplot2 google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional leaflet r sf tidyverse
Last synced: 02 Apr 2025
https://github.com/jnnjenga/video-game-analysis
Performs exploratory data analysis on a video game dataset, examining titles, release dates, teams, ratings, genres, and user engagement metrics to uncover trends, popular genres, and developer insights.
data-analysis data-science dataset eda exploratory-data-analysis game-analysis games genres jupyter-notebook pandas python trends user-engagement video-game visualization
Last synced: 13 Apr 2026
https://github.com/ilke-kas/multivariate-data-analysis
A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.
classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics
Last synced: 05 Oct 2025
https://github.com/seblehner/feldprakt
Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.
data-analysis data-visualization jupyter-notebook python
Last synced: 05 Oct 2025
https://github.com/paphada1103/data-analysis-with-python
📊 Analyze data efficiently using Python’s top libraries. Learn to explore, clean, and visualize data for meaningful insights in your projects.
carpentries data-analysis data-carpentry data-visualisation dataframe-api dataset english hacktoberfest ibm jovian lsl machine-learning matplotlib programming python realtime social-sciences spark
Last synced: 09 May 2026
https://github.com/harkishen/Agriculture-DS
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 11 Dec 2025
https://github.com/joyceannie/sql-data-with-danny-case-studies
Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
8-week-sql-challenge 8weeksqlchallenge case-study data-analysis data-analytics postgresql sql
Last synced: 05 Oct 2025
https://github.com/ankitwalimbe/ecommerce-funnel-analysis
SQL-based analysis of the Olist e-commerce dataset — building an order funnel (purchase → approval → delivery) with breakdowns by payment type, product category, region, and monthly trend. Includes insights, CSV exports, and Tableau dashboard.
bigquery business-intelligence data-analysis ecommerce funnel-analysis sql tableau-public
Last synced: 05 Oct 2025
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/davifeliciano/modern_physics_experiments
Collection of data analysis and visualization scripts developed in Python around some modern physics experiments
data-analysis data-visualization modern-physics physics physics-experiments
Last synced: 18 Jan 2026
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/ilaxi/lomicontadores
data management tool in reference to number of actions per day in a year
data-analysis gdscript godot godot4 python
Last synced: 19 Apr 2026
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/thejvdev/ml-from-scratch
Repository for Implementing ML Models from Scratch in Python
classification data-analysis data-mining data-science deep-learning jupyter-notebook machine-learning matplotlib neural-networks numpy pandas prediction python regression seaborn sklearn visualization
Last synced: 02 Apr 2026
https://github.com/prarthana-singh/bangalore-house-price-predictor
🏡 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.
data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn
Last synced: 19 Apr 2026
https://github.com/npodlozhniy/podlozhnyy-module
One place for the most useful methods for work
data-analysis data-science pypi
Last synced: 21 Jan 2026
https://github.com/ak-abhilash/super-market-sales-data-analysis-and-forecasting-using-power-bi
Power BI project to visualize sales data of a supermarket.
dashboard data-analysis powerbi salesdata visualization
Last synced: 05 Feb 2026
https://github.com/ndomah1/learning-probability-and-statistics
This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.
correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics
Last synced: 18 Jan 2026
https://github.com/bhaveshbhakta/student-performance-prediction-using-ml
Student Performance Prediction
data-analysis data-visualization linear-regression machine-learning student-performance-analysis student-performance-prediction
Last synced: 08 Oct 2025
https://github.com/dcs-training/exploratory-data-analysis-and-visualisation-with-observable-plot
This two-hour workshop will teach you how to follow an exploratory data analysis pipeline with Observable Plot, a new JavaScript library based on the Grammar of Graphics, that proposes a simple yet expressive interface to create powerful graphics easily shareable on the web. Go to the Readme file
d3 data-analysis data-visualisation javascript observable-notebook
Last synced: 17 May 2026
https://github.com/inddrsingh/e-commerce_orders
ETL project, with Python for Data cleaning and MySQL for Data analysis
data-analysis etl-pipeline mysql python
Last synced: 18 Apr 2026
https://github.com/debjyotisaha/power-bi-projects-phase-2
Created interactive dashboards and reports using Power BI to visualize complex datasets. Demonstrated proficiency in data modelling, DAX calculations, and storytelling through data to provide actionable insights.
dashboards data-analysis data-modeling data-visualisation power-query powerbi
Last synced: 18 Jan 2026
https://github.com/dhruvalbhinsara1/influencer-and-platform-data-analysis
Exploratory analysis of synthetic influencer marketing data — engagement, revenue, and ROI.
campaign-analytics data-analysis data-analysis-project data-analysis-python eda influencer-marketing jupyter-notebook jupyter-notebooks marketing-analytics matplotlib pandas python seaborn synthetic-data visualization
Last synced: 09 May 2026
https://github.com/jhaayush2004/churncast
Fusion of deep Data Science, Machine Learning and MLOps...
aws data-analysis data-science data-visualization deep-neural-networks docker machine-learning mlops-workflow
Last synced: 09 Oct 2025
https://github.com/takshshah-16/pizza_sales_sql
SQL-powered pizza sales analytics project using MySQL Workbench to derive business insights through data exploration and queries.
business-intelligence data-analysis database-management mysql sql
Last synced: 09 Oct 2025
https://github.com/mirwais-farahi/data-visualization-with-tableau-specialization
The Specialization provides Tableau for data visualization and business intelligence. The series covers skills like assessing data quality, designing visualizations and dashboards, and combining data sources to create compelling, data-driven stories.
dashboard data-analysis geospatial map tableau visualization
Last synced: 16 Feb 2026
https://github.com/adithya2369/safa_public
AI-powered customer feedback analyzer that uses generative AI to transform customer reviews into actionable business insights. Upload review data, get instant summaries, satisfaction scores, detailed reports, and improvement suggestions—all in an easy-to-deploy Docker container.
data-analysis data-visualization docker-containerization full-stack-development generative-ai langchain langchain-groq web-development
Last synced: 10 Oct 2025
https://github.com/ninadpatil09/hospital_emergency_room_analysis
This comprehensive analysis delves into the performance and characteristics of the hospital's emergency room over the past year. By scrutinizing key metrics and patient demographics, this study aims to provide valuable insights for optimizing patient care, resource allocation, and overall operational efficiency.
data-analysis tableau-public visualization
Last synced: 15 Feb 2026
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 10 Oct 2025
https://github.com/loaiwalid07/automation_data_overviwe
This is Streamlit app that gives an overview for a dataset you upload
automation data data-analysis data-exploration data-science data-transformation data-visualization
Last synced: 19 May 2026
https://github.com/brooks-code/toulouse-biblio-chronicle
Snapshot of Toulouse public library customer habits — cleaning raw, messy datasets of musical, cinematic, and literary checkouts; includes data-cleaning steps, analysis notebook revealing cultural tastes in the Pink City.
data-analysis data-cleaning data-cleaning-and-preprocessing data-quality exploratory-data-analysis jupyter-notebook library-data misaligned-data mojibake tutorial
Last synced: 10 Oct 2025
https://github.com/filipe-rds/bi-atividade-1
Atividade de análise de dados para a disciplina de Inteligência Empresarial
data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python seaborn
Last synced: 04 May 2026
https://github.com/cyberoctane29/diamonds-anova-analysis
This project uses ANOVA in Python to analyze how diamond color and cut affect pricing. By testing for statistical significance and running post hoc comparisons, it reveals key pricing patterns. Built with pandas, statsmodels, and Seaborn, the findings help inform diamond valuation and purchasing decisions.
anova-test data-analysis data-analytics data-science diamonds-dataset regression-analysis statistical-analysis tukey-hsd
Last synced: 10 Oct 2025
https://github.com/scarlet-enlight/ml_project
Comparison of different classifiers (KNN, Naive Bayes, Decision Tree) on Sleep Health and Lifestyle Dataset
data-analysis machine-learning
Last synced: 13 Mar 2026
https://github.com/mysto-007/dog-vision-a-dog-breed-recognizer-kaggle-competition
A solution for Dog Breed Identification on Kaggle competition
colab-notebook data-analysis data-science data-visualization jupyter-notebook kaggle kaggle-competition python
Last synced: 09 May 2026
https://github.com/pranav016/exploratory-data-analysis-of-google-app-store-dataset
This is a data analysis done on the Google app store dataset to answer a few questions related to the data through data visualization techniques.
Last synced: 11 Oct 2025
https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python
Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.
data-analysis data-visualization numpy pandas python3 sns visualization
Last synced: 03 May 2026
https://github.com/mouadtaoussi/capmpingi-employee-reviews
Analysis of Capmpingi employee reviews using Python/Pandas and Power BI
data-analysis data-science kaggle pandas powerbi python python3
Last synced: 14 Apr 2026
https://github.com/vineet416/eda-hr-analytics
EDA on HR-Analytics by PW Skills Data Analytics course
data-analysis data-analysis-python data-analytics data-preprocessing data-processing data-visualization exploratory-data-analysis jupyter-notebook matplotlib-pyplot numpy pandas python seaborn statistical-analysis
Last synced: 14 Apr 2026
https://github.com/vinay-jose/territorial-sales-dashboard
EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.
data-analysis data-visualization powerbi-desktop sql
Last synced: 11 Oct 2025
https://github.com/bakulwani/data-mart-weekly-sales
Cleaned and analyzed weekly sales data using SQL to build a business-focused data mart with KPIs, customer segmentation, and platform insights.
customer-segmentation data-analysis data-cleaning etl kpi-analysis mysql sales-analysis sql
Last synced: 21 Feb 2026
https://github.com/thinzarhninyu/dap
Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org
data-analysis data-analysis-python ipynb jupyter-notebook python
Last synced: 18 Feb 2026
https://github.com/abeltavares/postql
Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.
cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper
Last synced: 19 Jan 2026
https://github.com/treasarose/us_candy_distribution_analysis_project
This project focuses on advanced data analysis and optimization using SQL. It includes queries for analyzing sales, product margins, and shipping efficiency for a US candy distributor.
data-analysis entity-relationship mssql optimization query sql-server sqlproject us-candy-distributor
Last synced: 12 Oct 2025
https://github.com/jeffbrennan/analysis-templates
Templates of commonly used graphics/functions/settings to help focus on the bigger picture
Last synced: 12 Oct 2025
https://github.com/rodrigojunqueiradev/the-ultimate-mysql-bootcamp
The Ultimate MySQL Bootcamp
data-analysis data-engineering data-science data-visualization database dataset mysql sql
Last synced: 14 Apr 2026
https://github.com/zulhaditya/web-scraping-python
A repository that stores various source code and web scraping methods using Python.
data-analysis python3 webscraping
Last synced: 12 Oct 2025
https://github.com/chirlmin-joo-lab/papylio
Single-molecule fluorescence trace extraction and analysis
biophysics data-analysis fluorescence fret single-molecule sparxs
Last synced: 12 Oct 2025
https://github.com/veronsheva/hr_dashboards
Interactive HR dashboard using Tableau & MySQL – explore employee trends, performance, attrition, and salary insights.
calculated-fields charts cte dashboards data-analysis data-cleaning design eda mysql queries tableau window-functions
Last synced: 24 Jan 2026
https://github.com/agb2k/twitter-analyzer
Project to extract tweets based on searches, analyze it's data and autocorrect potentially incorrect words
data-analysis python tweepy twitter
Last synced: 13 Oct 2025
https://github.com/madhursinghbhadoriya/atliq_salesdata_analysis-powerbi
AtliqH_SalesData Analysis - Power
dashboard data-analysis powerbi
Last synced: 21 Jan 2026
https://github.com/louisfernando1204/websocket-benchmark
A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.
benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws
Last synced: 09 Apr 2026
https://github.com/stefagnone/-employee-salary-analysis-and-insights
Predictive analysis of employee salary determinants for an anonymized dataset, highlighting key factors influencing salary and providing insights for salary policy improvements.
business-intelligence data-analysis data-science employee-salary-analysis excel gender-pay-gap predictive-insights regression-modeling spss statistical-analysis
Last synced: 23 Feb 2026
https://github.com/wassimhd/pwc-switzerland-power-bi-in-data-analytics-virtual-case-experience
The Project helps to build a foundation in data analysis and Power BI software which is provided by PWC virtual internship
data-analysis data-visualization datastorytelling powerbi
Last synced: 28 Jan 2026
https://github.com/yash1882/music-store-data-analysis
A project focuses on analyzing music store data using SQL ♬
begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project
Last synced: 28 Jan 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/anurag-ghosh-12/library_management_system_sql
This project showcases the development of a comprehensive Library Management System utilizing Structured Query Language (SQL). It demonstrates a practical application of relational database principles to efficiently manage library resources, member information, and borrowing/returning transactions.
data-analysis data-visualisation dbms-project sql
Last synced: 29 Jan 2026
https://github.com/andreicirciumaru/best-of-breed
CSV fundamentals screener: schema validation + market-cap weights
csv data-analysis finance pandas python screener
Last synced: 15 Apr 2026
https://github.com/anmolian/data_analysis_facebook_api_ads
Big Data Analytics
data-analysis data-visualization pyspark sql
Last synced: 24 Feb 2026
https://github.com/wareflowx/excel-toolkit
A powerful command-line toolkit for Excel and CSV data manipulation, analysis, and transformation.
data-analysis data-wrangling excel pandas python uv
Last synced: 29 Jan 2026
https://github.com/abhi227070/medical-insurance-predictor
This project implements a machine learning regression model to predict medical insurance charges based on user-provided details such as smoking status, number of children, gender, and age. The user-friendly interface allows individuals to estimate their average insurance price before purchasing medical insurance.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression-models
Last synced: 04 May 2026
https://github.com/smahala02/magnetism-lab
This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.
data-analysis inductance jupyter-notebook magnetism python toroids
Last synced: 29 Jan 2026
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/shrutiijoshi/marketing-campaign-report
The dataset includes information on campaign types, recipient segments, interactions (clicks, opens, bounces, etc.), and conversion metrics.
dashboard data-analysis data-visualization tableau-public
Last synced: 25 Feb 2026
https://github.com/joannescode/regex_with_py
Learning by practicing with Regex (Python)
Last synced: 30 Jan 2026
https://github.com/mfakhriazhar/us-companies-revenue-dashboard
This project is a data visualization dashboard built using Power BI that highlights lists of the largest companies in the United States by revenue. The goal is to provide an interactive overview of company performance across industries, focusing on revenue, employee metrics, and industry trends.
dashboard data-analysis data-visualization largest-companies-us powerbi revenue united-states
Last synced: 30 Jan 2026
https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-
This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms
data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier
Last synced: 15 Apr 2026
https://github.com/aygp-dr/values-compass
Tools for exploring and analyzing Anthropic's Values-in-the-Wild dataset for AI ethics research
ai-ethics anthropic-claude data-analysis nlp values
Last synced: 25 Feb 2026
https://github.com/jcaperella29/jc_bioinformatics_hub
A personal hub to showcase my bioinformatics applications including RNA-Seq, ATAC-Seq, and miRNA-Seq analysis tools. Powered by simple HTML, CSS, and JavaScript with a biotech-themed design.
atac-seq bioinformatics biotech data-analysis github-pages portal rna-seq webapp
Last synced: 25 Feb 2026
https://github.com/jaseel342/ecommerce_sales_dashboard
The E-commerce Sales Dashboard project offers a comprehensive view of e-commerce sales performance using interactive Power BI dashboards. It focuses on key metrics like YTD Sales, YTD Profit, YTD Profit Margin, and Quantity of Products sold, analyzing data by product categories, states, and regions.
data-analysis data-modelling dax-expression excel power-query powerbi visualization
Last synced: 07 Feb 2026
https://github.com/gurpreet17/uc-davis-sql-for-data-science-specialization
Completed the SQL Basics for Data Science Specialization from the University of California, Davis, gaining proficiency in Data Analysis, SQL, Apache Spark, and Delta Lake.
apache-spark bigdata data-analysis data-science delta-lake sqlite
Last synced: 15 Apr 2026
https://github.com/luminati-io/indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 07 Feb 2026
https://github.com/tralahm/parliament-2017-dataset
Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition
csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek
Last synced: 31 Jan 2026
https://github.com/traore-07/fedex-sales-analysis
Analysis of the FedEx Sales Transaction
data-analysis data-visualization sales-analysis tabeau
Last synced: 31 Jan 2026