Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/edjoukou/altip-sales-analysis
It is about Sales data analysis
data-analysis mysql-database sql tableau visualization
Last synced: 20 Jul 2025
https://github.com/rdrahul123/my_python-codes
Python Programming codes and Notebooks
anaconda data-analysis data-science jupyter-notebook python python3 visual-studio
Last synced: 17 May 2026
https://github.com/kunalkumar2001/sales-project-using-excel-and-sql
Comprehensive sales analysis using SQL, Excel, and PowerPoint to uncover insights on top-sellers, peak times, and branch performance.
data-analysis data-analytics excel mssql sql
Last synced: 03 Nov 2025
https://github.com/eslamdyab21/a-b-test-to-an-e-commerce-website
A/B test to an e-commerce website
csv data-analysis data-science hypothesis-testing pandas python udacity-data-analyst-nanodegree
Last synced: 17 May 2026
https://github.com/imnotamr/ai
A collection of machine learning and AI projects implemented in Jupyter notebooks, covering regression, classification, and neural networks
ai classification colab-notebook data-analysis data-preprocessing data-preprocessing-and-cleaning data-visualization deep-learning deep-neural-networks jupyter-notebook machine-learning model-evaluation predictive-modeling project-based-learning python supervised-learning supervised-learning-algorithms supervised-learning-classifiers unsupervised-learning unsupervised-learning-algorithms
Last synced: 17 May 2026
https://github.com/borjamome/visualization_supermarkets_with_r
Visualization using R and OpenStreetMaps
data-analysis datavisualization openstreetmap r
Last synced: 02 Jan 2026
https://github.com/myke003/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 14 Mar 2025
https://github.com/gemaquejr/restaurant-orders
Projeto com o objetivo de aplicar os conceitos de POO e trabalhar com Set, Hashmap e Dict. Este projeto foi criado para avaliação final na seção 06 do módulo de ciência da computação do Curso de Desenvolvimento Web na Trybe.
data-analysis dict hashmap poo python set
Last synced: 30 Oct 2025
https://github.com/collins-kimotho/communicate-data-findings
Data Analysis Project: Investigating Factors Contributing to No-Show Appointments in Medical Records
data-analysis data-science data-visualization dataset pandas python
Last synced: 17 May 2026
https://github.com/cano1998/data-visualization-project
A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.
bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot
Last synced: 17 Jul 2025
https://github.com/finnishcancerregistry/directadjusting
Compute estimates of weighted averages with confidence intervals.
biostatistics confidence-intervals data-analysis direct-adjusting direct-adjustment epidemiology health-statistics r r-package statistical-adjusting statistical-adjustment weighted-analysis weighted-average weighted-averages
Last synced: 26 Mar 2025
https://github.com/brownred/python-and-sql
Python and SQL (postgreSQL & mySQL) for data analysis.
data-analysis databases python3 sql
Last synced: 11 May 2026
https://github.com/ishmal793/basic-python-
Beginner-friendly Python code examples and exercises – a strong foundation for aspiring data analysts.
data-analysis data-analytics learning-python-code problem-solving python-basics python-for-beginners
Last synced: 23 Jul 2025
https://github.com/shrutiii1109/diwali-sales-analysis-through-python
Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.
data-analysis jupyer-notebook matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/sahilmaurya28/youtube-data-analysis
YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.
analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube
Last synced: 13 Apr 2026
https://github.com/edumoraes1/journey_active_users
Segmentação de base via SQL para jornada de vendedores ativos
bq data-analysis salesforce sql
Last synced: 02 Feb 2026
https://github.com/sreekar0101/-movie-recommendation-system-using-python
The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice
data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python
Last synced: 02 Jan 2026
https://github.com/nitins17/tableauvisualizations
Visualizations I created while learning to work with Tableau
data-analysis data-science data-visualization tableau visualization
Last synced: 01 Mar 2026
https://github.com/djccnt15/mathematics
data-analysis data-science linear-algebra python statistics
Last synced: 24 Jun 2025
https://github.com/dionixius7/titanic-disaster-ml-model
This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances
data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction
Last synced: 07 Feb 2026
https://github.com/mamtapanda088/dataanalaysis-warmup-
Tasks: Create a DataFrame: Convert the dictionary into a pandas DataFrame. Top and Bottom Rows: Display the top 3 bottom ,3 rows of the DataFrame. Summary Statistics: Generate summary statistics for the dataset. Gender Count: Count the occurrences of each gender. Marks Analysis: Calculate the average, maxi, and min marks. Tools Used: Python ,pandas
data-analysis data-science jupyter-notebook visualization
Last synced: 04 Apr 2025
https://github.com/lucashomuniz/project-01
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-05
[STATISTICAL ANALYSIS] Integrating Automation and Visualization for Optimal Data Analytics
automation data-analysis kolmogorov-smirnov language-r nonparametric-analysis parametric-analysis shapiro-wilk shiny-apps statistics t-test wilcoxon-test
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-02
TRACKING USER ACCEPTANCE TESTING WITH POWERBI AND R
data-analysis data-visualization dax-languague powerbi-report powerbi-visuals powerquery r-language shiny-apps visuals
Last synced: 30 Mar 2025
https://github.com/davydantoniuk/stackoverflow-graph-analyse-r
data-analysis graph r stackoverflow
Last synced: 13 Mar 2025
https://github.com/achronus/data-exploration
A repository dedicated to interesting data exploration projects I've completed
data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn
Last synced: 02 Jan 2026
https://github.com/surajsanap/employee-resigning-analysis-powerbi-dashboard-data-analytics
Effortlessly analyze employee resignations with our concise Power BI dashboard. Download the XML file, open the dashboard, and gain quick insights into resignation trends and reasons for departure. Streamlined and effective
dashboard data-analysis data-analytics powerbi python xml-dataset
Last synced: 08 May 2025
https://github.com/nemat-al/aviation-accidents
Aviation Accidents Analysis
aviation data-analysis data-science data-visualization plotly python
Last synced: 17 May 2026
https://github.com/hemangsharma/bookingdataanalysisreport
The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.
analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard
Last synced: 14 May 2026
https://github.com/saymyname1337/bachelor-s-thesis
Bachelor's thesis of a student of the MPEI of Shevts G. V.
Last synced: 23 Jul 2025
https://github.com/arv-anshul/pw-experience-portal
Data Analysis on PW Skills and Ineuron.ai experience/internship portal.
data-analysis experience ineuron-ai internship physics-wallah portal pw-skills python3
Last synced: 16 Apr 2026
https://github.com/sdley/logiciel-de-deliberation-uam-2022
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 08 May 2026
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 14 Mar 2025
https://github.com/maazie-khan/austin-housing-insights-powerbi
Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.
dashboard data-analysis data-science data-visualization database powerbi
Last synced: 02 Jan 2026
https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny
🌎Datamart Analysis with Machine Learning
data-analysis data-science dataviz machine-learning ml r retail-analysis rstudio shiny
Last synced: 06 Apr 2025
https://github.com/admacpherson/admacpherson.github.io
This repository hosts my personal website & portfolio. You can find my work experience, endorsements, contact information, and more on it at andrewmacpherson.dev
data-analysis personal-site portfolio website
Last synced: 15 Sep 2025
https://github.com/rudra-g-23/power-bi-custom-visual
A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.
custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization
Last synced: 02 Jan 2026
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 06 Apr 2025
https://github.com/sangampaudel530/bhutan-rainfall-explorer
Interactive dashboard to explore, analyze, and forecast rainfall trends in Bhutan (2021–2025) using Streamlit, Plotly, and Prophet.
bhutan climate-change data-analysis prophet-facebook rainfall-prediction streamlit visualization
Last synced: 17 May 2026
https://github.com/tushar2704/employee-distribution
This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.
data-analysis data-visualization excel postgresql powerbi sql tushar2704
Last synced: 04 Nov 2025
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/maazie-khan/power-bi-projects
Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.
dashboard data-analysis data-science data-visualization database excel powerbi
Last synced: 02 Jan 2026
https://github.com/sramalhao/sleep_health_analysis
This repository contains a comprehensive project focused on analyzing various factors influencing sleep health, such as BMI, occupation, gender, age, physical activity, and stress levels.
analytics data-analysis eda matplotlib pandas python seaborn sklearn visualization
Last synced: 13 Apr 2026
https://github.com/wesleych3n/my-work-log
A self project to record and analyze work's check in/out time on google sheet with telegram bot.
data-analysis telegram-bot worklog
Last synced: 20 Jul 2025
https://github.com/mikeesto/ausvotes19
:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election
australia data-analysis data-visualization elections open-data twitter
Last synced: 06 Apr 2025
https://github.com/lfariello/atmospheric_reentry
Matlab code for the determination of the reentry trajectory, deceleration profiles, and heat flux of the ARD capsule during orbital reentry into Earth's atmosphere.
data-analysis heat-flux-prediction heat-transfer hypersonic hypersonic-capsule matlab-programming trajectory-prediction
Last synced: 23 Mar 2025
https://github.com/deliprofesor/2024-salary-analysis-for-machine-learning-engineers
This project analyzes a salary dataset to explore factors like experience, company size, remote work ratio, and country. It includes data cleaning, group analysis, visualizations, and machine learning models (linear regression and Random Forest) to predict salaries and identify key features.
data-analysis data-cleaning data-visualization ggplot2 linear-regression machine-learning plotly r-programming random-forest salary-prediction salary-trends
Last synced: 07 Mar 2026
https://github.com/aelmah/ibm-applied-ds
Find here : A collection of projects I've done throught Applied DS Specialization !
applied-data-science-capstone beautifulsoup data-analysis data-visualization machine-learning python-for-ai-and-data-science web-scraping
Last synced: 11 Sep 2025
https://github.com/fatihilhan42/wnba-draft-player-dataanalysis-1997-2022-with-python
In this project, the statistics of the players in the WNBA drafts from 1997 to 2022 were examined. The data in the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.
data-analysis data-analysis-python data-visualization jupyter-notebook python
Last synced: 17 May 2026
https://github.com/aliiahmadi/data-manipulation
Algorithms and solutions
algorithm algorithms data-analysis data-science
Last synced: 30 Jun 2025
https://github.com/shreshthvashisht/abc-call-volume-trend-analysis
Customer Experience Analysis
advanced-excel call-centre-analysis call-volume-trend data-analysis data-visualisation experience-analytics pivot-tables
Last synced: 01 Mar 2026
https://github.com/aditiagrawal04/netflix-insights-mysql-
SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.
business-intelligence data-analysis data-exploration mysql netflix sql sql-project
Last synced: 28 Jun 2025
https://github.com/dhruvil-26/sql-projects
This repository contains SQL projects focusing on data analysis and insights. Currently, it includes: 1. RSVP Movies Analysis - SQL queries to analyze movie trends, ratings, and genres. 2. Pizza Sales Analysis - SQL queries to explore sales patterns, customer behavior, and profitability in a pizza business.
analysis data-analysis database mysql pizza-sales-analysis rdbms rsvp sql
Last synced: 17 May 2026
https://github.com/waheed24-03/-ipl-stats-compare
Comparing Stats of Cricketers in IPL
cricket dashboard data-analysis data-visualization ipl python sports-analytics streamlit
Last synced: 28 Jun 2025
https://github.com/leoz0214/foodhygieneanalysis
Data analysis regarding Food Hygiene Ratings in England, Wales and Northern Ireland.
data-analysis food-hygiene-ratings pandas python
Last synced: 17 May 2026
https://github.com/iamber12/stack-overflow-analysis-using-stack-exchange-api
This Python-based project utilizes the Stack Exchange API to analyze StackOverflow data, focusing on the 'R' and 'Dot Net' programming tags.
data-analysis data-visualization python stack-exchange-api
Last synced: 20 Jul 2025
https://github.com/sunnyrao07/data-analysis-dashboard-in-excel
I implemented a comprehensive data analysis solution using Excel, developing multiple dashboards and tables to visualize and interpret the data. This involved a rigorous data cleaning and preprocessing pipeline followed by data visualization.
dashboard data-analysis excel visualization
Last synced: 03 Feb 2026
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/dsite42/simple_data_visualizer
This is a simple tool to visualize data for a quick Exploratory Data Analysis (EDA). You can create various plot types as seaborn or plotly plot via a GUI in multiple windows (RelPlot, PairPlot, JointPlot, DisPlot, CatPlot, LmPlot, 3DPlot).
data-analysis data-science data-visualisation data-visualization eda exploratory-data-analysis plotly seaborn
Last synced: 12 May 2026
https://github.com/poglolopez/nesarc_research
Analyzing the relationship between Social Anxiety Disorder (SAD) and family history of behavioral problems using NESARC data. Includes statistical hypothesis testing (ANOVA, Chi-Square, Pearson Correlation, Moderation Analysis). Developed as part of the Data Analysis and Interpretation Specialization from Wesleyan University (Coursera).
anova chi-square coursera-assignment data-analysis hypothesis-testing mental-health moderation-analysis nesarc pandas pearson-correlation python social-anxiety statistical-analysis
Last synced: 14 Apr 2026
https://github.com/tolumie/rfm-marketing-analysis
This project focuses on RFM (Recency, Frequency, and Monetary) Analysis, a powerful customer segmentation technique used in marketing and business analytics. The analysis helps businesses identify their most valuable customers, potential loyalists, at-risk customers, and churned users.
business-analytics customer-behavior-analysis customer-loyalty customer-retention customer-segmentation-analysis data-analysis data-driven-decisions ecommerce marketing-analytics python
Last synced: 18 May 2026
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/preciousclement/maternal-experiences-in-nigeria
This repository contains a Python-based project that generates realistic synthetic data simulating the maternal health journey of 5,000 women in Nigeria.
data-analysis data-generation maternal-health nigeria public-health python
Last synced: 08 May 2025
https://github.com/fazej99/u.s-climate-and-temperature-analysis
This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.
data-analysis data-science data-visualization gis machine-learning streamlit
Last synced: 22 May 2026
https://github.com/mh0386/motorcycle_data_analysis
Data analysis applied to motorcycle dataset.
Last synced: 19 Jul 2025
https://github.com/hemangsharma/job-tracker
A comprehensive Streamlit application for tracking and analyzing job applications.
data-analysis python streamlit-dashboard streamlit-webapp
Last synced: 15 Mar 2025
https://github.com/alexjackson1/commons-indicative-votes
A cluster analysis of the House of Commons' Indicative Brexit Voting Process on 27 Match 2019
Last synced: 19 Jul 2025
https://github.com/bhushan148/finance-domain-bank-loan-report-tableau
I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.
bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau
Last synced: 08 Apr 2025
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/merrill007/sql-data-warehouse-project
The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.
architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver
Last synced: 22 Mar 2025
https://github.com/kaushik-puttaswamy/airline-passenger-referral-prediction-using-machine-learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 22 Mar 2025
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 08 Apr 2026
https://github.com/manel15279/datamining-project
A university project that aims to explore various data mining techniques like Data Exploration, Association Rule Mining, Supervised and Unsupervised Learning, applied to real-world datasets, focusing on soil fertility analysis and COVID-19 cases evolution over time.
covid-19 data-analysis data-mining data-visualization datascience gradio machine-learning python soil-properties
Last synced: 10 Jun 2025
https://github.com/macnianios/salifort-motors_retention
Google Advanced Data Analytics Capstone: Analyzing customer retention at Salifort Motors.
data-analysis machine-learning pandas python seaborn sklearn
Last synced: 08 Apr 2026
https://github.com/l337x911/simulations
data analysis via in silico simulations
data-analysis machine-learning python3
Last synced: 06 Apr 2025
https://github.com/valna/mercado-play
Mercado Play (streaming service of @mercadolibre) redesign with Astro.
astro data-analysis data-manipulation data-science data-visualization husky javascript justd mercado-play mit-license prettier react rocketseat typescript
Last synced: 08 Apr 2026
https://github.com/abhinavhariyal/diwali-sales-analysis
This project is based on data visualization and analysis using python and jupyter notebook on the data for diwali sales.
data-analysis data-visualization jupyter python
Last synced: 19 May 2026
https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm
The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.
algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script
Last synced: 17 May 2026
https://github.com/theveryhim/massive-text-processing
cleaning, processing and analysis of papers' dataset in pyspark(rdd) framework
big-data data-analysis frequent-itemsets massive-datasets pyspark text-preprocessing
Last synced: 18 Jul 2025
https://github.com/jofaval/daily-california-births
Data Analysis of the Daily AFAB (Assigned Female At Birth) Births in California, 1959
california data-analysis data-science data-visualization deep-learning google-colab machine-learning python tensorflow timeseries timeseries-analysis
Last synced: 28 Jun 2025
https://github.com/theveryhim/web-scraping-and-statistical-tests
Crawling web for data and perform statistical tests to verify judgments
data-analysis hypothesis-testing web-scraping
Last synced: 18 Jul 2025
https://github.com/b-varun-reddy/fairwai-bias-detection
Submission for the FairwAI Hospitality Intern Challenge. This project analyzes bias signals in Yelp hospitality reviews using open-source data, Python, and fairness-focused keyword detection.
bias-detection data-analysis ethical-ai fairness hospitality machine-learning natural-language-processing python social-impact yelp-dataset
Last synced: 19 Apr 2025
https://github.com/dimits-ts/college_analysis
A statistical study about US college admissions, featuring a full report in LaTeX.
anova data-analysis exploratory-data-analysis linear-regression statistics
Last synced: 25 Jan 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 22 May 2026
https://github.com/onome-joseph/flexisaf
Generative AI & Data Science
data-analysis data-science machine-learning
Last synced: 16 Sep 2025
https://github.com/atharvapathak/rsvp_movies_case_study
SQL queries performed on IMDb database to provide recommendations to RSVP Movies based on insights.
data-analysis data-cleaning data-science imdb-dataset rsvp-movies sql
Last synced: 28 Jan 2026
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/cyblx/clustering
This project explores clustering techniques and supervised learning applied to World Cup team performance analysis. The methodologies include K-Means, DBSCAN, K-Nearest Neighbors, Gaussian Mixture Models (GMM), and Agglomerative Clustering.
clustering data-analysis dbscan gmm kmeans supervised-learning unsupervised-learning world-cup
Last synced: 18 Jul 2025
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 18 Feb 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/nikbarb810/covid_growth_rate_390.51
Exploring Covid Growth Rate of European Population using genetic data analysis
bioinformatics data-analysis r rcpp
Last synced: 07 Apr 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/gabrieladados/analise-ecommerce
Análise SQL para E-commerce: Estratégias de Crescimento para Impulsionar Vendas
bigquery data-analysis ecommerce sql
Last synced: 31 Mar 2025