An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/wei-rongrong2/openfoodfactclustering

A project that explores clustering food products based on nutritional attributes using K-Means, Fuzzy C-Means, and DBSCAN algorithms, with a Streamlit dashboard for visualizing results.

clustering dashboard data-analysis dbscan food-products fuzzy-cmeans k-means machine-learning nutrition nutrition-clustering open-food-facts streamlit

Last synced: 28 Apr 2026

https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach

Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.

data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost

Last synced: 28 Apr 2026

https://github.com/kisaa-fatima/data-visualization-with-tableauleu

Conducted Exploratory Data Analysis (EDA) on the Berkeley Earth Dataset (large scale dataset), which features high-resolution land and ocean time series data. Created interactive dashboards using Tableau to effectively visualize and highlight trends and patterns within the data.

data-analysis data-science exploratory-data-analysis insights python tableau visualizations

Last synced: 29 Apr 2026

https://github.com/thanaraklee/pyspark-dataframe-operations

This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.

data-analysis dataframe pyspark python

Last synced: 29 Apr 2026

https://github.com/devexpress-examples/winforms-visualize-pivot-grid-data-in-chart

The following example shows how to integrate the Pivot Grid with the Chart control.

charting data-analysis dotnet pivot-grid-for-winforms winforms

Last synced: 29 Apr 2026

https://github.com/satyacoder29/crowdfunding-in-sql

Crowdfunding is a method of raising funds for projects or causes by collecting small contributions from a large group of people, usually through online platforms. It enables individuals, startups, and nonprofits to secure funding, offering rewards or recognition in exchange, and helps bring ideas to life without traditional financing.

data-analysis data-cleaning database-management mysql-database quries sql sql-functions sql-server views

Last synced: 29 Apr 2026

https://github.com/i7t5/sentimentnlp

Sentiment analysis for COMP 435 Introduction to Machine Learning, Spring 2025

data-analysis jupyter-notebook machine-learning nlp python sentiment-analysis

Last synced: 29 Apr 2026

https://github.com/khushi-sabarad/web_scraping

This project is a Python-based web scraper that extracts the menu from a cafe and saves it to an Excel file. It was created to automate the process of retrieving and updating menu prices, a task that was observed to be done manually at the hostel.

beautifulsoup data-analysis data-visualization market-analysis pandas python requests web-scraping wordcloud

Last synced: 29 Apr 2026

https://github.com/hardikk-7/election-analysis-project

A data analytics project exploring the 2024 Indian General Election results using Python. Includes party-wise, state-wise, and vote share analysis with visualizations.

data-analysis data-science election-analysis jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/srinibas-masanta/yelp-business-reviews-analysis

This project analyzes Yelp business reviews using Python, Snowflake, and SQL, focusing on efficient data ingestion, transformation, and analysis. We preprocess JSON data, optimize ingestion via Amazon S3, classify sentiments with Python UDFs, and extract insights using SQL queries—showcasing a streamlined end-to-end workflow.

amazon-s3 data-analysis json python snowflake sql

Last synced: 29 Apr 2026

https://github.com/sdley/cas_pratique-del_annuel

Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités

data-analysis pandas python tkinter-gui

Last synced: 29 Apr 2026

https://github.com/theoplayz2/eda-explorer

Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".

analysis battery-life cqrs csharp data-analysis eeg-analysis exploratorydataanalysis json-visualization matplotlib messaging profile-report python verilog visualization

Last synced: 29 Apr 2026

https://github.com/farhad-here/student_performance_analyzer

Student Performance Analyzer with python, it is on of my data analysis course project. I teach you about filter(),lambda,map() in python

data-analysis data-visualization filter kaggle kaggle-dataset lambda map pandas python python-tutorial streamlit

Last synced: 29 Apr 2026

https://github.com/aishwaryagade02/loan-funnel-optimization-analysis

Tracks how loan applications move through each stage, helps spot where people drop off, and gives clear insights to improve approval strategies and overall performance.

ab-testing data-analysis data-creation hypothesis-testing python reporting sql statistical-methods streamlit

Last synced: 30 Apr 2026

https://github.com/celineboutinon/laplace-immo

ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 3

data-analysis data-analytics data-structures database-design database-schema databases mysql-connector-python mysql-workbench python sql

Last synced: 30 Apr 2026

https://github.com/fbarffmann/credit-risk-classification

Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.

classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn

Last synced: 30 Apr 2026

https://github.com/devexpress-examples/wpf-pivot-grid-provide-custom-summary-values

This example demonstrates how to determine the value type when you calculate custom summary values in Pivot Grid for WPF.

data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf

Last synced: 01 May 2026

https://github.com/devag2004/electricity-analysis-using-spark

electricity analysis project made using spark

data-analysis spark spark-mllib

Last synced: 01 May 2026

https://github.com/bpkaur/a-network-analysis-of-game-of-thrones

A Network analysis of Game of Thrones: To analyze the co-occurrence network of the characters in the Game of Thrones books

data-analysis data-science machine-learning networkx python3

Last synced: 01 May 2026

https://github.com/sairupeshl/leo-orbital-congestion-analysis

Geospatial data analysis of the UCS Satellite Database using Python to map active LEO space assets, validate orbital parameters, and isolate mega-constellation traffic bottlenecks.

aerospace-engineering data-analysis geospatial-analysis orbital-mechanics pandas python satellite-data seaborn

Last synced: 08 Jun 2026

https://github.com/pablo1785/receipt-rs

Receipt processing backend built with Shuttle.rs, Axum and Azure Form Recognizer API

api-rest axum azure backend cognitive-services computer-vision data-analysis rust shuttle-rs sqlx

Last synced: 01 May 2026

https://github.com/fbarffmann/project1

Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.

box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026