An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/viper373/163-buff

爬取网易BUFF平台CS:GO武器皮肤交易数据

163 arima crawler-python csgo data-analysis prediction python

Last synced: 24 Oct 2025

https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel

This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.

automation data-analysis data-science dataset-generation excel-import signal-processing

Last synced: 18 Apr 2026

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/tunjis/global-superstore_dashboard_tableau

Tableau dashboard with 4 different types of visualisations

charts dashboard data-analysis data-visualisation excel tableau

Last synced: 23 Jan 2026

https://github.com/ankit21111/filmilytics

This repository contains data and analysis on RSVP Movie House Production, focusing on past performance metrics and audience trends. Our goal is to derive actionable insights that can guide future productions for greater success. Explore the data, analysis scripts, and recommendations to understand how RSVP can thrive in the film industry.

data-analysis database database-design database-schema erdiagram sql

Last synced: 13 Jun 2025

https://github.com/chen0040/pyspark-advanced-algorithms

Samples of Advanced Algorithms and Data Analysis implemented in pyspark

advanced-algorithms data-analysis map-reduce pyspark

Last synced: 12 Jan 2026

https://github.com/realorangeone/docker-cyberchef

A containerized deployment of CyberChef, with additional protections

cyberchef data-analysis data-manipulation docker encoding

Last synced: 24 Aug 2025

https://github.com/RickContreras/StudentPerformancePredictionSaberPro

Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.

classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance

Last synced: 24 Oct 2025

https://github.com/as16082023/atliq-hospitality-analysis

This project presents an overview of AtliQ Grands' performance in the hospitality industry using Power BI.

atliqgrand codebasicsresumeprojectchallenge data-analysis data-visualization powerbi revenueinsights

Last synced: 23 Jan 2026

https://github.com/karthikmprakash/911-call-dataanalysis

Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA

911-call-analysis data-analysis data-visualization python3 united-states-data

Last synced: 10 May 2026

https://github.com/santiagohermo/data.tools

Convenience functions for data manipulation

data-analysis data-cleaning data-science data-wrangling

Last synced: 24 Oct 2025

https://github.com/ondrejhruby/countries-of-the-world

Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.

data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics

Last synced: 16 Apr 2026

https://github.com/dcs-training/null-hypothesis-testing-with-r

This two-class course will focus on developing theoretical and practical skills for null hypothesis testing in R. Go to the readme file

data-analysis data-wrangling r statistics

Last synced: 24 Oct 2025

https://github.com/jakubkorytko/data-graphs

Transform raw data into captivating visual stories with this app, effortlessly craft stunning data charts that unveil insights and trends

charts data-analysis mit-license open-source

Last synced: 14 May 2026

https://github.com/ayenpure/stockmeup

This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.

data-analysis data-mining data-science java mapreduce mapreduce-java

Last synced: 24 Oct 2025

https://github.com/allanotieno254/british-airways-review-dashboard-using-tableau

An interactive dashboard built using Tableau, visualizing British Airways customer review data. This project analyzes customer sentiment, satisfaction levels, and other key metrics to provide insights into passenger experiences and identify improvement areas for British Airways.

customer-satisfaction-analysis data-analysis data-visualisation tableau-dashboards tableau-public

Last synced: 19 Mar 2026

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/alphonsg/bimana

Package for performing automated bio-image analysis tasks.

bioimage-analysis bioinformatics data-analysis deep-learning edge-detection image-analysis image-processing

Last synced: 25 Oct 2025

https://github.com/jabhij/fbi_nics-firearm-background-checks

This project is a try to showcase the use of guns across the US.

data-analysis data-analytics data-science data-visualization tableau

Last synced: 23 Feb 2026

https://github.com/gholamrezadar/favourite-youtube-channels

this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!

data-analysis data-visualization python

Last synced: 16 Jan 2026

https://github.com/mkk-1817/adhd-prediction

This project focuses on leveraging machine learning techniques to predict Attention-Deficit/Hyperactivity Disorder (ADHD) in children. Accurate and early diagnosis is crucial for effective intervention and support.

adhd data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction python

Last synced: 09 May 2026

https://github.com/luminati-io/Airbnb-dataset-samples

A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.

airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping

Last synced: 09 Apr 2025

https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2

Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.

dashboard data data-analysis dax-measures excel powerbi powerbidashboard

Last synced: 23 Jan 2026

https://github.com/antonijn/polyfit

Fits a polygon to a given data input

c data-analysis linear-algebra toy

Last synced: 16 Jul 2025

https://github.com/luabagg/worldwide-trends

Worldwide Google Trends visualization and classification

data-analysis data-visualization google-trends trends

Last synced: 03 Feb 2026

https://github.com/shubham5027/kisanai--the-ultimate-ai-ml-powered-platform-smart-farming-platform

KisanAI – The Ultimate AI/ML-Powered Smart Farming Platform KisanAI leverages AI/ML to optimize farming practices, enhance crop yields, and empower small-scale farmers with data-driven insights.

ai api aws chatbot crm data-analysis deep-learning deplyment farming llm mapping ml nodejs predictive-modeling reactjs supabase sustainability

Last synced: 30 May 2026

https://github.com/olob0/badwords-pt-br

💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"

badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist

Last synced: 06 Jan 2026

https://github.com/carusel02/sequential-data-processing-and-analysis

Sequential data processing and analysis using linked-list in C

data-analysis data-processing linked-list

Last synced: 26 Oct 2025

https://github.com/erickkhosasi/thelook-data_analysis

Final project for my SQL mini bootcamp. This project explores an e-commerce dataset to uncover key business insights. Data insights were queried in Google BigQuery and visualized with Google Sheets.

bigquery data-analysis e-commerce sql

Last synced: 05 Oct 2025

https://github.com/sunnybibyan/call_centre_power_bi_dashboard

Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.

call-centre-analysis dashboard data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/abdul-wahab-318/pakistani-news-sentiment-analysis

This project involves performing sentiment analysis on Pakistani news articles collected over the past month (August-September 2024). The primary goal is to understand media sentiments regarding various topics and events covered in the news. A total of 800+ articles were scraped from multiple news sources.

data-analysis machine-learning pakistan pakistani-politics sentiment-analysis

Last synced: 26 Oct 2025

https://github.com/saeun-park/lg-aimers-4th

MQL 데이터 기반 B2B 영업기회 창출 예측 모델 개발

b2b data-analysis data-science machine-learning mql

Last synced: 08 Apr 2025

https://github.com/kyleprotho/analysistoolbox

Analysis Tool Box (i.e. "analysistoolbox") is a collection of tools in Python for data collection and processing, statisitics, analytics, and intelligence analysis.

analytics data-analysis open-source-intelligence python3 r research snippets statistics

Last synced: 22 Aug 2025

https://github.com/komailmk/instagram-reach-forecasting

This repository provides a Python-based solution for forecasting Instagram reach using historical data and SARIMA modeling techniques.

data-analysis data-visualizations machine-learning

Last synced: 05 Oct 2025

https://github.com/faisal-khann/diwali-sales-analysis

The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future

csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/john-science/data_science_by_example

Examples of Data Science Tools & Libraries

data-analysis data-science ipython pandas

Last synced: 12 May 2025

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 21 Jan 2026

https://github.com/ivanildobarauna-dev/currency-quote

Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 27 Oct 2025

https://github.com/titanscouting/tra-analysis

Titan Robotics 2022 Strategy Team Analysis Repository

data-analysis frc frc-scouting hacktoberfest python

Last synced: 29 Jan 2026

https://github.com/amstuta/cpp-neural-network

Simple implementation of a feedforward neural network in c++

data-analysis deep-learning machine-learning neural-network

Last synced: 08 Apr 2025

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/cintia0528/data_cleaning_and_analytics-python

Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.

colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 Jan 2026

https://github.com/allanotieno254/codsoft

This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.

classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression

Last synced: 15 May 2025

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 20 Jan 2026

https://github.com/rayyan9477/household-transactions-analysis-and-clustering

This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.

customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning

Last synced: 27 Feb 2025

https://github.com/parmeetbhamrah/air-quality-india-analysis

Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.

air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/jpotter80/notebook-examples

This repository demonstrates a systematic approach to cleaning and standardizing e-commerce product data using DuckDB. The notebook serves as a detailed walkthrough of our data cleaning methodology, showcasing how we handle common data quality challenges in e-commerce datasets.

data-analysis data-cleaning jupyter-notebook

Last synced: 12 Jun 2025

https://github.com/solrikk/pictrace-web

PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image

automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api

Last synced: 12 Apr 2026

https://github.com/foggy-projects/foggy-data-mcp-bridge

MCP Data Bridge for Java. Enabling safe Text-to-Query via a semantic layer, making enterprise data accessible to AI Agents.

agent data-analysis java llm mcp semantic-layer spring-boot text-to-sql

Last synced: 16 Mar 2026

https://github.com/muhammadhilmyputrarisma/ab-test

Python code for A/B testing on Cookie Cats game data. This project analyzes the impact of moving the first gate from level 30 to level 40 on player retention and game rounds, helping to evaluate if delaying the gate improves player engagement and gameplay experience.

ab-testing cookie-cats data-analysis data-visualization game-analytics python statistics

Last synced: 18 May 2026

https://github.com/savinrazvan/heredity

An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.

ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python

Last synced: 27 Feb 2025

https://github.com/virajbhutada/google-stock-price-forecasting-lstm

Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.

data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting

Last synced: 27 Feb 2025

https://github.com/victoriapm/analyze_a-b_test_results

Understand the results of an A/B test run by an e-commerce website.

ab-testing data-analysis ecommerce-website

Last synced: 06 Oct 2025

https://github.com/aryansharma5/data-visualization-and-thorough-analysis

comprehensive guide for data analysis and visualization

data-analysis data-visualization

Last synced: 18 Mar 2025

https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection

The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.

banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier

Last synced: 06 Oct 2025

https://github.com/an1mch1k-theone/project_2_hh_analyze

Анализ вакансий из HeadHunter

data-analysis data-analysis-project postgresql python sql

Last synced: 14 Apr 2026

https://github.com/emredurukn/data-analysis

Example notebooks for analyzing data

data-analysis data-visualization python

Last synced: 12 May 2026

https://github.com/neerajcodes888/whatsapp-chat-analyzer

A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱

analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud

Last synced: 12 Apr 2026

https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system

The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.

beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application

Last synced: 18 Apr 2026

https://github.com/odeyiany2/flit-apprenticeship-data-science-projects

This repo contains all my projects for my FLiT Apprenticeship

data-analysis data-science data-visualization machine-learning sql

Last synced: 17 May 2026

https://github.com/ajimaulana123/e-commerce-data-analis

Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer

data-analysis

Last synced: 28 Apr 2026

https://github.com/silvano315/med-physics

This would be a repository about medical physics. It will based on 4 paths: medical data to analyse, SOTA programs for medical purposes, computer vision and eXplainability.

computer-vision data-analysis data-science explainable-ai medical-imaging medical-physics medical-tool

Last synced: 24 Mar 2025

https://github.com/bretsw/beds

Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists

analytics data-analysis data-analytics data-science

Last synced: 31 Mar 2025

https://github.com/abdelhakim-gh/machine-learning_data-analysis_project

recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions

data-analysis jupyter-notebook machine-learning python r r-studio

Last synced: 12 Apr 2026

https://github.com/chelseammatta/nopd-cad-data-analysis

Analysis of 911 call data from New Orleans' 3rd & 4th police districts (2019-2022) using BigQuery

911-calls 911-data bigquery cad-data crime-analysis data-analysis emergency-response new-orleans public-safety sql

Last synced: 01 Jul 2025

https://github.com/m-faizan-mahmood/detailed-exploratory-data-analysis-eda-marketing-recomendations.

This project focuses on cleaning, preprocessing, and analyzing data using Pandas and NumPy. Key steps include handling missing values, removing outliers, feature engineering, and exploratory data analysis (EDA). Visualizations with Matplotlib and Seaborn highlight trends in customer spending, campaign performance, and product sales.

big-data data-analysis data-processing data-science eda exploratory-data-analysis numpy pandas python

Last synced: 11 Apr 2026

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 11 Jun 2026

https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard

The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.

business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports

Last synced: 09 Apr 2025

https://github.com/viztruth/google-play-store-data-analysis

This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.

data-analysis data-visualization

Last synced: 21 Aug 2025

https://github.com/rajnish93/jpandas

A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.

data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript

Last synced: 11 Jun 2025

https://github.com/happybono/sonatasmooth

Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.

algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms

Last synced: 30 Oct 2025

https://github.com/harshals499/ecosecure-visualization

Data visualization project using Qlik to analyze sales performance for EcoSecure Systems.

business-intelligence data-analysis data-visualization qlik-sense sales-analysis

Last synced: 12 Jun 2026

https://github.com/siddharthbadal/sql-case-studies-data-analysis

Data Analysis case studies on various databases using SQL . Demonstrating proficiency in solving diverse business problems. Projects cover sales, orders, products, finance, healthcare and other sectors, and highlight my ability to analyze complex datasets through SQL queries, data manipulation, and visualization techniques.

data-analysis sql sql-query sql-server sqlserver

Last synced: 08 Jan 2026

https://github.com/kentlouisetonino/sw-project-data-analysis

My project for AMA MATH 6200 course.

data-analysis python school-project

Last synced: 28 Feb 2025

https://github.com/vipul2001/cousera-courses

This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics

coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network

Last synced: 29 May 2026

https://github.com/derrickbaruga7/mapping-median-age-europe

An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.

data-analysis data-science data-visualization datascience european-union mapping r

Last synced: 25 Mar 2025

https://github.com/raccoon-hero/gender-equality-tracker

A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.

chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api

Last synced: 07 May 2026

https://github.com/ifibla/adsdb-project

Algorithms, Data Structures and Databases Project

data-analysis data-engineering python

Last synced: 12 Apr 2026

https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa

Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi

aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website

Last synced: 02 May 2026