An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation

A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.

data-analysis data-analysis-python machine-learning python random-forest

Last synced: 18 Mar 2026

https://github.com/nagar2nd/jenson-usa-mysql-analysis

We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.

data-analysis problem-solving sql

Last synced: 01 Feb 2026

https://github.com/vishnu-vamshii/data-science-jobs-salaries

Created an interactive dashboard to analyze data science jobs salaries in different regions of the world, experience levels, average salaries in USD and type of employment along with a geographical visual.

data-analysis data-science data-visualization tableau tableau-dashboard

Last synced: 01 Feb 2026

https://github.com/yeuner/file-analysis-sql-demo

Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.

data-analysis dataset pandas sql sqlite streamlit vizualization

Last synced: 15 Apr 2026

https://github.com/khanovico/python-stock-analyzer

This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.

amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn

Last synced: 02 Feb 2026

https://github.com/vladimiracunadev-create/python-data-science-program

Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.

bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 Jun 2026

https://github.com/shubham200137/customer-churn-analysis

In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.

data-analysis data-visualization numpy-python pandas-python sqlite tableau

Last synced: 15 Apr 2026

https://github.com/devbigboy/excel-power-query-get-transform

Power Query is a feature in Excel that allows you to quickly import data from multiple sources and easily clean, transform, and reshape it to suit your needs.

data-analysis data-science excel

Last synced: 08 Feb 2026

https://github.com/pawlo77/smarty

End-to-End Data Science tool

data-analysis data-processing pandas pipeline

Last synced: 08 May 2026

https://github.com/sroman0/data-analytics

Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.

data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics

Last synced: 15 Apr 2026

https://github.com/grindelfp/datasets-analysis

The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.

data-analysis datasets ipynb mlda

Last synced: 05 Mar 2026

https://github.com/mdaltamashalam/uber-fare-prediction-models

Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.

catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm

Last synced: 26 Feb 2026

https://github.com/josericodata/statisticsapp

Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.

alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test

Last synced: 26 Feb 2026

https://github.com/jweinst1/xenon

A processing based language

data-analysis interpreter reactive-programming

Last synced: 15 Apr 2026

https://github.com/shibbir24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/an1mch1k-theone/project_1_hh_analyze

Проект: анализ резюме из HeadHunter

data-analysis data-analysis-project python

Last synced: 15 Apr 2026

https://github.com/barraharrison/airbnb-price-trends

Looking at how Airbnbs differ in price when it comes to location, room type and host activity

data-analysis data-science pandas plotly python streamlit

Last synced: 09 Feb 2026

https://github.com/allanotieno254/powerbi-dax-filter-context

This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.

business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization

Last synced: 08 Jan 2026

https://github.com/dipeshgoyal013/crypto-currency-dashboard

This project analyzes historical cryptocurrency data and builds an interactive Power BI dashboard. It includes time-series forecasting of Bitcoin and Ethereum using ARIMA and Power BI’s forecasting model.

data-analysis excel power-bi python

Last synced: 15 Jun 2026

https://github.com/nulltea/kicksware-scrapebot

Web scraping tool to retrieve sneaker details & images from web store sites

bot data-analysis pandas python sneakers web-scraping

Last synced: 15 Apr 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/shruti23-ui/blinkit-powerbi-dashboard

A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.

data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql

Last synced: 09 Feb 2026

https://github.com/wadeChriestenson/Main_Application

A Django application to host my personal resume.

data-analysis data-visualization django plotly python ui-design

Last synced: 11 Mar 2025

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/gnneto/nf-analyzer

Script Python para extrair dados de Notas Fiscais Eletrônicas (XML) e gerar Excel consolidado, com foco na extração de informações financeiras, como vencimentos e valores, para uma análise mais detalhada e eficiente. mantendo formatação numérica.

data-analysis excel finance nf-analyzer pandas python xlm

Last synced: 16 Apr 2026

https://github.com/bcko/ud-da-eda-redwinequality

Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : Red Wine Quality dataset

data-analysis data-analyst-nanodegree exploratory-data-analysis r-markdown rstudio udacity udacity-data-analyst-nanodegree udacity-nanodegree

Last synced: 10 Feb 2026

https://github.com/prateekbisht23/inventory_management

This project is an Inventory Management System built using Python (Pandas, NumPy, SciPy) and Jupyter Notebook. It allows efficient tracking of stock, performing data analysis, and generating useful statistical insights (mean, standard error, confidence intervals) to support better decision-making.

data-analysis jupyter-notebook management python3

Last synced: 11 Feb 2026

https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals

Analyzed Data By Creating Interactive Dashboard Using MS Excel

data-analysis data-cleaning data-visualization excel pivot-tables powerquery

Last synced: 11 Feb 2026

https://github.com/haonamnguyen/data-science-job-analysis

Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.

data-analysis data-science data-visualization jupyter-notebook python

Last synced: 16 Apr 2026

https://github.com/shrutiijoshi/crm-sales-analysis

The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.

data-analysis data-visualization dax-functions powerbi powerquery

Last synced: 11 Feb 2026

https://github.com/dhruwsunita/car-sales-dashboard

Car sales dashboard using Tableau visualization tool.

car-sales data-analysis data-visualization excel kpis tableau

Last synced: 27 Feb 2026

https://github.com/praveen-devknight/event-registration-analytics-dashboard

This project presents an interactive and visually-rich Power BI dashboard that analyzes registration data from a college-level technical and non-technical event, Teciton. The dashboard provides comprehensive insights into participant demographics, event preferences, food choices, and time-based trends.

data-analysis data-visualization excel powerbi sql

Last synced: 11 Feb 2026

https://github.com/chdre/data-analyzer

A small package to analyze and preprocess data.

data-analysis python

Last synced: 28 Jun 2026

https://github.com/rodrigojunqueiradev/python-exercises

Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language

data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics

Last synced: 16 Apr 2026

https://github.com/sharmas1ddharth/mode_of_transport_analysis

This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.

data-analysis r-programming

Last synced: 11 Feb 2026

https://github.com/thlindustries/mortalidade_neonatal_python_react

Uma plataforma de visualização de dados montada utilizando Python e React com a library de visualização do Plotly

data-analysis data-visualization plotly python python3 react reactjs

Last synced: 16 Apr 2026

https://github.com/thanaraklee/exploring-and-analyzing-data-in-oracle-database

This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.

data-analysis oracle-database sql sql-developer

Last synced: 12 Feb 2026

https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance

Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.

bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse

Last synced: 12 Feb 2026

https://github.com/andimashkulli/vpms

Vehicle Parking Management System for Gjon Buzuku Gymnasium

backend-api data-analysis databases frontend-react mongodb nodejs software

Last synced: 12 Feb 2026

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 12 Feb 2026

https://github.com/ankit21111/carpredict

This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!

data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/mehedi-hassan81/mastercourse

Data analysis project analysing renewable energy production across 212 countries, visualizing trends with Tableau. Highlights China's dominance (2,894 TWh) and Paraguay's 100% renewable share.

data-analysis pandas python renewable-energy selenium tableau-dashboards tableau-public web-scraping

Last synced: 08 May 2026

https://github.com/dcs-training/data-wrangling-and-vis-pandas

Introduction to analyzing structured data with the Python libraries pandas, for CSV and TSV data, and ElementTree, for XML data. Go to the readme file

data-analysis data-visualisation data-wrangling python

Last synced: 16 Jun 2026

https://github.com/mananabbasi/dashboard-power-bi

This repository showcases **Power BI projects** focused on data visualization and business intelligence. Each project transforms raw data into interactive dashboards and reports, providing actionable insights for decision-making. The repository includes Power BI files, datasets, and documentation for each project.

data-analysis data-science data-visualization powerbi

Last synced: 13 Feb 2026

https://github.com/llnl/cap

HPC workflow that automates the tedious actions of compiling, analyzing, and parsing with bincfg

data-analysis hpc python workflows

Last synced: 17 Jun 2026

https://github.com/lopes51789/salaryanalysis

This salary dataset is a good candidate for descriptive analysis, and we can identify which demographics experience reduced or increased salaries. For example, we could explore the salary variations by gender, age, industry, and even years of prior work.

data-analysis json mysql python3 sql tableau

Last synced: 13 Apr 2026

https://github.com/m-ah07/text-sentiment-analysis-api

A lightweight Python project for analyzing the sentiment of textual data using the TextBlob library. This project provides a simple and effective way to measure the polarity and subjectivity of any given text.

data-analysis machine-learning python python-project sentiment-analysis text-analysis text-mining

Last synced: 14 Feb 2026

https://github.com/malakaburamila/power-bi-dashboards

A portfolio of interactive Power BI dashboards I developed, showcasing data visualization, analytics, and data-driven insights.

amazonsalesanalysis analytics dashboards data-analysis data-visualization datasets hranalytics power-bi

Last synced: 14 Feb 2026

https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project

【Sprinkle some star dust on this repo! ⭐️ It's good karma!】A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.

cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow

Last synced: 02 Apr 2026

https://github.com/juanse0330/registro-pacientes-terapia-python

Proyecto en Python para automatizar el registro y análisis de pacientes en terapia ocupacional domiciliaria. Herramienta orientada al sector salud.

automatizacion data-analysis python salud terapia-ocupacional

Last synced: 17 Jun 2026

https://github.com/fhdsl/seattlestatsummer_r

A 4-day introduction to R programming, focused on Fred Hutch Research Interns

beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse

Last synced: 19 Mar 2026

https://github.com/hlexnc/project-arepo

Data-driven stroke risk assessment & personalized recommendations, powered by machine-learning and an NLU-driven chatbot.

chatbot data-analysis docker docker-compose machine-learning nlu-chatbot python rasa scikit-learn sklearn streamlit

Last synced: 15 Feb 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/siddhant2105s/bring-your-own-device-boyd-system

This repository contains the design and implementation of the Bring Your Own Device (BYOD) System for managing personal devices at Life Insurance Company. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.

data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models my-sql relational-databases relational-model sql-queries

Last synced: 15 Feb 2026

https://github.com/shellynagar27/candy-market-share-analysis

Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.

critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis

Last synced: 03 Feb 2026

https://github.com/tj2904/lfb-callout-analysis

An investigation into London Fire Brigade's callout data.

data-analysis decsion-tree kmeans lfb-incidents london-fire-brigade pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/r-xue/xlib

Rui's IDL code library for Astrophysics

astrophysics data-analysis idl

Last synced: 16 Feb 2026

https://github.com/kheriberto/bedu_dc

Ejercicios del curso de "python desde 0" de la plataforma BEDU

data-analysis python

Last synced: 18 Jun 2026

https://github.com/tejas-130704/dataanalysis-hr-manager

Presence Insights of Employees This project provides insightful data analysis on employee attendance and presence, including work-from-home (WFH) data, sick leave records, and presence excluding holidays. The analysis spans a three-month period and is visualized using Power BI to help HR managers understand trends and optimize workflow.

dashboard data-analysis data-visualization hr-manager power-bi

Last synced: 01 Mar 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/nagar2nd/airbnb-property-management-optimization

This project aims to analyze Airbnb’s dataset to optimize rental strategies, enhance customer satisfaction, and maximize revenue for property owners. Using Tableau, the insights generated will help improve decision-making for both Airbnb and its hosts.

data-analysis data-visualization tableau

Last synced: 01 Mar 2026

https://github.com/hawmex/aut_data_and_information_analysis_project

This repository contains the files of my project for the "Data & Information Analysis" course at AUT (Tehran Polytechnic).

data-analysis data-science k-means outlier-detection python

Last synced: 19 May 2026

https://github.com/devexpress-examples/wpf-pivotgrid-how-to-display-underlying-data

This example demonstrates how to obtain the records from the control's underlying data source for a selected cell or multiple selected cells.

data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf

Last synced: 19 May 2026

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/prakshal0809/sql-data-analysis-project

This project involves analyzing pizza sales data using SQL to address various data analysis questions, providing essential foundational to advanced SQL knowledge.

data-analysis sql

Last synced: 26 Jun 2025

https://github.com/borjamome/radiografia-madrid

Análisis de Población, Economía y Sociedad de Madrid con R.

data-analysis data-visualization madrid r

Last synced: 17 Jun 2025

https://github.com/singingsandhill/data_analysis

데이터 분석_개인 프로젝트 정리

data-analysis python

Last synced: 19 May 2026

https://github.com/ansh-info/literaturesurvey

Literature Survey Engine, leverages the powerful Semantic Scholar's Recommendation API to provide you with highly relevant research article recommendations based on your curated lists of articles.

api api-integration automation data-analysis data-visualization docker docker-compose literature-survey machine-learning mysql paper-recommendations python recommendation-system research-tools semantic-scholar streamlit zotero

Last synced: 10 Apr 2026

https://github.com/rohithay/titanic-data-analysis

Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.

data-analysis machine-learning matplotlib pandas scipy-stats statistical-models

Last synced: 15 May 2026

https://github.com/danpoynor/python-number-guessing-game-with-stats

A number guessing game written in Python 3 that presents median, mode, and mean statistics

console-game data-analysis number-guessing-game python3 statistics

Last synced: 26 May 2026

https://github.com/sukhitashvili/pca_tutorial

PCA algorithm from scrach, using only matrix-vector multiplications

data-analysis data-science data-visualization machine-learning-algorithms pca

Last synced: 29 Mar 2025

https://github.com/samukiszhsd/alteryx-analytics

Você está trabalhando com dados de transações bancárias do Itaú e precisa fazer algumas análises para ajudar o time de auditoria a detectar padrões incomuns e possíveis transações suspeitas.

alteryx data-analysis data-structures data-visualization etl workflow

Last synced: 18 Feb 2026

https://github.com/prady2309/stock-analysis

Analysis on the stock prices of Apple, Google, Microsoft and Amazon

data-analysis data-science data-visualization python stock-market

Last synced: 19 May 2026

https://github.com/diliprk/smartcityvisualization

Data Wrangling and Data Visualization Works done for Smart City Project at HBK Saar

bokeh data-analysis data-visualization python3

Last synced: 15 May 2026

https://github.com/eve-ning/ppshift

Analyzes maps and scores from 2015

data-analysis data-mining osu osugame

Last synced: 13 Feb 2026