An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/lulloooo/article-fromfitto55tofittoeveryone

Analysis leading to an article published in the EcoSprinter 2024 Annual edition about an Analysis of EU "Fit for 55" packages under a different perspective 🔎

analysis data environment european-union

Last synced: 12 Jun 2026

https://github.com/wiseql/wiseql

The wise data browser — run SQL recipes as small, observable, debuggable steps

data debugging duckdb oracle quality sql tui

Last synced: 13 Jun 2026

https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey

IBM Data Analyst professionale certificate program final project.

cognos data data-analytics looker power-bi python sql statics

Last synced: 01 May 2026

https://github.com/dnut/associations

Python 3 library to identify high-dimensional statistical relationships in any data set.

analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules

Last synced: 01 May 2026

https://github.com/jigyasag18/sql-music-store-analysis

This repository contains an analysis of sales and customer data from a fictional music store. Using SQL, we explore trends in sales, popularity of artists and genres, and customer purchasing behavior. The project aims to derive actionable insights that can guide marketing strategies and inventory management decisions.

data dataanalysis dataanalytics database database-management dataset sql sqlqueries sqlquery

Last synced: 08 Jun 2026

https://github.com/chompfoods/sdk-kotlin

Kotlin SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food foods grocery ingredients kotlin nutrition raw recipe-api recipes sdk sdk-kotlin

Last synced: 01 May 2026

https://github.com/word2vect/beijing-new-house-data-visualization

Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab

data python visualization

Last synced: 13 Jun 2026

https://github.com/dhruvil-26/tableau-projects

This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns

analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public

Last synced: 19 Jan 2026

https://github.com/bastianolea/plebiscitos_chile

Datos de resultados electorales de los plebiscitos constitucionales de 2022 y 2023

chile comunas data elecciones politica social

Last synced: 15 Jun 2026

https://github.com/sandygcabanes/etl-earthquake-data-from-usgs-google-cloud-composer-airflow

Airflow, Google Cloud Composer, GCS, BigQuery, Python. This automated pipeline pulls daily earthquake data from a trusted public source, stores it securely in the cloud, and organizes it into clean, searchable tables for analysis.

cloud composer dag data engineering etl etl-pipeline google json python

Last synced: 01 May 2026

https://github.com/sebastianbrzustowicz/github-data

Java + Spring Boot. Application for sending requests to GitHub API and collecting received data.

api ci data github json junit mapping parallel repository rest-api stream

Last synced: 01 May 2026

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/rbreeze/dashboard

My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.

css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization

Last synced: 02 May 2026

https://github.com/loggdme/kyro

Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.

data package

Last synced: 14 Jan 2026

https://github.com/mubashirsidiki/olympics-data-enigeering

Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.

analytics azure big-data data dataengineering devops pipeline

Last synced: 02 May 2026

https://github.com/jesuscc1993/data-cleaner-extension

Clears browser data in a single click.

application-data chrome chrome-extension data

Last synced: 02 May 2026

https://github.com/viniddev/active_finance

Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário

automation data data-analisis rpa selenium-webdriver telegram-bot

Last synced: 03 May 2026

https://github.com/smaug6739/sidonie

📦 Sidonie is a prototype of module to manipulate json data.

data database javascript json module typescript

Last synced: 03 May 2026

https://github.com/bastianolea/mineduc_desvinculacion

Tasas de incidencia de desvinculación de estudiantes de enseñanza básica y media, por año, comuna y género.

chile comunas data educacion social tiempo

Last synced: 10 Oct 2025

https://github.com/redgoose-dev/baguni

이미지를 보관하고 탐색하는 웹 프로그램

data explorer file management upload

Last synced: 14 Apr 2026

https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning

Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page

data data-visualization dataanalytics excel postgresql powerbi python3

Last synced: 03 May 2026

https://github.com/joelgombin/intro_r_iau

Introduction à R #WeData

data data-science dataviz gis r

Last synced: 04 May 2026

https://github.com/soham7998/data-analysis-projects

My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.

data data-analysis data-science machine-learning nlp python soham visualization

Last synced: 04 May 2026

https://github.com/cburmeister/disc-golf-courses

All the disc golf courses i've played at. Maintained with http://geojson.io/.

data geojson

Last synced: 21 Jan 2026

https://github.com/dimitryzub/russo-ukraine-war-prediction-losses

Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱‍👤

data dataanalysis dataanalytics matplotlib pandas prophet python

Last synced: 04 May 2026

https://github.com/ayushman0511/data-analytics-project1

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics busine data data-anal data-enginee data-sci data-scien database datascien query reporting sql sql-query sql-server window-func

Last synced: 17 Jun 2026

https://github.com/djdhairya/whatsapp-chat-analysis

WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.

data data-science dataanalytics datapreprocessing machine-learning ml

Last synced: 08 Oct 2025

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/a-poor/datatransform.jl

A package for defining (and performing) tabular-data transformations with JSON.

data data-science data-transformation etl feature-engineering json julia julia-package tabular-data

Last synced: 05 May 2026

https://github.com/rodgeraraujo/open-dataverse

OpenDataverse: ETL application to filter and import open data from https://dados.ifpb.edu.br/ save on database, and exported via a Rest API.

data dataset dataverse flask ifpb pandas python

Last synced: 05 May 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/ournet/weather-data

Ournet weather data module

data forecast ournet storage weather

Last synced: 07 Oct 2025

https://github.com/dushansenadheera/web_scraper

web scraper using Python along with BeautifulSoup and Selenium

beautifulsoup data python selenium web-scraping

Last synced: 19 Jun 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/rylan12/apscores

A quick way to visualize how the AP score distributions have changed from year to year.

advanced-placement analysis ap-exam data scores

Last synced: 19 Jun 2026

https://github.com/openwashdata/ugabore

Borehole repair data from central Uganda associated with a project report completed by Joseph Lwere for the “data science for openwashdata” course

analysis borehole data open-data r uganda wash water

Last synced: 17 Jan 2026

https://github.com/jbn/vaquero

A Python library for iterative and interactive data wrangling at laptop-scale.

data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework

Last synced: 10 Jun 2026

https://github.com/xljones/bugsnag-exporter

Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs

bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh

Last synced: 06 May 2026

https://git.sheetjs.com/sheetjs/sheetjs

📗 SheetJS Community Edition -- Spreadsheet Data Toolkit

angular bun csv data database deno excel grid html html5 ios javascript json nodejs react spreadsheet table vue xlsx xml

Last synced: 06 Oct 2025

https://github.com/darrendavy12/azure-databricks-setup-guide-with-formula1-csv

Azure Databricks Setup Guide with Formula1 CSV - Azure Databricks, PySpark, Python, Data Lake Storage

apache azure cloud data databricks lake notebooks pyspark python spark storage

Last synced: 06 May 2026

https://github.com/artcc/coredatademo

Demo for CoreDataGenericModule implementation

core coredata coredata-model data encrypted encrypted-data encryption persist

Last synced: 19 Jun 2026

https://github.com/hackersandslackers/hackers-jupyter-posts

:red_circle: :closed_book: Our repository for Jupyter Notebook to serve as blog posts.

blog data data-engineering gatsbyjs jupyter jupyter-notebook python python3

Last synced: 07 May 2026

https://github.com/fehmitahsindemirkan/web-scrapper

Professional and high performance web scraping project.

data ecommerce emailsender fileexplorer logging python web webscraping

Last synced: 10 Jan 2026

https://github.com/chardos/get-git-data

Access git repository data in node.

data git javascript node

Last synced: 07 May 2026

https://github.com/anand-sony/mttr-dashboard

Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.

analytics business-analytics dashboard data python statistical-analysis

Last synced: 30 May 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/flyconnectome/hnf

Documentation for the hierarchical neuron format

annotations data dotprops hdf5 mesh neurons skeleton storage

Last synced: 17 Jan 2026

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/praveendecode/retail-revenue-forecasting

Designed an end-to-end ML model pipeline, forecasting department-wide sales by accounting for holiday markdown effects, spanning data collection to inferencing.

azure collection data datapreprocessing docker exploratory-data-analysis feature-engineering featureimportance model modelbuilding modeldeployment modelselction python report tableau

Last synced: 16 Apr 2026

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/affan005-ai/tesla-stock-prediction

This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models

data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn

Last synced: 05 Oct 2025

https://github.com/project-renard/test-data

Files for testing

data

Last synced: 27 Feb 2026

https://github.com/afeiship/next-object-operator

Object set/get/sets/gets and other operator.

data get gets next operator set sets store

Last synced: 27 Feb 2026

https://github.com/beastbytes/postal-code-data-php

Implementation of PostalCodeDataInterface using PHP file storage

data php postal-code yii3

Last synced: 27 Feb 2026

https://github.com/abhinavrobinson/mc-community-world

Minecraft community world data.

data minecraft server world

Last synced: 27 Feb 2026

https://github.com/os-climate/data-requests

This repo is used to track issues related to new Data Requests

data data-engineering dataset

Last synced: 27 Feb 2026

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/bishtrishu/super_store_sales_dashboard

This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.

analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql

Last synced: 28 Feb 2026

https://github.com/neurazum-ai-department/tumor-stages-dataset---v1

Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.

brain data dataset datasets image mri neuroscience tumor tumor-segmentation

Last synced: 18 Mar 2026

https://github.com/samaalharbi2/project-recommendation-system

This project focuses on building a Recommendation System using real interaction data from IBM's Watson Studio platform.

clustering data ibm-watson kmeans nlp python rec svd udacity-nanodegree

Last synced: 09 Feb 2026

https://github.com/sumaiyyaf/british-airline-dashboard

This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.

analysis dashboard data tableau visualization

Last synced: 13 Feb 2026

https://github.com/j0a0m4/olympics

Final Project for Data Engineering Accelerated LATAM

data olympics spark

Last synced: 13 Feb 2026

https://github.com/krishkumar/scrobbles

all the music 🎸

data music scrobble

Last synced: 13 Feb 2026

https://github.com/danicaalana/wine-dataset-decision-tree

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.

data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset

Last synced: 18 Apr 2026

https://github.com/ms140569/loki-example-store

Testdata for loki password manager

data

Last synced: 26 Feb 2026

https://github.com/sanand0/iss-location

Tracks the International Space Station position. A demo of how to use GitHub Actions to schedule commits weekly.

data

Last synced: 14 Feb 2026

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 01 Feb 2026

https://github.com/molinsagustin/cinedata

# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.

agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver

Last synced: 28 Feb 2026

https://github.com/sunnahboy/checkfake_true_news

Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News

algorithms data level low programming structure

Last synced: 28 Feb 2026

https://github.com/davorg/towerbridge

When is Tower Bridge lifting?

data hacktoberfest london perl web-scraping

Last synced: 29 Jun 2026

https://github.com/natarizkie2/neurochain-airdrop-bot

🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.

airdrop automated bot data multi-account natarizkie neurochain nodejs web3

Last synced: 10 Jun 2026

https://github.com/chompfoods/sdk-typescript-angular

Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript

Last synced: 09 May 2026

https://github.com/krakozaure/pyzzy

Set of packages to simplify development in Python

configuration data formats json library logging logs python3 toml utils yaml

Last synced: 14 Jan 2026

https://github.com/opdev1004/totjs

Not totally new but a file format for managing human readable data in a file. JS version.

data data-storage data-store database database-management hacktoberfest hactoberfest-accepted nodejs

Last synced: 31 Jan 2026

https://github.com/lut-ful/pizza-sales-report

This Pizza Sales Report provides valuable insights into sales performance through detailed analysis and visualizations. By leveraging Power BI and SQL Server

data data-wrangling microsoft-sql-server power-bi power-bi-dax python

Last synced: 30 Jan 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/arnocan/yapydata

The yapydata provides miscellaneous low-level Python data access APIs.

data datastructures ini json properties python python2 python3 xml yaml

Last synced: 16 Feb 2026

https://github.com/themost-framework/cache

MOST Web Framework Caching Module

cache caching data

Last synced: 12 Feb 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/aimin-nur/data-analyst-model-predictive

Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.

analyst data mechine-learning visualization

Last synced: 29 Jan 2026

https://github.com/soenneker/soenneker.attributes.mapto

A C# attribute for generic data mapping translation

attributes columns csharp data datatables dotnet mapping mapto maptoattribute object

Last synced: 02 Mar 2026

https://github.com/spatialcurrent/go-counter

Simple library and command line program for generating frequency distributions.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/soenneker/soenneker.data.zipcode

US ZIP code data from USPS, updated daily

code csharp data dotnet usps zip

Last synced: 02 Mar 2026

https://github.com/pawlo77/nos_snowflake

Network Operating Systems course for DS studies in Winter 2024/25

azure data data-science snowflake snowpark streamlit

Last synced: 09 May 2026

https://github.com/spatialcurrent/go-pipe

go-pipe is a simple library for piping objects from iterators to writers.

big-data bigdata concurrency data

Last synced: 29 Jan 2026

https://github.com/eugenedakin/des-encryption-decryption

Encrypt and Decrypt text in Xojo using DES - Written in Native Xojo Language - Cross Platform

data data-encryption-standard decryption des encryption standard xojo

Last synced: 24 Feb 2026

https://github.com/romaintailhurat/dagster-playground

Playing with Dagster 🐙

data pipelines python3

Last synced: 14 Jun 2025

https://github.com/shubhamsoni98/excel-practice

Excel-Practice-Questions

analysis data excel formula raw-data xlsx

Last synced: 03 Mar 2026