An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/rahult18/atmo-flow

AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes

airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark

Last synced: 23 Jun 2026

https://github.com/sebhoss/countries-and-cities

dolt database for countries and their cities

cities countries data database dolt

Last synced: 11 Oct 2025

https://github.com/faster-games/dynamic-components

Dynamic Runtime Components for Unity3D

data framework unity3d

Last synced: 11 Apr 2026

https://github.com/sanand0/marvel-powers

Scrapes Marvel Fandom for character powers

data

Last synced: 12 Oct 2025

https://github.com/equinor/sumo-wrapper-python

Thin python wrapper to interact with Sumo API

analytics data fmu python subsurface sumo

Last synced: 19 Jan 2026

https://github.com/flowsynx/plugin-base64

FlowSynx plugin to provides encoding and decoding of Base64 strings, allowing workflows to handle Base64 content transformations efficiently.

base64 base64-decoding base64-encoding data data-platform decoding encoding flowsynx flowsynx-plugins

Last synced: 10 Mar 2026

https://github.com/jameshenderson12/chatbot-utils

Generic data and elements that can be reused or repurposed for chatbot development.

boilerplate chatbot data development elements intents template utterances

Last synced: 04 Mar 2026

https://github.com/pbinkley/mfmcollections

Project to distill data about published collections of microfilms from library lists

data research retro

Last synced: 28 May 2026

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/amethyst-php/activity

Someone just did something, should we save who did this and when?

activity amethyst amethyst-package api data laravel

Last synced: 17 May 2026

https://github.com/scjoaoantonio/trab_datascience

Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.

bluesky data data-science streamlit

Last synced: 09 May 2026

https://github.com/bdr-pro/graphyml

A powerful, interactive Streamlit application to explore, edit, visualize, and query a graph-based database of YAML nodes — ideal for movie metadata, research articles, or structured knowledge graphs.

data database yaml yml

Last synced: 23 Jul 2025

https://github.com/drzax/light-up-brisbane

Where, what and why various public places in Brisbane are lit up.

brisbane data git-scraping

Last synced: 19 Jan 2026

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 19 Jan 2026

https://github.com/thanh-wutan/chess-opening-comparator

Interactive web app using R to visualize and compare chess opening performance and popularity.

chess-openings data databases datavisualisation r

Last synced: 09 May 2026

https://github.com/heyimsteve/solnftdatadash

This a React-based web application that provides detailed information about NFT collections on the Solana blockchain. It uses the HelloMoon API to fetch and display data about NFT collections, including statistics, loan summaries, ownership information, and floor prices.

dashboard data hellomoon nft react solana solana-nft

Last synced: 30 Jan 2026

https://github.com/jhpoelen/bees

Content-based iDigBio prototype

biodiversity data ecololgical informatics provenance

Last synced: 18 Mar 2026

https://github.com/moeabbas6/bq_data_loader

A Python script for executing and logging batch SQL commands in Google BigQuery. Includes tracking of execution times, unique job and statement IDs, and automated logging to a specified BigQuery table.

bigquery data python

Last synced: 24 Mar 2025

https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas

SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.

data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase

Last synced: 18 May 2026

https://github.com/bertrand31/one-billion-rows-challenge

🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds

competitive-programming competitive-programming-contests data data-engineering data-processing performance scala

Last synced: 05 Sep 2025

https://github.com/nolanbconaway/rollercoaster-tycoon-data

Every roller coaster I have built in RCT2 for iPad

data roller-coaster-tycoon

Last synced: 24 Mar 2025

https://github.com/donghquinn/gopandas

gopandas

data go golang

Last synced: 14 Oct 2025

https://github.com/rubyonworld/ldpath

This is a ruby implementation of LDPath, a language for selecting values linked data resources.

data ldpath resource ruby

Last synced: 12 Nov 2025

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026

https://github.com/deepanshkhurana/facebook-birthdays

Python script to create a .csv from Facebook's Event Data to list Birthdays.

data facebook python

Last synced: 14 Oct 2025

https://github.com/plateformeio/docs

The official documentation of the Plateforme framework

api app asgi async data db docs fastapi plateforme pydantic python restx services sqlalchemy

Last synced: 11 Apr 2026

https://github.com/pranjaldhamane/social-media-sentiment-analysis

This project aims to analyze sentiment in Twitter data to understand attitudes towards specific topics or entities. It seeks to uncover positive and negative sentiment patterns, detect potential cyberbullying or hate speech, and provide insights into Twitter's overall sentiment landscape.

data dataanalysis logistic-regression nlp-machine-learning python sentiment-analysis twitter

Last synced: 18 Apr 2026

https://github.com/meizuflux/cion

Python minimal data validation library

data minimal python validation

Last synced: 28 May 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://gitlab.com/hailstorm75/Common

A collection of extension libraries for various use-cases

common core cpp csharp data extensions libraries library math matrix

Last synced: 07 May 2025

https://github.com/mohammad-malik/covid-visualizations-d3

This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).

covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization

Last synced: 28 May 2026

https://github.com/brandonzylstra/essence

🧘🏼‍♂️ Relaxed Rails Modeling & Migrations

active-record data database gem hcl modeling rails ruby ruby-on-rails yaml

Last synced: 14 Apr 2026

https://github.com/mominurr/fire-gas-leak-detection-system

A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.

ai computer-vision data datascience machine-learning ml python yolo

Last synced: 27 Jan 2026

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/nivasharmaa/genetrack

A Java program for analyzing DNA sequences and identifying individuals based on Short Tandem Repeats (STRs). Features profile database creation, STR analysis, individual identification, and relationship detection.

data data-processing dna-analysis file-io-in-java genetic-analysis java-oop

Last synced: 25 Aug 2025

https://github.com/jamiew/void-runners-analysis

basic data analysis for the Void Runners Genesis Fleet spaceships

analysis data nfts

Last synced: 29 Mar 2025

https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis

In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.

data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis

Last synced: 10 May 2026

https://github.com/st-universe/data

The STU data assets

assets data stu

Last synced: 14 Mar 2026

https://github.com/repirate/asset-recovery-tool

A simple tool for recovering undrained tokens and NFTs from a compromised wallet on the Ethereum network.

bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask-desktop metamask-plugin phrase recovery seed token wallet

Last synced: 10 May 2026

https://github.com/j-sephb-lt-n/personal-projects

A history of my personal projects and professional development

ai api auth cloud data llms personal-development web

Last synced: 24 Jan 2026

https://github.com/lablnet/alibaba_scraper

This is a robust web scraper that extracts data from the Alibaba website. It's multi-threaded and utilizes Playwright to efficiently scrape data from the website. This script is capable of scraping the entire Alibaba site, which would take approximately 4-6 months to complete.

alibaba data ecom mit-license open-source products scraper

Last synced: 15 Mar 2025

https://github.com/bdr-pro/streamlint

ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files

data streamlit

Last synced: 14 Apr 2026

https://github.com/vanduc1102/parse-stackoverflow-data

Parse stackoverflow data

data parser stackoverflow

Last synced: 16 Oct 2025

https://github.com/omarcodex/data_analysis

My repository of past and present research and data-driven projects.

data ecodev ecology science sustainability yale

Last synced: 18 Jan 2026

https://github.com/bhemen/aave-data

Borrowing and lending data sets from the Aave protocol on Ethereum

aave borrow data ethereum lend python

Last synced: 05 Feb 2026

https://github.com/gsinghjay/ywcc-307-003

Group Presentations

cloud data government

Last synced: 04 Feb 2026

https://github.com/sakan811/show-leaving-soon-tracker-website

This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.

data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website

Last synced: 18 Mar 2025

https://github.com/enoch208/eventmaster

A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.

automation data key keylogging-python replay spy tools

Last synced: 01 Jun 2026

https://github.com/psgebeline/harvard-data-science

My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.

data linear-regression machine-learning modeling probability-theory r visualization wrangling

Last synced: 19 Oct 2025

https://github.com/rezapace/newbash

This project involves managing various application shortcuts and configurations primarily for a Linux environment. It includes scripts for creating .desktop entries for applications, managing system configurations, and handling application processes.

automation backup bash data dekstop linux newbash ohmyzsh script testing zsh

Last synced: 11 Apr 2026

https://github.com/parvezk/d3-fundamentals

D3 library API fundamentals

charts d3 data graphs visualization

Last synced: 19 Oct 2025

https://github.com/psyteachr/psyteachrdata

Datasets for psyTeachR Books

data

Last synced: 23 Mar 2025

https://github.com/dansalahi/query-builder-experiment

Customized Query Builder for creating Rules and Groups

data data-structures jsonlogic query-builder reactjs typescript validation

Last synced: 11 Apr 2026

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025

https://github.com/r-mahesh45/india-news-headlines-analysis

Excited to share my latest project: India News Headlines Analysis (2001–2023). This Power BI report dives deep into 21 years of Indian headlines, uncovering: Trends that defined the nation, Key themes that shaped public discourse, Insights into the evolution of media coverage.

data data-science powerbi visualization

Last synced: 05 Jan 2026

https://github.com/infinitode/crsd

A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.

ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data

Last synced: 10 Jun 2026

https://github.com/cemc-oper/nmc-typhoon-db-client

A CLI client for NMC Typhoon Database.

data database-client nmc

Last synced: 01 Jun 2026

https://github.com/dhimmel/adeptus

ADEPTUS -- differential gene expression signatures of disease

adeptus data differential-expression disease gene-expression genes rephetio

Last synced: 05 Jan 2026

https://github.com/zanysoft/virtualcolumn

Laravel virtual column

data laravel virtual-column

Last synced: 12 Apr 2026

https://github.com/avestura/shell-dads

❓ Show a random tip from NIST DADS (https://xlinux.nist.gov/dads) every time you open your terminal

algorithms dads data data-structures ds nist

Last synced: 23 Oct 2025

https://github.com/mohibmirza-py/email-verifier-script

Streamlit app to verify emails in bulk

ai analysis data streamlit

Last synced: 29 Apr 2026

https://github.com/wittyicon29/zeotap-ds-assignment

Internship application assignment

data data-science

Last synced: 19 Aug 2025

https://github.com/halyusa16/mysql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 20 Jan 2026

https://github.com/suryadev99/stream_processing_website_click_data

Stream Processing of website click data using Kafka and monitored and visualised using Prometheus and Grafana

clickdata data dataengineering docker flink-kafka flink-metrics flink-stream-processing git grafana kafka kafka-streams kafka-topic prometheus psql python

Last synced: 10 Mar 2026

https://github.com/smeltier/data-structures-c

This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.

algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c

Last synced: 16 May 2025

https://github.com/tjpalanca/pins

Data Pins

data pins

Last synced: 05 Jan 2026

https://github.com/ssanthosh010303/collection-data-training

A collection of challenges exercised during data training program.

airflow apache azure azure-data-factory azure-databricks azure-logic-apps bigdata data hadoop spark

Last synced: 27 Jan 2026

https://github.com/mecha-cms/x.route

Custom route files.

custom data extension file folder path route url

Last synced: 23 Mar 2025

https://github.com/bastianolea/cut_comunas

Versión actualizada de los códigos únicos territoriales (CUT) de las comunas y regiones del país.

chile comunas data estado

Last synced: 24 Jun 2026

https://github.com/andrewl/danelaw

Geopackage containing the boundary of the Danelaw

data geospatial medieval viking

Last synced: 23 Jan 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/cracko298/planet-life-save-converter

Convert your Planet-Life Saves To and From Base64 & *.planet files.

base64 base64-decoding base64-encoding data python python-script python3 save-converter save-data save-files

Last synced: 15 Mar 2025

https://github.com/kenjyco/libs

Easily install kenjyco libs

api cli command-line data helper kenjyco libs python

Last synced: 16 May 2026

https://github.com/cognitixe/metamask-wallet-recovery-funds-phrase-data-seed-token

This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.

bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security

Last synced: 13 May 2026

https://github.com/sankooc/validatez

object validation for node

data validate

Last synced: 13 May 2026

https://github.com/afeiship/data-arary

Data array with some new methods.

array data data-structure js list

Last synced: 11 May 2026

https://github.com/ybelenko/openapi-data-mocker-interfaces

Package with OpenApiDataMocker interfaces.

data fake faker interface mock mocker oas oas3 openapi swagger

Last synced: 05 Jan 2026

https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard

Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.

api-rest da data express-js github-api nodejs vercel-api

Last synced: 07 Mar 2026

https://github.com/dhanish03/reliance-sales-report-dashboard

This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.

data datavisualization-project powerbi visualization

Last synced: 23 Jan 2026

https://github.com/charon25/weatherdata

17 000 weather measurements collected by a weather station created for a college project.

csv data dataset datasets json measurements strasbourg weather weather-data

Last synced: 16 Jan 2026