An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/devtin/duckfficer

Zero-dependencies light-weight library for modeling, validating and sanitizing data 🦆 🐵 👁

coercion data duck-typing json parsing schema validation

Last synced: 01 Mar 2025

https://github.com/punch-mission/simpunch

Simulate PUNCH Data

data nasa-data punch simulation

Last synced: 27 Dec 2025

https://github.com/smolsoftboi/php-faker-providers

Faker providers that generate fake data for you.

data faker faker-generator faker-provider generator php

Last synced: 22 Apr 2025

https://github.com/EmirhanServeren/NFT-CollaBot

NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.

data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet

Last synced: 17 Apr 2025

https://github.com/lewagon/matplotlib

Matplotlib examples for Le Wagon's Data Science bootcamp

data

Last synced: 13 Jul 2025

https://github.com/mews-labs/crep

This simple module aims at providing some function to tackle tabular data that have a continuous axis. In situations, this index can represent time, but this tool was originally developed to tackle rail way description.

data pandas pandas-dataframe python python3 rails-application time-series

Last synced: 23 Feb 2026

https://github.com/elianhugh/streams

Flexible data streaming for R

data package r r-package streaming

Last synced: 26 May 2026

https://github.com/anders617/cscareerquestions-salaries

Python script for retrieving offer information from /r/cscareerquestions salary sharing threads.

career data reddit salary

Last synced: 24 Aug 2025

https://github.com/yezz123/awsflowutils

Improve your data workflow with enhanced simplicity and robustness in handling common data tasks ✨

aws data redshift s3 s3-bucket workflow

Last synced: 07 Jan 2026

https://github.com/rousan/bytevault

A command line application that stores sensitive data as key-value pair securely in local machine

application byte c command-line data encrypts key-value sensitive vault

Last synced: 16 Mar 2025

https://github.com/ntia/5g_aerial_rf_radiation_data

Measured airborne radiation patterns around 5G MIMO gNodeB base station transmitters.

5g 5g-nr airborne-data altimeter data emc ji-frai mimo radalt radar

Last synced: 25 Jan 2026

https://github.com/rufat/ikinci-qarabag-muharibesi-shehidleri-json

İkinci Qarabağ müharibəsi şəhidlərinin şəkillərlə birgə JSON formatında məlumat bazası.

azerbaijan data json karabakh martyrs war

Last synced: 01 Jul 2025

https://github.com/quetz-al/quetzal

Quetzal API (short for Quetzalcoatl): a data and metadata management application

api data data-science flask-application openapi3 python quetzal

Last synced: 09 Mar 2026

https://github.com/bitartisan1/netdigger

A .NET 8.0 C# WPF desktop application for web scraping data into structured databases with a modern UI, comprehensive logging and optimized high performance.

csharp data data-scraper data-scraping database desktop dotnet internet logging scraper ui url web-scraper web-scrapers web-scraping web-scrapping

Last synced: 13 Apr 2025

https://github.com/siongui/gopaliwordvfs

Serve JSON data of Pali words, embedded in Go code

data go golang pali vfs virtual-file-system virtualfilesystem

Last synced: 04 Apr 2025

https://github.com/nalgeon/nalgeon.github.io

Everything about SQLite, Python, open data and awesome software

data python sqlite

Last synced: 14 Jul 2025

https://github.com/ferhatgec/kedi

Fegeya Kedi, Experimental Data Interface.

cpp cpp17 data data-interchange data-interface fegeya gnu json library linux xml

Last synced: 14 Apr 2025

https://github.com/romelperez/empanada

Simple data mock generator.

data generator javascript mock typescript

Last synced: 11 Apr 2025

https://github.com/hanwentao/china-regions

Data of China's Regions

china data geography

Last synced: 13 Apr 2025

https://github.com/stefanbohacek/fediverse-explorations

Exploring the fediverse through data, studies, and polls.

data data-visualization fediverse mastodon social-media

Last synced: 12 Apr 2025

https://github.com/priyanka7411/dataspark-electronics-retail-analytics

DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.

analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization

Last synced: 10 Jul 2025

https://github.com/gappeah/apocalypse-food-prep-report

This PowerBI project focuses on visualising data for Apocalypse Food Prep, a company specialising in emergency food supplies. The dataset consists of various CSV files containing information on customers, locations, products, sales, sales teams, and state regions.

data data-visualization powerbi powerbi-report powerbi-visuals

Last synced: 25 Feb 2025

https://github.com/charconstpointer/markovbot

PoC markov chain sentence generator, powered by discord for data gathering

bot chain collection data discord markov parsing

Last synced: 16 May 2026

https://github.com/logikal-io/mindlab

Data science toolbox

data jupyterlab python spark

Last synced: 29 Oct 2025

https://github.com/astrid-project/lcp

In each local agent, the control plane is responsible for programmability, i.e., changing the behaviour of the data plane at run-time.

agent beats control data ebpf elasticsearch log logstash management programmability security

Last synced: 06 Apr 2025

https://github.com/thamerh/web-scraper-with-node.js-and-cheerio

used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio

cheer data expressjs nodejs scarper webscraping

Last synced: 08 Apr 2026

https://github.com/intercloud/gotsgen

Golang Time Series Data Generator

data generator golang library timeseries

Last synced: 20 Jun 2025

https://github.com/nix1707/webscrapper-browserextension

Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.

chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping

Last synced: 21 Jun 2025

https://github.com/rclement/romain-clement.net

Freelance Software Engineer & Trainer

data freelancer machine-learning mkdocs mkdocs-material python

Last synced: 21 Mar 2025

https://github.com/biglocalnews/upload-files

Upload comma-delimited files to biglocalnews.org in your GitHub Action

action actions archiving csv data data-journalism github-actions journalism news

Last synced: 27 Apr 2026

https://github.com/vijishmadhavan/parse-clip

A simple CLIP based project for combining images from multiple datasets.

clip data datacleaning dataexploration dataset fastai image python

Last synced: 14 May 2026

https://github.com/urunov/algorithms

algorithm, data structure, dynamic array, dynamic programming

algorithms algorithms-and-data-structures data dynamic-programming

Last synced: 20 Mar 2025

https://github.com/nickmcintyre/processing-netcdf

Simple access to scientific datasets with Processing

data netcdf processing

Last synced: 11 Apr 2025

https://github.com/monfireboose/monfireboose

A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.

api data database firebase firestore high-level-api interact javascript library model storage

Last synced: 18 Feb 2026

https://github.com/owsas/open-categories

Open Categorization system, available as a node module

categories categorization categorize data data-structures node open-source typescript yaml

Last synced: 30 Apr 2025

https://github.com/strmprivacy/docs

With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.

data documentation docusaurus privacy privacy-enhancing-technologies

Last synced: 12 Jul 2025

https://github.com/codenoid/lazy-mongo

Insert data to mongo from text plain or file

crystal crystal-language data database mongoclient mongodb

Last synced: 13 Apr 2026

https://github.com/emrecpp/datapacket-csharp

Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C#.

compress data deserialization deserialize deserializer encrypt packet send serialization serialize serializer socket

Last synced: 15 Sep 2025

https://github.com/schluppeck/ng-data-club

Nottingham Psychology data club resources

analysis data julialang maths matlab python r

Last synced: 10 Sep 2025

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 08 Apr 2026

https://github.com/ebsco/builde

Open source Bibframe vocabulary files

bibframe data libraries linked

Last synced: 08 Mar 2026

https://github.com/themitosan/grpp

GRPP is a simple tool written in TS that helps preserving git repositories.

cli data git grpp linux preservation project repo repository

Last synced: 15 Jul 2025

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 03 Apr 2025

https://github.com/abdussattar-70/oop-school-library

The OOP-School-Library project demonstrates the principles of data abstraction, inheritance, encapsulation, and polymorphism, which are fundamental concepts in object-oriented programming(OOP).

abstraction data encapsulation inheritance polymorphism rubocop-configuration ruby

Last synced: 29 Mar 2025

https://github.com/alexgustafsson/systembolaget-api-data

An up to date data mirror of Systembolaget's APIs

data data-science sweden systembolaget

Last synced: 28 Oct 2025

https://github.com/v4ss3ur/hierarchicaldatagrid.wpf

A WPF control that mix DataGrid and TreeView functionalities, allowing for hierarchical, recursive data display with expandable nested rows. Ideal for complex data structures in an easy-to-use, MVVM-friendly tabular format.

controls data datagrid hierarchical hierarchical-data mvvm nested nested-objects nested-structures treeview wpf xaml

Last synced: 13 May 2025

https://github.com/oliver021/ecmalinq

The linq runtime and support to typescript/javascript ecosystem

collection data iterable iteration javascript library linq linq-expressions nodejs query stream stream-data structure typescript

Last synced: 13 May 2025

https://github.com/csengupta1101/dig-student-files

This Repository will contain all student submissions at one place.

data datascience education machine-learning python students visualization

Last synced: 17 Jul 2025

https://github.com/techiaith/brawddegau-tagiedig

Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. // A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.

annotated cc0 commonvoice data nlp welsh

Last synced: 17 Jan 2026

https://github.com/muradisazade777/vaultedge

**VaultEdge** is a secure, modular, and scalable backend system built with C#. It provides robust user authentication, encrypted vault storage, and a clean RESTful API architecture.

api backend backend-api backend-server backend-service core csharp data json json-server testing token

Last synced: 29 Oct 2025

https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 18 Apr 2026

https://github.com/rcorrero/light-pipe

A high-level syntax for data pipelines, designed to make pipeline development quick and painless.

data data-pipelines data-processing geospatial-analysis geospatial-processing pipeline

Last synced: 14 Dec 2025

https://github.com/opendatablend/opendatablend-py

The fastest way to get data from the Open Data Blend Dataset API

data data-engineering data-science dataset frictionless-data frictionlessdata koalas pandas python

Last synced: 14 Dec 2025

https://github.com/mbrn/dbmixer

A project that mask database columns by several algorithms

data database mask security

Last synced: 19 Jul 2025

https://github.com/justunsix/debezium-tests

Testing different Debezium development environment set ups

azure capture cdc change data debezium kafka mssql openshift sql streaming

Last synced: 19 May 2026

https://github.com/aymericzip/api-refetch

Alternative to SWC or react-query. Hook that store your API calls and provide states as isLoading, isFetched, data, error. Allow to instantly fetch the API when the hook is mounted. Provide retry and revalidation options.

api async autofetch cache data fetch loading react-query retry revalidate session-storage state store swr zustand

Last synced: 11 Apr 2025

https://github.com/ultreon/ubo

NBT inspired data I/O. Made for games.

api binary-data data data-storage file-type game-data io library ubo

Last synced: 16 Jun 2025

https://github.com/noi-techpark/it.bz.opendatahub.sparql

The Virtual Knowledge Graph of the Open Data Hub

data graph hub knowledge open sparql

Last synced: 12 Jan 2026

https://github.com/deveripon/assignment-6-assets

This assets is only for Reactive Accelarator Batch 2 - Assignment 6

data images recipe

Last synced: 30 Apr 2025

https://github.com/vasturiano/data-bind-mapper

Bind data arrays with any type of JS objects

bind data digest joins mapper performance

Last synced: 26 Jul 2025

https://github.com/mbanq/dupe

Fake banking data for your front- or backend

backend data datagenerator fake faker frontend javascript nodejs npm npm-package

Last synced: 13 May 2025

https://github.com/hoanganhngo610/introduction-r-packages

This repository is an introduction to the most essential packages in R programming, for the sake of satisfying any demand and customised work flow

data packages r tidyverse

Last synced: 28 Jun 2025

https://github.com/edgardleal/thanos-for-data

A Thanos implementation to restore the balance of your data

data tests

Last synced: 15 Jun 2025

https://github.com/randomgamingdev/mc_block_color_mapper

Python scripts & libraries for generating and mapping the average colors for each of the Minecraft blocks

average average-calculator cli data data-generator documented-api extract extract-data extractor fast minecraft python3 simple small texture texture-pack textures

Last synced: 22 May 2026

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 08 May 2025

https://github.com/FCC/contours-api-node

Enterprise Contours Node API

api contours data data-visualization geospatial gis map

Last synced: 27 Jul 2025

https://github.com/0xopenbytes/cache

📦 Simple in-memory key-value store

cache data json swift

Last synced: 29 Apr 2026

https://github.com/rpidanny/streamline.js

A JavaScript class that reads and processes a stream line-by-line in order.

big-data data data-processing file-stream javascript stream streams typescript

Last synced: 08 Sep 2025

https://github.com/njraladdin/newspapers-com-scraper

A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.

archive data newspapers scraper scraper-api scraping

Last synced: 06 Apr 2025

https://github.com/gianlucatruda/project_sleep

A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.

data experiment matplotlib python quantified science self sleep visualization

Last synced: 03 Apr 2025

https://github.com/equinor/data-marketplace

Easily find and check out data products

data product search

Last synced: 01 May 2025

https://github.com/kawai-senpai/potatodb

PotatoDB is a lightweight, file-based NoSQL database for Python projects, designed for easy setup and use in small-scale applications. Ideal for developers seeking simple data persistence without the complexity of traditional databases.

data database easy-to-use file-based json key-value lightweight nosql nosql-database persistence python simple

Last synced: 23 Oct 2025

https://github.com/guiferviz/tuberia

Data engineering meets software engineering

data data-engineering expectations pipeline python spark

Last synced: 08 Mar 2026

https://github.com/pseudomuto/iceberg-rest-go

A Go client library for working with Iceberg Rest catalogs

client data go iceberg

Last synced: 25 Jan 2026

https://github.com/weecology/ratdat

R package version of Portal Project Teaching Database

data database ecology teaching teaching-data

Last synced: 17 Feb 2026

https://github.com/olajideolagunju/gcp_mage_data_pipeline

An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.

automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders

Last synced: 07 Mar 2025

https://github.com/klevu/feed

Klevu Feed Format (Feed V2) to generate data

data feed xml

Last synced: 06 Apr 2026

https://github.com/justintime50/dad

Dummy Address Data (DAD) - Real addresses from all around the world.

address addresses country dad data dummy dummy-data json real world

Last synced: 18 Feb 2026

https://github.com/divinemonk/dataentrywebapp

Data Entry Web App is a lightweight web application built with Flask, a Python web framework, designed to streamline data entry and management processes. It provides a user-friendly interface for efficient data entry, viewing, editing, and deletion.

data data-entry flask flask-application production production-server web webapp

Last synced: 13 Apr 2025

https://github.com/xtlsoft/xdo

[DEPRECATED] XDO is a fast,light PHP Data Object. Includes DB,Cache,Upload.

cache data database php upload web

Last synced: 05 Apr 2025

https://github.com/fabriquebeweb/dao

Le 'Data Access Object' pour les nuls !

dao data npm package

Last synced: 18 Feb 2026

https://github.com/bastianolea/economia_chile

Indicadores económicos de Chile, actualizados automáticamente cada día, incluyendo PIB, IPSA, IMACEC, IPC, UF, precio del cobre, inversión extranjera, y más

app chile data economia estado laboral meses social tiempo

Last synced: 04 Jul 2025