An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/thearyadev/top500-aggregator

A suite of tools and a web service to collect and provide data on the Overwatch 2 Top 500 leaderboards.

data overwatch

Last synced: 16 Jan 2026

https://github.com/michalporeba/odis

Search in decentralised systems. Search federation, result moderation, aggregation and feedback with hypermedia in ReSTful API to round it all of.

data data-discovery discoverability federated information-discovery mesh-networks search

Last synced: 18 Jan 2026

https://github.com/turbot/steampipe-export

Steampipe Export is a zero-ETL CLI to fetch data from cloud services and APIs. Hundreds of plugins with thousands of documented examples.

aws azure backup data devsecops etl gcp golang kubernetes security steampipe steampipe-engine zero-etl

Last synced: 31 Jul 2025

https://github.com/devscast/cd-data

important background data for the creation of a solution for the DRC

congo congo-kinshasa data data-science json rdata rdc rdc-data

Last synced: 06 Apr 2025

https://github.com/pondpilot/pondpilot

A lightweight local first SQL analytics tool. Get your data 🦆 in a row

analytics data duckdb sql wasm

Last synced: 08 Apr 2025

https://github.com/mara/mara-mondrian

A python integration for the Saiku ad hoc analysis tool

adhoc-analysis data mara mondrian mondrian-olap-engine python reporting saiku

Last synced: 30 Apr 2025

https://github.com/casbin/confita

An open-source version of Kaggle written in Go and React

casbin casdoor conference data go javascript kaggle react

Last synced: 09 Aug 2025

https://github.com/nrennie/national-highways

R package for accessing the National Highways WebTRIS API via R.

data r r-package

Last synced: 14 Aug 2025

https://github.com/mewmix/gh_llm_loader

clone GitHub repositories and prepare their data for ingestion for LLMs.

context data data-structures github llm llm-training python

Last synced: 19 Sep 2025

https://github.com/wahyudesu/predicting-hotel-booking-cancellations

This project will help hotel managers optimize their booking policies, reduce cancellations, and improve revenue.

data data-analysis data-science python

Last synced: 07 Jul 2025

https://github.com/katahiromz/ads

ADS (alternate data stream) manipulation

ads alternate cxx data files ntfs stream win32 win32api

Last synced: 05 May 2025

https://github.com/globalgov/manydata

The portal for global governance data

data dataset r r-package

Last synced: 16 Apr 2025

https://github.com/flor91/data-structures-and-algorithms

Theory and Implementation of Data Structures and Algorithms using Python

algorith data data-structures python

Last synced: 19 Apr 2025

https://github.com/lukasmosser/oklahomaproductiondata

A repository of machine-learnable formatted oklahoma o&g production data.

data data-mining energy machine-learning

Last synced: 03 Aug 2025

https://github.com/edoardottt/postgressql-db

Easy implementation of some postgreSQL Databases for practicing with Conceptual analysis of requirements, design of relational databases and SQL queries

data database pgplsql pgsql plsql postgres postgresql postgresql-database rdbm rdbms relational-databases sql

Last synced: 27 Oct 2025

https://github.com/chalk-ai/chalk-ts

Typescript client for working with Chalk

chalk data feature-engineering pipelines typescript

Last synced: 17 Jan 2026

https://github.com/lens-vm/spec

LensVM specifications and ABI definition

abi data interoperability lenses schema transformations web-assembly

Last synced: 12 Jan 2026

https://github.com/datadesk/calfire-wildfires

Download wildfires data from CalFire

cli data data-journalism geojson journalism news python wildfires

Last synced: 05 Jan 2026

https://github.com/AurelienAubry/Spotlight

Spotlight is a Spotify dashboard that allows user to visualize his listening habits.

backend bootstrap chartjs data data-analysis data-science data-visualization flask frontend javascript js pandas python python3 react react-bootstrap spotify

Last synced: 15 Apr 2025

https://github.com/mittwald/react-use-promise

Simple and declarative use of Promises in your React components. Observe their state and refresh them in various advanced ways.

api async data fetch hooks promise react suspense

Last synced: 01 Jul 2025

https://github.com/bloomberggraphics/2017-trump-llc-documents

Dataset of Trump LLC documents

data oge opendata trump

Last synced: 16 Oct 2025

https://github.com/joeyism/jsonl-to-conll

A simple tool to convert JSONL to CONLL

conll convert converter data etl jsonl learning machine train training

Last synced: 11 Oct 2025

https://github.com/danieljdufour/xdim

Multi-Dimensional Functions. Create, Query, and Transform Multi-Dimensional Data.

array binary data dimensions format formatter functions image javascript js layout math multidimensional ndarray rearrange reorganize reshape shape theory

Last synced: 13 Jun 2025

https://github.com/emilyriederer/dbt-convo-covid

Demo repo with full code described in blog post

controlled-vocabulary data dbt sql variable-names

Last synced: 26 Oct 2025

https://github.com/glassflow/glassflow-python-sdk

GlassFlow Python SDK to publish and consume data to your pipelines at Glassflow.dev

data data-processing datastreaming python real-time sdk stream-processing

Last synced: 12 Oct 2025

https://github.com/yussufbiyik/how-much-time-you-spent-listening-music-on-spotify

A Python script to calculate your listening time with your data from Spotify

data listening music python spotfiy

Last synced: 22 Apr 2025

https://github.com/nejdetkadir/turkish-taboo-words

It is includes turkish taboo words with different formats as JSON, XML and YAML (500+ words)

data json taboo taboo-word turkish words xml yaml yml

Last synced: 07 Sep 2025

https://github.com/jordan-iralde/probestojarvisai

JarvisIA es un sistema de Inteligencia Artificial autónomo diseñado para aprender, optimizar y ejecutar tareas complejas sin intervención humana. Combina automatización, clonación de voz, reconocimiento facial e interacción con sistemas operativos para crear una IA adaptable y eficiente, capaz de mejorar continuamente.

data ia-automator python react tensorflow

Last synced: 23 Jan 2026

https://github.com/dsdanielpark/arxiv2text

Converting PDF files to text, mainly with a focus on arXiv papers.

crawling data datamining translation

Last synced: 05 Sep 2025

https://github.com/klaudiosinani/binoheap

Binomial heaps for ES6

binomial data es6 heap structure typescript

Last synced: 12 Jun 2025

https://github.com/nrc-cnrc/nrc-gamma

Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.

ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata

Last synced: 01 Dec 2025

https://github.com/thomwright/balamb

🌱 Concurrently run a set of dependent, asynchronous tasks with type-safe dependencies

concurrent dag dags data data-seeding dependency-injection di seed seeding tasks

Last synced: 07 May 2025

https://github.com/cipherstash/protectjs

Encrypt and protect data using industry standard algorithms, field level encryption, a unique data key per record, bulk encryption operations, and decryption level identity verification. Powered by CipherStash Encryption.

data data-security encryption javascript postgres postgresql security typescript

Last synced: 29 Oct 2025

https://github.com/lifyzer/data-parser-system

:apple: Simple script that parses data from open source databases to the standard Lifyzer database structure :green_apple:

data data-parser databases food food-data health ingredients lifyzer nutrition parsed-data parser parses-data

Last synced: 09 Apr 2025

https://github.com/apache/airavata-data-lake

Apache Airavata Data Lake

airavata apache data lake mft

Last synced: 22 Apr 2025

https://github.com/psyteachr/reprores-v2

Data Skills for Reproducible Research

data education r rstudio tidyverse

Last synced: 19 Oct 2025

https://github.com/evidence-dev/evidence-vscode

The official VS Code extension for Evidence projects.

analytics data evidence markdown sql vscode

Last synced: 23 Apr 2025

https://github.com/qzcool/cpef

私募基金管理人查询数据接口。Chinese Private Equity Funds APIs.

china crawler data finance fund funds hedge-funds private-equity python python3 scraper scraping-websites spider

Last synced: 11 Jul 2025

https://github.com/dkv204p/c-programming

Welcome to the C-Programming repository! This repository is a comprehensive collection of resources, examples, and exercises for learning and mastering the C programming language.

algorithm and c c-enums c-file-handling c-functions c-programming c-programming-language c-structures c-tutorial data dsa dsa-in-c structure

Last synced: 28 Aug 2025

https://github.com/lukasmosser/geolink_dataset

Analysis notebooks for the geolink well log dataset

data datasets eda facies geology lithology machine-learning petrophysics wireline

Last synced: 15 Apr 2025

https://github.com/klaudiosinani/shtack

LIFO Stacks for ES6

data es6 lifo stack structure typescript

Last synced: 24 Apr 2025

https://github.com/zdavatz/oddb2xml

oddb2xml, create xml files using refdata, swissmedic and bag xml files

bag data drug open refdata ruby source swissmedic switzerland xml

Last synced: 19 Oct 2025

https://github.com/coryleach/unityinfotables

Unity package of ScriptableObjects for building static data info tables. Generates enum source code for easy access of table entry ids.

data enum scriptableobject scriptableobjects unity unity-plugin unity-scripts unity3d unity3d-plugin unitypackage

Last synced: 24 Oct 2025

https://github.com/soasis/encoding_tables

Shared tables between C and C++ for encoding infrastructure

c cpp data encoding

Last synced: 09 Apr 2025

https://github.com/bastianolea/delincuencia_chile

Visualizador web de estadísticas de delincuencia en Chile

app chile comunas data politica social tiempo

Last synced: 17 Oct 2025

https://github.com/remi-gau/bids_cookbook

A simple recipe to convert your neuroimaging data into a BIDS dataset.

bids data eeg fmri matlab meg spm12 tutorial

Last synced: 23 Apr 2025

https://github.com/educationaltestingservice/ies-writing-achievement-study-data

Data from an IES research study that explores the relationship between writing achievement and success at 4-year postsecondary institutions.

data features ies nlp writing

Last synced: 24 Jan 2026

https://github.com/miku/workshops

A level of indirection.

data golang python talks workshop

Last synced: 11 Apr 2025

https://github.com/wdataorg/wdata

A database with multiple data sets that support drawing, These data sets are: World population data set, World Carbon dioxide Concentration data set, World Number of Cities data set, China number of population data set, China number of space vehicles data set......

chinese data database pip pypi python3

Last synced: 25 Mar 2025

https://github.com/realign/localstorage

Encapsulate a simple LocalStorage Class

data json localstorage quick

Last synced: 07 Oct 2025

https://github.com/anuraganalog/try-every-ml-algorithm

Trying every Machine learning algorithm on a given dataset and measuring the efficiency.

accuracy algorithms analysis classification data deep-learning efficiency learning machine metrics neural-networks regression streamlit

Last synced: 04 Jul 2025

https://github.com/rtmigo/tabular_dart

Dart library for displaying tabular data in a visually appealing ASCII table format.

ascii dart data flutter formatting github markdown prettytable pubdev readme spreadsheet table tabulate

Last synced: 10 Jul 2025

https://github.com/andredarcie/best-games-of-all-time-data-based

🏆 Definite Best Games Of All Time Data Based by multiple sources

best critics data dataset game rank video-game video-games web-crawling web-scraping

Last synced: 28 Apr 2025

https://github.com/rririanto/unstructured-demo-streamlit

Extract your docs (CSV, PDF, JSON, HTML, DOCS, Sheets and more) for your own GPT and LLM projects using Unstructured.io via streamlit

ai data data-extraction gpt unstructured unstructured-data

Last synced: 09 Apr 2025

https://github.com/orico/flexeegile

Extending Agile For AI & Data Teams

agile ai data data-science flexeegile methodology

Last synced: 08 Jan 2026

https://github.com/finos/legend-community-delta

Combining best of open data standards with open source technologies

data database delta-lake spark

Last synced: 15 Oct 2025

https://github.com/dagshub/open-source-ml-datasets

This repository holds open source datasets for various machine learning domains with a link to download and use them

ai dagshub data data-engine dataset hacktoberfest hacktoberfest2023 machine-learning mlops open-source

Last synced: 19 Oct 2025

https://github.com/samedwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 23 Apr 2025

https://github.com/r3li4nt/datafaker

Generador de datos falsos.

data datafaker hacking kali-linux pentesting python

Last synced: 15 Oct 2025

https://github.com/disruptek/skiplists

generic skip list implementations💃

collection data linked list nim search skip structure

Last synced: 09 Apr 2025

https://github.com/secnot/leaky_diode

Leaky diode is a data exfiltration test tool for data diodes.

cybersecurity data diodes exfiltration pentesting

Last synced: 17 Jan 2026

https://github.com/n1ghtf1re/map-of-emergency-incidents

Emergency Map allows you to effectively visualize multi-dimensional information, has an intuitive interface. The developed code is easily modified for use in a variety of areas. The use of color mixing technology enhances the perception and analysis of information

big-data big-data-analytics big-data-visualization bigdata color-mixing colors data data-analytics data-science data-visualization data-visualization-challenges data-visualization-simpler mysql open-source-project php student-project

Last synced: 18 Mar 2025

https://github.com/instafluff/coronavirus

COVID-19 Coronavirus Data Tracker

2019-ncov coronavirus covid-19 data ncov ncov-2019 sars-cov-2 wuhan

Last synced: 29 Oct 2025

https://github.com/cloudposse/terraform-aws-dms

Terraform modules for provisioning and managing AWS DMS resources

data dms dynamodb migration mysql oracle postgres postgresql s3 sql sql-server

Last synced: 09 Sep 2025

https://github.com/nikitavoloboev/data

All kinds of data

data

Last synced: 26 Mar 2025

https://github.com/akeneo/transporteo

Migration Tool for Akeneo PIM from 1.7 to 2.0

akeneo akeneo-pim data migration php symfony

Last synced: 29 Jul 2025

https://github.com/killovsky/trendings

Repositório do módulo scrapper para obtenção de trendings do Twitter.

api countries data hashtag information module network nodejs scraper scraping social social-network tendencia topics trending trends trends24 twitter world

Last synced: 07 Jul 2025

https://github.com/motasimfoad/emr

“EMR” is a platform built using leading edge web technologies and API’s to help Doctors/ Patient/ Hospitals/ Pharmacies to better deal with medical documentation.

apollo data doctor emr graphcool graphql hospital medical patients pharmacy reactjs record yarn

Last synced: 10 Apr 2025

https://github.com/arverma/data_diode

A unidirectional network (also referred to as a unidirectional security gateway or data diode ) is a network appliance or device allowing data to travel only in one direction. It is used in guaranteeing information security. They are most commonly found in high security environments such as defense, where they serve as connections between two or more networks of differing security classification – also known as a "cross domain solution." This technology is also found at the industrial control level for such facilit ies as nuclear power plants, electric power generation/distribution, oil and gas production, water/wastewater, airplanes (between flight control units and in - flight entertainment systems), and manufacturing.

c client client-server client-server-architecture data data-diode diode networking server socket-programming

Last synced: 23 Aug 2025

https://github.com/purarue/hpi-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 18 Mar 2025

https://github.com/modmuss50/cursemapper

A tool to make graphs and stuff from downloads on curse

curseforge data google-charts gradle graph javafx js kotlin php

Last synced: 18 Jul 2025

https://github.com/pysat/pysatmodels

Interface for model analysis and model-data comparisons within the pysat ecosystem

comparison data dineof model pysat python sami2 tie-gcm validation

Last synced: 22 Jul 2025

https://github.com/stas00/reddit-to-threads

Convert arctic_shift Reddit data dumps into thread-view documents

conversion data dump reddit

Last synced: 20 Mar 2025

https://github.com/texora/ssol-da

Data Availability for Solana Layer 2 Blockchain - SuperSol

data data-availability docs layer2 solana

Last synced: 10 Apr 2025

https://github.com/cecoeco/htmltables.jl

Julia package for reading and writing HTML tables using the Tables.jl interface

css data html http julia table tables

Last synced: 11 Apr 2025

https://github.com/purarue/HPI-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 01 May 2025

https://github.com/chifisource/parsenoteval.jl

Expands the usage of Base.parse to work with more Base structures.

data data-structures evaluator julia parse parsing

Last synced: 13 Apr 2025

https://github.com/SamEdwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 09 Jul 2025