An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/devscast/cd-data

important background data for the creation of a solution for the DRC

congo congo-kinshasa data data-science json rdata rdc rdc-data

Last synced: 06 Apr 2025

https://github.com/ybelenko/openapi-data-mocker

Library that generates fake data from OpenAPI 3.0 Spec

data fake faker mock mocker oas oas3 openapi swagger

Last synced: 11 Apr 2025

https://github.com/enkidevs/driveql

1. Sync your files from Google Drive. 2. access them with an automatically generated API

api data google-docs graphql

Last synced: 12 Apr 2025

https://github.com/nobrainr/axios-morphism

Axios plugin to transform data requests/responses based on a schema.

api axios client data http interceptor javascript network nodejs request response transform typescript

Last synced: 10 Jul 2025

https://github.com/moumen-soliman/hashed-device-fingerprint-js

A lightweight JavaScript/TypeScript package that generates device-specific hashed fingerprints for devices in both browser and server environments.

data device expressjs fingerprint fingerprinting javascript nodejs sha256 sha256-hash typescript

Last synced: 13 Apr 2025

https://github.com/chuongmep/ifc-to-excel

Convert Metadata From IFC To Excel

autodesk big-data data ifc ifc-excel ifc-viewer

Last synced: 30 Apr 2025

https://github.com/iamhosseindhv/lstm-classification

Comment toxicity classification using Karas/TensorFlow

classification cnn data data-mining keras lstm machine-learning python rnn tensorflow

Last synced: 08 May 2025

https://github.com/niklashenning/pyqtcountup

A simple numerical data animation library for PyQt and PySide labels

animation countup data label numerical pyqt pyqt5 pyqt6 pyside pyside2 pyside6 python

Last synced: 29 Oct 2025

https://github.com/castelao/directip

Iridium SBD Direct-IP (satellite communication)

communication-systems data remote-sensors satellite

Last synced: 11 Jul 2025

https://github.com/mewmix/gh_llm_loader

clone GitHub repositories and prepare their data for ingestion for LLMs.

context data data-structures github llm llm-training python

Last synced: 19 Sep 2025

https://github.com/jordan-iralde/probestojarvisai

JarvisIA es un sistema de Inteligencia Artificial autónomo diseñado para aprender, optimizar y ejecutar tareas complejas sin intervención humana. Combina automatización, clonación de voz, reconocimiento facial e interacción con sistemas operativos para crear una IA adaptable y eficiente, capaz de mejorar continuamente.

data ia-automator python react tensorflow

Last synced: 23 Jan 2026

https://github.com/nejdetkadir/turkish-taboo-words

It is includes turkish taboo words with different formats as JSON, XML and YAML (500+ words)

data json taboo taboo-word turkish words xml yaml yml

Last synced: 07 Sep 2025

https://github.com/emilyriederer/dbt-convo-covid

Demo repo with full code described in blog post

controlled-vocabulary data dbt sql variable-names

Last synced: 26 Oct 2025

https://github.com/lukasmosser/geolink_dataset

Analysis notebooks for the geolink well log dataset

data datasets eda facies geology lithology machine-learning petrophysics wireline

Last synced: 12 Mar 2026

https://github.com/psyteachr/reprores-v2

Data Skills for Reproducible Research

data education r rstudio tidyverse

Last synced: 19 Oct 2025

https://github.com/jvaleroliet/datosgobes

Manage data from Spain's open data portal (https://datos.gob.es/es/) easily. The library simplifies API access and streamlines workflows for Python users.

data datosgobes opendata python spain

Last synced: 07 Feb 2026

https://github.com/defgsus/parking-data

historic archive of free parking places across germany

archive csv data free historic parking spaces

Last synced: 02 Mar 2026

https://github.com/zdavatz/oddb2xml

oddb2xml, create xml files using refdata, swissmedic and bag xml files

bag data drug open refdata ruby source swissmedic switzerland xml

Last synced: 06 May 2026

https://github.com/zeljkovranjes/brute

Brute an application that monitors authentication attempts on your server that uses OpenSSH or any protocol. May not be practical, but it is cool.

application bruteforce-logger data data-tracking honeypot ipinfo rest-api rust

Last synced: 13 Apr 2026

https://github.com/joeyism/jsonl-to-conll

A simple tool to convert JSONL to CONLL

conll convert converter data etl jsonl learning machine train training

Last synced: 11 Oct 2025

https://github.com/dkv204p/c-programming

Welcome to the C-Programming repository! This repository is a comprehensive collection of resources, examples, and exercises for learning and mastering the C programming language.

algorithm and c c-enums c-file-handling c-functions c-programming c-programming-language c-structures c-tutorial data dsa dsa-in-c structure

Last synced: 02 Apr 2026

https://github.com/nrc-cnrc/nrc-gamma

Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.

ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata

Last synced: 05 Mar 2026

https://github.com/bloomberggraphics/2017-trump-llc-documents

Dataset of Trump LLC documents

data oge opendata trump

Last synced: 16 Oct 2025

https://github.com/realign/localstorage

Encapsulate a simple LocalStorage Class

data json localstorage quick

Last synced: 07 Oct 2025

https://github.com/polyfrost/datastorage

Static data storage for Polyfrost projects.

data data-storage hypixel hypixel-api minecraft minecraft-mod

Last synced: 01 Mar 2026

https://github.com/chalk-ai/chalk-go

Go client for Chalk

data data-pipeline feature-engineering

Last synced: 28 Apr 2026

https://github.com/r3li4nt/datafaker

Generador de datos falsos.

data datafaker hacking kali-linux pentesting python

Last synced: 15 Oct 2025

https://github.com/finos/legend-community-delta

Combining best of open data standards with open source technologies

data database delta-lake spark

Last synced: 15 Oct 2025

https://github.com/bastianolea/delincuencia_chile

Visualizador web de estadísticas de delincuencia en Chile

app chile comunas data politica social tiempo

Last synced: 17 Oct 2025

https://github.com/educationaltestingservice/ies-writing-achievement-study-data

Data from an IES research study that explores the relationship between writing achievement and success at 4-year postsecondary institutions.

data features ies nlp writing

Last synced: 24 Jan 2026

https://github.com/dagshub/open-source-ml-datasets

This repository holds open source datasets for various machine learning domains with a link to download and use them

ai dagshub data data-engine dataset hacktoberfest hacktoberfest2023 machine-learning mlops open-source

Last synced: 19 Oct 2025

https://github.com/coryleach/unityinfotables

Unity package of ScriptableObjects for building static data info tables. Generates enum source code for easy access of table entry ids.

data enum scriptableobject scriptableobjects unity unity-plugin unity-scripts unity3d unity3d-plugin unitypackage

Last synced: 24 Oct 2025

https://github.com/ikstream/dalec

Dalec is a project that aims to provide a privacy preserving data collection method. It utilizes DNS for client/server seperation while transmiting data encrypted

collection data data-collection dns exfiltration shell

Last synced: 11 Aug 2025

https://github.com/upsonic/server

Self-Driven Autonomous Python Libraries

data data-science gpt-4o library-management ml mlops python

Last synced: 22 Aug 2025

https://github.com/anandchowdhary/everything

⏳ Everything Everywhere All at Once

api data github-actions json

Last synced: 30 Jul 2025

https://github.com/samedwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 23 Apr 2025

https://github.com/pysat/pysatmodels

Interface for model analysis and model-data comparisons within the pysat ecosystem

comparison data dineof model pysat python sami2 tie-gcm validation

Last synced: 12 Mar 2026

https://github.com/anuraganalog/try-every-ml-algorithm

Trying every Machine learning algorithm on a given dataset and measuring the efficiency.

accuracy algorithms analysis classification data deep-learning efficiency learning machine metrics neural-networks regression streamlit

Last synced: 04 Jul 2025

https://github.com/soasis/encoding_tables

Shared tables between C and C++ for encoding infrastructure

c cpp data encoding

Last synced: 09 Apr 2025

https://github.com/HowTheyVote/data

Weekly updated data on roll-call-votes in the European Parliament, as collected by https://howtheyvote.eu/.

data ep eu european parliament plenary votes

Last synced: 12 Aug 2025

https://github.com/pravj/github-dynamics

Source code for "Information Dynamics on the GitHub Network"

data open-source visualization

Last synced: 30 Jul 2025

https://github.com/half0wl/datagovsg_api

Unofficial Python API wrapper for public APIs at developers.data.gov.sg

api-wrapper data government-data python singapore

Last synced: 13 Aug 2025

https://github.com/ttimbers/canlang

R Data package for Canadian language data collected via the Canadian Census.

census data language package r

Last synced: 04 Apr 2025

https://github.com/seanbreckenridge/HPI-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 29 Jul 2025

https://github.com/remi-gau/bids_cookbook

A simple recipe to convert your neuroimaging data into a BIDS dataset.

bids data eeg fmri matlab meg spm12 tutorial

Last synced: 23 Apr 2025

https://github.com/cmpadden/dagster.nvim

Control the Dagster orchestrator directly from Neovim

dagster data neovim plugins

Last synced: 15 Mar 2025

https://github.com/p0dalirius/windowsbuilds

This repository contains the list of windows builds as parsable JSON files.

builds data json windows

Last synced: 03 Sep 2025

https://github.com/0x8b/datamatrix

Library that enables programs to write Data Matrix barcodes of the modern ECC200 variety.

cli code data data-matrix datamatrix ecc ecc200 elixir generator matrix

Last synced: 26 Sep 2025

https://github.com/andredarcie/best-games-of-all-time-data-based

🏆 Definite Best Games Of All Time Data Based by multiple sources

best critics data dataset game rank video-game video-games web-crawling web-scraping

Last synced: 28 Apr 2025

https://github.com/effect-deprecated/monocle

Optics for your data (port of monocle-ts)

data functional optics

Last synced: 23 Apr 2025

https://github.com/rtmigo/tabular_dart

Dart library for displaying tabular data in a visually appealing ASCII table format.

ascii dart data flutter formatting github markdown prettytable pubdev readme spreadsheet table tabulate

Last synced: 10 Jul 2025

https://github.com/oobianom/quickcode

An R package made out of mine and Brice's scrapbook of much needed functions.

colors cran data distributions images r

Last synced: 26 May 2026

https://github.com/colinianking/sluice

Sluice is a program that reads input on stdin and outputs on stdout at a specified data rate.

data rate-limiting transfer

Last synced: 13 Feb 2026

https://github.com/markpflug/sylvan.tools.etl

Extract transform load command line tools.

csv data etl sqlite sqlserver

Last synced: 24 Apr 2025

https://github.com/ghodsizadeh/household-survey-data-iran

Household Survey Data Iran but in sqlite3(to work with data easily and everywhere)

data iran iran-data

Last synced: 04 Oct 2025

https://github.com/rririanto/unstructured-demo-streamlit

Extract your docs (CSV, PDF, JSON, HTML, DOCS, Sheets and more) for your own GPT and LLM projects using Unstructured.io via streamlit

ai data data-extraction gpt unstructured unstructured-data

Last synced: 09 Apr 2025

https://github.com/node/circos

Mirror of cricos. "Circos is a software package for visualizing data and information. It visualizes data in a circular layout — this makes Circos ideal for exploring relationships between objects or positions. "

circos data visualization

Last synced: 19 Apr 2025

https://github.com/secnot/leaky_diode

Leaky diode is a data exfiltration test tool for data diodes.

cybersecurity data diodes exfiltration pentesting

Last synced: 17 Jan 2026

https://github.com/gagolews/clustering-data-v1

A framework for benchmarking clustering algorithms – Benchmark suite, version 1

benchmark benchmark-datasets clustering data dataset datasets machine-learning

Last synced: 19 Apr 2025

https://github.com/miku/workshops

A level of indirection.

data golang python talks workshop

Last synced: 11 Apr 2025

https://github.com/disruptek/skiplists

generic skip list implementations💃

collection data linked list nim search skip structure

Last synced: 09 Apr 2025

https://github.com/orico/flexeegile

Extending Agile For AI & Data Teams

agile ai data data-science flexeegile methodology

Last synced: 08 Jan 2026

https://github.com/marcoradocchia/microxdg

An XDG Base Directory Specification Rust library that aims to be conservative on memory allocation and overall memory footprint.

appdir basedir cache config data directories directory executable file home path runtime rust rust-lang state xdg xdg-basedir xdg-compliance xdg-user-dirs

Last synced: 23 Mar 2025

https://github.com/modmuss50/cursemapper

A tool to make graphs and stuff from downloads on curse

curseforge data google-charts gradle graph javafx js kotlin php

Last synced: 18 Jul 2025

https://github.com/openscm/scmdata

Handling of Simple Climate Model data

climate data data-visualization

Last synced: 27 Jul 2025

https://github.com/deiu/solidproxy

Proxy server with authentication (using WebID-TLS delegation)

data delegation linked proxy-server solid webid webid-tls-delegation

Last synced: 11 Jan 2026

https://github.com/texora/ssol-da

Data Availability for Solana Layer 2 Blockchain - SuperSol

data data-availability docs layer2 solana

Last synced: 10 Apr 2025

https://github.com/n1ghtf1re/map-of-emergency-incidents

Emergency Map allows you to effectively visualize multi-dimensional information, has an intuitive interface. The developed code is easily modified for use in a variety of areas. The use of color mixing technology enhances the perception and analysis of information

big-data big-data-analytics big-data-visualization bigdata color-mixing colors data data-analytics data-science data-visualization data-visualization-challenges data-visualization-simpler mysql open-source-project php student-project

Last synced: 18 Mar 2025

https://github.com/smac-group/imudata

:battery: This package is meant to serve as a data collection tool for IMU data. This data can be used as a means to assess and test methods designed to analyse IMU error signals (i.e. long and complex autocorrelated signals). An example method used for this kind of data is implemented in the GMWM R package which can also model the latent models that often characterize this data.

accelerometer data gyroscope imu mems-imu-dataset r

Last synced: 22 Apr 2025

https://github.com/hyparam/hightable

A dynamic windowed scrolling table component for react

component data dataset grid javascript react scrolling table virtualized windowed

Last synced: 14 May 2025

https://github.com/dhhruv/fleck-pro

Prototype of Data Hiding Software for your personal files in your Laptop/PC. (Windows OS)

batch batch-script batchfile cli data data-hiding fleck-pro framework hacktoberfest hacktoberfest2021 security shell software terminal windows

Last synced: 03 May 2025

https://github.com/arverma/data_diode

A unidirectional network (also referred to as a unidirectional security gateway or data diode ) is a network appliance or device allowing data to travel only in one direction. It is used in guaranteeing information security. They are most commonly found in high security environments such as defense, where they serve as connections between two or more networks of differing security classification – also known as a "cross domain solution." This technology is also found at the industrial control level for such facilit ies as nuclear power plants, electric power generation/distribution, oil and gas production, water/wastewater, airplanes (between flight control units and in - flight entertainment systems), and manufacturing.

c client client-server client-server-architecture data data-diode diode networking server socket-programming

Last synced: 23 Aug 2025

https://github.com/malloydata/malloy-cli

A command-line interface for executing Malloy and SQL

data data-modeling transformation

Last synced: 13 Jul 2025

https://github.com/endermanbugzjfc/configstruct

Type and shape system for arrays. Help write clearer code when implementing configs for your PocketMine-MP plugin or composer project.

array config data pm pmmp pocketmine shape struct structure virion

Last synced: 23 Oct 2025

https://github.com/chifisource/parsenoteval.jl

Expands the usage of Base.parse to work with more Base structures.

data data-structures evaluator julia parse parsing

Last synced: 13 Apr 2025

https://github.com/nikitavoloboev/data

All kinds of data

data

Last synced: 26 Mar 2025

https://github.com/datapreprocessing/datacleaning

Data Cleaning is a python package for data preprocessing. This cleans the CSV file and returns the cleaned data frame. It does the work of imputation, removing duplicates, replacing special characters, and many more.

data data-cleaning data-cleansing data-preprocessing data-wrangling imputation python threshold

Last synced: 14 Dec 2025