An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/moumen-soliman/hashed-device-fingerprint-js

A lightweight JavaScript/TypeScript package that generates device-specific hashed fingerprints for devices in both browser and server environments.

data device expressjs fingerprint fingerprinting javascript nodejs sha256 sha256-hash typescript

Last synced: 13 Apr 2025

https://github.com/chuongmep/ifc-to-excel

Convert Metadata From IFC To Excel

autodesk big-data data ifc ifc-excel ifc-viewer

Last synced: 30 Apr 2025

https://github.com/enkidevs/driveql

1. Sync your files from Google Drive. 2. access them with an automatically generated API

api data google-docs graphql

Last synced: 12 Apr 2025

https://github.com/reycn/data-analytics-in-julia

Notebooks for data analysis in social science using Julia, replicating frequent analytical steps in Python & R.

data data-analysis data-science data-visualization julia

Last synced: 07 May 2025

https://github.com/gagniuc/prototype-software-for-photon-pixel-coupling

Photon-pixel coupling is a novel method that allows a parallel sampling of an unlimited number of sensors. In the case shown here, 200 sensors are sampled in parallel at video rate frequency. This implementation is done in Visual Basic 6.0 (VB6).

biosensors coupling curent data electronics led photon-pixel sampling sensors skin vb6 voltage webcam

Last synced: 08 Mar 2026

https://github.com/usgs/warc-iridium-sbd-decoder

Java library for decoding Iridium Short Burst Data packets.

burst data iridium java sbd short

Last synced: 21 Jun 2025

https://github.com/OpenCourseAPI/OwlAPI

An open source REST API written in Python to scrape and serve Foothill / De Anza course data :ledger:

api course data de-anza foothill myportal owl-api webscraping

Last synced: 13 Jul 2025

https://github.com/castelao/directip

Iridium SBD Direct-IP (satellite communication)

communication-systems data remote-sensors satellite

Last synced: 11 Jul 2025

https://github.com/mewmix/gh_llm_loader

clone GitHub repositories and prepare their data for ingestion for LLMs.

context data data-structures github llm llm-training python

Last synced: 19 Sep 2025

https://github.com/iamhosseindhv/lstm-classification

Comment toxicity classification using Karas/TensorFlow

classification cnn data data-mining keras lstm machine-learning python rnn tensorflow

Last synced: 08 May 2025

https://github.com/zeljkovranjes/brute

Brute an application that monitors authentication attempts on your server that uses OpenSSH or any protocol. May not be practical, but it is cool.

application bruteforce-logger data data-tracking honeypot ipinfo rest-api rust

Last synced: 13 Apr 2026

https://github.com/joeyism/jsonl-to-conll

A simple tool to convert JSONL to CONLL

conll convert converter data etl jsonl learning machine train training

Last synced: 11 Oct 2025

https://github.com/jvaleroliet/datosgobes

Manage data from Spain's open data portal (https://datos.gob.es/es/) easily. The library simplifies API access and streamlines workflows for Python users.

data datosgobes opendata python spain

Last synced: 07 Feb 2026

https://github.com/defgsus/parking-data

historic archive of free parking places across germany

archive csv data free historic parking spaces

Last synced: 02 Mar 2026

https://github.com/nrc-cnrc/nrc-gamma

Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.

ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata

Last synced: 05 Mar 2026

https://github.com/lukasmosser/geolink_dataset

Analysis notebooks for the geolink well log dataset

data datasets eda facies geology lithology machine-learning petrophysics wireline

Last synced: 12 Mar 2026

https://github.com/dkv204p/c-programming

Welcome to the C-Programming repository! This repository is a comprehensive collection of resources, examples, and exercises for learning and mastering the C programming language.

algorithm and c c-enums c-file-handling c-functions c-programming c-programming-language c-structures c-tutorial data dsa dsa-in-c structure

Last synced: 02 Apr 2026

https://github.com/bloomberggraphics/2017-trump-llc-documents

Dataset of Trump LLC documents

data oge opendata trump

Last synced: 16 Oct 2025

https://github.com/emilyriederer/dbt-convo-covid

Demo repo with full code described in blog post

controlled-vocabulary data dbt sql variable-names

Last synced: 26 Oct 2025

https://github.com/nejdetkadir/turkish-taboo-words

It is includes turkish taboo words with different formats as JSON, XML and YAML (500+ words)

data json taboo taboo-word turkish words xml yaml yml

Last synced: 07 Sep 2025

https://github.com/zdavatz/oddb2xml

oddb2xml, create xml files using refdata, swissmedic and bag xml files

bag data drug open refdata ruby source swissmedic switzerland xml

Last synced: 06 May 2026

https://github.com/psyteachr/reprores-v2

Data Skills for Reproducible Research

data education r rstudio tidyverse

Last synced: 19 Oct 2025

https://github.com/jordan-iralde/probestojarvisai

JarvisIA es un sistema de Inteligencia Artificial autónomo diseñado para aprender, optimizar y ejecutar tareas complejas sin intervención humana. Combina automatización, clonación de voz, reconocimiento facial e interacción con sistemas operativos para crear una IA adaptable y eficiente, capaz de mejorar continuamente.

data ia-automator python react tensorflow

Last synced: 23 Jan 2026

https://github.com/finos/legend-community-delta

Combining best of open data standards with open source technologies

data database delta-lake spark

Last synced: 15 Oct 2025

https://github.com/realign/localstorage

Encapsulate a simple LocalStorage Class

data json localstorage quick

Last synced: 07 Oct 2025

https://github.com/coryleach/unityinfotables

Unity package of ScriptableObjects for building static data info tables. Generates enum source code for easy access of table entry ids.

data enum scriptableobject scriptableobjects unity unity-plugin unity-scripts unity3d unity3d-plugin unitypackage

Last synced: 24 Oct 2025

https://github.com/bastianolea/delincuencia_chile

Visualizador web de estadísticas de delincuencia en Chile

app chile comunas data politica social tiempo

Last synced: 17 Oct 2025

https://github.com/polyfrost/datastorage

Static data storage for Polyfrost projects.

data data-storage hypixel hypixel-api minecraft minecraft-mod

Last synced: 01 Mar 2026

https://github.com/educationaltestingservice/ies-writing-achievement-study-data

Data from an IES research study that explores the relationship between writing achievement and success at 4-year postsecondary institutions.

data features ies nlp writing

Last synced: 24 Jan 2026

https://github.com/r3li4nt/datafaker

Generador de datos falsos.

data datafaker hacking kali-linux pentesting python

Last synced: 15 Oct 2025

https://github.com/dagshub/open-source-ml-datasets

This repository holds open source datasets for various machine learning domains with a link to download and use them

ai dagshub data data-engine dataset hacktoberfest hacktoberfest2023 machine-learning mlops open-source

Last synced: 19 Oct 2025

https://github.com/chalk-ai/chalk-go

Go client for Chalk

data data-pipeline feature-engineering

Last synced: 28 Apr 2026

https://github.com/miku/workshops

A level of indirection.

data golang python talks workshop

Last synced: 11 Apr 2025

https://github.com/anandchowdhary/everything

⏳ Everything Everywhere All at Once

api data github-actions json

Last synced: 30 Jul 2025

https://github.com/secnot/leaky_diode

Leaky diode is a data exfiltration test tool for data diodes.

cybersecurity data diodes exfiltration pentesting

Last synced: 17 Jan 2026

https://github.com/markpflug/sylvan.tools.etl

Extract transform load command line tools.

csv data etl sqlite sqlserver

Last synced: 24 Apr 2025

https://github.com/HowTheyVote/data

Weekly updated data on roll-call-votes in the European Parliament, as collected by https://howtheyvote.eu/.

data ep eu european parliament plenary votes

Last synced: 12 Aug 2025

https://github.com/p0dalirius/windowsbuilds

This repository contains the list of windows builds as parsable JSON files.

builds data json windows

Last synced: 03 Sep 2025

https://github.com/pravj/github-dynamics

Source code for "Information Dynamics on the GitHub Network"

data open-source visualization

Last synced: 30 Jul 2025

https://github.com/ttimbers/canlang

R Data package for Canadian language data collected via the Canadian Census.

census data language package r

Last synced: 04 Apr 2025

https://github.com/anuraganalog/try-every-ml-algorithm

Trying every Machine learning algorithm on a given dataset and measuring the efficiency.

accuracy algorithms analysis classification data deep-learning efficiency learning machine metrics neural-networks regression streamlit

Last synced: 04 Jul 2025

https://github.com/marcoradocchia/microxdg

An XDG Base Directory Specification Rust library that aims to be conservative on memory allocation and overall memory footprint.

appdir basedir cache config data directories directory executable file home path runtime rust rust-lang state xdg xdg-basedir xdg-compliance xdg-user-dirs

Last synced: 23 Mar 2025

https://github.com/gagolews/clustering-data-v1

A framework for benchmarking clustering algorithms – Benchmark suite, version 1

benchmark benchmark-datasets clustering data dataset datasets machine-learning

Last synced: 19 Apr 2025

https://github.com/disruptek/skiplists

generic skip list implementations💃

collection data linked list nim search skip structure

Last synced: 09 Apr 2025

https://github.com/andredarcie/best-games-of-all-time-data-based

🏆 Definite Best Games Of All Time Data Based by multiple sources

best critics data dataset game rank video-game video-games web-crawling web-scraping

Last synced: 28 Apr 2025

https://github.com/seanbreckenridge/HPI-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 29 Jul 2025

https://github.com/samedwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 23 Apr 2025

https://github.com/rtmigo/tabular_dart

Dart library for displaying tabular data in a visually appealing ASCII table format.

ascii dart data flutter formatting github markdown prettytable pubdev readme spreadsheet table tabulate

Last synced: 10 Jul 2025

https://github.com/colinianking/sluice

Sluice is a program that reads input on stdin and outputs on stdout at a specified data rate.

data rate-limiting transfer

Last synced: 13 Feb 2026

https://github.com/cmpadden/dagster.nvim

Control the Dagster orchestrator directly from Neovim

dagster data neovim plugins

Last synced: 15 Mar 2025

https://github.com/oobianom/quickcode

An R package made out of mine and Brice's scrapbook of much needed functions.

colors cran data distributions images r

Last synced: 26 May 2026

https://github.com/pysat/pysatmodels

Interface for model analysis and model-data comparisons within the pysat ecosystem

comparison data dineof model pysat python sami2 tie-gcm validation

Last synced: 12 Mar 2026

https://github.com/ikstream/dalec

Dalec is a project that aims to provide a privacy preserving data collection method. It utilizes DNS for client/server seperation while transmiting data encrypted

collection data data-collection dns exfiltration shell

Last synced: 11 Aug 2025

https://github.com/node/circos

Mirror of cricos. "Circos is a software package for visualizing data and information. It visualizes data in a circular layout — this makes Circos ideal for exploring relationships between objects or positions. "

circos data visualization

Last synced: 19 Apr 2025

https://github.com/effect-deprecated/monocle

Optics for your data (port of monocle-ts)

data functional optics

Last synced: 23 Apr 2025

https://github.com/orico/flexeegile

Extending Agile For AI & Data Teams

agile ai data data-science flexeegile methodology

Last synced: 08 Jan 2026

https://github.com/0x8b/datamatrix

Library that enables programs to write Data Matrix barcodes of the modern ECC200 variety.

cli code data data-matrix datamatrix ecc ecc200 elixir generator matrix

Last synced: 26 Sep 2025

https://github.com/soasis/encoding_tables

Shared tables between C and C++ for encoding infrastructure

c cpp data encoding

Last synced: 09 Apr 2025

https://github.com/ghodsizadeh/household-survey-data-iran

Household Survey Data Iran but in sqlite3(to work with data easily and everywhere)

data iran iran-data

Last synced: 04 Oct 2025

https://github.com/rririanto/unstructured-demo-streamlit

Extract your docs (CSV, PDF, JSON, HTML, DOCS, Sheets and more) for your own GPT and LLM projects using Unstructured.io via streamlit

ai data data-extraction gpt unstructured unstructured-data

Last synced: 09 Apr 2025

https://github.com/upsonic/server

Self-Driven Autonomous Python Libraries

data data-science gpt-4o library-management ml mlops python

Last synced: 22 Aug 2025

https://github.com/remi-gau/bids_cookbook

A simple recipe to convert your neuroimaging data into a BIDS dataset.

bids data eeg fmri matlab meg spm12 tutorial

Last synced: 23 Apr 2025

https://github.com/half0wl/datagovsg_api

Unofficial Python API wrapper for public APIs at developers.data.gov.sg

api-wrapper data government-data python singapore

Last synced: 13 Aug 2025

https://github.com/caltechlibrary/caltechdata

The CaltechDATA InvenioRDM source code

data inveniordm repository

Last synced: 13 Apr 2025

https://github.com/chifisource/parsenoteval.jl

Expands the usage of Base.parse to work with more Base structures.

data data-structures evaluator julia parse parsing

Last synced: 13 Apr 2025

https://github.com/instafluff/coronavirus

COVID-19 Coronavirus Data Tracker

2019-ncov coronavirus covid-19 data ncov ncov-2019 sars-cov-2 wuhan

Last synced: 25 Feb 2026

https://github.com/hyparam/hightable

A dynamic windowed scrolling table component for react

component data dataset grid javascript react scrolling table virtualized windowed

Last synced: 14 May 2025

https://github.com/pnnl/constrain

ConStrain is a data-driven knowledge-integrated framework that automatically verifies that building system controls function as intended.

bms building commissioning data hvac simulation verification

Last synced: 12 Apr 2025

https://github.com/purarue/HPI-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 01 May 2025

https://github.com/killovsky/trendings

Repositório do módulo scrapper para obtenção de trendings do Twitter.

api countries data hashtag information module network nodejs scraper scraping social social-network tendencia topics trending trends trends24 twitter world

Last synced: 07 Jul 2025

https://github.com/cloudposse/terraform-aws-dms

Terraform modules for provisioning and managing AWS DMS resources

data dms dynamodb migration mysql oracle postgres postgresql s3 sql sql-server

Last synced: 09 Sep 2025

https://github.com/SamEdwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 09 Jul 2025

https://github.com/n1ghtf1re/map-of-emergency-incidents

Emergency Map allows you to effectively visualize multi-dimensional information, has an intuitive interface. The developed code is easily modified for use in a variety of areas. The use of color mixing technology enhances the perception and analysis of information

big-data big-data-analytics big-data-visualization bigdata color-mixing colors data data-analytics data-science data-visualization data-visualization-challenges data-visualization-simpler mysql open-source-project php student-project

Last synced: 18 Mar 2025

https://github.com/malloydata/malloy-cli

A command-line interface for executing Malloy and SQL

data data-modeling transformation

Last synced: 13 Jul 2025

https://github.com/datapreprocessing/datacleaning

Data Cleaning is a python package for data preprocessing. This cleans the CSV file and returns the cleaned data frame. It does the work of imputation, removing duplicates, replacing special characters, and many more.

data data-cleaning data-cleansing data-preprocessing data-wrangling imputation python threshold

Last synced: 14 Dec 2025

https://github.com/midaef/custom-memory-cache

C.M.C it's simple implementation memory cache [key:value] in GO with time to live

cache data go golang memory midaef

Last synced: 14 Jan 2026

https://github.com/purarue/hpi-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 18 Mar 2025

https://github.com/Evan-Kim2028/subgraph-query-portal

A collection of reusable public goods subgraph queries.

data messari polars python query subgraphs

Last synced: 09 May 2025