An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/niklashenning/pyqtcountup

A simple numerical data animation library for PyQt and PySide labels

animation countup data label numerical pyqt pyqt5 pyqt6 pyside pyside2 pyside6 python

Last synced: 29 Oct 2025

https://github.com/nobrainr/axios-morphism

Axios plugin to transform data requests/responses based on a schema.

api axios client data http interceptor javascript network nodejs request response transform typescript

Last synced: 10 Jul 2025

https://github.com/castelao/directip

Iridium SBD Direct-IP (satellite communication)

communication-systems data remote-sensors satellite

Last synced: 11 Jul 2025

https://github.com/mewmix/gh_llm_loader

clone GitHub repositories and prepare their data for ingestion for LLMs.

context data data-structures github llm llm-training python

Last synced: 19 Sep 2025

https://github.com/OpenCourseAPI/OwlAPI

An open source REST API written in Python to scrape and serve Foothill / De Anza course data :ledger:

api course data de-anza foothill myportal owl-api webscraping

Last synced: 13 Jul 2025

https://github.com/waylonwalker/steel-toes

a kedro hook to protect against breaking changes to data

cli data kedro kedro-hook kedro-plugin python

Last synced: 05 May 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-projects-with-pandas

A repository featuring practical data analysis projects using Pandas, demonstrating data manipulation, visualization, and real-world problem-solving techniques. Ideal for learning and applying Pandas for data analysis.

data data-science jupyter-notebook pandas

Last synced: 05 May 2025

https://github.com/lukasmosser/geolink_dataset

Analysis notebooks for the geolink well log dataset

data datasets eda facies geology lithology machine-learning petrophysics wireline

Last synced: 12 Mar 2026

https://github.com/defgsus/parking-data

historic archive of free parking places across germany

archive csv data free historic parking spaces

Last synced: 02 Mar 2026

https://github.com/psyteachr/reprores-v2

Data Skills for Reproducible Research

data education r rstudio tidyverse

Last synced: 19 Oct 2025

https://github.com/jordan-iralde/probestojarvisai

JarvisIA es un sistema de Inteligencia Artificial autónomo diseñado para aprender, optimizar y ejecutar tareas complejas sin intervención humana. Combina automatización, clonación de voz, reconocimiento facial e interacción con sistemas operativos para crear una IA adaptable y eficiente, capaz de mejorar continuamente.

data ia-automator python react tensorflow

Last synced: 23 Jan 2026

https://github.com/nrc-cnrc/nrc-gamma

Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.

ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata

Last synced: 05 Mar 2026

https://github.com/bloomberggraphics/2017-trump-llc-documents

Dataset of Trump LLC documents

data oge opendata trump

Last synced: 16 Oct 2025

https://github.com/joeyism/jsonl-to-conll

A simple tool to convert JSONL to CONLL

conll convert converter data etl jsonl learning machine train training

Last synced: 11 Oct 2025

https://github.com/dkv204p/c-programming

Welcome to the C-Programming repository! This repository is a comprehensive collection of resources, examples, and exercises for learning and mastering the C programming language.

algorithm and c c-enums c-file-handling c-functions c-programming c-programming-language c-structures c-tutorial data dsa dsa-in-c structure

Last synced: 02 Apr 2026

https://github.com/jvaleroliet/datosgobes

Manage data from Spain's open data portal (https://datos.gob.es/es/) easily. The library simplifies API access and streamlines workflows for Python users.

data datosgobes opendata python spain

Last synced: 07 Feb 2026

https://github.com/zdavatz/oddb2xml

oddb2xml, create xml files using refdata, swissmedic and bag xml files

bag data drug open refdata ruby source swissmedic switzerland xml

Last synced: 06 May 2026

https://github.com/nejdetkadir/turkish-taboo-words

It is includes turkish taboo words with different formats as JSON, XML and YAML (500+ words)

data json taboo taboo-word turkish words xml yaml yml

Last synced: 07 Sep 2025

https://github.com/zeljkovranjes/brute

Brute an application that monitors authentication attempts on your server that uses OpenSSH or any protocol. May not be practical, but it is cool.

application bruteforce-logger data data-tracking honeypot ipinfo rest-api rust

Last synced: 13 Apr 2026

https://github.com/emilyriederer/dbt-convo-covid

Demo repo with full code described in blog post

controlled-vocabulary data dbt sql variable-names

Last synced: 26 Oct 2025

https://github.com/realign/localstorage

Encapsulate a simple LocalStorage Class

data json localstorage quick

Last synced: 07 Oct 2025

https://github.com/coryleach/unityinfotables

Unity package of ScriptableObjects for building static data info tables. Generates enum source code for easy access of table entry ids.

data enum scriptableobject scriptableobjects unity unity-plugin unity-scripts unity3d unity3d-plugin unitypackage

Last synced: 24 Oct 2025

https://github.com/educationaltestingservice/ies-writing-achievement-study-data

Data from an IES research study that explores the relationship between writing achievement and success at 4-year postsecondary institutions.

data features ies nlp writing

Last synced: 24 Jan 2026

https://github.com/bastianolea/delincuencia_chile

Visualizador web de estadísticas de delincuencia en Chile

app chile comunas data politica social tiempo

Last synced: 17 Oct 2025

https://github.com/dagshub/open-source-ml-datasets

This repository holds open source datasets for various machine learning domains with a link to download and use them

ai dagshub data data-engine dataset hacktoberfest hacktoberfest2023 machine-learning mlops open-source

Last synced: 19 Oct 2025

https://github.com/polyfrost/datastorage

Static data storage for Polyfrost projects.

data data-storage hypixel hypixel-api minecraft minecraft-mod

Last synced: 01 Mar 2026

https://github.com/finos/legend-community-delta

Combining best of open data standards with open source technologies

data database delta-lake spark

Last synced: 15 Oct 2025

https://github.com/r3li4nt/datafaker

Generador de datos falsos.

data datafaker hacking kali-linux pentesting python

Last synced: 15 Oct 2025

https://github.com/chalk-ai/chalk-go

Go client for Chalk

data data-pipeline feature-engineering

Last synced: 28 Apr 2026

https://github.com/rtmigo/tabular_dart

Dart library for displaying tabular data in a visually appealing ASCII table format.

ascii dart data flutter formatting github markdown prettytable pubdev readme spreadsheet table tabulate

Last synced: 10 Jul 2025

https://github.com/anandchowdhary/everything

⏳ Everything Everywhere All at Once

api data github-actions json

Last synced: 30 Jul 2025

https://github.com/ghodsizadeh/household-survey-data-iran

Household Survey Data Iran but in sqlite3(to work with data easily and everywhere)

data iran iran-data

Last synced: 04 Oct 2025

https://github.com/node/circos

Mirror of cricos. "Circos is a software package for visualizing data and information. It visualizes data in a circular layout — this makes Circos ideal for exploring relationships between objects or positions. "

circos data visualization

Last synced: 19 Apr 2025

https://github.com/p0dalirius/windowsbuilds

This repository contains the list of windows builds as parsable JSON files.

builds data json windows

Last synced: 03 Sep 2025

https://github.com/gagolews/clustering-data-v1

A framework for benchmarking clustering algorithms – Benchmark suite, version 1

benchmark benchmark-datasets clustering data dataset datasets machine-learning

Last synced: 19 Apr 2025

https://github.com/half0wl/datagovsg_api

Unofficial Python API wrapper for public APIs at developers.data.gov.sg

api-wrapper data government-data python singapore

Last synced: 13 Aug 2025

https://github.com/samedwardes/pydatafaker

A python package to create fake data with relationships between tables.

data data-science fake-data python

Last synced: 23 Apr 2025

https://github.com/miku/workshops

A level of indirection.

data golang python talks workshop

Last synced: 11 Apr 2025

https://github.com/remi-gau/bids_cookbook

A simple recipe to convert your neuroimaging data into a BIDS dataset.

bids data eeg fmri matlab meg spm12 tutorial

Last synced: 23 Apr 2025

https://github.com/upsonic/server

Self-Driven Autonomous Python Libraries

data data-science gpt-4o library-management ml mlops python

Last synced: 22 Aug 2025

https://github.com/rririanto/unstructured-demo-streamlit

Extract your docs (CSV, PDF, JSON, HTML, DOCS, Sheets and more) for your own GPT and LLM projects using Unstructured.io via streamlit

ai data data-extraction gpt unstructured unstructured-data

Last synced: 09 Apr 2025

https://github.com/secnot/leaky_diode

Leaky diode is a data exfiltration test tool for data diodes.

cybersecurity data diodes exfiltration pentesting

Last synced: 17 Jan 2026

https://github.com/oobianom/quickcode

An R package made out of mine and Brice's scrapbook of much needed functions.

colors cran data distributions images r

Last synced: 26 May 2026

https://github.com/ttimbers/canlang

R Data package for Canadian language data collected via the Canadian Census.

census data language package r

Last synced: 04 Apr 2025

https://github.com/markpflug/sylvan.tools.etl

Extract transform load command line tools.

csv data etl sqlite sqlserver

Last synced: 24 Apr 2025

https://github.com/0x8b/datamatrix

Library that enables programs to write Data Matrix barcodes of the modern ECC200 variety.

cli code data data-matrix datamatrix ecc ecc200 elixir generator matrix

Last synced: 26 Sep 2025

https://github.com/disruptek/skiplists

generic skip list implementations💃

collection data linked list nim search skip structure

Last synced: 09 Apr 2025

https://github.com/HowTheyVote/data

Weekly updated data on roll-call-votes in the European Parliament, as collected by https://howtheyvote.eu/.

data ep eu european parliament plenary votes

Last synced: 12 Aug 2025

https://github.com/pravj/github-dynamics

Source code for "Information Dynamics on the GitHub Network"

data open-source visualization

Last synced: 30 Jul 2025

https://github.com/orico/flexeegile

Extending Agile For AI & Data Teams

agile ai data data-science flexeegile methodology

Last synced: 08 Jan 2026

https://github.com/anuraganalog/try-every-ml-algorithm

Trying every Machine learning algorithm on a given dataset and measuring the efficiency.

accuracy algorithms analysis classification data deep-learning efficiency learning machine metrics neural-networks regression streamlit

Last synced: 04 Jul 2025

https://github.com/seanbreckenridge/HPI-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 29 Jul 2025

https://github.com/ikstream/dalec

Dalec is a project that aims to provide a privacy preserving data collection method. It utilizes DNS for client/server seperation while transmiting data encrypted

collection data data-collection dns exfiltration shell

Last synced: 11 Aug 2025

https://github.com/marcoradocchia/microxdg

An XDG Base Directory Specification Rust library that aims to be conservative on memory allocation and overall memory footprint.

appdir basedir cache config data directories directory executable file home path runtime rust rust-lang state xdg xdg-basedir xdg-compliance xdg-user-dirs

Last synced: 23 Mar 2025

https://github.com/pysat/pysatmodels

Interface for model analysis and model-data comparisons within the pysat ecosystem

comparison data dineof model pysat python sami2 tie-gcm validation

Last synced: 12 Mar 2026

https://github.com/colinianking/sluice

Sluice is a program that reads input on stdin and outputs on stdout at a specified data rate.

data rate-limiting transfer

Last synced: 13 Feb 2026

https://github.com/effect-deprecated/monocle

Optics for your data (port of monocle-ts)

data functional optics

Last synced: 23 Apr 2025

https://github.com/andredarcie/best-games-of-all-time-data-based

🏆 Definite Best Games Of All Time Data Based by multiple sources

best critics data dataset game rank video-game video-games web-crawling web-scraping

Last synced: 28 Apr 2025

https://github.com/soasis/encoding_tables

Shared tables between C and C++ for encoding infrastructure

c cpp data encoding

Last synced: 09 Apr 2025

https://github.com/cmpadden/dagster.nvim

Control the Dagster orchestrator directly from Neovim

dagster data neovim plugins

Last synced: 15 Mar 2025

https://github.com/hyparam/hightable

A dynamic windowed scrolling table component for react

component data dataset grid javascript react scrolling table virtualized windowed

Last synced: 14 May 2025

https://github.com/arverma/data_diode

A unidirectional network (also referred to as a unidirectional security gateway or data diode ) is a network appliance or device allowing data to travel only in one direction. It is used in guaranteeing information security. They are most commonly found in high security environments such as defense, where they serve as connections between two or more networks of differing security classification – also known as a "cross domain solution." This technology is also found at the industrial control level for such facilit ies as nuclear power plants, electric power generation/distribution, oil and gas production, water/wastewater, airplanes (between flight control units and in - flight entertainment systems), and manufacturing.

c client client-server client-server-architecture data data-diode diode networking server socket-programming

Last synced: 23 Aug 2025

https://github.com/openscm/scmdata

Handling of Simple Climate Model data

climate data data-visualization

Last synced: 27 Jul 2025

https://github.com/endermanbugzjfc/configstruct

Type and shape system for arrays. Help write clearer code when implementing configs for your PocketMine-MP plugin or composer project.

array config data pm pmmp pocketmine shape struct structure virion

Last synced: 23 Oct 2025

https://github.com/stas00/reddit-to-threads

Convert arctic_shift Reddit data dumps into thread-view documents

conversion data dump reddit

Last synced: 20 Mar 2025

https://github.com/dhhruv/fleck-pro

Prototype of Data Hiding Software for your personal files in your Laptop/PC. (Windows OS)

batch batch-script batchfile cli data data-hiding fleck-pro framework hacktoberfest hacktoberfest2021 security shell software terminal windows

Last synced: 03 May 2025

https://github.com/wdataorg/wdata

A database with multiple data sets that support drawing, These data sets are: World population data set, World Carbon dioxide Concentration data set, World Number of Cities data set, China number of population data set, China number of space vehicles data set......

chinese data database pip pypi python3

Last synced: 07 Mar 2026

https://github.com/texora/ssol-da

Data Availability for Solana Layer 2 Blockchain - SuperSol

data data-availability docs layer2 solana

Last synced: 10 Apr 2025

https://github.com/rob-med/backupsetlist.fm

Backup your setlist.fm attended gigs data.

backup data export heroku python setlist setlist-fm

Last synced: 10 Apr 2025

https://github.com/killovsky/trendings

Repositório do módulo scrapper para obtenção de trendings do Twitter.

api countries data hashtag information module network nodejs scraper scraping social social-network tendencia topics trending trends trends24 twitter world

Last synced: 07 Jul 2025

https://github.com/cloudposse/terraform-aws-dms

Terraform modules for provisioning and managing AWS DMS resources

data dms dynamodb migration mysql oracle postgres postgresql s3 sql sql-server

Last synced: 09 Sep 2025

https://github.com/midaef/custom-memory-cache

C.M.C it's simple implementation memory cache [key:value] in GO with time to live

cache data go golang memory midaef

Last synced: 14 Jan 2026

https://github.com/chifisource/parsenoteval.jl

Expands the usage of Base.parse to work with more Base structures.

data data-structures evaluator julia parse parsing

Last synced: 13 Apr 2025

https://github.com/nikitavoloboev/data

All kinds of data

data

Last synced: 26 Mar 2025

https://github.com/pnnl/constrain

ConStrain is a data-driven knowledge-integrated framework that automatically verifies that building system controls function as intended.

bms building commissioning data hvac simulation verification

Last synced: 12 Apr 2025

https://github.com/purarue/hpi-template

A cookiecutter template for creating a HPI repository

data lifelogging quantified-self

Last synced: 18 Mar 2025