An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/emrecpp/datapacket-csharp

Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C#.

compress data deserialization deserialize deserializer encrypt packet send serialization serialize serializer socket

Last synced: 15 Sep 2025

https://github.com/aa-sikkkk/twitterdatamining

A Simple Script to mine data from X/Twitter

data mining

Last synced: 24 Jan 2026

https://github.com/nickmcintyre/processing-netcdf

Simple access to scientific datasets with Processing

data netcdf processing

Last synced: 11 Apr 2025

https://github.com/biglocalnews/upload-files

Upload comma-delimited files to biglocalnews.org in your GitHub Action

action actions archiving csv data data-journalism github-actions journalism news

Last synced: 27 Apr 2026

https://github.com/themitosan/grpp

GRPP is a simple tool written in TS that helps preserving git repositories.

cli data git grpp linux preservation project repo repository

Last synced: 15 Jul 2025

https://github.com/cttynul/elsoftware

⚽ Vinci al Fantacalcio usando librerie di pandas, facendo credere a tutti che tu stia usando il machine learning

data data-science fantacalcio machine-learning pandas

Last synced: 30 Jun 2026

https://github.com/schluppeck/ng-data-club

Nottingham Psychology data club resources

analysis data julialang maths matlab python r

Last synced: 10 Sep 2025

https://github.com/gianlucatruda/project_sleep

A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.

data experiment matplotlib python quantified science self sleep visualization

Last synced: 03 Apr 2025

https://github.com/intercloud/gotsgen

Golang Time Series Data Generator

data generator golang library timeseries

Last synced: 20 Jun 2025

https://github.com/codenoid/lazy-mongo

Insert data to mongo from text plain or file

crystal crystal-language data database mongoclient mongodb

Last synced: 13 Apr 2026

https://github.com/noi-techpark/it.bz.opendatahub.sparql

The Virtual Knowledge Graph of the Open Data Hub

data graph hub knowledge open sparql

Last synced: 12 Jan 2026

https://github.com/rcorrero/light-pipe

A high-level syntax for data pipelines, designed to make pipeline development quick and painless.

data data-pipelines data-processing geospatial-analysis geospatial-processing pipeline

Last synced: 14 Dec 2025

https://github.com/ikstream/dns-handler

Data collection server for the dalec user collection system

collection dalec data data-collection dns dns-server python python3

Last synced: 13 Mar 2025

https://github.com/csengupta1101/dig-student-files

This Repository will contain all student submissions at one place.

data datascience education machine-learning python students visualization

Last synced: 17 Jul 2025

https://github.com/ljharb/define-data-property

Define a data property on an object. Will fall back to assignment in an engine without descriptors.

accessor configurable data define ecmascript enumerable javascript object property writable

Last synced: 13 Apr 2025

https://github.com/techiaith/brawddegau-tagiedig

Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. // A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.

annotated cc0 commonvoice data nlp welsh

Last synced: 17 Jan 2026

https://github.com/owsas/open-categories

Open Categorization system, available as a node module

categories categorization categorize data data-structures node open-source typescript yaml

Last synced: 30 Apr 2025

https://github.com/nitzano/datazar

CLI Database & File Anonymizer

anonymizer cli data fake

Last synced: 22 Oct 2025

https://github.com/vijishmadhavan/parse-clip

A simple CLIP based project for combining images from multiple datasets.

clip data datacleaning dataexploration dataset fastai image python

Last synced: 14 May 2026

https://github.com/monfireboose/monfireboose

A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.

api data database firebase firestore high-level-api interact javascript library model storage

Last synced: 18 Feb 2026

https://github.com/rpidanny/streamline.js

A JavaScript class that reads and processes a stream line-by-line in order.

big-data data data-processing file-stream javascript stream streams typescript

Last synced: 08 Sep 2025

https://github.com/equinor/data-marketplace

Easily find and check out data products

data product search

Last synced: 01 May 2025

https://github.com/strmprivacy/docs

With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.

data documentation docusaurus privacy privacy-enhancing-technologies

Last synced: 12 Jul 2025

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 08 Apr 2026

https://github.com/imagodata/filter_mate

FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers

data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database

Last synced: 29 Apr 2026

https://github.com/dkxce/osm2shp

Flexible OSM to SHP Converter (convert .osm & .pbf files to ESRI Shape .shp files). OSM to Shape.

converter data dbf dkxce earth esri map maps openseamap openstreetmap osm pbf routes shape shapes shp

Last synced: 26 Apr 2026

https://github.com/cmudig/mosaic-profiler

A data profiler built with Mosaic

data jupyter visualization

Last synced: 25 Oct 2025

https://github.com/sneels/parkds

Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)

cross-domain data database datasource datasources javascript source

Last synced: 24 Feb 2026

https://github.com/0xdir/htcds_dart

Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.

data humanitarian schema standards

Last synced: 24 Oct 2025

https://github.com/ryanmorr/fastmap

Accelerated hash maps

data hashmap javascript map performance

Last synced: 10 Oct 2025

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/suh1z/rakkauttify_fullstack

CS2 Data and Statistics Dashboard -fullstackproject

analytics data expressjs gaming mongo nodejs react redux

Last synced: 24 Oct 2025

https://github.com/legopitstop/addons

All legopitstop's Bedrock add-ons in one place.

add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla

Last synced: 06 Feb 2026

https://github.com/binarybardakshat/suryanayan

Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.

data drone nlp python solar

Last synced: 10 Oct 2025

https://github.com/geopython/pygeoapi-examples

Example pygeoapi deployment patterns and configurations

api data geospatial ogc ogc-api osgeo pygeoapi

Last synced: 11 Oct 2025

https://github.com/akuzko/use-stash

React hooks for app-wide data access and manipulation

action actions data hook hooks react store

Last synced: 09 May 2026

https://github.com/cicerops/monitoring-check-grafana

Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.

data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale

Last synced: 08 May 2026

https://github.com/chompfoods/sdk-csharp

C# SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp csharp csharp-sdk data database dll food grocery ingredients nuget nutrition raw recipes recipes-api restsharp sdk swagger

Last synced: 06 May 2026

https://github.com/pranavpandey/dynamic-backup

Backup and restore app data on Android.

android app backup data library restore storage

Last synced: 07 Sep 2025

https://github.com/jvirtanen/dada

Generate tabular text data

csv data dsv generator tsv

Last synced: 13 Jun 2026

https://github.com/jderstd/spec

A standard for JSON responses

data error jder json response specification structure

Last synced: 13 May 2026

https://github.com/justfairdev/Web-Stack-Query

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

async cache data fetch graphql hooks query react resources rest

Last synced: 14 Oct 2025

https://github.com/potch/whizzy

A prototype rich data editor for GitHub

csv csvconf data github

Last synced: 01 May 2026

https://github.com/paladique/azuresample-guestbook

Guestbook using MySQL and Cosmos DB on Azure

cosmosdb data mysql spa websockets

Last synced: 30 Apr 2026

https://github.com/geocollections/emaapou

eMaapõu: Eesti maapõue andmebaas

data database estonia geology portal

Last synced: 05 Feb 2026

https://github.com/anicolaspp/mapr-data-gen

Data generator for MapR Data Platform

data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark

Last synced: 29 Apr 2026

https://github.com/sparkpost/event-data

self-hosted message events

api aws data email webhooks

Last synced: 29 Apr 2026

https://github.com/dlr-pa/pydabu

python data bubble

data json python

Last synced: 28 Apr 2026

https://github.com/yazaabed/at-who-angular

wrapper for At.js that add mentions autocomplete to your application with angular component for using it on any AngularJS projects

angular-components angularjs autocomplete components data modules webpack wrapper

Last synced: 28 Apr 2026

https://github.com/luminovrym/pbo-biodata

Simulasi Cara Input Data dengan OOP

data oop-in-php php-native

Last synced: 18 Jun 2026

https://github.com/mongodb-developer/rocket-analytics

Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.

data federation lucene lucenesearch mongodb s3 search sql

Last synced: 28 Apr 2026

https://github.com/justjavac/deno_data_dir

Returns the path to the user's data directory.

data deno deno-module deno-modules directory

Last synced: 27 Apr 2026

https://github.com/kanugurajesh/firebase-data

Adding data to firebase store

data firebase firebase-database python

Last synced: 27 Apr 2026

https://github.com/anthonykrivonos/ts-algo-masterclass

👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.

algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript

Last synced: 11 May 2026

https://github.com/joamag/pandas

Loads of pandas data from China with awesome data

data data-analysis jupyter notebook pandas

Last synced: 25 Apr 2026

https://github.com/corentinb/txtoredis

:fire: Push each line of a text file, to a Redis set

data datascience dataset go golang redis set

Last synced: 24 Apr 2026

https://github.com/d2hydro/fewspy

A Python API for the Deltares FEWS PI REST Web Service

data geopandas hydrology hydrometrics pandas python

Last synced: 23 Apr 2026

https://github.com/healthyregions/oeps

Opioid Environment Policy Scan - data explorer and backend management

data data-visualization public-health

Last synced: 21 Apr 2026

https://github.com/adanos-software/free-ticker-database

Free global stock & ETF ticker reference database - 50k+ tickers, 66 exchanges, 81 countries

data etf finance stocks

Last synced: 10 May 2026

https://github.com/lilingxi01/bloark

Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.

architecture bloark data revision-based

Last synced: 05 Apr 2026

https://github.com/metapsy-project/data-gambling-psyctr

Database of psychological interventions for problem gambling and gambling disorder.

data

Last synced: 02 Apr 2026

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/robertmyles/riscobrasil

An R package to download 'Brazil Risk' data :chart_with_upwards_trend:

brazil data finance r

Last synced: 08 Apr 2025

https://github.com/muhammadibrahim313/datavue

"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.

analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit

Last synced: 10 Apr 2025

https://github.com/louisbrulenaudet/legalkit-pipeline

Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.

data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python

Last synced: 17 Mar 2025

https://github.com/tommasoazz/collaborative-location-activity-recommendations

Project for the course Scalable and Cloud Programming

data map mapreduce scala spark

Last synced: 16 Apr 2026

https://github.com/abrudz/parsing

Dyalog APL expressions to parse common and unusual data formats from text files

apl csv data data-format dyalog-apl dyalogapl parsing

Last synced: 20 Mar 2026

https://github.com/woctezuma/steam-reviews-data

Data available to compute statistics of Steam reviews.

data steam steam-reviews

Last synced: 19 Mar 2026

https://github.com/huangcongqing/ranking-list

数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data

data rank ranking

Last synced: 15 Feb 2026

https://github.com/gadenbuie/crantrack

Hourly snapshots of CRAN's incoming packages folder

cran data r-packages

Last synced: 12 Mar 2026

https://github.com/qeeqbox/data-compliance

Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse

compliance data data-compliance infosecsimplified qeeqbox

Last synced: 19 Mar 2026

https://github.com/fforres/webpack-plugin-dx-metrics

Webpack plugin to track webpack behaviour in datadog

data datadog developer-experience typescript visualization webpack

Last synced: 13 Feb 2026

https://github.com/baaziznasser/qurani

برنامج قرآني بواجهة بسيطة وبميزات خرافية مع قواعد بيانات كبيرة للقرآن الكريم وتفسيره

base data i3rab json quran qurani sql tafsir

Last synced: 12 Feb 2026

https://github.com/mabel-dev/opteryx-catalog

📚 Opteryx Cloud Catalog

catalog data python sql

Last synced: 27 Feb 2026

https://github.com/onaio/gisida-react

React Dashboard library for Gisida.

dashboard data gisida map react visualization

Last synced: 28 Apr 2025

https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats

Last synced: 11 Feb 2026

https://github.com/bluegreen-labs/oneflux_containers

Containerized (docker) versions of the ONEFlux processing pipeline

data ecosystem fluxes micrometeorology processing

Last synced: 07 Oct 2025