An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/codiepp/elykseer-base

cryptographic data archive; written in F#; envisaged to stay another 10 years

archive cli cryptography data distributed-storage dotnet fsharp longterm-storage

Last synced: 19 May 2026

https://github.com/strmprivacy/docs

With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.

data documentation docusaurus privacy privacy-enhancing-technologies

Last synced: 12 Jul 2025

https://github.com/owsas/open-categories

Open Categorization system, available as a node module

categories categorization categorize data data-structures node open-source typescript yaml

Last synced: 30 Apr 2025

https://github.com/jujuadams/ini-to-json

JSON+buffer replacement for native GameMaker INI functions.

data gamemaker gamemaker-studio-2 gms2 ini json save

Last synced: 21 Jul 2025

https://github.com/intercloud/gotsgen

Golang Time Series Data Generator

data generator golang library timeseries

Last synced: 20 Jun 2025

https://github.com/logikal-io/mindlab

Data science toolbox

data jupyterlab python spark

Last synced: 29 Oct 2025

https://github.com/dsietz/daas

Overview of the Data as a Service (DaaS) architecture

archconf architecture daas data message-broker microservice nfjs patterns rust-lang uberconf

Last synced: 21 May 2026

https://github.com/bohnacker/data-manipulation

Some Javascript and Python scripts to manipulate (large) CSV files and JSON data.

data data-mining data-structures javascript python

Last synced: 18 May 2026

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 03 Apr 2025

https://github.com/rpidanny/streamline.js

A JavaScript class that reads and processes a stream line-by-line in order.

big-data data data-processing file-stream javascript stream streams typescript

Last synced: 08 Sep 2025

https://github.com/rclement/romain-clement.net

Freelance Software Engineer & Trainer

data freelancer machine-learning mkdocs mkdocs-material python

Last synced: 21 Mar 2025

https://github.com/fabriquebeweb/dao

Le 'Data Access Object' pour les nuls !

dao data npm package

Last synced: 18 Feb 2026

https://github.com/codenoid/lazy-mongo

Insert data to mongo from text plain or file

crystal crystal-language data database mongoclient mongodb

Last synced: 13 Apr 2026

https://github.com/nickmcintyre/processing-netcdf

Simple access to scientific datasets with Processing

data netcdf processing

Last synced: 11 Apr 2025

https://github.com/edgardleal/thanos-for-data

A Thanos implementation to restore the balance of your data

data tests

Last synced: 15 Jun 2025

https://github.com/aidinhamedi/pneumonia-detection-ai

This project uses a deep learning model built with the TensorFlow Library to detect pneumonia in X-ray images. The model architecture is based on the EfficientNetB7 model, which has achieved an accuracy of approximately 97.12% (97.11538%) on our test data. This high accuracy rate is one of the strengths of our AI model.

ai artificial-intelligence artificial-neural-networks cli data data-science database gui interface kaggle keras pneumonia pneumonia-classification pneumonia-detection pneumonia-detector pneumoniac-xray python tensorflow tensorflow2 training

Last synced: 06 Apr 2025

https://github.com/schluppeck/ng-data-club

Nottingham Psychology data club resources

analysis data julialang maths matlab python r

Last synced: 10 Sep 2025

https://github.com/thamerh/web-scraper-with-node.js-and-cheerio

used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio

cheer data expressjs nodejs scarper webscraping

Last synced: 08 Apr 2026

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 08 May 2025

https://github.com/rn0x/app.altaqwaa.org

موقع إسلامي شامل يحتوي على الأذكار والقرآن الكريم بأصوات عدد كبير من القراء، بالإضافة إلى تفسير وحصن المسلم. يتضمن الموقع أيضًا مسبحة إلكترونية وإذاعات إسلامية وأوقات الصلاة.

app broadcasts data islam muslim prayer quran tafsir website website-design

Last synced: 07 Aug 2025

https://github.com/themitosan/grpp

GRPP is a simple tool written in TS that helps preserving git repositories.

cli data git grpp linux preservation project repo repository

Last synced: 15 Jul 2025

https://github.com/datawookie/data-diaspora

Various datasets used in tutorials and workshops.

data datasets

Last synced: 20 Mar 2025

https://github.com/ikstream/dns-handler

Data collection server for the dalec user collection system

collection dalec data data-collection dns dns-server python python3

Last synced: 13 Mar 2025

https://github.com/ferhatgec/kedi

Fegeya Kedi, Experimental Data Interface.

cpp cpp17 data data-interchange data-interface fegeya gnu json library linux xml

Last synced: 14 Apr 2025

https://github.com/justintime50/dad

Dummy Address Data (DAD) - Real addresses from all around the world.

address addresses country dad data dummy dummy-data json real world

Last synced: 18 Feb 2026

https://github.com/gianlucatruda/project_sleep

A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.

data experiment matplotlib python quantified science self sleep visualization

Last synced: 03 Apr 2025

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/fforres/webpack-plugin-dx-metrics

Webpack plugin to track webpack behaviour in datadog

data datadog developer-experience typescript visualization webpack

Last synced: 13 Feb 2026

https://github.com/binarybardakshat/suryanayan

Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.

data drone nlp python solar

Last synced: 10 Oct 2025

https://github.com/dkxce/osm2shp

Flexible OSM to SHP Converter (convert .osm & .pbf files to ESRI Shape .shp files). OSM to Shape.

converter data dbf dkxce earth esri map maps openseamap openstreetmap osm pbf routes shape shapes shp

Last synced: 26 Apr 2026

https://github.com/healthyregions/oeps

Opioid Environment Policy Scan - data explorer and backend management

data data-visualization public-health

Last synced: 21 Apr 2026

https://github.com/hadro/brewery-guides

The data for guides to breweries across the United States from 1896 to 1918

brewers brewery-guides brewing brewing-history data dataset digital-collections digital-humanities hocr nypl open-data

Last synced: 16 Mar 2026

https://github.com/cmudig/mosaic-profiler

A data profiler built with Mosaic

data jupyter visualization

Last synced: 25 Oct 2025

https://github.com/baaziznasser/qurani

برنامج قرآني بواجهة بسيطة وبميزات خرافية مع قواعد بيانات كبيرة للقرآن الكريم وتفسيره

base data i3rab json quran qurani sql tafsir

Last synced: 12 Feb 2026

https://github.com/datalayer/desktop

Ξ 🖥️ Datalayer Destkop.

ai data data-analysis data-science datalayer desktop electron

Last synced: 25 Oct 2025

https://github.com/mabel-dev/opteryx-catalog

📚 Opteryx Cloud Catalog

catalog data python sql

Last synced: 27 Feb 2026

https://github.com/jderstd/spec

A standard for JSON responses

data error jder json response specification structure

Last synced: 13 May 2026

https://github.com/carpentries-incubator/indigenous-data-sovereignty

Introduces the concepts and framework of Indigenous Data Sovereignty and Governance.

data english lesson pre-alpha

Last synced: 24 Jan 2026

https://github.com/critocrito/data-scores-in-the-uk

Investigate the uses of data analytics and algorithms in public services in the UK.

clojure data data-investigation data-preservation javascript social-sciences sugarcube uk

Last synced: 18 Oct 2025

https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021

This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.

data data-analysis data-visualization powerbi sql-server

Last synced: 25 Feb 2026

https://github.com/open-i18n/data-unicode-cldr

Git mirror for Unicode Common Locale Data Repository (CLDR) data

cldr data open-i18n unicode unicode-consortium

Last synced: 07 Feb 2026

https://github.com/anthonykrivonos/ts-algo-masterclass

👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.

algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript

Last synced: 11 May 2026

https://github.com/as/worm

Worm provides write-once read-many log-structured storage semantics

data log record storage worm

Last synced: 31 Jan 2026

https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats

Last synced: 11 Feb 2026

https://github.com/gadenbuie/crantrack

Hourly snapshots of CRAN's incoming packages folder

cran data r-packages

Last synced: 12 Mar 2026

https://github.com/jsdhami/python-for-research

"Python-For-Research" Event Organized By Tri-Chandra Research Group, Ghantaghar, Kathmandu

analysis colab data jupyter matplotlib numpy panda physics python research visualization

Last synced: 27 Oct 2025

https://github.com/geopython/pygeoapi-examples

Example pygeoapi deployment patterns and configurations

api data geospatial ogc ogc-api osgeo pygeoapi

Last synced: 11 Oct 2025

https://github.com/mutasim77/dbt-analytics

🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.

big-query data data-analysis dbt warehouse

Last synced: 25 Feb 2026

https://github.com/d2hydro/fewspy

A Python API for the Deltares FEWS PI REST Web Service

data geopandas hydrology hydrometrics pandas python

Last synced: 23 Apr 2026

https://github.com/pranavpandey/dynamic-backup

Backup and restore app data on Android.

android app backup data library restore storage

Last synced: 07 Sep 2025

https://github.com/potch/whizzy

A prototype rich data editor for GitHub

csv csvconf data github

Last synced: 01 May 2026

https://github.com/bluegreen-labs/oneflux_containers

Containerized (docker) versions of the ONEFlux processing pipeline

data ecosystem fluxes micrometeorology processing

Last synced: 07 Oct 2025

https://github.com/desultory/pycpio

Python library for CPIO manipulation

cpio cpio-archives data initramfs pypi-package python python-3 python3

Last synced: 04 Feb 2026

https://github.com/corentinb/txtoredis

:fire: Push each line of a text file, to a Redis set

data datascience dataset go golang redis set

Last synced: 24 Apr 2026

https://github.com/metapsy-project/data-gambling-psyctr

Database of psychological interventions for problem gambling and gambling disorder.

data

Last synced: 02 Apr 2026

https://github.com/adanos-software/free-ticker-database

Free global stock & ETF ticker reference database - 50k+ tickers, 66 exchanges, 81 countries

data etf finance stocks

Last synced: 10 May 2026

https://github.com/iondv/metrics

IONDV. Framework application: Metrics is to collect and show the metrics data.

collecting data data-analysis iondv iondv-app metrics

Last synced: 10 Feb 2026

https://github.com/geocollections/emaapou

eMaapõu: Eesti maapõue andmebaas

data database estonia geology portal

Last synced: 05 Feb 2026

https://github.com/ryanmorr/fastmap

Accelerated hash maps

data hashmap javascript map performance

Last synced: 10 Oct 2025

https://github.com/reubano/ckanutils

A Python library for interacting with CKAN instances

ckan data library open-data

Last synced: 10 Feb 2026

https://github.com/tommasoazz/collaborative-location-activity-recommendations

Project for the course Scalable and Cloud Programming

data map mapreduce scala spark

Last synced: 16 Apr 2026

https://github.com/sneels/parkds

Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)

cross-domain data database datasource datasources javascript source

Last synced: 24 Feb 2026

https://github.com/paladique/azuresample-guestbook

Guestbook using MySQL and Cosmos DB on Azure

cosmosdb data mysql spa websockets

Last synced: 30 Apr 2026

https://github.com/0xdir/htcds_dart

Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.

data humanitarian schema standards

Last synced: 24 Oct 2025

https://github.com/joamag/pandas

Loads of pandas data from China with awesome data

data data-analysis jupyter notebook pandas

Last synced: 25 Apr 2026

https://github.com/justfairdev/web-stack-query

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

async cache data fetch graphql hooks query react resources rest

Last synced: 09 May 2026

https://github.com/andyfratello/dbd

🗄️ Exercicis de Disseny de Bases de Dades (DBD) Q2 - UPC FIB

data database dbd dbd-fib dbeaver fib-upc nosql nosql-database oracle sql sql-database

Last synced: 10 Feb 2026

https://github.com/thitlwincoder/browser_data

Dart package to retrieve browser's data.

bookmark browser dart data flutter history package

Last synced: 23 Feb 2026

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/cnayan/q-server

Gives API for back-end server connectivity; MS SQL Server connector provided.

data database provider q-server query query-engine

Last synced: 09 Oct 2025

https://github.com/anicolaspp/mapr-data-gen

Data generator for MapR Data Platform

data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark

Last synced: 29 Apr 2026

https://github.com/suh1z/rakkauttify_fullstack

CS2 Data and Statistics Dashboard -fullstackproject

analytics data expressjs gaming mongo nodejs react redux

Last synced: 24 Oct 2025

https://github.com/tosun-si/world-cup-qatar-team-stats-kotlin-midgard

This application shows a full Apache Beam pipeline with Kotlin and Midgard library. The use case works on the last Qatar FIFA world cup data and calculate players statistics per team. This application will be presented at Beam Summit 2023 in New York

apache-beam beam-summit data kotlin midgard world-cup-2022

Last synced: 01 Feb 2026

https://github.com/sparkpost/event-data

self-hosted message events

api aws data email webhooks

Last synced: 29 Apr 2026

https://github.com/dlr-pa/pydabu

python data bubble

data json python

Last synced: 28 Apr 2026

https://github.com/legopitstop/addons

All legopitstop's Bedrock add-ons in one place.

add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla

Last synced: 06 Feb 2026

https://github.com/onaio/gisida-react

React Dashboard library for Gisida.

dashboard data gisida map react visualization

Last synced: 28 Apr 2025

https://github.com/floriancassayre/nicknames-datasets

Open source nicknames sets with informations about the data origin(s).

data data-mining dataset

Last synced: 08 Feb 2026

https://github.com/vrm-piyush/python-projects

Open source Python Projects. Feel Free to contribute!

data dataanalysis games open-source pygame-games python python-app

Last synced: 26 Feb 2026