An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/dsietz/daas

Overview of the Data as a Service (DaaS) architecture

archconf architecture daas data message-broker microservice nfjs patterns rust-lang uberconf

Last synced: 21 May 2026

https://github.com/justunsix/debezium-tests

Testing different Debezium development environment set ups

azure capture cdc change data debezium kafka mssql openshift sql streaming

Last synced: 19 May 2026

https://github.com/thamerh/web-scraper-with-node.js-and-cheerio

used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio

cheer data expressjs nodejs scarper webscraping

Last synced: 08 Apr 2026

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 08 Apr 2026

https://github.com/intercloud/gotsgen

Golang Time Series Data Generator

data generator golang library timeseries

Last synced: 20 Jun 2025

https://github.com/bohnacker/data-manipulation

Some Javascript and Python scripts to manipulate (large) CSV files and JSON data.

data data-mining data-structures javascript python

Last synced: 18 May 2026

https://github.com/nix1707/webscrapper-browserextension

Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.

chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping

Last synced: 21 Jun 2025

https://github.com/darcyclarke/darcy

👤 all my info, centralized in one place

bio darcy data graph info information links nodejs npm open profile

Last synced: 08 Jan 2026

https://github.com/hanwentao/china-regions

Data of China's Regions

china data geography

Last synced: 13 Apr 2025

https://github.com/deveripon/assignment-6-assets

This assets is only for Reactive Accelarator Batch 2 - Assignment 6

data images recipe

Last synced: 30 Apr 2025

https://github.com/ultreon/ubo

NBT inspired data I/O. Made for games.

api binary-data data data-storage file-type game-data io library ubo

Last synced: 16 Jun 2025

https://github.com/priyanka7411/dataspark-electronics-retail-analytics

DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.

analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization

Last synced: 10 Jul 2025

https://github.com/siongui/gopaliwordvfs

Serve JSON data of Pali words, embedded in Go code

data go golang pali vfs virtual-file-system virtualfilesystem

Last synced: 04 Apr 2025

https://github.com/0xopenbytes/cache

📦 Simple in-memory key-value store

cache data json swift

Last synced: 29 Apr 2026

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 08 May 2025

https://github.com/oliver021/ecmalinq

The linq runtime and support to typescript/javascript ecosystem

collection data iterable iteration javascript library linq linq-expressions nodejs query stream stream-data structure typescript

Last synced: 13 May 2025

https://github.com/charconstpointer/markovbot

PoC markov chain sentence generator, powered by discord for data gathering

bot chain collection data discord markov parsing

Last synced: 16 May 2026

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 03 Apr 2025

https://github.com/justintime50/dad

Dummy Address Data (DAD) - Real addresses from all around the world.

address addresses country dad data dummy dummy-data json real world

Last synced: 18 Feb 2026

https://github.com/pseudomuto/iceberg-rest-go

A Go client library for working with Iceberg Rest catalogs

client data go iceberg

Last synced: 25 Jan 2026

https://github.com/ljharb/define-data-property

Define a data property on an object. Will fall back to assignment in an engine without descriptors.

accessor configurable data define ecmascript enumerable javascript object property writable

Last synced: 13 Apr 2025

https://github.com/vasturiano/data-bind-mapper

Bind data arrays with any type of JS objects

bind data digest joins mapper performance

Last synced: 26 Jul 2025

https://github.com/rpidanny/streamline.js

A JavaScript class that reads and processes a stream line-by-line in order.

big-data data data-processing file-stream javascript stream streams typescript

Last synced: 08 Sep 2025

https://github.com/edgardleal/thanos-for-data

A Thanos implementation to restore the balance of your data

data tests

Last synced: 15 Jun 2025

https://github.com/suh1z/rakkauttify_fullstack

CS2 Data and Statistics Dashboard -fullstackproject

analytics data expressjs gaming mongo nodejs react redux

Last synced: 24 Oct 2025

https://github.com/desultory/pycpio

Python library for CPIO manipulation

cpio cpio-archives data initramfs pypi-package python python-3 python3

Last synced: 04 Feb 2026

https://github.com/reubano/ckanutils

A Python library for interacting with CKAN instances

ckan data library open-data

Last synced: 10 Feb 2026

https://github.com/corentinb/txtoredis

:fire: Push each line of a text file, to a Redis set

data datascience dataset go golang redis set

Last synced: 24 Apr 2026

https://github.com/tommasoazz/collaborative-location-activity-recommendations

Project for the course Scalable and Cloud Programming

data map mapreduce scala spark

Last synced: 16 Apr 2026

https://github.com/mutasim77/dbt-analytics

🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.

big-query data data-analysis dbt warehouse

Last synced: 25 Feb 2026

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/geopython/pygeoapi-examples

Example pygeoapi deployment patterns and configurations

api data geospatial ogc ogc-api osgeo pygeoapi

Last synced: 11 Oct 2025

https://github.com/iondv/metrics

IONDV. Framework application: Metrics is to collect and show the metrics data.

collecting data data-analysis iondv iondv-app metrics

Last synced: 10 Feb 2026

https://github.com/d2hydro/fewspy

A Python API for the Deltares FEWS PI REST Web Service

data geopandas hydrology hydrometrics pandas python

Last synced: 23 Apr 2026

https://github.com/as/worm

Worm provides write-once read-many log-structured storage semantics

data log record storage worm

Last synced: 31 Jan 2026

https://github.com/open-i18n/data-unicode-cldr

Git mirror for Unicode Common Locale Data Repository (CLDR) data

cldr data open-i18n unicode unicode-consortium

Last synced: 07 Feb 2026

https://github.com/0xdir/htcds_dart

Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.

data humanitarian schema standards

Last synced: 24 Oct 2025

https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021

This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.

data data-analysis data-visualization powerbi sql-server

Last synced: 25 Feb 2026

https://github.com/muhammadibrahim313/datavue

"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.

analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit

Last synced: 10 Apr 2025

https://github.com/jderstd/spec

A standard for JSON responses

data error jder json response specification structure

Last synced: 13 May 2026

https://github.com/sneels/parkds

Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)

cross-domain data database datasource datasources javascript source

Last synced: 24 Feb 2026

https://github.com/louisbrulenaudet/legalkit-pipeline

Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.

data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python

Last synced: 17 Mar 2025

https://github.com/healthyregions/oeps

Opioid Environment Policy Scan - data explorer and backend management

data data-visualization public-health

Last synced: 21 Apr 2026

https://github.com/dkxce/osm2shp

Flexible OSM to SHP Converter (convert .osm & .pbf files to ESRI Shape .shp files). OSM to Shape.

converter data dbf dkxce earth esri map maps openseamap openstreetmap osm pbf routes shape shapes shp

Last synced: 26 Apr 2026

https://github.com/kanugurajesh/firebase-data

Adding data to firebase store

data firebase firebase-database python

Last synced: 27 Apr 2026

https://github.com/onaio/gisida-react

React Dashboard library for Gisida.

dashboard data gisida map react visualization

Last synced: 28 Apr 2025

https://github.com/abrudz/parsing

Dyalog APL expressions to parse common and unusual data formats from text files

apl csv data data-format dyalog-apl dyalogapl parsing

Last synced: 20 Mar 2026

https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats

Last synced: 11 Feb 2026

https://github.com/metapsy-project/data-gambling-psyctr

Database of psychological interventions for problem gambling and gambling disorder.

data

Last synced: 02 Apr 2026

https://github.com/jmsallan/esdata

A R package to bring Spanish economic databases into the R environment

data datasets ine inflation spain unemployment-data

Last synced: 18 Jan 2026

https://github.com/justjavac/deno_data_dir

Returns the path to the user's data directory.

data deno deno-module deno-modules directory

Last synced: 27 Apr 2026

https://github.com/woctezuma/steam-reviews-data

Data available to compute statistics of Steam reviews.

data steam steam-reviews

Last synced: 19 Mar 2026

https://github.com/binarybardakshat/suryanayan

Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.

data drone nlp python solar

Last synced: 10 Oct 2025

https://github.com/mongodb-developer/rocket-analytics

Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.

data federation lucene lucenesearch mongodb s3 search sql

Last synced: 28 Apr 2026

https://github.com/hadro/brewery-guides

The data for guides to breweries across the United States from 1896 to 1918

brewers brewery-guides brewing brewing-history data dataset digital-collections digital-humanities hocr nypl open-data

Last synced: 16 Mar 2026

https://github.com/mabel-dev/opteryx-catalog

📚 Opteryx Cloud Catalog

catalog data python sql

Last synced: 27 Feb 2026

https://github.com/yazaabed/at-who-angular

wrapper for At.js that add mentions autocomplete to your application with angular component for using it on any AngularJS projects

angular-components angularjs autocomplete components data modules webpack wrapper

Last synced: 28 Apr 2026

https://github.com/baaziznasser/qurani

برنامج قرآني بواجهة بسيطة وبميزات خرافية مع قواعد بيانات كبيرة للقرآن الكريم وتفسيره

base data i3rab json quran qurani sql tafsir

Last synced: 12 Feb 2026

https://github.com/thitlwincoder/browser_data

Dart package to retrieve browser's data.

bookmark browser dart data flutter history package

Last synced: 23 Feb 2026

https://github.com/cmudig/mosaic-profiler

A data profiler built with Mosaic

data jupyter visualization

Last synced: 25 Oct 2025

https://github.com/huangcongqing/ranking-list

数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data

data rank ranking

Last synced: 15 Feb 2026

https://github.com/cnayan/q-server

Gives API for back-end server connectivity; MS SQL Server connector provided.

data database provider q-server query query-engine

Last synced: 09 Oct 2025

https://github.com/justfairdev/web-stack-query

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

async cache data fetch graphql hooks query react resources rest

Last synced: 09 May 2026

https://github.com/paladique/azuresample-guestbook

Guestbook using MySQL and Cosmos DB on Azure

cosmosdb data mysql spa websockets

Last synced: 30 Apr 2026

https://github.com/datalayer/desktop

Ξ 🖥️ Datalayer Destkop.

ai data data-analysis data-science datalayer desktop electron

Last synced: 25 Oct 2025

https://github.com/luminovrym/pbo-biodata

Simulasi Cara Input Data dengan OOP

data oop-in-php php-native

Last synced: 18 Jun 2026

https://github.com/jsdhami/python-for-research

"Python-For-Research" Event Organized By Tri-Chandra Research Group, Ghantaghar, Kathmandu

analysis colab data jupyter matplotlib numpy panda physics python research visualization

Last synced: 27 Oct 2025

https://github.com/imagodata/filter_mate

FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers

data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database

Last synced: 29 Apr 2026

https://github.com/bluegreen-labs/oneflux_containers

Containerized (docker) versions of the ONEFlux processing pipeline

data ecosystem fluxes micrometeorology processing

Last synced: 07 Oct 2025

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/qeeqbox/data-compliance

Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse

compliance data data-compliance infosecsimplified qeeqbox

Last synced: 19 Mar 2026

https://github.com/carpentries-incubator/indigenous-data-sovereignty

Introduces the concepts and framework of Indigenous Data Sovereignty and Governance.

data english lesson pre-alpha

Last synced: 24 Jan 2026

https://github.com/ryanmorr/fastmap

Accelerated hash maps

data hashmap javascript map performance

Last synced: 10 Oct 2025

https://github.com/potch/whizzy

A prototype rich data editor for GitHub

csv csvconf data github

Last synced: 01 May 2026

https://github.com/cicerops/monitoring-check-grafana

Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.

data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale

Last synced: 08 May 2026

https://github.com/dlr-pa/pydabu

python data bubble

data json python

Last synced: 28 Apr 2026

https://github.com/gadenbuie/crantrack

Hourly snapshots of CRAN's incoming packages folder

cran data r-packages

Last synced: 12 Mar 2026

https://github.com/akuzko/use-stash

React hooks for app-wide data access and manipulation

action actions data hook hooks react store

Last synced: 09 May 2026