An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/metapsy-project/data-gambling-psyctr

Database of psychological interventions for problem gambling and gambling disorder.

data

Last synced: 02 Apr 2026

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/d2hydro/fewspy

A Python API for the Deltares FEWS PI REST Web Service

data geopandas hydrology hydrometrics pandas python

Last synced: 23 Apr 2026

https://github.com/mrlynn/30-min-data-web-form

30 Minutes to a Data Enabled Web Form with MongoDB

beginner data html html-form javascript mongodb mongodb-atlas mongodb-database web webforms

Last synced: 15 Apr 2026

https://github.com/hsyntes/data-modeling

A Backend application that provides Advanced Data Modeling and Schema Design with MongoDB, mongoose in Node.js & Express

data database datamodeling express modeling mongodb mongoose nodejs schema

Last synced: 10 Apr 2026

https://github.com/openintrostat/airports

📦 R package for data on airports 🛫

data openintro rstats rstats-package

Last synced: 22 Feb 2026

https://github.com/mabel-dev/opteryx-catalog

📚 Opteryx Cloud Catalog

catalog data python sql

Last synced: 27 Feb 2026

https://github.com/alexfreska/elixir_skynet

Elixir client for uploading and downloading files from Sia Skynet

cdn data elixir sia sia-skynet skynet storage

Last synced: 15 Apr 2025

https://github.com/onaio/gisida-react

React Dashboard library for Gisida.

dashboard data gisida map react visualization

Last synced: 28 Apr 2025

https://github.com/skhg/weather-station

🌥 ESP8266-based personal weather station project

air-quality arduino cpp data electronics environment esp8266 weather

Last synced: 16 Jan 2026

https://github.com/potch/whizzy

A prototype rich data editor for GitHub

csv csvconf data github

Last synced: 01 May 2026

https://github.com/geocollections/emaapou

eMaapõu: Eesti maapõue andmebaas

data database estonia geology portal

Last synced: 05 Feb 2026

https://github.com/smolsoftboi/php-faker-providers

Faker providers that generate fake data for you.

data faker faker-generator faker-provider generator php

Last synced: 22 Apr 2025

https://github.com/cicerops/monitoring-check-grafana

Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.

data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale

Last synced: 08 May 2026

https://github.com/haideralipunjabi/harrypotter-analysis

Repository with code to generate visualisations of Harry Potter Fanfiction and Books

analysis data harry-potter python visualization wordcloud

Last synced: 25 Mar 2025

https://github.com/robertmyles/riscobrasil

An R package to download 'Brazil Risk' data :chart_with_upwards_trend:

brazil data finance r

Last synced: 08 Apr 2025

https://github.com/elianhugh/streams

Flexible data streaming for R

data package r r-package streaming

Last synced: 26 May 2026

https://github.com/juliapsychometrics/psychometrictests.jl

Efficient data structures for psychometric modeling in Julia

data julia psychometrics

Last synced: 11 Jun 2025

https://github.com/anicolaspp/mapr-data-gen

Data generator for MapR Data Platform

data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark

Last synced: 29 Apr 2026

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/joaocarmo/react-very-simple-data-table

When all you want is a table

data react simple table

Last synced: 06 Mar 2025

https://github.com/rikvdh/zabuffer

Zero-Allocation buffer handling in C

buffer c clib data embedded memory string zero-allocation

Last synced: 03 Mar 2025

https://github.com/purarue/listenbrainz_export

Export your scrobbling history from ListenBrainz

data data-export music scrobbling

Last synced: 24 Jan 2026

https://github.com/squareslab/probabilisticmodel_saner2018

Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018

code data mausotog published replication

Last synced: 26 Oct 2025

https://github.com/slipke/eurlex-model-go

This projects implements the EUR-Lex XML data model in Golang. For more information see README.md

data datamodel eur-lex eurlex webservice

Last synced: 09 Mar 2026

https://github.com/codewithmide/solexplorer

Next-generation AI powered Solana data explorer

dashboard data explorer solana svm

Last synced: 14 Feb 2026

https://github.com/lxcoding06/e-gereja

Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis

data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan

Last synced: 15 May 2025

https://github.com/mo-karbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 02 Aug 2025

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/lovethebomb/data-tiles

🍜 Data Tiles is a small website that shows data.

data express javascript nextjs typescript

Last synced: 10 Apr 2026

https://github.com/banbord/data-vis-tornados

This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.

d3-visualization d3js data javascript json visualization

Last synced: 24 Feb 2026

https://github.com/mollybeach/cherryether

CherryEther: Typescript Staking Deposits Ethereum Transactions

blockchain data data-science ethereum typescripts

Last synced: 21 May 2026

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/relintai/ess_data

Godot plugin that helps to create/manage resource files.

addon data data-management godot

Last synced: 18 Aug 2025

https://github.com/bastianolea/siedu_indicadores_urbanos

Datos del Sistema de Indicadores y Estándares de Desarrollo Urbano, con datos comunales sobre temas como transporte, urbanismo, servicios básicos, calidad de vida y más.

ambiental app chile ciudad comunas data estado social

Last synced: 19 Feb 2026

https://github.com/mahmoud-saeed-mahmoud/loading_state_handler

The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.

dart data error flutter flutter-package flutter-widget loading state

Last synced: 10 Mar 2026

https://github.com/vutran/yahoo-stocks-cli

Fetch stock data from the CLI

cli data finance stocks yahoo

Last synced: 08 Jun 2026

https://github.com/oliver021/entity-dock

A superset with libraries, components, tools and more to work with entity on .Net

api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi

Last synced: 09 May 2026

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/ayoub-amzil/offline-globe

Offline country data for PHP Laravel framework. Over 200 countries, capitals, flags, languages, currencies. No internet needed.

composer data internet laravel offline php

Last synced: 09 May 2026

https://github.com/erwan-simon/aws-data-platform-framework

A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.

aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module

Last synced: 23 May 2026

https://github.com/physio/flatten-ts

Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.

array conversion data flatten javascript json object typescript

Last synced: 06 May 2026

https://github.com/mujadded/facebook_scrapper

The fcebook scrapper gem that dont need the api

data data-mining facebook ruby-gem scrapper selenium-webdriver

Last synced: 28 Oct 2025

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/yorkulibraries/vendorpol

URLs for vendor privacy policies and terms of use.

data libraries privacy-policy

Last synced: 15 Oct 2025

https://github.com/caelean/twittermap

Map of twitter user's influence as defined on by influencetracker

data google-maps maps sparql twitter visualization

Last synced: 14 Jun 2025

https://github.com/ctechhindi/auto-fill-form-data

AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME

autocomplete chrome-extension data extension

Last synced: 17 Apr 2026

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/kshitij1235/boxdb

This a database managment lib made for python, which works like any Libraries and is very lite no aditional setup require but there is some procedure to create a project is very easy.

boxdb data database library python

Last synced: 14 Jan 2026

https://github.com/tayeva/eia-client-python

EIA Open Data API Client - Python

data open-source python python-3 python3

Last synced: 14 Oct 2025

https://github.com/nrennie/data

A collection of random datasets, either from web-scraping or processing more complex data.

data

Last synced: 30 May 2026

https://github.com/anandchowdhary/health

🫀 @AnandChowdhary's body measurements

csv data fitness github-actions health

Last synced: 29 Apr 2026

https://github.com/ssiarhei115/customer-classification

Developing ML model predicting bank' customer inclination to open a deposit

big-data big-data-analytics data data-science data-visualization mashine-learning

Last synced: 09 Apr 2025

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation

Last synced: 12 Feb 2026

https://github.com/peterdavehello/nrd-list-archive

🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨

archive data nrd

Last synced: 17 Mar 2026

https://github.com/mrsaeeddev/data-science-roadmap-for-beginners

📈 A minimal and easy road map for beginners who want to dive into the field of Data Science

data data-science datascience python

Last synced: 29 Jun 2025

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/doughtnerd/pod

Read and write Excel data with Java

data excel extract poi-library

Last synced: 08 Apr 2025

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 16 May 2026

https://github.com/bernard-ng/drc-news-corpus

DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation

aggregator data news nlp politics

Last synced: 06 Sep 2025

https://github.com/secret-guest/file_organizer

Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.

c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory

Last synced: 10 Jun 2026

https://github.com/felipesousa/usa-cities-api

A JSON with all USA cities/states.

cities data json nodejs usa

Last synced: 03 May 2026

https://github.com/wonderium/browser-releases

This repository contains release dates for browser versions.

browsers data json releases wonderium

Last synced: 31 Jan 2026

https://github.com/leeper/mcode

Functions to merge and recode across multiple variables

data data-transformation r recode recoding

Last synced: 16 May 2025

https://github.com/georgetdn/syscppcplinux

Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework linux object persistence serialize sql

Last synced: 12 Feb 2026

https://github.com/yanpitangui/iteminfoconverter

Application that converts ragnarok legacy data files to iteminfo.lua

data itemdbconf iteminfo luafiles ragnarok

Last synced: 12 Oct 2025

https://github.com/stdlib-js/ndarray-base-char2dtype

Return the data type string associated with a provided single letter abbreviation.

abbr abbreviation array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 12 Mar 2026

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/espoirmur/balobi_nini

An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.

data nlp nlp-machine-learning

Last synced: 24 Aug 2025

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025