An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kawai-senpai/potatodb

PotatoDB is a lightweight, file-based NoSQL database for Python projects, designed for easy setup and use in small-scale applications. Ideal for developers seeking simple data persistence without the complexity of traditional databases.

data database easy-to-use file-based json key-value lightweight nosql nosql-database persistence python simple

Last synced: 23 Oct 2025

https://github.com/equinor/data-marketplace

Easily find and check out data products

data product search

Last synced: 01 May 2025

https://github.com/schluppeck/ng-data-club

Nottingham Psychology data club resources

analysis data julialang maths matlab python r

Last synced: 10 Sep 2025

https://github.com/farhadrezvani/warframe-drops-pwa

a warframe app that finds the best place to farm any in-game item by looking through the official drop tables published by Digital Extremes.

data drop-data game preact pwa vite warframe

Last synced: 11 Oct 2025

https://github.com/gianlucatruda/project_sleep

A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.

data experiment matplotlib python quantified science self sleep visualization

Last synced: 03 Apr 2025

https://github.com/muneeb1030/finetune-tiny-llama

Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.

data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping

Last synced: 08 Apr 2026

https://github.com/bastianolea/economia_chile

Indicadores económicos de Chile, actualizados automáticamente cada día, incluyendo PIB, IPSA, IMACEC, IPC, UF, precio del cobre, inversión extranjera, y más

app chile data economia estado laboral meses social tiempo

Last synced: 04 Jul 2025

https://github.com/andygol/yamap

Yamap Ain't Map – deployment of OSM infrastructure project inspired by osm-seed

api data extract geo-data map openstreetmap osm

Last synced: 24 Jun 2025

https://github.com/themitosan/grpp

GRPP is a simple tool written in TS that helps preserving git repositories.

cli data git grpp linux preservation project repo repository

Last synced: 15 Jul 2025

https://github.com/nix1707/webscrapper-browserextension

Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.

chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping

Last synced: 21 Jun 2025

https://github.com/v4ss3ur/hierarchicaldatagrid.wpf

A WPF control that mix DataGrid and TreeView functionalities, allowing for hierarchical, recursive data display with expandable nested rows. Ideal for complex data structures in an easy-to-use, MVVM-friendly tabular format.

controls data datagrid hierarchical hierarchical-data mvvm nested nested-objects nested-structures treeview wpf xaml

Last synced: 13 May 2025

https://github.com/rpidanny/streamline.js

A JavaScript class that reads and processes a stream line-by-line in order.

big-data data data-processing file-stream javascript stream streams typescript

Last synced: 08 Sep 2025

https://github.com/intercloud/gotsgen

Golang Time Series Data Generator

data generator golang library timeseries

Last synced: 20 Jun 2025

https://github.com/hanwentao/china-regions

Data of China's Regions

china data geography

Last synced: 13 Apr 2025

https://github.com/urunov/algorithms

algorithm, data structure, dynamic array, dynamic programming

algorithms algorithms-and-data-structures data dynamic-programming

Last synced: 20 Mar 2025

https://github.com/mbanq/dupe

Fake banking data for your front- or backend

backend data datagenerator fake faker frontend javascript nodejs npm npm-package

Last synced: 13 May 2025

https://github.com/rn0x/app.altaqwaa.org

موقع إسلامي شامل يحتوي على الأذكار والقرآن الكريم بأصوات عدد كبير من القراء، بالإضافة إلى تفسير وحصن المسلم. يتضمن الموقع أيضًا مسبحة إلكترونية وإذاعات إسلامية وأوقات الصلاة.

app broadcasts data islam muslim prayer quran tafsir website website-design

Last synced: 07 Aug 2025

https://github.com/olajideolagunju/gcp_mage_data_pipeline

An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.

automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders

Last synced: 07 Mar 2025

https://github.com/nickmcintyre/processing-netcdf

Simple access to scientific datasets with Processing

data netcdf processing

Last synced: 11 Apr 2025

https://github.com/justintime50/dad

Dummy Address Data (DAD) - Real addresses from all around the world.

address addresses country dad data dummy dummy-data json real world

Last synced: 18 Feb 2026

https://github.com/njraladdin/newspapers-com-scraper

A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.

archive data newspapers scraper scraper-api scraping

Last synced: 06 Apr 2025

https://github.com/codiepp/elykseer-base

cryptographic data archive; written in F#; envisaged to stay another 10 years

archive cli cryptography data distributed-storage dotnet fsharp longterm-storage

Last synced: 19 May 2026

https://github.com/ferhatgec/kedi

Fegeya Kedi, Experimental Data Interface.

cpp cpp17 data data-interchange data-interface fegeya gnu json library linux xml

Last synced: 14 Apr 2025

https://github.com/csengupta1101/dig-student-files

This Repository will contain all student submissions at one place.

data datascience education machine-learning python students visualization

Last synced: 17 Jul 2025

https://github.com/suh1z/rakkauttify_fullstack

CS2 Data and Statistics Dashboard -fullstackproject

analytics data expressjs gaming mongo nodejs react redux

Last synced: 24 Oct 2025

https://github.com/cicerops/monitoring-check-grafana

Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.

data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale

Last synced: 08 May 2026

https://github.com/cdcgov/nchsdata

NCHS data: public use files (PUFs) from the National Center for Health Statistics (NCHS)

data public-health r survey survey-data

Last synced: 02 Jul 2026

https://github.com/definetlynotai/llm_data

A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI

c code-examples cpp cuda data data-dum jupyter-notebook llm llm-code llm-datasets programming-data programming-data-sets python3

Last synced: 08 Oct 2025

https://github.com/baaziznasser/qurani

برنامج قرآني بواجهة بسيطة وبميزات خرافية مع قواعد بيانات كبيرة للقرآن الكريم وتفسيره

base data i3rab json quran qurani sql tafsir

Last synced: 12 Feb 2026

https://github.com/bluegreen-labs/oneflux_containers

Containerized (docker) versions of the ONEFlux processing pipeline

data ecosystem fluxes micrometeorology processing

Last synced: 07 Oct 2025

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/tosun-si/world-cup-qatar-team-stats-kotlin-midgard

This application shows a full Apache Beam pipeline with Kotlin and Midgard library. The use case works on the last Qatar FIFA world cup data and calculate players statistics per team. This application will be presented at Beam Summit 2023 in New York

apache-beam beam-summit data kotlin midgard world-cup-2022

Last synced: 01 Feb 2026

https://github.com/corentinb/txtoredis

:fire: Push each line of a text file, to a Redis set

data datascience dataset go golang redis set

Last synced: 24 Apr 2026

https://github.com/thitlwincoder/browser_data

Dart package to retrieve browser's data.

bookmark browser dart data flutter history package

Last synced: 23 Feb 2026

https://github.com/abrudz/parsing

Dyalog APL expressions to parse common and unusual data formats from text files

apl csv data data-format dyalog-apl dyalogapl parsing

Last synced: 20 Mar 2026

https://github.com/chompfoods/sdk-csharp

C# SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp csharp csharp-sdk data database dll food grocery ingredients nuget nutrition raw recipes recipes-api restsharp sdk swagger

Last synced: 06 May 2026

https://github.com/0xdir/htcds_dart

Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.

data humanitarian schema standards

Last synced: 24 Oct 2025

https://github.com/vrm-piyush/python-projects

Open source Python Projects. Feel Free to contribute!

data dataanalysis games open-source pygame-games python python-app

Last synced: 26 Feb 2026

https://github.com/sneels/parkds

Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)

cross-domain data database datasource datasources javascript source

Last synced: 24 Feb 2026

https://github.com/sparkpost/event-data

self-hosted message events

api aws data email webhooks

Last synced: 29 Apr 2026

https://github.com/reala10n/simplejsondb

Create a simple JSON database with just one line of code!

data database db easy json python simple

Last synced: 27 Oct 2025

https://github.com/binarybardakshat/suryanayan

Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.

data drone nlp python solar

Last synced: 10 Oct 2025

https://github.com/geocollections/emaapou

eMaapõu: Eesti maapõue andmebaas

data database estonia geology portal

Last synced: 05 Feb 2026

https://github.com/muhammadibrahim313/datavue

"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.

analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit

Last synced: 10 Apr 2025

https://github.com/louisbrulenaudet/legalkit-pipeline

Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.

data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python

Last synced: 17 Mar 2025

https://github.com/paladique/azuresample-guestbook

Guestbook using MySQL and Cosmos DB on Azure

cosmosdb data mysql spa websockets

Last synced: 30 Apr 2026

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/woctezuma/steam-reviews-data

Data available to compute statistics of Steam reviews.

data steam steam-reviews

Last synced: 19 Mar 2026

https://github.com/d2hydro/fewspy

A Python API for the Deltares FEWS PI REST Web Service

data geopandas hydrology hydrometrics pandas python

Last synced: 23 Apr 2026

https://github.com/mongodb-developer/rocket-analytics

Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.

data federation lucene lucenesearch mongodb s3 search sql

Last synced: 28 Apr 2026

https://github.com/kanugurajesh/firebase-data

Adding data to firebase store

data firebase firebase-database python

Last synced: 27 Apr 2026

https://github.com/luminovrym/pbo-biodata

Simulasi Cara Input Data dengan OOP

data oop-in-php php-native

Last synced: 18 Jun 2026

https://github.com/anicolaspp/mapr-data-gen

Data generator for MapR Data Platform

data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark

Last synced: 29 Apr 2026

https://github.com/huangcongqing/ranking-list

数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data

data rank ranking

Last synced: 15 Feb 2026

https://github.com/justjavac/deno_data_dir

Returns the path to the user's data directory.

data deno deno-module deno-modules directory

Last synced: 27 Apr 2026

https://github.com/lilingxi01/bloark

Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.

architecture bloark data revision-based

Last synced: 05 Apr 2026

https://github.com/justfairdev/web-stack-query

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

async cache data fetch graphql hooks query react resources rest

Last synced: 09 May 2026

https://github.com/floriancassayre/nicknames-datasets

Open source nicknames sets with informations about the data origin(s).

data data-mining dataset

Last synced: 08 Feb 2026

https://github.com/cmudig/mosaic-profiler

A data profiler built with Mosaic

data jupyter visualization

Last synced: 25 Oct 2025

https://github.com/metapsy-project/data-gambling-psyctr

Database of psychological interventions for problem gambling and gambling disorder.

data

Last synced: 02 Apr 2026

https://github.com/cnayan/q-server

Gives API for back-end server connectivity; MS SQL Server connector provided.

data database provider q-server query query-engine

Last synced: 09 Oct 2025

https://github.com/qeeqbox/data-compliance

Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse

compliance data data-compliance infosecsimplified qeeqbox

Last synced: 19 Mar 2026

https://github.com/akuzko/use-stash

React hooks for app-wide data access and manipulation

action actions data hook hooks react store

Last synced: 09 May 2026

https://github.com/jderstd/spec

A standard for JSON responses

data error jder json response specification structure

Last synced: 13 May 2026

https://github.com/fforres/webpack-plugin-dx-metrics

Webpack plugin to track webpack behaviour in datadog

data datadog developer-experience typescript visualization webpack

Last synced: 13 Feb 2026

https://github.com/joamag/pandas

Loads of pandas data from China with awesome data

data data-analysis jupyter notebook pandas

Last synced: 25 Apr 2026

https://github.com/robertmyles/riscobrasil

An R package to download 'Brazil Risk' data :chart_with_upwards_trend:

brazil data finance r

Last synced: 08 Apr 2025

https://github.com/imagodata/filter_mate

FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers

data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database

Last synced: 29 Apr 2026

https://github.com/reubano/ckanutils

A Python library for interacting with CKAN instances

ckan data library open-data

Last synced: 10 Feb 2026

https://github.com/mutasim77/dbt-analytics

🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.

big-query data data-analysis dbt warehouse

Last synced: 25 Feb 2026

https://github.com/iondv/metrics

IONDV. Framework application: Metrics is to collect and show the metrics data.

collecting data data-analysis iondv iondv-app metrics

Last synced: 10 Feb 2026

https://github.com/hadro/brewery-guides

The data for guides to breweries across the United States from 1896 to 1918

brewers brewery-guides brewing brewing-history data dataset digital-collections digital-humanities hocr nypl open-data

Last synced: 16 Mar 2026

https://github.com/potch/whizzy

A prototype rich data editor for GitHub

csv csvconf data github

Last synced: 01 May 2026

https://github.com/critocrito/data-scores-in-the-uk

Investigate the uses of data analytics and algorithms in public services in the UK.

clojure data data-investigation data-preservation javascript social-sciences sugarcube uk

Last synced: 18 Oct 2025

https://github.com/jmsallan/esdata

A R package to bring Spanish economic databases into the R environment

data datasets ine inflation spain unemployment-data

Last synced: 18 Jan 2026

https://github.com/as/worm

Worm provides write-once read-many log-structured storage semantics

data log record storage worm

Last synced: 31 Jan 2026

https://github.com/gadenbuie/crantrack

Hourly snapshots of CRAN's incoming packages folder

cran data r-packages

Last synced: 12 Mar 2026

https://github.com/jsdhami/python-for-research

"Python-For-Research" Event Organized By Tri-Chandra Research Group, Ghantaghar, Kathmandu

analysis colab data jupyter matplotlib numpy panda physics python research visualization

Last synced: 27 Oct 2025