An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/tayeva/eia-client-python

EIA Open Data API Client - Python

data open-source python python-3 python3

Last synced: 14 Oct 2025

https://github.com/skywarth/fenrir-wolfpack-simulator

Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.

data data-structures javascript max-heap simulation simulations wolfpack

Last synced: 14 Oct 2025

https://github.com/yorkulibraries/vendorpol

URLs for vendor privacy policies and terms of use.

data libraries privacy-policy

Last synced: 15 Oct 2025

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/audeering/emodb

Publishes Berlin Database of Emotional Speech with audb

audb data emotion

Last synced: 19 Oct 2025

https://github.com/a3r0id/lightshot-data-miner

A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.

brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition

Last synced: 19 Oct 2025

https://github.com/olegegoism/datagenerator

Django web application for managing database connections and generating test data.

app application big-data csv data database dataset db django fake generator schema teable work

Last synced: 26 Oct 2025

https://github.com/squareslab/probabilisticmodel_saner2018

Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018

code data mausotog published replication

Last synced: 26 Oct 2025

https://github.com/mrnazu/eth-data-library

eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.

blockchain data ethereum nodejs smart-contracts web3

Last synced: 28 Jan 2026

https://github.com/d3oxy/country-state-data

A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.

address cities countries currency data dropdown geographical iso json languages location regions states typescript

Last synced: 24 Jan 2026

https://github.com/flrd/standardlastprofile

R Data Package for BDEW Standard Load Profiles in Electricity

data electricity germany r

Last synced: 16 Mar 2026

https://github.com/drkenreid/introductory-data-science

Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.

cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials

Last synced: 28 Jan 2026

https://github.com/banbord/data-vis-tornados

This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.

d3-visualization d3js data javascript json visualization

Last synced: 24 Feb 2026

https://github.com/cerema/groum

Utilitaire en ligne de commande pour convertir les données d'arrêtés de circulation

data traffic

Last synced: 06 Feb 2026

https://github.com/asirihewage/simplest-xpath-web-scraper

Simplest web scraper created using Python3 and MongoDB

data data-mining python3 scraper web webscrping

Last synced: 29 Jan 2026

https://github.com/peterdavehello/nrd-list-archive

🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨

archive data nrd

Last synced: 17 Mar 2026

https://github.com/wonderium/browser-releases

This repository contains release dates for browser versions.

browsers data json releases wonderium

Last synced: 31 Jan 2026

https://github.com/dhimmel/het.io-rep-data

Data from Project Rephetio for the het.io website

browser data datatables drug-repurposing rephetio

Last synced: 07 Feb 2026

https://github.com/jaldekoa/nyfedapi

A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 08 Feb 2026

https://github.com/planarnetwork/feeds.planar.network

GTFS feeds for bus, train and plane

data feeds gtfs transit transportation

Last synced: 11 Feb 2026

https://github.com/bkamapantula/india-pc-nfhs4

Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.

data government health india nfhs nfhs4

Last synced: 19 Mar 2026

https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation

Last synced: 12 Feb 2026

https://github.com/aleklukanen/chapterhousedb-v1

Allows you to create simple data streaming warehouses written in Golang using Apache Parquet and Arrow.

arrow data database event golang ingestion parquet pipeline processing stream

Last synced: 27 Feb 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/codewithmide/solexplorer

Next-generation AI powered Solana data explorer

dashboard data explorer solana svm

Last synced: 14 Feb 2026

https://github.com/ad4ndi/lsd

Low-level data copying utility

c cli data

Last synced: 14 Feb 2026

https://github.com/pawelzny/vo

DDD Value Object implementation

data ddd-patterns object python3 value

Last synced: 15 Feb 2026

https://github.com/acaciaman/db-autotest

DB Database test automation. This python package allows to create database object structure and load data from database.

data database test-automation

Last synced: 05 May 2026

https://github.com/axa-ch/health-insurance-data

Swiss health insurance data

axa data health insurance swiss

Last synced: 19 Mar 2026

https://github.com/bdpedigo/neuropull

A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.

connectome connectomes connectomics data dataset networks networks-biology

Last synced: 05 Oct 2025

https://github.com/dantesc03/uberpool-case-study

This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.

analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization

Last synced: 16 Apr 2026

https://github.com/socketsupply/dynavolt

A highly opinionated DynamoDB client for aws-sdk v3 using esm.

aws data database dynamo dynamodb key-value kvstore

Last synced: 05 May 2026

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/asidlo/po

Data science library for manipulating data in Go using the familiar DataFrame and Series constructs from the Python Pandas library.

data dataframe go pandas series

Last synced: 14 Jan 2026

https://github.com/yashmistry-24/ytcomment-iq

YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.

analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube

Last synced: 15 Feb 2026

https://github.com/xtrendence/comp2001-coursework

Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.

api asp-net csharp data dataset linked-data php restful restful-api

Last synced: 13 Apr 2026

https://github.com/gauravkoradiya/tensorflow-data-and-deployement

This repository contains usage of data and deployment pipline in tensorflow.

data deployment machine-learning-algorithms pipline tensorflowjs

Last synced: 06 Oct 2025

https://github.com/ium101/files-and-folders-lister-z

Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.

application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows

Last synced: 09 Oct 2025

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/oliver021/entity-dock

A superset with libraries, components, tools and more to work with entity on .Net

api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi

Last synced: 09 May 2026

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/priyanka7411/customer-segmentation-churn-dashboard

📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.

churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization

Last synced: 14 Apr 2026

https://github.com/countervolts/apple-music-stats-calculator

how to get your most streamed songs/artists

apple apple-music applemusic calculator data

Last synced: 11 Feb 2026

https://github.com/physio/flatten-ts

Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.

array conversion data flatten javascript json object typescript

Last synced: 06 May 2026

https://github.com/thyringer/cast

CLI tool for reading strings or complex data sets from CSV files to output them in other text formats.

csv-converter data data-preprocessing python python3 sql-builder

Last synced: 02 Feb 2026

https://github.com/acdh-oeaw/histogis-data

Data created by HistoGIS

data histogis

Last synced: 24 Oct 2025

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/liamross/use-data

A React hook for async fetching of data, data manipulation, and take latest vs take every functionality.

async data hook hooks react

Last synced: 22 Jan 2026

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/plabayo/datapoints.earth

Earth data liberation for and by its citizens.

data foss free scrape

Last synced: 15 Mar 2026

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/nononoexe/setariaviridis

🌾 Field-collected data of green foxtail

data data-science dataset rpackage

Last synced: 27 Feb 2026

https://github.com/ballerina-platform/module-ballerina-data.csv

The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.

ballerina ballerina-csv csv csv-data data

Last synced: 29 Jan 2026

https://github.com/lindsaygelle/emojipedia

Go application. Simple program that scrapes unicode.org for Emoji content. Parses out HTML into categorically ordered data subsets. Explored from the command line.

cli data data-mining emoji emojipedia encyclopedia go golang golang-application html-scraping unicode-characters

Last synced: 11 Mar 2026

https://github.com/pommes-public/pommesdata

A full-featured transparent data preparation routine from raw data to POMMES model inputs

data opensource power raw-data transparent

Last synced: 07 Oct 2025

https://github.com/stdlib-js/array-base-filled-by

Create a filled generic array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map node node-js nodejs stdlib structure types

Last synced: 24 Apr 2025

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/junkwaxhero/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 24 Apr 2025

https://github.com/stdlib-js/ndarray-base-dtype-str2enum

Return the enumeration constant associated with an ndarray data type string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 15 Mar 2026

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/ndarville/ipa

HTML hex codes for the IPA table.

data ipa linguistics

Last synced: 24 Jan 2026

https://github.com/farovictor/mongodbextractor

This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.

data go mongodb pipeline

Last synced: 14 Jan 2026

https://github.com/woo071002/parcel-management-system

A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.

cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml

Last synced: 08 Oct 2025

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/dark-art108/yonk

A cli-utility to streamline data science work by creating templates

data machine-learning python3

Last synced: 08 May 2026

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/xxczaki/parsify-plugin-covid19

Parsify plugin, that adds COVID 19-related variables 🦠

confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables

Last synced: 13 Mar 2026

https://github.com/ayoub-amzil/offline-globe

Offline country data for PHP Laravel framework. Over 200 countries, capitals, flags, languages, currencies. No internet needed.

composer data internet laravel offline php

Last synced: 09 May 2026

https://github.com/StudyResearchProjects/arrbuffstr

Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser

arraybuffer browser data node string transform

Last synced: 09 Oct 2025