An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/karashiiro/lodestone-id-time

Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.

data ffxiv ffxiv-character lodestone

Last synced: 30 Jun 2025

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/srijanshetty/amfitools

Tools to get the open NAV for any MF in India

amfi cli data funds india investing mutual nav

Last synced: 04 Oct 2025

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/oliverhennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 16 Apr 2026

https://github.com/liamross/use-data

A React hook for async fetching of data, data manipulation, and take latest vs take every functionality.

async data hook hooks react

Last synced: 22 Jan 2026

https://github.com/zig-utils/zig-faker

A high-performance, lightweight fake data generator. Generate realistic fake data for testing, prototyping, and development.

data faker library mocker zig

Last synced: 01 Apr 2026

https://github.com/deepwaterpaladin/statscanpy

Basic package for querying & downloading StatsCan data by table name.

api data

Last synced: 16 Jan 2026

https://github.com/mihasm/arso-scraper

Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.

api arso cli data historical-data meteorological python slovenia weather

Last synced: 14 Feb 2026

https://github.com/jimut123/scrapers

All Scrapers that I'll build

bs4 data python3 real-time-visualisations scrapers scrapy wget

Last synced: 16 Jan 2026

https://github.com/sadcenter/messenger

Data messaging system between servers using popular messaging brokers

data message

Last synced: 06 Aug 2025

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/steelcake/cherry-pipelines

A collection of pipelines built with cherry

blockchain clickhouse data pipeline pyhton

Last synced: 09 Mar 2026

https://github.com/bradlindblad/quotableoffice

Repo for the quotable office R Shiny app

data datascience golem-apps r shiny shiny-apps text text-mining

Last synced: 26 May 2026

https://github.com/debdutto/algorhythm

Algorithmic music driven by data and / or algorithms

algorithm data music nodejs

Last synced: 18 Apr 2026

https://github.com/jazeee/dexcom-android-wall-panel

Display data as a Graph on Android, jazeee data plotter

android data jazeee plotter

Last synced: 02 May 2026

https://github.com/uk-ipop/open-data-pipeline

A pipeline for processing, enhancing, and sharing open datasets.

actions automation data python

Last synced: 25 May 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/weisscharlesj/data_scicompforchem

Zipped data for SciCompforChem book for easy download

chemistry chemistry-education data data-visualization python

Last synced: 07 Nov 2025

https://github.com/wamphlett/input-collection

A smarter and stricter way to capture and validate request data

data dto forms php validation

Last synced: 27 May 2026

https://github.com/mlr-org/mlr3data

Data sets used in the book, gallery, or in examples of mlr3.

data data-science data-sets machine-learning mlr3 r r-package

Last synced: 09 Apr 2025

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/quetz-al/quetzal-client

Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 28 Jul 2025

https://github.com/freight-trust/edi-onboarding

ESC Guidelines for X12/EDIFACT Messages

b2b data data-interchange edi edi-xml edifact enterprise x12

Last synced: 04 Mar 2026

https://github.com/amacd31/daily_hydromet_sample_data

This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.

data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow

Last synced: 16 Jan 2026

https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation

Last synced: 12 Feb 2026

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/joelllllll/up-sync

Sync account and transaction data from up bank to your local environment

accounts bank data postgres sync transactions up upbank

Last synced: 06 Jul 2025

https://github.com/agnosticeng/agx

Query and explore local and remote data with Clickhouse

clickhouse d3 data rust svelte

Last synced: 26 Oct 2025

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/yaoguangduan/protosync

generate go code from protobuf ,sync proto dirty data

data golang protobuf sync

Last synced: 12 Mar 2026

https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas

Last synced: 30 Apr 2026

https://github.com/openpeeps/zxc-nim

Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed

archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc

Last synced: 07 Jun 2026

https://github.com/ssiarhei115/customer-classification

Developing ML model predicting bank' customer inclination to open a deposit

big-data big-data-analytics data data-science data-visualization mashine-learning

Last synced: 09 Apr 2025

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/0xdir/relief_web_dart

A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters

api dart data humanitarian jobs

Last synced: 11 Jun 2026

https://github.com/kocyigitkim/realtime.io

Real time data streaming & socket programming library

data realtime socket streaming

Last synced: 29 Jul 2025

https://github.com/hmeleiro/opencis

R package to import data from spanish Sociological Research Center (CIS)

abiertos api centro cis data datos estudios open r sociologicos

Last synced: 31 Jul 2025

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 16 May 2026

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/rastmob/wordpress-llms-output-plugin

A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).

ai data llm llms training training-data wordpress wordpress-development wordpress-plugin

Last synced: 03 May 2026

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/defano/chicago-oasis

A visualization of Chicago business accessibility by neighborhood or census tract.

census chicago data data-science javascript neighborhood

Last synced: 11 Mar 2026

https://github.com/anandchowdhary/health

🫀 @AnandChowdhary's body measurements

csv data fitness github-actions health

Last synced: 29 Apr 2026

https://github.com/bkamapantula/india-pc-nfhs4

Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.

data government health india nfhs nfhs4

Last synced: 19 Mar 2026

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025

https://github.com/feltex/datahora-java

Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.

brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime

Last synced: 18 Jun 2026

https://github.com/doughtnerd/pod

Read and write Excel data with Java

data excel extract poi-library

Last synced: 08 Apr 2025

https://github.com/olegegoism/datagenerator

Django web application for managing database connections and generating test data.

app application big-data csv data database dataset db django fake generator schema teable work

Last synced: 26 Oct 2025

https://github.com/gonzalezlrjesus/covid-19API

Convierte la data ofrecida por: the Johns Hopkins University Center en formato CSV al formato JSON sobre los casos confirmados, muertos y recuperados de COVID-19 por paises.

api api-rest api-server coronavirus covid-19 data go golang json

Last synced: 06 May 2025

https://github.com/wonderium/browser-releases

This repository contains release dates for browser versions.

browsers data json releases wonderium

Last synced: 31 Jan 2026

https://github.com/windwalker-io/data

[READ ONLY] A library contains data/collection objects with null-object pattern.

collection collections data data-object iterator nullobject value-object

Last synced: 12 Mar 2026

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/georgetdn/syscppcplinux

Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework linux object persistence serialize sql

Last synced: 12 Feb 2026

https://github.com/mitranim/gt

Short for "Go Types". Important data types missing from the Go standard library.

data date datetime go golang interval json null-safety sql time url uuid

Last synced: 10 May 2026

https://github.com/wibosco/modelingformchanges-example

An example project to show how we can implement a model to simplify form validation

data swift unit-testing validator

Last synced: 16 Mar 2025

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 12 Apr 2026

https://github.com/sdhutchins/jxn-open-data-api

Access Jackson, MS open government data using a python API wrapper.

api data jackson jxn mississippi open-gov

Last synced: 08 Apr 2025

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 13 Jun 2025

https://github.com/utrechtuniversity/dataprivacyproject

This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.

data gdpr open-science privacy rdm research research-data-management utrecht-university

Last synced: 10 Oct 2025

https://github.com/qit-tools/unicode-emoji-json-lite

This library provides a lightweight version of the unicode-emoji-json library.

data emoji emojipedia emojis json lite unicode

Last synced: 07 Jan 2026

https://github.com/leeper/mcode

Functions to merge and recode across multiple variables

data data-transformation r recode recoding

Last synced: 16 May 2025

https://github.com/stefen-taime/real-time-data-pipeline-snake-game

Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API

chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis

Last synced: 04 May 2026

https://github.com/andreaselia/quotes-xd

A plugin for Adobe XD to insert a text element with a random quote and respective author.

adobe adobe-xd data design design-tool design-tools quote random xd

Last synced: 24 Apr 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah

Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544

checklist data gbif google-earth kmz sabah

Last synced: 09 Mar 2026

https://github.com/physio/flatten-ts

Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.

array conversion data flatten javascript json object typescript

Last synced: 06 May 2026

https://github.com/squareslab/probabilisticmodel_saner2018

Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018

code data mausotog published replication

Last synced: 26 Oct 2025

https://github.com/asidlo/po

Data science library for manipulating data in Go using the familiar DataFrame and Series constructs from the Python Pandas library.

data dataframe go pandas series

Last synced: 14 Jan 2026

https://github.com/ismet55555/pdw-asym-2link

Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.

data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator

Last synced: 29 Apr 2026