An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/defano/chicago-oasis

A visualization of Chicago business accessibility by neighborhood or census tract.

census chicago data data-science javascript neighborhood

Last synced: 11 Mar 2026

https://github.com/clinical-genomics/housekeeper

File data orchestrator

data file orchestrator

Last synced: 15 Aug 2025

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/debdutto/algorhythm

Algorithmic music driven by data and / or algorithms

algorithm data music nodejs

Last synced: 18 Apr 2026

https://github.com/espoirmur/balobi_nini

An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.

data nlp nlp-machine-learning

Last synced: 24 Aug 2025

https://github.com/sectly/suchdb

SuchDB: An simple, dependency free, require & go node.js and browser database

browser data database db json node node-js nodejs npm npm-package

Last synced: 09 Mar 2026

https://github.com/slipke/eurlex-model-go

This projects implements the EUR-Lex XML data model in Golang. For more information see README.md

data datamodel eur-lex eurlex webservice

Last synced: 09 Mar 2026

https://github.com/gonzalezlrjesus/covid-19API

Convierte la data ofrecida por: the Johns Hopkins University Center en formato CSV al formato JSON sobre los casos confirmados, muertos y recuperados de COVID-19 por paises.

api api-rest api-server coronavirus covid-19 data go golang json

Last synced: 06 May 2025

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025

https://github.com/mlr-org/mlr3data

Data sets used in the book, gallery, or in examples of mlr3.

data data-science data-sets machine-learning mlr3 r r-package

Last synced: 09 Apr 2025

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/andrew-johnson-4/misspeller

Take correctly spelled words and return common spelling mistakes

common-mistakes data language natural nlp processing rust

Last synced: 30 Apr 2025

https://github.com/mahmoud-saeed-mahmoud/loading_state_handler

The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.

dart data error flutter flutter-package flutter-widget loading state

Last synced: 10 Mar 2026

https://github.com/karashiiro/lodestone-id-time

Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.

data ffxiv ffxiv-character lodestone

Last synced: 30 Jun 2025

https://github.com/jimut123/scrapers

All Scrapers that I'll build

bs4 data python3 real-time-visualisations scrapers scrapy wget

Last synced: 16 Jan 2026

https://github.com/wamphlett/input-collection

A smarter and stricter way to capture and validate request data

data dto forms php validation

Last synced: 27 May 2026

https://github.com/georgetdn/syscppcp

Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework object persistence serialize sql windows

Last synced: 04 Apr 2025

https://github.com/amacd31/daily_hydromet_sample_data

This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.

data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow

Last synced: 16 Jan 2026

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/mrsaeeddev/data-science-roadmap-for-beginners

📈 A minimal and easy road map for beginners who want to dive into the field of Data Science

data data-science datascience python

Last synced: 29 Jun 2025

https://github.com/doughtnerd/pod

Read and write Excel data with Java

data excel extract poi-library

Last synced: 08 Apr 2025

https://github.com/sdhutchins/jxn-open-data-api

Access Jackson, MS open government data using a python API wrapper.

api data jackson jxn mississippi open-gov

Last synced: 08 Apr 2025

https://github.com/leeper/mcode

Functions to merge and recode across multiple variables

data data-transformation r recode recoding

Last synced: 16 May 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/praveenpuglia/css-support

The source of truth for CSS browser support of info

api browser compatibility css data properties selectors support

Last synced: 31 Mar 2025

https://github.com/zoo-js/zoo-data

🍩 The data for zoo-js.

actions data js json nodejs workflow

Last synced: 22 Apr 2025

https://github.com/qit-tools/unicode-emoji-json-lite

This library provides a lightweight version of the unicode-emoji-json library.

data emoji emojipedia emojis json lite unicode

Last synced: 07 Jan 2026

https://github.com/cosmos-loops/cosmos-efcore

Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.

cosmos-loops data efcore entityframeworkcore

Last synced: 14 Aug 2025

https://github.com/vikashpr/18cse301j_ra2011003010737

This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.

css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization

Last synced: 03 May 2026

https://github.com/marcuwynu23/phaddress

Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines

address address-data-api api barangay city data geolocation municipalities provinces

Last synced: 14 Feb 2026

https://github.com/purarue/listenbrainz_export

Export your scrobbling history from ListenBrainz

data data-export music scrobbling

Last synced: 24 Jan 2026

https://github.com/junkwaxhero/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 24 Apr 2025

https://github.com/stdlib-js/array-base-filled-by

Create a filled generic array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map node node-js nodejs stdlib structure types

Last synced: 24 Apr 2025

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 13 Jun 2025

https://github.com/bernard-ng/drc-news-corpus

DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation

aggregator data news nlp politics

Last synced: 06 Sep 2025

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/yaoguangduan/protosync

generate go code from protobuf ,sync proto dirty data

data golang protobuf sync

Last synced: 12 Mar 2026

https://github.com/wibosco/modelingformchanges-example

An example project to show how we can implement a model to simplify form validation

data swift unit-testing validator

Last synced: 16 Mar 2025

https://github.com/stdlib-js/ndarray-base-char2dtype

Return the data type string associated with a provided single letter abbreviation.

abbr abbreviation array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 12 Mar 2026

https://github.com/techbureau/zaifdata

:blue_book: Data Reader for zaif Exchange

bitcoin blockchain cryptocurrency data exchange nem token trading xem zaif

Last synced: 19 Apr 2026

https://github.com/cdcgov/nchsdata

NCHS data: public use files (PUFs) from the National Center for Health Statistics (NCHS)

data public-health r survey survey-data

Last synced: 13 Apr 2026

https://github.com/nrennie/londonmarathon

R package containing data relating to London Marathon.

data r r-package

Last synced: 02 Apr 2025

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/acdh-oeaw/histogis-data

Data created by HistoGIS

data histogis

Last synced: 24 Oct 2025

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/farovictor/mongodbextractor

This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.

data go mongodb pipeline

Last synced: 14 Jan 2026

https://github.com/asidlo/po

Data science library for manipulating data in Go using the familiar DataFrame and Series constructs from the Python Pandas library.

data dataframe go pandas series

Last synced: 14 Jan 2026

https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network

Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 10 Oct 2025

https://github.com/xxczaki/parsify-plugin-covid19

Parsify plugin, that adds COVID 19-related variables 🦠

confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables

Last synced: 13 Mar 2026

https://github.com/ndarville/ipa

HTML hex codes for the IPA table.

data ipa linguistics

Last synced: 24 Jan 2026

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/datahub-local/datahub-local

DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home

data data-engineering devops kubernetes raspberrypi

Last synced: 21 Jan 2026

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/stdlib-js/ndarray-base-dtype-str2enum

Return the enumeration constant associated with an ndarray data type string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 15 Mar 2026

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/asirihewage/simplest-xpath-web-scraper

Simplest web scraper created using Python3 and MongoDB

data data-mining python3 scraper web webscrping

Last synced: 29 Jan 2026

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/nononoexe/setariaviridis

🌾 Field-collected data of green foxtail

data data-science dataset rpackage

Last synced: 27 Feb 2026

https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation

Last synced: 12 Feb 2026

https://github.com/axa-ch/health-insurance-data

Swiss health insurance data

axa data health insurance swiss

Last synced: 19 Mar 2026

https://github.com/rajatt95/python_rs

Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping

constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping

Last synced: 15 Feb 2026

https://github.com/pawelzny/vo

DDD Value Object implementation

data ddd-patterns object python3 value

Last synced: 15 Feb 2026

https://github.com/zig-utils/zig-faker

A high-performance, lightweight fake data generator. Generate realistic fake data for testing, prototyping, and development.

data faker library mocker zig

Last synced: 01 Apr 2026

https://github.com/uk-ipop/open-data-pipeline

A pipeline for processing, enhancing, and sharing open datasets.

actions automation data python

Last synced: 25 May 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/pitmonticone/covid-italy

References for COVID-19 situation in Italy.

coronavirus covid-19 covid-19-italy data data-analysis documentation testing

Last synced: 05 Apr 2026

https://github.com/0xdir/relief_web_dart

A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters

api dart data humanitarian jobs

Last synced: 11 Jun 2026

https://github.com/tomwhite/chernoff

A visual mood indicator. One of the first Java programs I ever wrote.

chernoff-faces data visualization

Last synced: 20 Apr 2026

https://github.com/openpeeps/zxc-nim

Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed

archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc

Last synced: 07 Jun 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/anandchowdhary/health

🫀 @AnandChowdhary's body measurements

csv data fitness github-actions health

Last synced: 29 Apr 2026

https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas

Last synced: 30 Apr 2026