An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/nuhmanpk/Webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets

Last synced: 08 Jul 2025

https://github.com/teamdigitale/padigitale2026-opendata

opendata PNRR - PA digitale 2026

data opendata pnrr pnrr-data

Last synced: 25 Jan 2026

https://github.com/coolbutuseless/zap

Fast object serialization with high compression

compression data library package pkg r rstats serializing

Last synced: 18 Jul 2025

https://github.com/data-engineering-community/data-engineering-salaries

A Streamlit app to explore data engineering salary data.

data data-engineering data-visualization database dataset salaries

Last synced: 06 Apr 2025

https://github.com/vutran/yahoo-stocks

Fetch stock data from Yahoo! Finance

data finance scaper stocks trade yahoo

Last synced: 13 Apr 2025

https://github.com/hoangsonww/north-carolina-household-analysis

🏠 This repository contains data analysis scripts for the 2022 American Community Survey (ACS) focusing on individuals aged 25 and over in North Carolina, based on 75,340 observations. This repository offers valuable insights into demographic and economic patterns across North Carolina's urban areas.

confidence-interval confidence-score data data-analysis data-analytics data-science data-visualization ggplot2 hypothesis-testing hypothesis-tests north-carolina r r-language r-programming stata

Last synced: 11 Apr 2025

https://github.com/gabrieldim/a1on-webscraping-pandas-data-science

Learning WebScraping using Pandas in python. - Data Science

data data-science pandas sciecne web-scraping

Last synced: 10 Jul 2025

https://github.com/chalmerlowe/machine_learning

A gentle introduction to machine learning: data handling, linear regression, naive bayes, clustering

data data-science linear-regression machine-learning nearest-neighbors python scikit-learn

Last synced: 10 Apr 2025

https://github.com/m-ld/m-ld-spec

Platform-independent m-ld specification

crdt data graph rdf

Last synced: 01 May 2025

https://github.com/gematik/app-referencevalidator

The reference validator is a tool to perform advanced validation of FHIR resources for TI applications and interoperability standards

data fhir reference-implementation reference-validator

Last synced: 21 Jun 2025

https://github.com/vhoulbreque/dafter

📥 Command-line downloader for public datasets

brew-style command-line data database dataset download fetcher linux osx public-data unix

Last synced: 22 Mar 2025

https://github.com/rysuds/dunebuggy

Lightweight Python Client for Dune Analytics

crypto data defi dune ethereum sdk solana sql web3

Last synced: 25 Sep 2025

https://github.com/geerlingguy/drupal-8-example-pix_migrate

Drupal 6 to Drupal 8 media gallery example migration.

data drupal examples migrate migration pictures

Last synced: 18 Mar 2025

https://github.com/areal-team/bx-data

Удобные классы для 1C-Bitrix.

1c-bitrix bitrix bitrix-cms bitrix-module data highload

Last synced: 13 Jan 2026

https://github.com/khive-ai/pydapter

adapt data to and from every format

ai data database pydantic schema vector

Last synced: 12 Oct 2025

https://github.com/PiotrZakrzewski/merge-chance

Source code of https://merge-chance.info

analysis data data-analysis open-source

Last synced: 26 Mar 2025

https://github.com/praveingk/dptp

Data-Plane Time synchronization Protocol (P4-Tofino & DPDK)

barefoot data data-plane dpdk moongen p4 p4language sdn synchronization time-synchronization timesync tofino

Last synced: 01 Feb 2026

https://github.com/tomasonjo/bitcoin-to-neo4jdash

Project that listens to bitcoin websocket API for new transactions and stores them to Neo4j to be analyzed

bitcoin dashboard data data-science graph graphdatabase neo4j python websocket

Last synced: 03 Jul 2025

https://github.com/nshki/naisho

Send personal data deletion request emails to hundreds of data brokers at once.

data privacy rails

Last synced: 14 Jun 2025

https://github.com/anish-agnihotri/tubby-cats-data

Statistics from Tubby Cats NFT minting

data minting nft statistics

Last synced: 25 Oct 2025

https://github.com/hexastack/eazychart

EazyChart is a reactive chart library 📈, it allows you to easily add SVG charts in your React and Vue web applications.

chart charts d3 data dataanalysis dataviz graphs hacktoberfest hacktoberfest2022 javascript library react typescript visualization vue web

Last synced: 20 Oct 2025

https://github.com/prestashop/prestashop-shop-creator

Generate random demo data to test your PrestaShop shop.

data fixture-generator generator hacktoberfest prestashop random shop

Last synced: 30 Oct 2025

https://github.com/pravahio/go-mesh

Realtime data exchange platform for Smart Cities

data realtime streaming-data

Last synced: 14 Jan 2026

https://github.com/catdevnull/preciazo

analisis de precios en supermercados minoristas. en constante evolución https://preciazo.nulo.lol

data data-science price-tracker scraper supermarket

Last synced: 17 Mar 2025

https://github.com/chuhousen/amerifluxr

An R programmatic interface for AmeriFlux data and metadata

ameriflux api carbon-flux data time-series

Last synced: 22 Oct 2025

https://github.com/mitranim/emerge

Use plain JS types as immutable data, with efficient merging and memory sharing

data functional-data-structure functional-programming immutable structural-sharing

Last synced: 30 Apr 2025

https://github.com/Matdata-eu/yasgui

SPARQL development web application

data graph knowledge linked rdf sparql typescript webapp yasgui

Last synced: 01 Feb 2026

https://github.com/torvaney/footballdatr

A minimal R package to fetch data from football-data.co.uk

data football-data soccer

Last synced: 01 May 2025

https://github.com/ryapric/readit

[RETIRED] Effortlessly Read Any Rectangular Data Into R

data data-import r read read-data

Last synced: 11 Jul 2025

https://github.com/smathot/eeg_eyetracking_parser

Python routines for parsing of combined EEG and eye-tracking data

data data-science eeg eye eye-tracking mne pupillometry python

Last synced: 10 Apr 2025

https://github.com/zdavatz/covid19_ch

covid19 map for Switzerland using Esri Maps.

cases covid2019 data ddrobotec esri switzerland visualization ywesee

Last synced: 03 May 2025

https://github.com/arakelian/faker

A Java library for generating fake data such as names, addresses, and phone numbers.

data fake fake-content fake-data faker java java-8 mocking mocks test

Last synced: 01 Aug 2025

https://github.com/marioparaschiv/dm-opener

Small script to extract private message channels from a discord data package & open them on a certain account.

data datapackage discord dms messages open package private

Last synced: 14 Apr 2025

https://github.com/kejawenlab/nusantara

Nusantara adalah Script untuk mengambil data terbaru daerah di Indoensia mulai dari Propinsi hingga Desa/Kelurahan

data data-propinsi indonesia kabupaten propinsi wilayah-indonesia

Last synced: 01 Sep 2025

https://github.com/gagolews/teaching-data

Dr Marek's Data for Teaching/Training

data data-science data-wrangling datasets machine-learning

Last synced: 03 Jan 2026

https://github.com/johnymontana/flat-graph

GitHub Action for scraping data and importing into Neo4j using Cypher

data github-actions neo4j

Last synced: 02 Sep 2025

https://github.com/datreks/codetime-web

Statistical analysis and presentation of programming time.

code data data-visualization time

Last synced: 03 Jan 2026

https://github.com/scikit-hep/pyhepmc

Easy-to-use Python bindings for HepMC3

data generator hep hepmc monte-carlo particle physics python simulation vertex

Last synced: 07 Apr 2025

https://github.com/lucasconstantino/react-data-loader

Dead simple data loader helper for React

data higher-order-component loader react

Last synced: 30 Jul 2025

https://github.com/txn2/irsync

rsync on interval, via command line binary or docker container. Server and IOT builds for pull or push based device content management.

backup configuration-management content-management data devops docker docker-container docker-image go golang homebrew interval iot iot-application linux macos rsync rsync-commandline system-administration

Last synced: 16 Aug 2025

https://github.com/revolist/angular-datagrid

Powerful virtual data grid smartsheet with advanced customization in Angular. Best features from excel plus incredible performance 🔋

angular angular2 data data-grid-for-angular excel filters grid spreadsheet virtual-scroll

Last synced: 12 Oct 2025

https://github.com/bids-standard/pybv

A lightweight I/O utility for the BrainVision data format, written in Python.

brain brainproducts brainvision data eeg ieeg products vhdr vision vmrk

Last synced: 04 Oct 2025

https://github.com/jsonstat/jsonstat

JSON-stat GitHub Repository

cube data government jsonstat nsi stats tools

Last synced: 07 Apr 2025

https://github.com/feddelegrand7/baris

Use the French Open Data Portal API features from R

api data datagouvfr dataportal france french r rstats

Last synced: 25 Jul 2025

https://github.com/jonschlinkert/section-matter

Like front-matter, but allows multiple sections in a single document.

assemble blog data front-matter gray-matter jekyll markdown matter parse templates

Last synced: 14 Apr 2025

https://github.com/dataforgoodfr/batch4_data_manifesto

Serment Hippocrate pour data scientist

data dataforgood oath scientists

Last synced: 28 Jan 2026

https://github.com/ikatyang-collab/linguist-languages

Linguist languages.yml in JSON format

data json language linguist

Last synced: 12 Apr 2025

https://github.com/hoangsonww/global-covid19-analysis

🌍 This repository hosts an in-depth analysis of COVID-19's impact across five key countries from Jan 2020 to Dec 2021. Through advanced data analysis and visualization, we aim to provide insights into how the pandemic evolved differently across these nations, shedding light on the effectiveness of various health measures and vaccination campaigns.

covid covid-19 covid19-tracker data data-analysis data-analytics data-science data-visualization ggplot2 julia julia-language python r r-language r-markdown r-programming sas sas-programming stata vaccination

Last synced: 10 Apr 2025

https://github.com/azure/azure-data-labs

Terraform templates to deploy Azure Data resources

analytics azure blueprints data data-science github github-actions labs terraform

Last synced: 20 Oct 2025

https://github.com/feddelegrand7/BARIS

Use the French Open Data Portal API features from R

api data datagouvfr dataportal france french r rstats

Last synced: 02 May 2025

https://github.com/pratapvardhan/fifaworldcup

FIFA World Cup data includes teams data, squad formations, clubs dominance

data fifa wikipedia worldcup

Last synced: 25 Jan 2026

https://github.com/Azure/azure-data-labs

Terraform templates to deploy Azure Data resources

analytics azure blueprints data data-science github github-actions labs terraform

Last synced: 06 May 2025

https://github.com/octoenergy/tentaclio

Single repository regrouping IO connectors used in the data world.

data data-science database-connections protocols python3

Last synced: 24 Jun 2025

https://github.com/usa-npn/rnpn

R client for the National Phenology Network database API

data national-phenology-network phenology r r-package rstats species web-api

Last synced: 06 Apr 2025

https://github.com/naturalsolutions/ecoSecrets

ecoSecrets is a web application which enables users to manage their camera traps data

biodiversity camera-traps data opensource picture python react wildlife

Last synced: 18 Mar 2025

https://github.com/qascade/dcr

A PoC framework to orchestrate interoperable Differentially Private Data Clean Room Services using Intel SGX hardware as root of trust.

confidential-computing data data-security datacleanroom differential-privacy golang gssoc gssoc23 intel-sgx intel-sgx-sdk

Last synced: 15 Jan 2026

https://github.com/pacemaker82/sections-gauge-card

A simple gauge card for home assistant with multiple styles and support for multiple entities, targets and other customisations.

card custom-card data gauge home-assistant stats

Last synced: 20 Jan 2026

https://github.com/datopian/datapub

📝 React-based framework for building data publishing workflows (esp for CKAN)

ckan data datahub reactjs

Last synced: 21 Jul 2025

https://github.com/mrchypark/sejongfindata

세종기업데이터 크롤러 및 데이터

data financial-data korea r

Last synced: 07 Apr 2025

https://github.com/aradfarahani/geoelectricspy

This project presents an interactive 3D visualization of subsurface resistivity using geoelectric data. It utilizes Python and its scientific libraries to process and visualize the data across multiple depths, ranging from 1 meter to 464 meters.

data geophysics goelectrics resistivity visualization

Last synced: 07 Apr 2025

https://github.com/johncoene/sacred

📖 Sacred texts in R

bible data r rstats text-mining

Last synced: 10 Jun 2025

https://github.com/capturr/scraper

All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.

captcha cheerio crawler crawling data declarative extract growth-hacking html javascript json jsonld nodejs recaptcha scraper scraping spider typescript web web-scraping

Last synced: 02 Aug 2025

https://github.com/toddbirchard/pythonmyadmin

🐍💾 Web GUI for connecting to modifying databases & data.

cms data database flask gui plotly plotly-dash python3

Last synced: 14 Apr 2025

https://github.com/fatiando/ensaio

Practice datasets to probe your code

data earth-science fatiando-a-terra geophysics geoscience

Last synced: 28 Apr 2025

https://github.com/erictleung/pixarfilms

:movie_camera: R data package to explore Pixar films, the people, and reception data

data data-science datapackage disney imdb imdb-dataset pixar pixar-films rstats web-scraping wikipedia

Last synced: 29 Oct 2025

https://github.com/nullvoxpopuli/ember-contextual-services

Services in Ember are scoped to the app as a whole and are singletons. Sometimes you don't want that. :) This addon provides ephemeral route-based services.

addon consumer contexts contextual data ember loading provider react route service

Last synced: 23 Apr 2025

https://github.com/hodur-org/hodur-visualizer-schema

Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.

clojure data modeling schema visualization

Last synced: 12 Dec 2025

https://github.com/sanand0/fifadata

FIFA data

data

Last synced: 29 Nov 2025

https://github.com/dalekube/hr

Methods for better worker data engineering in the human resources (HR) corporate domain. Designed for HR analytics practitioner to get value from common workforce-oriented data sets.

analytics data data-engineering data-science human-resources r

Last synced: 22 Oct 2025

https://github.com/realtimeweb/datasets

The primary repository for all of the CORGIS Datasets

csv data datasets java jinja2 json python racket

Last synced: 30 Jan 2026

https://github.com/qarmin/system-info-collector

App to collect ram/cpu usage from OS and show it in pretty graphs

data data-collection system

Last synced: 06 Jul 2025

https://github.com/prestodb/prestorials

Tutorials and examples of how to deploy Presto and connect it to different data sources

aws awsglue data datalake docker example glue lakehouse mongodb presto presto-connector prestodb prestosql sql tutorial walkthrough

Last synced: 24 Oct 2025

https://github.com/shuffle/singul

Singul: Connect to your favorite services with a Singul line of code.

ai api api-client automation data llm normalization open security source standardization standards

Last synced: 05 Sep 2025

https://github.com/epranka/airports-db

Public airports database API service based on GitHub

airport airports api aviation csv data database json node node-js service

Last synced: 13 Sep 2025

https://github.com/sdebruyn/inzight

analyse your electricity usage data from Belgian smart meters with dbt, duckdb and evidence

data dbt duckdb electricity evidence

Last synced: 27 Feb 2025

https://github.com/hasnocool/tradingview-pine-scripts

This project simulates trading strategies using Pine Editor scripts on TradingView and visualizes performance metrics.

analysis btcusdt cryptocurrency data pandas python simulation trading tradingview

Last synced: 09 Apr 2025

https://github.com/tushar2704/sql-portfolio

Collection of personal SQL projects and queries I've worked on, showcasing my skills and expertise in database management, data analysis, and data manipulation using SQL.

data data-analytics data-science dataanalysis datamanipulation machine-learning mysql postgresql sql streamlit-tushar2704 tushar2704

Last synced: 07 May 2025

https://github.com/streamr-dev/chat

A proof-of-concept chat application built on the Streamr network.

chat dapp data messaging real-time streamr web3

Last synced: 13 Jul 2025