An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/johntocci/nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

data data-analysis data-science datacleaning pandas polars python

Last synced: 06 Apr 2026

https://github.com/joelllllll/up-sync

Sync account and transaction data from up bank to your local environment

accounts bank data postgres sync transactions up upbank

Last synced: 06 Jul 2025

https://github.com/arcticsnow/climatepy

Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)

climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray

Last synced: 05 Feb 2026

https://github.com/writetome51/big-dataset-paginator

A TypeScript/JavaScript class for pagination in a real-world web app.

app data javascript pagination paginator typescript

Last synced: 17 May 2026

https://github.com/cdcgov/nchsdata

NCHS data: public use files (PUFs) from the National Center for Health Statistics (NCHS)

data public-health r survey survey-data

Last synced: 13 Apr 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 06 Nov 2025

https://github.com/masesgroup/datadistributionmanager

A reliable subsystem to distribute data across multiple datacenters using multiple languages (C/C++, .NET, JVM enabled languages) over multiple technologies (e.g. Apache Kafka, OpenDDS, etc)

apachekafka availability business-solutions businesscontinuity data durability opendds reliability softwareneverlockdown

Last synced: 14 Apr 2025

https://github.com/manuelblancovalentin/dynamictable

An easy and user-friendly way to display dynamic data in Python using command-line tables

data deep display dynamic friendly gui interface io keras learning loop machine python python3 pytorch table tensorflow user

Last synced: 20 Jan 2026

https://github.com/aaronmeder/social-history

A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.

archives data facebook instagram statistics stats

Last synced: 27 Mar 2025

https://github.com/hyper63/copy

hyper copy tool

copy data hyper

Last synced: 20 Jul 2025

https://github.com/wfamous/fiv_update-data

This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.

ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu

Last synced: 17 Feb 2026

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/petermeissner/statsgrokse

R-API-binding to stats.grok.se server providing Wikipedia page view statistics for 2008 up to 2015

api binding data pageviews r wikipedia

Last synced: 17 May 2026

https://github.com/bkamapantula/india-pc-nfhs4

Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.

data government health india nfhs nfhs4

Last synced: 19 Mar 2026

https://github.com/joisino/twinpaper

Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)

causal-inference data research science-of-science

Last synced: 21 Mar 2025

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/eliot-akira/png-compressor

Compress and encode data as PNG image

cartridge data gzip png

Last synced: 17 Mar 2025

https://github.com/georgetdn/syscppcp

Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework object persistence serialize sql windows

Last synced: 04 Apr 2025

https://github.com/plabayo/datapoints.earth

Earth data liberation for and by its citizens.

data foss free scrape

Last synced: 15 Mar 2026

https://github.com/yanpitangui/iteminfoconverter

Application that converts ragnarok legacy data files to iteminfo.lua

data itemdbconf iteminfo luafiles ragnarok

Last synced: 12 Oct 2025

https://github.com/jasondrawdy/compendio

Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.

compendium converters cryptography data extensions generators hashing library security utilities validation windows

Last synced: 18 May 2026

https://github.com/stdlib-js/ndarray-base-from-scalar

Convert a scalar value to a zero-dimensional ndarray.

base convert data javascript ndarray node node-js nodejs scalar stdlib structure types wrap

Last synced: 03 Jul 2025

https://github.com/ingmarboeschen/jatsdecoderevaluation

Evaluation data and code

data evaluation jatsdecoder

Last synced: 04 Feb 2026

https://github.com/nowosad/cllc

Country-level Land Cover - categories and transitions

data dataset land-cover land-cover-transitions r

Last synced: 04 Apr 2025

https://github.com/vutran/yahoo-stocks-cli

Fetch stock data from the CLI

cli data finance stocks yahoo

Last synced: 08 Jun 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/zoo-js/zoo-data

🍩 The data for zoo-js.

actions data js json nodejs workflow

Last synced: 22 Apr 2025

https://github.com/datahub-local/datahub-local

DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home

data data-engineering devops kubernetes raspberrypi

Last synced: 21 Jan 2026

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/camara94/data-visualization-with-python

Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.

data data-science data-structures data-visualization python3

Last synced: 16 May 2026

https://github.com/stanford-oval/medxchange

Medical Data Exchange (MedXchange) platform

data ethereum exchange medical medxchange

Last synced: 16 May 2026

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/ange007/jquery.mydata

jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.

data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs

Last synced: 19 May 2026

https://github.com/randomfractals/unfolded-map-renderer

Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.

cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode

Last synced: 21 Mar 2025

https://github.com/sanand0/imdbscrape

A weekly archive of the IMDB Top 250 results. Automatically scraped via GitHub Actions. Useful to see trends on IMDb Top 250

data

Last synced: 30 May 2026

https://github.com/elggem/shapeset-generator

Generates varying shapes as training data for neural nets: ShapeSet.

data machine-learning numpy opencv python svg training

Last synced: 11 Apr 2026

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/meetyildiz/pandazip

Reduce RAM footprint of Pandas DataFrame without losing information.

compression data data-mining data-science machine-learning pandas utilities

Last synced: 20 Jan 2026

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/gabrielu3/ori-inverted-list

Inverted List made for a college discipline named Organization and Retrieval of Information

c data data-structures index inverted-index list

Last synced: 24 Feb 2026

https://github.com/gavinr/minor-league-baseball

Data set of minor league baseball teams

baseball data hacktoberfest maps open-data open-datasets sports

Last synced: 06 Apr 2025

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/flrd/standardlastprofile

R Data Package for BDEW Standard Load Profiles in Electricity

data electricity germany r

Last synced: 16 Mar 2026

https://github.com/skywarth/fenrir-wolfpack-simulator

Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.

data data-structures javascript max-heap simulation simulations wolfpack

Last synced: 14 Oct 2025

https://github.com/inspect-js/is-data-view

Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.

data dataview ecmascript javascript typedarray typedarrays view

Last synced: 05 Apr 2025

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025

https://github.com/dhruvldrp9/simpledht

A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.

cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication

Last synced: 28 Feb 2026

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/mlr-org/mlr3data

Data sets used in the book, gallery, or in examples of mlr3.

data data-science data-sets machine-learning mlr3 r r-package

Last synced: 09 Apr 2025

https://github.com/gonzalezlrjesus/covid-19API

Convierte la data ofrecida por: the Johns Hopkins University Center en formato CSV al formato JSON sobre los casos confirmados, muertos y recuperados de COVID-19 por paises.

api api-rest api-server coronavirus covid-19 data go golang json

Last synced: 06 May 2025

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/machu-gwu/constant2-project

provide extensive way of managing your constant variable.

configuration constants data developer-tools python

Last synced: 26 May 2026

https://github.com/xtrendence/comp2001-coursework

Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.

api asp-net csharp data dataset linked-data php restful restful-api

Last synced: 13 Apr 2026

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/itzshoaib/hashtegrity

A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity

crypto-hash data data-integrity hacktoberfest hash integrity

Last synced: 07 Mar 2026

https://github.com/gauravkoradiya/tensorflow-data-and-deployement

This repository contains usage of data and deployment pipline in tensorflow.

data deployment machine-learning-algorithms pipline tensorflowjs

Last synced: 06 Oct 2025

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 14 May 2026

https://github.com/jesusgraterol/bitcoin-lightning-network-stats-dataset-builder

The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data data-science dataset dataset-generation lightning-network machine-learning

Last synced: 16 May 2026

https://github.com/acdh-oeaw/histogis-data

Data created by HistoGIS

data histogis

Last synced: 24 Oct 2025

https://github.com/d3oxy/country-state-data

A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.

address cities countries currency data dropdown geographical iso json languages location regions states typescript

Last synced: 24 Jan 2026

https://github.com/yashika-malhotra/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/hmeleiro/alquilermad

Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid

data data-science data-visualization datascience housing-location-visualization rent renting

Last synced: 13 Sep 2025

https://github.com/priyanka7411/customer-segmentation-churn-dashboard

📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.

churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization

Last synced: 14 Apr 2026

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/StudyResearchProjects/arrbuffstr

Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser

arraybuffer browser data node string transform

Last synced: 09 Oct 2025

https://github.com/wonderium/browser-feature-compatibility

This repository contains browser support details for HTML, CSS, JS and SVG features.

browsers compatability css data html js json releases support svg wonderium

Last synced: 27 Jan 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/stdlib-js/ndarray-base-dtype2c

Return the C data type associated with a provided data type string.

array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 18 Jun 2025

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/Nazaniiin/EDA_QualityofRedWine

:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.

charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization

Last synced: 30 Jul 2025

https://github.com/secret-guest/file_organizer

Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.

c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory

Last synced: 10 Jun 2026

https://github.com/freight-trust/edi-onboarding

ESC Guidelines for X12/EDIFACT Messages

b2b data data-interchange edi edi-xml edifact enterprise x12

Last synced: 04 Mar 2026

https://github.com/pommes-public/pommesdata

A full-featured transparent data preparation routine from raw data to POMMES model inputs

data opensource power raw-data transparent

Last synced: 07 Oct 2025

https://github.com/nop-dev/learning-js

Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰

data data-structures functions javascript js

Last synced: 17 Apr 2026

https://github.com/uk-ipop/open-data-pipeline

A pipeline for processing, enhancing, and sharing open datasets.

actions automation data python

Last synced: 25 May 2026

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/junkwaxhero/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 24 Apr 2025