An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/coasterfreakde/ork

Object Relational Mapping for Kotlin

data database kotlin mariadb mysql orm sql sqlite

Last synced: 29 Jul 2025

https://github.com/defano/chicago-oasis

A visualization of Chicago business accessibility by neighborhood or census tract.

census chicago data data-science javascript neighborhood

Last synced: 11 Mar 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/axa-ch/health-insurance-data

Swiss health insurance data

axa data health insurance swiss

Last synced: 19 Mar 2026

https://github.com/mihasm/arso-scraper

Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.

api arso cli data historical-data meteorological python slovenia weather

Last synced: 14 Feb 2026

https://github.com/amethyst-php/customer

A person or an organization that pays for goods or services

amethyst amethyst-package api customer data laravel

Last synced: 11 May 2026

https://github.com/ad4ndi/lsd

Low-level data copying utility

c cli data

Last synced: 14 Feb 2026

https://github.com/erwan-simon/aws-data-platform-framework

A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.

aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module

Last synced: 23 May 2026

https://github.com/windwalker-io/data

[READ ONLY] A library contains data/collection objects with null-object pattern.

collection collections data data-object iterator nullobject value-object

Last synced: 12 Mar 2026

https://github.com/rajatt95/python_rs

Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping

constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping

Last synced: 15 Feb 2026

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/datahub-local/datahub-local

DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home

data data-engineering devops kubernetes raspberrypi

Last synced: 21 Jan 2026

https://github.com/srijanshetty/amfitools

Tools to get the open NAV for any MF in India

amfi cli data funds india investing mutual nav

Last synced: 04 Oct 2025

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/wamphlett/input-collection

A smarter and stricter way to capture and validate request data

data dto forms php validation

Last synced: 27 May 2026

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/physio/flatten-ts

Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.

array conversion data flatten javascript json object typescript

Last synced: 06 May 2026

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/mystpi/crossings

🌉 A tiny library focused on easily connecting JS to HTML.

connect data frontend html javascript reactive simple small tiny

Last synced: 10 Jun 2026

https://github.com/amacd31/daily_hydromet_sample_data

This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.

data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow

Last synced: 16 Jan 2026

https://github.com/lukekim/demo

Luke's Spice.ai demo app

ai data web3

Last synced: 18 Jan 2026

https://github.com/kocyigitkim/realtime.io

Real time data streaming & socket programming library

data realtime socket streaming

Last synced: 29 Jul 2025

https://github.com/hmeleiro/opencis

R package to import data from spanish Sociological Research Center (CIS)

abiertos api centro cis data datos estudios open r sociologicos

Last synced: 31 Jul 2025

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025

https://github.com/frnt-end/weather-app-react

:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO 🌞 https://Frnt-End.github.io/Weather-App-React 👈

api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp

Last synced: 18 Feb 2026

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/yash22222/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3

Last synced: 09 Apr 2026

https://github.com/leapfrogtechnology/datamegh

Datamegh - Data Engineering for the cloud.

cloud cloud-native data datamegh docker megha python serverless

Last synced: 14 May 2026

https://github.com/xxczaki/parsify-plugin-covid19

Parsify plugin, that adds COVID 19-related variables 🦠

confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables

Last synced: 13 Mar 2026

https://github.com/utrechtuniversity/dataprivacyproject

This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.

data gdpr open-science privacy rdm research research-data-management utrecht-university

Last synced: 10 Oct 2025

https://github.com/quetz-al/quetzal-client

Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 28 Jul 2025

https://github.com/sadcenter/messenger

Data messaging system between servers using popular messaging brokers

data message

Last synced: 06 Aug 2025

https://github.com/steelcake/cherry-pipelines

A collection of pipelines built with cherry

blockchain clickhouse data pipeline pyhton

Last synced: 09 Mar 2026

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 12 Apr 2026

https://github.com/rulox/faker

A Go library to create Fake Data for your projects

data dummy dummy-data fake fake-data faker go golang

Last synced: 28 May 2026

https://github.com/mrsaeeddev/data-science-roadmap-for-beginners

📈 A minimal and easy road map for beginners who want to dive into the field of Data Science

data data-science datascience python

Last synced: 29 Jun 2025

https://github.com/richardschoen/ibmixmlservicestd

IBM i XMLSERVICE C# and VB.Net Data Access Service Wrapper for .Net 4.6.1 and above and .Net Core 2.0 and above

as400 cl cobol command data database db2 ddm drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg service xmlservice

Last synced: 18 Apr 2025

https://github.com/0xdir/relief_web_dart

A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters

api dart data humanitarian jobs

Last synced: 11 Jun 2026

https://github.com/tomwhite/chernoff

A visual mood indicator. One of the first Java programs I ever wrote.

chernoff-faces data visualization

Last synced: 20 Apr 2026

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/mohasarc/treeviz

The best tree data-structures visualization tool

data structures visualization visualization-tools

Last synced: 25 Apr 2026

https://github.com/antononcube/raku-data-importers

Various data importing routines with a unified interface (data-import, slurp).

data data-ingestion raku rakulang slurp

Last synced: 23 Feb 2026

https://github.com/dantesc03/uberpool-case-study

This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.

analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization

Last synced: 16 Apr 2026

https://github.com/ssiarhei115/customer-classification

Developing ML model predicting bank' customer inclination to open a deposit

big-data big-data-analytics data data-science data-visualization mashine-learning

Last synced: 09 Apr 2025

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/tusharnankani/analysis-2.0

An Exhaustive WhatsApp Chat Data Analysis 2.0

analysis data data-science plots trends visualization

Last synced: 31 Mar 2025

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/agnosticeng/agx

Query and explore local and remote data with Clickhouse

clickhouse d3 data rust svelte

Last synced: 26 Oct 2025

https://github.com/bernard-ng/drc-news-corpus

DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation

aggregator data news nlp politics

Last synced: 06 Sep 2025

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/infinitode/pwlds

A public dataset of over 10 million passwords, with assigned strength levels.

ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data

Last synced: 22 Feb 2026

https://github.com/yaoguangduan/protosync

generate go code from protobuf ,sync proto dirty data

data golang protobuf sync

Last synced: 12 Mar 2026

https://github.com/jackallabs/canine-oracle

The Oracle Daemon for the Jackal Blockchain

blockchain cosmos data feed jackal oracle stream

Last synced: 06 Feb 2026

https://github.com/mskian/tamil-words

Tamil words Collections with English Meaning - API and SQL Data.

api data javascript json json-api mysql pdo php sql tamil tamil-language tamil-sms tamilwords translate translator

Last synced: 14 Apr 2026

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/zoo-js/zoo-data

🍩 The data for zoo-js.

actions data js json nodejs workflow

Last synced: 22 Apr 2025

https://github.com/alexandregazagnes/unilasalle-public-resources

UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python

data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization

Last synced: 28 Apr 2026

https://github.com/cosmos-loops/cosmos-efcore

Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.

cosmos-loops data efcore entityframeworkcore

Last synced: 14 Aug 2025

https://github.com/ingmarboeschen/jatsdecoderevaluation

Evaluation data and code

data evaluation jatsdecoder

Last synced: 04 Feb 2026

https://github.com/stdlib-js/ndarray-base-from-scalar

Convert a scalar value to a zero-dimensional ndarray.

base convert data javascript ndarray node node-js nodejs scalar stdlib structure types wrap

Last synced: 03 Jul 2025

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/yashmistry-24/ytcomment-iq

YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.

analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube

Last synced: 15 Feb 2026

https://github.com/xtrendence/comp2001-coursework

Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.

api asp-net csharp data dataset linked-data php restful restful-api

Last synced: 13 Apr 2026

https://github.com/ctechhindi/auto-fill-form-data

AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME

autocomplete chrome-extension data extension

Last synced: 17 Apr 2026

https://github.com/caelean/twittermap

Map of twitter user's influence as defined on by influencetracker

data google-maps maps sparql twitter visualization

Last synced: 14 Jun 2025

https://github.com/lindsaygelle/emojipedia

Go application. Simple program that scrapes unicode.org for Emoji content. Parses out HTML into categorically ordered data subsets. Explored from the command line.

cli data data-mining emoji emojipedia encyclopedia go golang golang-application html-scraping unicode-characters

Last synced: 11 Mar 2026

https://github.com/stefen-taime/real-time-data-pipeline-snake-game

Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API

chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis

Last synced: 04 May 2026

https://github.com/Nazaniiin/EDA_QualityofRedWine

:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.

charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization

Last synced: 30 Jul 2025