An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/oliverhennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 16 Apr 2026

https://github.com/zig-utils/zig-faker

A high-performance, lightweight fake data generator. Generate realistic fake data for testing, prototyping, and development.

data faker library mocker zig

Last synced: 01 Apr 2026

https://github.com/bdpedigo/neuropull

A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.

connectome connectomes connectomics data dataset networks networks-biology

Last synced: 05 Oct 2025

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/karashiiro/lodestone-id-time

Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.

data ffxiv ffxiv-character lodestone

Last synced: 30 Jun 2025

https://github.com/azawawi/perl6-msgpack

Perl 6 Interface to libmsgpack

data messagepack msgpack perl6 wrapper

Last synced: 12 Jun 2025

https://github.com/nop-dev/learning-js

Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰

data data-structures functions javascript js

Last synced: 17 Apr 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/mahmoud-saeed-mahmoud/loading_state_handler

The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.

dart data error flutter flutter-package flutter-widget loading state

Last synced: 10 Mar 2026

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 13 Jun 2025

https://github.com/pitmonticone/covid-italy

References for COVID-19 situation in Italy.

coronavirus covid-19 covid-19-italy data data-analysis documentation testing

Last synced: 05 Apr 2026

https://github.com/acdh-oeaw/histogis-data

Data created by HistoGIS

data histogis

Last synced: 24 Oct 2025

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/0xdir/relief_web_dart

A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters

api dart data humanitarian jobs

Last synced: 11 Jun 2026

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/pommes-public/pommesdata

A full-featured transparent data preparation routine from raw data to POMMES model inputs

data opensource power raw-data transparent

Last synced: 07 Oct 2025

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/mskian/tamil-words

Tamil words Collections with English Meaning - API and SQL Data.

api data javascript json json-api mysql pdo php sql tamil tamil-language tamil-sms tamilwords translate translator

Last synced: 14 Apr 2026

https://github.com/mohasarc/treeviz

The best tree data-structures visualization tool

data structures visualization visualization-tools

Last synced: 25 Apr 2026

https://github.com/kefniark/kaaya

JS Library for State management and Data synchronization between Applications

data game kaaya mutation network serialization state-management

Last synced: 06 Jun 2026

https://github.com/ciscorn/tinygrib2

(experimental) A tiny toolkit for parsing JMA's GRIB2 files.

data grib grib2 meteorology rust weather

Last synced: 26 Apr 2026

https://github.com/infinitode/pwlds

A public dataset of over 10 million passwords, with assigned strength levels.

ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data

Last synced: 22 Feb 2026

https://github.com/kylekirkby/cardatasnatch

CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.

beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering

Last synced: 15 Apr 2025

https://github.com/dantesc03/uberpool-case-study

This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.

analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization

Last synced: 16 Apr 2026

https://github.com/andrewrporter/my-analytics

Analyzes FireFox browsing history with modern python3 features and libraries

analytics data firefox matplotlib python python3 sqlite3

Last synced: 28 Apr 2026

https://github.com/alexandregazagnes/unilasalle-public-resources

UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python

data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization

Last synced: 28 Apr 2026

https://github.com/ismet55555/pdw-asym-2link

Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.

data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator

Last synced: 29 Apr 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/bernard-ng/drc-news-corpus

DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation

aggregator data news nlp politics

Last synced: 06 Sep 2025

https://github.com/anandchowdhary/health

🫀 @AnandChowdhary's body measurements

csv data fitness github-actions health

Last synced: 29 Apr 2026

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas

Last synced: 30 Apr 2026

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/lovethebomb/data-tiles

🍜 Data Tiles is a small website that shows data.

data express javascript nextjs typescript

Last synced: 10 Apr 2026

https://github.com/nikolaydubina/aws-s3-reader

Efficient Go Reader for large AWS S3 Objects

aws data golang reader s3 streaming

Last synced: 30 Apr 2026

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025

https://github.com/hasnocool/war_thunder_camouflage_scraper

A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.

asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web

Last synced: 04 Jan 2026

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 16 May 2026

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/banyan-team/banyan-julia-examples

Adventures in massively parallel cloud computing with Banyan Julia!

banyan data data-analytics data-processing data-science julia

Last synced: 02 May 2026

https://github.com/ozanarkancan/sailx

This repo contains the code for generating artificial navigational instruction following data.

data grounded-language-learning

Last synced: 08 Jan 2026

https://github.com/yanpitangui/iteminfoconverter

Application that converts ragnarok legacy data files to iteminfo.lua

data itemdbconf iteminfo luafiles ragnarok

Last synced: 12 Oct 2025

https://github.com/rastmob/wordpress-llms-output-plugin

A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).

ai data llm llms training training-data wordpress wordpress-development wordpress-plugin

Last synced: 03 May 2026

https://github.com/vyahello/fake-employee-api

👨‍🔧 Simple mock employees data parser (responder + heroku + pytest + github/travis CI)

data employee employer mock responder rest-api

Last synced: 09 Jun 2026

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/acaciaman/db-autotest

DB Database test automation. This python package allows to create database object structure and load data from database.

data database test-automation

Last synced: 05 May 2026

https://github.com/askaniy/celestialocationsmaker

Tool for making Celestia location files

celestia data geology locations mapping planetary-science space

Last synced: 14 Mar 2025

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/mawburn/across-a-thousand-dead-worlds-data

Across a Thousand Dead Worlds Data

data json ttrpg

Last synced: 21 Apr 2026

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/jorgeatgu/wiki-cachitos

¿Cuánto aumentan las visitas a la Wikipedia cuando un artista sale en Cachitos?

cachitos data dataviz musi musica rtve

Last synced: 05 Jul 2025

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/lxcoding06/e-gereja

Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis

data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan

Last synced: 15 May 2025

https://github.com/tayeva/eia-client-python

EIA Open Data API Client - Python

data open-source python python-3 python3

Last synced: 14 Oct 2025

https://github.com/dark-art108/yonk

A cli-utility to streamline data science work by creating templates

data machine-learning python3

Last synced: 08 May 2026

https://github.com/ayoub-amzil/offline-globe

Offline country data for PHP Laravel framework. Over 200 countries, capitals, flags, languages, currencies. No internet needed.

composer data internet laravel offline php

Last synced: 09 May 2026

https://github.com/yorkulibraries/vendorpol

URLs for vendor privacy policies and terms of use.

data libraries privacy-policy

Last synced: 15 Oct 2025

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/j1sk1ss/dateapppc.exmpl

Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.

data oop-principles parsing pgadmin4 sql wpf

Last synced: 11 Apr 2026

https://github.com/guslovesmath/top_tech_sp_500_forecasting

Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.

arima-forecasting arima-model data data-science forecasting vector-autoregression

Last synced: 14 Mar 2025

https://github.com/synthead/timex-datalink-assembler

Toebes' Timex Datalink WristApp assembler wrapped in a Docker image with Wine

150 150s 6800 6805 assembler compiler data data-link datalink docker link timex toebes wine wristapp

Last synced: 10 May 2026

https://github.com/synthead/timex-datalink-toebes-tutorials

Toebes' WristApp tutorial sources for the Timex Datalink

6800 6805 app assembly crt data data-link datalink link timex watch wrist wristapp

Last synced: 08 Apr 2025

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/natylaza89/covid19-il

Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★

api covid covid19 covid19-data data israel pandas python

Last synced: 13 Apr 2026

https://github.com/secret-guest/file_organizer

Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.

c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory

Last synced: 10 Jun 2026

https://github.com/email-types/data

TypeScript definitions, compatibility data, and utils that makes building emails easier.

css css-in-js data email mso types

Last synced: 13 May 2026

https://github.com/leapfrogtechnology/datamegh

Datamegh - Data Engineering for the cloud.

cloud cloud-native data datamegh docker megha python serverless

Last synced: 14 May 2026

https://github.com/agnosticeng/agx

Query and explore local and remote data with Clickhouse

clickhouse d3 data rust svelte

Last synced: 26 Oct 2025

https://github.com/feltex/datahora-java

Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.

brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime

Last synced: 18 Jun 2026

https://github.com/nrennie/data

A collection of random datasets, either from web-scraping or processing more complex data.

data

Last synced: 30 May 2026