An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/gavinr/minor-league-baseball

Data set of minor league baseball teams

baseball data hacktoberfest maps open-data open-datasets sports

Last synced: 06 Apr 2025

https://github.com/zarr-developers/cookiecutter-zarr-store

Cookiecutter for Zarr store implementations

chunked data n-dimensional zarr

Last synced: 16 Jun 2025

https://github.com/eliot-akira/png-compressor

Compress and encode data as PNG image

cartridge data gzip png

Last synced: 17 Mar 2025

https://github.com/felixklauke/atomizer

Playing around with butter knife, android bindings and rx java.

binding butterknife data java react rx rxjava

Last synced: 15 May 2026

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/petermeissner/statsgrokse

R-API-binding to stats.grok.se server providing Wikipedia page view statistics for 2008 up to 2015

api binding data pageviews r wikipedia

Last synced: 17 May 2026

https://github.com/wpp-public/akqa-nz-tagmanager-connector

A simple javascript library to send events to a tag manager container

data google-tag-manager

Last synced: 05 Apr 2025

https://github.com/havocesp/dictexpire

Dict like with specific expire time per item.

data dict expire item items key map mapping python python3 time timeout value volatile

Last synced: 17 Jul 2025

https://github.com/johnmackintosh/simd2016_tmap

Mapping SIMD with tmap - static & interactive

data data-science data-visualization mapping r visualisation

Last synced: 20 Mar 2025

https://github.com/marlenezw/speech-to-text

Turn any video or audio recording into a written transcript using python

data data-science python speech speech-recognition speech-synthesis speech-to-text

Last synced: 27 Apr 2026

https://github.com/dhruvldrp9/simpledht

A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.

cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication

Last synced: 28 Feb 2026

https://github.com/seregpie/vuefort

The state management for Vue.

data model state store vue

Last synced: 13 Apr 2026

https://github.com/johntocci/nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

data data-analysis data-science datacleaning pandas polars python

Last synced: 06 Apr 2026

https://github.com/tillahoffmann/idxhound

🐶 Track indices across one or more numpy selections.

data numpy scientific-computing

Last synced: 14 May 2026

https://github.com/public-health-scotland/waiting_times_clinical_prioritisation

This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.

data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time

Last synced: 26 Jul 2025

https://github.com/ushkinaz/cbn-data

Automated game data extraction and processing for Cataclysm: Bright Nights. Provides JSON mirrors, WebP asset conversion, and unified translation data.

cataclysm-bn data wiki

Last synced: 07 Mar 2026

https://github.com/ayush585/fireducksblog

BLOG: Unlocking AI Efficiency: How FireDucks Revolutionizes Data Preprocessing

data processing

Last synced: 28 Apr 2026

https://github.com/camilajaviera91/dbt-transformations-sql-mock-data

This repository contains the transformations and documentation for the data model generated in sql-mock-data.

data dbt postgresql sql

Last synced: 02 Feb 2026

https://github.com/coral/ddp

Distributed Display Protocol (DDP) in Go

data ddp distributed golang led pixel protocol wled

Last synced: 26 Jun 2025

https://github.com/stdlib-js/ndarray-base-dtype-resolve-str

Return the data type string associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 06 Mar 2026

https://github.com/codenoid/storial.co-database

a Storial.co Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/ibnz36/arrowpipe

Build complex pipelines easily

cargo crate data pipe rust

Last synced: 13 Apr 2025

https://github.com/uhstray-io/just-dashboards

Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source

analytics data dioxus rust visualization

Last synced: 29 Mar 2025

https://github.com/stdlib-js/array-base-count-same-value

Count the number of elements that are equal to a given value in an array.

array count countif data javascript node node-js nodejs same stdlib structure sum summation total types

Last synced: 21 Apr 2026

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/fairspec/fairspec-extension

Fairspec Extension is a Git repository template for rapid Fairspec extension development

ckan csv data dataset excel fair json ods polars python quality schema sqlite tabl typescript validation zenodo

Last synced: 20 Jan 2026

https://github.com/jrdnbradford/google-sheet-color-sort

Google Sheet-bound script that assists with sorting Google Sheet rows by background fill color

data excel google-apps google-apps-script google-sheet google-sheets javascript microsoft-excel sort-rows

Last synced: 14 Apr 2025

https://github.com/prdktntwcklr/weatherman

A simple web app displaying environmental data from an SQLite database.

dashboard data flask sensor sqlite

Last synced: 19 May 2026

https://github.com/desilinguist/hanukkah-of-data-2022

My solutions to Hanukkah of Data 2022

2022 data hanukkah pandas python

Last synced: 17 May 2026

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 21 Mar 2025

https://github.com/dbrennand/rm-content

A Python 3.7 script to remove a specific string from all files and repos (owned by the user).

content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content

Last synced: 29 Mar 2025

https://github.com/yogaprasadk/dbms_course_a_to_z

it is a repository for complete lecture of Database Management Systems taught by riti kumari

acidproperties btree data database dbms filesystem normalform normalization sql

Last synced: 20 Mar 2025

https://github.com/lmuffato/project-mysql-one-for-all-trybe

Projeto mysql one for all - Projeto avaliativo da Trybe do Bloco 21: Normalização e Modelagem de Banco de Dados

back-end data database database-modeling mysql mysqlworkbench query sql trybe-projects

Last synced: 08 May 2026

https://github.com/FAIMS/OpenDataPresentation

Brian Ballsun-Stanton's presentation

context data presentation

Last synced: 03 Apr 2025

https://github.com/d-ganchar/thedus

Thedus is a lightweight migration tool for Clickhouse

cli clickhouse data database migration migrations python

Last synced: 12 Apr 2025

https://github.com/open-i18n/data

Open i18n Data packages

data internationalization

Last synced: 07 Mar 2026

https://github.com/lagden/injection

Inject data into file

data file inject nodejs

Last synced: 24 Apr 2026

https://github.com/wklee610/de_project

[Data Engineer] Personal Toy Project For Study

data dataengineering

Last synced: 31 Mar 2025

https://github.com/jvrck/australianpayphones

Get Australian payphone data in GeoJSON format.

australia data geojson geojson-data scraper

Last synced: 04 Apr 2025

https://github.com/inzhenerka/scooters_data_uploader

Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех

data dbt postgresql

Last synced: 04 May 2026

https://github.com/aboualine/sql-formation

Library Management System Database: A MySQL project with tables, triggers, stored procedures, and views for managing books, members, and borrowings. Includes sample data for testing. Ideal for learning SQL or building a library app.

data database library-management-system mysql sql system

Last synced: 18 Apr 2026

https://github.com/marxmit7/kaggle

Kaggle competitions

data kaggle kaggle-competition

Last synced: 19 May 2026

https://github.com/lmuffato/project-mongodb-commerce-trybe

Projeto Mongodb Commerce - Projeto avaliativo da Trybe do Bloco 23: MongoDB: Updates Simples e Complexos

api back-end crud data database mongo mongodb query trybe-projects update

Last synced: 08 May 2026

https://github.com/eugenedakin/steganography-pictures

Add and remove a picture-in-a-picture with steganography

compare data steganography steganography-tools xojo

Last synced: 12 Feb 2026

https://github.com/aditya172926/blockchain_indexers

Indexers to fetch data from blockchain events and transactions data with their parameters

blockchain data indexers rust

Last synced: 02 Aug 2025

https://github.com/whatheheckisthis/pwc_project-

Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver

case-study data data-science dataanalytics databases datavisualization powerbi virtual

Last synced: 05 Apr 2025

https://github.com/frequentlymisseddeadlines/chessfessor

Command line tool to extract game data from Lichess.org and Chess.com

chess data extract lichess pgn

Last synced: 19 May 2026

https://github.com/linas/archeo

File Recovery, Integrity and Archive Management

corruption data monitoring recovery

Last synced: 29 Mar 2025

https://github.com/incubrain/awesome-maharashtra-data

A collection of datasets specific to Maharashtra, India. WIP

ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi

Last synced: 23 May 2026

https://github.com/lmuffato/project-restaurant-orders-trybe

Projeto restaurant orders - Projeto avaliativo da Trybe do Bloco 36: Estrutura de Dados I: Arrays, Hashmaps e Sets

array array-set csv data data-analysis hashmap python set trybe trybe-projects

Last synced: 13 Sep 2025

https://github.com/gbburleigh/quick-seeders

Generate realistic test data quickly with Quick-Seeders, a Python library offering a wide range of data types and schema definitions. Control data variance, probabilities, and output formats, including SQL. Simplify your data seeding process and improve testing efficiency.

data dataset faker generator python seeder sql test

Last synced: 03 Apr 2025

https://github.com/aleklukanen/chapterhousedb-example-app

An example application using the ChapterhouseDB processing engine

arrow data database event golang parquet processing stream

Last synced: 18 Apr 2026

https://github.com/agustinmusanti/sqlchallenge-2

This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.

challenge data mysql

Last synced: 03 Apr 2025

https://github.com/lisakey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 08 May 2026

https://github.com/jbdesbas/custom-scripts

Custom SQL functions or scripts

data database sql

Last synced: 28 Jun 2026

https://github.com/Greatwoman23/Market-Basket-Analysis

Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.

analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python

Last synced: 04 May 2025

https://github.com/m-muecke/isocountry

R package containing ISO codes for countries and currencies

country-codes currency-codes data iso-3166-1 iso-4217 r r-package

Last synced: 20 Mar 2025

https://github.com/muhammad-fiaz/ason

ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.

adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3

Last synced: 02 Feb 2026

https://github.com/am-i-groot/summer-intern-iitguwahati-spml

Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.

algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing

Last synced: 17 May 2026

https://github.com/cliffano/volothamp

Random D&D stuffs my son and I dabble with

data dungeons-and-dragons info little-godzilla

Last synced: 06 Apr 2025

https://github.com/josephbarbierdarnal/cieri-analytics.com

CIERI Analytics is the applied research department of the non-profit organization CIERI.

analysis behavior data identity research

Last synced: 12 Jan 2026

https://github.com/sottey/shon

SHON (Structured Human-Optimized Notation) is a data serialization format designed for readability, schema support, and practical use in modern systems. Version 0.6 introduces advanced types and syntax improvements.

data golang json spec specification

Last synced: 18 May 2026

https://github.com/tomasfarias/pipeline

A simple data pipeline done as a challenge project

challenge data python

Last synced: 29 Mar 2025

https://github.com/cainmi/easy-pull-from-repository

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 04 Apr 2025

https://github.com/owengombas/genyus

🐍 Lyrics analysis with genius.com, Python and Jupyter Notebooks

api data data-science genius jupyter-notebook lyrics python statistics

Last synced: 20 May 2026

https://github.com/aikuyun/flinkx

flinkx 一些修改

data flink

Last synced: 04 Apr 2025

https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning

The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/

airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn

Last synced: 30 Dec 2025

https://github.com/stdlib-js/array-base-to-reversed

Return a new array with elements in reverse order.

array data generic javascript node node-js nodejs rev reverse stdlib structure swap types

Last synced: 11 Apr 2025

https://github.com/anuraganalog/365-data-science

A Repository which contains lecture notes, exercise, solutions

365 data exercises ipynb lecture notes pdfs python python3 science solutions sql

Last synced: 15 May 2026

https://github.com/chompfoods/sdk-typescript-fetch

Fetch TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database fetch food grocery ingredients nutrition raw recipe-api recipes sdk typescript

Last synced: 03 May 2026

https://github.com/definetlynotai/test_generator

A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.

algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools

Last synced: 21 Jul 2025

https://github.com/elazar/pycopyql

Exports a subset of data from a relational database.

data database export relational tool utility

Last synced: 16 May 2026

https://github.com/nichtich/wikidata-taxonomy-examples

Extract classifications from Wikidata

coli-conc data knowledge-organization wikidata

Last synced: 12 Jul 2025

https://github.com/openfoodfacts/openfoodfacts-corrector

Ruby script to correct and enhance data on OpenFoodFacts

correction data food ruby

Last synced: 24 Apr 2026

https://github.com/pbinkley/tweets-libraries-covid19

A twarc harvest of tweets related to libraries during the COVID-19 outbreak, starting 2020-03-02

data social

Last synced: 06 Mar 2026