An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/clabe45/kaz

Minimalistic local storage cli

cli data minimalistic storage utility

Last synced: 17 Jul 2025

https://github.com/hughrawlinson/github-data-scripts

Scripts to grab data about repos of interest to compare

data github-graphql github-repo-organizer graphql scripts typescript

Last synced: 09 Jul 2025

https://github.com/tsiarokhin/student_bsu_by

Tool for parsing various BSU student information from student.bsu.by website.

belarus bsu data grades python students study university

Last synced: 28 May 2026

https://github.com/lucaaszsx/spyder

A powerful schema-based web scraping library for Node.js built for fast, structured, and reliable data extraction.

cheerio crawler data dom dom-manipulation html json json-ld parser scraper web xml

Last synced: 11 Jun 2026

https://github.com/nia-cloud-official/influx

Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.

data find people-search search-engine

Last synced: 27 Jun 2025

https://github.com/geo-c/oct-ckan

The Open City Toolkit (more information about the project: http://geo-c.eu)

cities collaboration data open participation transparency

Last synced: 16 May 2026

https://github.com/glassflow/pipelines-push-action

This Github Action lets you automate GlassFlow pipelines deployments as code

data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing

Last synced: 19 May 2026

https://github.com/stdlib-js/ndarray-slice-assign

Assign element values from a broadcasted input ndarray to corresponding elements in an output ndarray view.

assign assignment copy data javascript matrix ndarray node node-js nodejs set setitem slice stdlib structure types vector view

Last synced: 11 Apr 2025

https://github.com/gappeah/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 25 Feb 2025

https://github.com/gappeah/beverage-sales-analytics

This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.

data data-visualization powerbi powerbi-report powerbi-visuals

Last synced: 25 Feb 2025

https://github.com/gappeah/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 11 Jun 2025

https://github.com/speakeasy-sdks/fivetran-python-sdk

Python SDK for accessing Fivetran API.

api connector data fivetran fivetran-connector python sdk

Last synced: 01 Jul 2025

https://github.com/hoaihuongbk/lakeops

A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.

data data-operations dataengineering datalake

Last synced: 07 Mar 2026

https://github.com/concaption/ksa-lawyers-data

scraped data of ksa lawyers and law firms

data lawyers

Last synced: 03 Apr 2025

https://github.com/flownrecords/flightTracker

A mobile app built to record essential flight data for post-flight review and debriefing.

aviation data gps tracking

Last synced: 23 Jun 2025

https://github.com/emomaxd/flog

header-only logging library

c-plus-plus data files formatting logging stdout

Last synced: 20 Mar 2025

https://github.com/lunastev/reflectlm

ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.

ai data language-model llm model open-source ts web

Last synced: 22 Jun 2025

https://github.com/rafalwrzeszcz-wrzasqpl/pl.wrzasq.commons

General-purpose data structures and routines.

aws data data-structures library rust

Last synced: 10 Apr 2025

https://github.com/wahyuwsslah/salary_prediction-aiml

Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest

ai analytics data data-science datascience machine-learning python python3

Last synced: 19 May 2026

https://github.com/exponea/exponea-python-sdk

⚠️ DEPRECATED Python SDK for Exponea Data and Tracking API

api data exponea python sdk

Last synced: 09 Apr 2026

https://github.com/amazingtest/data4test

测试数据构造生成器,you can get useful data here for software testing

data test-automation testdata testdatabuilder testing testing-tools

Last synced: 16 Jan 2026

https://github.com/yazeed44/reform-api

A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.

data geojson python

Last synced: 18 May 2026

https://github.com/passidel/weedmap

Konsumverbot Cannabis

cannabis data map visual

Last synced: 14 Mar 2025

https://github.com/antoineaugusti/antennes-free

Historique des antennes relais Free Mobile en maintenance ou en panne

data free-mobile free-mobile-operator mobile-networks

Last synced: 30 Jul 2025

https://github.com/swarchal/morar

Processing phenotypic screening data

biology data data-analysis drug-discovery hts phenotypic

Last synced: 19 Jun 2025

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/instafluff/acdb

Animal Crossing Database API

animal api crossing data database json open villagers

Last synced: 28 Apr 2026

https://github.com/m-muecke/isocountry

R package containing ISO codes for countries and currencies

country-codes currency-codes data iso-3166-1 iso-4217 r r-package

Last synced: 20 Mar 2025

https://github.com/diddypod/crop-data-comparer

A Python script to compare crop data over years

comparison crop data openpyxl python

Last synced: 28 Jun 2026

https://github.com/stdlib-js/ndarray-base-reverse-dimension

Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 07 Mar 2026

https://github.com/fairdataihub/fair-amd-oct-paper-code

Code associated with the paper on FAIR assessment of AMD-related datasets containing OCT data

amd biomedical data eye fair oct

Last synced: 03 Apr 2025

https://github.com/vishwagauravin/screener-scraper-pro

Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.

data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks

Last synced: 18 Feb 2026

https://github.com/hmeleiro/r_dataviz

Data visualization projects with R / Proyectos de visualización de datos con R

data dataviz r rmd-files social-science survey-data

Last synced: 21 Jun 2026

https://github.com/greatwoman23/market-basket-analysis

Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.

analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python

Last synced: 28 Apr 2026

https://github.com/codedotjs/indiaartfairy

:beetle: Data & More - India Art Fair • 2018 - 2024

csv data python scraper

Last synced: 16 Jun 2025

https://github.com/chompfoods/sdk-go

Go SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go grocery ingredients nutrition raw recipe-api recipes sdk

Last synced: 19 May 2026

https://github.com/puzzlef/graph-openmp

Design of high-performance parallel Graph interface supporting efficient Dynamic batch updates.

data digraph directed graph in mtx openmp parallel structure undirected weighted

Last synced: 06 Apr 2025

https://github.com/puzzlef/hybrid-csr

Comparing space usage of regular vs hybrid CSR.

csr data graph hybrid regular space structure usage

Last synced: 06 Apr 2025

https://github.com/cobluestars/dataherd-raika

"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."

big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience

Last synced: 02 Jan 2026

https://github.com/realabbas/instagram-user-meta-data

Instagram User Meta Data 📷 can be fetched using this script in an easy to use JSON Object for displaying Instagram Cards.

data instagram javascript metadata nodejs profile user xray

Last synced: 10 May 2026

https://github.com/definetlynotai/test_generator

A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.

algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools

Last synced: 21 Jul 2025

https://github.com/bluecolor/lauda

Cross database data transfer tool

data database etl extract jdbc load

Last synced: 02 May 2026

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 16 Jul 2025

https://github.com/alexandregazagnes/ghisa

ghisa - Github Import Statistic Analyzer is a free and open-source software, app and python package that helps you to analyze the import statistics of your github repositories.

analytics data dependencies git github github-api import package pypi python skills tool

Last synced: 27 Jun 2025

https://github.com/adrian-pasek-prv/data-modeling-with-cassandra

Create a data model in Apache Cassandra for music streaming app

apache-cassandra data data-engineering data-modeling python

Last synced: 02 Jan 2026

https://github.com/plurid/deserve

Own Your Data · Control The Code

data owner

Last synced: 16 Jul 2025

https://github.com/openfoodfacts/openfoodfacts-corrector

Ruby script to correct and enhance data on OpenFoodFacts

correction data food ruby

Last synced: 24 Apr 2026

https://github.com/beangreen247/osfetch-old.sh

script that fetches system information and displays it to the user

247 bash bean beangreen247 data fetch green information neofetch neofetch-clone os script sh shell storage system tem zsh

Last synced: 02 Nov 2025

https://github.com/ibz-04/data-encryption

Encrypting and Decrypting given data of hospital patients such as: audio & image files

data decryption encryption

Last synced: 23 Jul 2025

https://github.com/benji-lewis/archivord

An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.

archive data data-mining discord discord-bot typescript

Last synced: 16 May 2026

https://github.com/panukatan/senso

An Interface to the Philippine Census of Population and Housing Data

census data philippines r rstats

Last synced: 29 Jun 2026

https://github.com/kingtous/bots_task_result

Result of the Barcelona OpenMP Tasks Suite (BOTS) using ompTG

data openmp

Last synced: 09 Jul 2025

https://github.com/mheadd/SamDotNet

:office: A C# wrapper for the SAM.gov API.

api business client data gov-api government

Last synced: 30 Apr 2025

https://github.com/12joan/not-analytics

don't be creepy.

data metrics privacy

Last synced: 30 Apr 2025

https://github.com/avto-dev/data-migrations-laravel

Package for database data migrations

data database laravel migrations package

Last synced: 12 Jul 2025

https://github.com/gcoronelc/ucv_gdi-1_202302-b2

Taller de Gestión de Datos e Información I con Gustavo Coronel.

data data-science data-structures database databases online oracle query relational-databases security sql sql-server

Last synced: 19 May 2026

https://github.com/davorg/data-tree

Perl library for handling trees

data perl tree

Last synced: 02 Apr 2025

https://github.com/rrwen/slides-covid19-geosocial-db

Presentation titled "A Real-time Geo-social Media Database for Large-scale Coronavirus Disease 2019 (COVID-19) Research" for my second research seminar at Ryerson University

covid covid-19 covid19 data database disease geo gis index media ncov-2019 ncov19 postgres postgresql presentation research seminar slides social virus

Last synced: 18 May 2026

https://github.com/sermetpekin/perse

Perse is an experimental Python package that combines some of the most widely-used functionalities from the powerhouse libraries Pandas, Polars, and DuckDB into a single, unified DataFrame object. The goal of Perse is to provide a streamlined and efficient interface, leveraging the strengths of these libraries to create a versatile data handling.

data data-science data-structures duckdb pandas polars

Last synced: 09 May 2026

https://github.com/katerynazakharova/common-ml

Creating this lib for ML tasks, because I'm bored of copy-pasting the same functions for different projects.

data data-processing deep-learning lib machi

Last synced: 26 Mar 2025

https://github.com/elvis-not-presley-one/lostcassowary

LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks

data data-mining data-science dataminer minecraft nbt nbt-parser scraper

Last synced: 12 Apr 2025

https://github.com/jub0t/Eso

An application to manage all your Encryption & Decryption keys and other related tools.

data encryption encryption-decryption hacking hacking-tool keys pgp privacy private

Last synced: 10 May 2025

https://github.com/erictleung/2017-new-coder-survey

:beginner: Code to help clean and format the 2017 New Coder Survey by freeCodeCamp

coder-survey data data-cleaning dplyr freecodecamp

Last synced: 03 Apr 2025

https://github.com/alpheustangs/jder

A standardized structure for JSON responses

api data error json response specification structure

Last synced: 26 Mar 2025

https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform

This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.

bigquery data data-analysis data-modeling live-streaming sql

Last synced: 23 Jun 2025

https://github.com/bredalis/matplotlib

📊 Library to create graphs in Python 📊

data graphics librery matplotlib matplotlib-pyplot python

Last synced: 30 Mar 2025

https://github.com/alhonaut/quant-assigment

Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho

analysis data defi quantitative-finance

Last synced: 16 May 2026

https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard

An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊

dashboard data data-analysis data-science data-visualization tableau tableau-public

Last synced: 17 Feb 2026

https://github.com/vulcalien/vulcdataformat

Simple data storage system for Java.

data data-storage java serialization

Last synced: 25 Feb 2025

https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing

Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.

data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates

Last synced: 13 Jul 2025

https://github.com/williamzebrowski/assistant-api

OpenAI Assistant API integrated with Elasticsearch, Logstash & Kibana

ai chatapp chatgpt conversational-ai data elasticsearch kibana llm-inference llms openai rag

Last synced: 16 Feb 2026

https://github.com/e-kotov/mapineqr

Access Mapineq inequality indicators via API

data demogrpahy r rstats socio-economic-indicators

Last synced: 06 Apr 2025

https://github.com/mierune/tinygrib2

(experimental) A tiny toolkit for parsing JMA's GRIB2 files.

data grib grib2 meteorology rust weather

Last synced: 27 Jun 2025

https://github.com/labgua/ilmeteo

Acquisizione dati dal sensore SHT71 e trasmissione in rete in Real-Time

acquisition data humidity humidity-sensor iot raspberry-pi real-time realtime rpi sht71 temperatura temperature temperature-sensor umidita web

Last synced: 24 Apr 2026

https://github.com/junkwaxdata/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 13 Mar 2025

https://github.com/soulyma/web_crawler

A focused web crawler to extract and structure Arabic content from web pages. Designed for researchers, data analysts, and developers working on Arabic language datasets.

beautifulsoup4 crawler csv data json python structured-data

Last synced: 15 May 2026

https://github.com/ubc-library-rc/ggplot2_intro_workshop

Workshop about data visualization with ggplot2 in R

data featured workshop

Last synced: 01 Jul 2026

https://github.com/tobinchilongo/oop-school-library

This project consists of Ruby script for the school library app. I implemented encapsulation and inheritance with Ruby by creating classes to represent students and teachers in the school.

data database gemfile input-output preserve rspec-testing rubocop unit-test

Last synced: 02 May 2026

https://github.com/bhpcv252/dda-binapprox-on-fits

Using the binapprox algorithm to efficiently estimate the median of each pixel from a set of astronomy images in FITS files.

astronomy data median python

Last synced: 22 Mar 2025

https://github.com/viveknathani/maketest

A command line tool to generate test data. 📊

command-line data golang testing-tools

Last synced: 08 Jun 2026

https://github.com/jensz12/uhc

Datapack til Minecraft 1.13+ UHC

data minecraft pack

Last synced: 21 Sep 2025

https://github.com/stonecharioteer/renfield

Synchronize and Search through Hard Drives

catalogue data search storage synchronization

Last synced: 09 Feb 2026

https://github.com/maxnowack/elastic-sync

Connector to sync mongodb documents into a elasticsearch index

data elasticsearch mongodb sync

Last synced: 20 Jan 2026