An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/devtin/duckfficer

Zero-dependencies light-weight library for modeling, validating and sanitizing data 🦆 🐵 👁

coercion data duck-typing json parsing schema validation

Last synced: 01 Mar 2025

https://github.com/onaio/gisida-react

React Dashboard library for Gisida.

dashboard data gisida map react visualization

Last synced: 28 Apr 2025

https://github.com/codewell/data-kale

The Simple Data Lake - Data Kale

data data-lake python

Last synced: 22 Feb 2025

https://github.com/qeeqbox/data-compliance

Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse

compliance data data-compliance infosecsimplified qeeqbox

Last synced: 05 Sep 2025

https://github.com/analyticace/budgetgenerator

This is a Budget Generator project that allows you to manage and track expenses for various events

budget budget-app budget-manager data database finance json pyhton tracker

Last synced: 08 Apr 2025

https://github.com/elianhugh/streams

Flexible data streaming for R

data package r r-package streaming

Last synced: 10 Mar 2025

https://github.com/jsdhami/python-for-research

"Python-For-Research" Event Organized By Tri-Chandra Research Group, Ghantaghar, Kathmandu

analysis colab data jupyter matplotlib numpy panda physics python research visualization

Last synced: 27 Oct 2025

https://github.com/tosun-si/world-cup-qatar-team-stats-kotlin-midgard

This application shows a full Apache Beam pipeline with Kotlin and Midgard library. The use case works on the last Qatar FIFA world cup data and calculate players statistics per team. This application will be presented at Beam Summit 2023 in New York

apache-beam beam-summit data kotlin midgard world-cup-2022

Last synced: 01 Feb 2026

https://github.com/mews-labs/crep

This simple module aims at providing some function to tackle tabular data that have a continuous axis. In situations, this index can represent time, but this tool was originally developed to tackle rail way description.

data pandas pandas-dataframe python python3 rails-application time-series

Last synced: 11 Nov 2025

https://github.com/xcanwin/wechatdat2pic

解码微信的临时文件. Decoding the WeChat temporary file.

dat data decode decrypt filestorage wechat

Last synced: 21 Jun 2025

https://github.com/robertoentringer/lod-opendata

A NPM package for get data of Lëtzebuerger Online Dictionnaire (LOD) from data.public.lu.

api data dictionary json-api lod-lu luxembourg luxemburgish open-data package parse public public-api

Last synced: 05 Sep 2025

https://github.com/haideralipunjabi/harrypotter-analysis

Repository with code to generate visualisations of Harry Potter Fanfiction and Books

analysis data harry-potter python visualization wordcloud

Last synced: 25 Mar 2025

https://github.com/muhammadibrahim313/datavue

"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.

analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit

Last synced: 10 Apr 2025

https://github.com/yezz123/awsflowutils

Improve your data workflow with enhanced simplicity and robustness in handling common data tasks ✨

aws data redshift s3 s3-bucket workflow

Last synced: 07 Jan 2026

https://github.com/rsn601kri/classifiertoidentifydogbreeds

This project aims to develop an image classification system to identify dog breeds using deep learning models. The classifier leverages pre-trained models from the PyTorch library, including ResNet18, AlexNet, and VGG16, to achieve accurate breed identification from images.

aiml cnn data model python pytorch

Last synced: 26 Sep 2025

https://github.com/0xdir/htcds_dart

Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.

data humanitarian schema standards

Last synced: 24 Oct 2025

https://github.com/ggreen/data-orchestration-with-scdf-showcase

data-orchestration-with-scdf-showcase

data orchestration scdf spring

Last synced: 14 Jan 2026

https://github.com/reala10n/simplejsondb

Create a simple JSON database with just one line of code!

data database db easy json python simple

Last synced: 27 Oct 2025

https://github.com/arda-guler/binsonograph

Encode any binary file into an audio file. Sister project of https://github.com/arda-guler/binGallery

audio converter data encoder proof-of-concept sonification sound

Last synced: 21 Jun 2025

https://github.com/monfireboose/monfireboose

A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.

api data database firebase firestore high-level-api interact javascript library model storage

Last synced: 26 Oct 2025

https://github.com/pranavpandey/dynamic-backup

Backup and restore app data on Android.

android app backup data library restore storage

Last synced: 07 Sep 2025

https://github.com/udityamerit/python-librearies-for-data-science

Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data

beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow

Last synced: 06 Feb 2026

https://github.com/patilni3/sql_practice_files

SQL Skills: Full Range from Beginner to Expert using SQL_Server

data database mssql mssql-database mysql mysql-database sql

Last synced: 14 Jun 2025

https://github.com/josecsotomorales/dbt

Repository for testing data build tool (dbt)

business-intelligence data data-engineering data-transformation dbt dbt-packages

Last synced: 06 Jan 2026

https://github.com/flexiodata/functions-covid-19-feed

Import Covid-19 data from Johns Hopkins University into Microsoft Excel and Google Sheets.

covid-19 data excel google-sheets import johns-hopkins-csse johns-hopkins-university spreadsheet

Last synced: 10 Mar 2025

https://github.com/suh1z/rakkauttify_fullstack

CS2 Data and Statistics Dashboard -fullstackproject

analytics data expressjs gaming mongo nodejs react redux

Last synced: 24 Oct 2025

https://github.com/legopitstop/addons

All legopitstop's Bedrock add-ons in one place.

add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla

Last synced: 06 Feb 2026

https://github.com/rousan/bytevault

A command line application that stores sensitive data as key-value pair securely in local machine

application byte c command-line data encrypts key-value sensitive vault

Last synced: 16 Mar 2025

https://github.com/smolsoftboi/php-faker-providers

Faker providers that generate fake data for you.

data faker faker-generator faker-provider generator php

Last synced: 22 Apr 2025

https://github.com/jmsallan/esdata

A R package to bring Spanish economic databases into the R environment

data datasets ine inflation spain unemployment-data

Last synced: 18 Jan 2026

https://github.com/cnayan/q-server

Gives API for back-end server connectivity; MS SQL Server connector provided.

data database provider q-server query query-engine

Last synced: 09 Oct 2025

https://github.com/robertmyles/riscobrasil

An R package to download 'Brazil Risk' data :chart_with_upwards_trend:

brazil data finance r

Last synced: 08 Apr 2025

https://github.com/desultory/pycpio

Python library for CPIO manipulation

cpio cpio-archives data initramfs pypi-package python python-3 python3

Last synced: 04 Feb 2026

https://github.com/mabel-dev/opteryx-catalog

📚 Opteryx Cloud Catalog

catalog data python sql

Last synced: 05 Feb 2026

https://github.com/odd12258053/dade

dade is data definition for Rust structures.

data json json-schema parsing rust validation

Last synced: 12 Jun 2025

https://github.com/ravi-prakash1907/caterapp

A Quick & Secured Data Sharing Application!

application cater caterapp data data-sharing pip python

Last synced: 06 Sep 2025

https://github.com/lewagon/matplotlib

Matplotlib examples for Le Wagon's Data Science bootcamp

data

Last synced: 13 Jul 2025

https://github.com/cmudig/mosaic-profiler

A data profiler built with Mosaic

data jupyter visualization

Last synced: 25 Oct 2025

https://github.com/mrlynn/30-min-data-web-form

30 Minutes to a Data Enabled Web Form with MongoDB

beginner data html html-form javascript mongodb mongodb-atlas mongodb-database web webforms

Last synced: 05 Jul 2025

https://github.com/dkxce/osm2shp

Flexible OSM to SHP Converter (convert .osm & .pbf files to ESRI Shape .shp files). OSM to Shape.

converter data dbf dkxce earth esri map maps openseamap openstreetmap osm pbf routes shape shapes shp

Last synced: 19 Jan 2026

https://github.com/geocollections/emaapou

eMaapõu: Eesti maapõue andmebaas

data database estonia geology portal

Last synced: 05 Feb 2026

https://github.com/animenosekai/cain

A small yet powerful data format ✨

cain data format python

Last synced: 03 Sep 2025

https://github.com/abrudz/parsing

Dyalog APL expressions to parse common and unusual data formats from text files

apl csv data data-format dyalog-apl dyalogapl parsing

Last synced: 18 Jan 2026

https://github.com/justfairdev/Web-Stack-Query

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

async cache data fetch graphql hooks query react resources rest

Last synced: 14 Oct 2025

https://github.com/vaibhavpandeyvpz/cbse-scraper

This script scrapes information about schools affiliated with CBSE for a given state.

cbse crawler data schools scraper

Last synced: 12 Jul 2025

https://github.com/ryanmorr/fastmap

Accelerated hash maps

data hashmap javascript map performance

Last synced: 10 Oct 2025

https://github.com/faster-games/whiskey

Data and Events framework for Unity. 🥃⚡

data events framework unity3d

Last synced: 11 Oct 2025

https://github.com/as/worm

Worm provides write-once read-many log-structured storage semantics

data log record storage worm

Last synced: 31 Jan 2026

https://github.com/severo/data-grid-cartograms

A curated collection of grid cartograms

cartogram collection curated data dataviz grid

Last synced: 04 Feb 2026

https://github.com/ejfox/election-helpers

A collection of resources, tools, and patterns for election data analysis and viz

data elections helpers

Last synced: 15 Apr 2025

https://github.com/cicerops/monitoring-check-grafana

Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.

data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale

Last synced: 01 Mar 2025

https://github.com/geopython/pygeoapi-examples

Example pygeoapi deployment patterns and configurations

api data geospatial ogc ogc-api osgeo pygeoapi

Last synced: 11 Oct 2025

https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation

Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.

colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats

Last synced: 04 Sep 2025

https://github.com/baked-libs/bstats-discord-integration

A simple program which queries https://bstats.org/ and presents this data in a highly customizable discord webhook

bstats data discord discord-webhook javascript minecraft notifications paper plugin spigot statistic stats typescript webhook

Last synced: 28 Apr 2025

https://github.com/datalayer/desktop

Ξ 🖥️ Datalayer Destkop.

ai data data-analysis data-science datalayer desktop electron

Last synced: 25 Oct 2025

https://github.com/definetlynotai/llm_data

A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI

c code-examples cpp cuda data data-dum jupyter-notebook llm llm-code llm-datasets programming-data programming-data-sets python3

Last synced: 08 Oct 2025

https://github.com/msrd0/gotham_formdata

Form data parsing for the gotham web framework

data form gotham html http multipart rust server urlencoded

Last synced: 16 Mar 2025

https://github.com/critocrito/data-scores-in-the-uk

Investigate the uses of data analytics and algorithms in public services in the UK.

clojure data data-investigation data-preservation javascript social-sciences sugarcube uk

Last synced: 18 Oct 2025

https://github.com/bluegreen-labs/oneflux_containers

Containerized (docker) versions of the ONEFlux processing pipeline

data ecosystem fluxes micrometeorology processing

Last synced: 07 Oct 2025

https://github.com/binarybardakshat/suryanayan

Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.

data drone nlp python solar

Last synced: 10 Oct 2025

https://github.com/open-i18n/data-unicode-cldr

Git mirror for Unicode Common Locale Data Repository (CLDR) data

cldr data open-i18n unicode unicode-consortium

Last synced: 07 Feb 2026

https://github.com/vijishmadhavan/parse-clip

A simple CLIP based project for combining images from multiple datasets.

clip data datacleaning dataexploration dataset fastai image python

Last synced: 09 Oct 2025

https://github.com/sungchun12/sqlmesh-demos

SQLMesh project for live demos - provides instructions so you can run this on your own!

data data-engineering sql sqlmesh

Last synced: 24 Oct 2025

https://github.com/bisonai/datamaxi-rust

Official Rust SDK for DataMaxi+ API

arbitrage cex cryptocurrency data dex trading

Last synced: 23 Apr 2025

https://github.com/quetz-al/quetzal

Quetzal API (short for Quetzalcoatl): a data and metadata management application

api data data-science flask-application openapi3 python quetzal

Last synced: 04 Jul 2025

https://github.com/louisbrulenaudet/legalkit-pipeline

Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.

data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python

Last synced: 17 Mar 2025

https://github.com/j1sk1ss/dateapppc.exmpl

Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.

data oop-principles parsing pgadmin4 sql wpf

Last synced: 03 Aug 2025

https://github.com/farovictor/mongodbextractor

This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.

data go mongodb pipeline

Last synced: 14 Jan 2026

https://github.com/ium101/files-and-folders-lister-z

Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.

application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows

Last synced: 09 Oct 2025

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/asidlo/po

Data science library for manipulating data in Go using the familiar DataFrame and Series constructs from the Python Pandas library.

data dataframe go pandas series

Last synced: 14 Jan 2026

https://github.com/mihasm/arso-scraper

Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.

api arso cli data historical-data meteorological python slovenia weather

Last synced: 12 Jun 2025

https://github.com/saleh0987/mohamed_saleh

That's my personal website where I show my skills and projects.

aos-animation axios boot data json nextjs portfolio portfolio-website projects react-icons reactjs sass swiper

Last synced: 09 Apr 2025

https://github.com/Nazaniiin/EDA_QualityofRedWine

:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.

charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization

Last synced: 30 Jul 2025

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 20 Feb 2025

https://github.com/divithraju/divith-raju-openmetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction

Last synced: 20 Feb 2025