An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/m-muecke/isocountry

R package containing ISO codes for countries and currencies

country-codes currency-codes data iso-3166-1 iso-4217 r r-package

Last synced: 20 Mar 2025

https://github.com/elazar/pycopyql

Exports a subset of data from a relational database.

data database export relational tool utility

Last synced: 16 May 2026

https://github.com/jbdesbas/custom-scripts

Custom SQL functions or scripts

data database sql

Last synced: 28 Jun 2026

https://github.com/glassflow/pipelines-push-action

This Github Action lets you automate GlassFlow pipelines deployments as code

data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing

Last synced: 19 May 2026

https://github.com/uhstray-io/just-dashboards

Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source

analytics data dioxus rust visualization

Last synced: 29 Mar 2025

https://github.com/hoaihuongbk/lakeops

A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.

data data-operations dataengineering datalake

Last synced: 07 Mar 2026

https://github.com/lisakey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 08 May 2026

https://github.com/agustinmusanti/sqlchallenge-2

This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.

challenge data mysql

Last synced: 03 Apr 2025

https://github.com/real-veersandhu/cia-country-comparison

Data analysis system on the CIA World Factbook

data

Last synced: 25 Feb 2025

https://github.com/codenoid/storial.co-database

a Storial.co Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/bastianolea/fonasa_beneficiarios

Datos de beneficiarios del Fondo Nacional de Salud, por tramo del sistema, edad, tramo de edad, sexo, y comuna.

chile comunas data estado genero salud social

Last synced: 27 Feb 2026

https://github.com/benji-lewis/archivord

An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.

archive data data-mining discord discord-bot typescript

Last synced: 16 May 2026

https://github.com/aleklukanen/chapterhousedb-example-app

An example application using the ChapterhouseDB processing engine

arrow data database event golang parquet processing stream

Last synced: 18 Apr 2026

https://github.com/wahyuwsslah/salary_prediction-aiml

Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest

ai analytics data data-science datascience machine-learning python python3

Last synced: 19 May 2026

https://github.com/gbburleigh/quick-seeders

Generate realistic test data quickly with Quick-Seeders, a Python library offering a wide range of data types and schema definitions. Control data variance, probabilities, and output formats, including SQL. Simplify your data seeding process and improve testing efficiency.

data dataset faker generator python seeder sql test

Last synced: 03 Apr 2025

https://github.com/giscience/measures-rest-sparql

A SPARQL endpoint for the Measures REST OSHDB App framework.

data osm quality semantics sparql sparql-endpoints

Last synced: 24 Jun 2025

https://github.com/lunastev/wson-rust

WSON data serialization parser

data parser serialization

Last synced: 07 Apr 2025

https://github.com/millengustavo/salarios-data-science

Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers

brasil brazil ciencia-de-dados data data-science heroku salarios salary

Last synced: 07 Oct 2025

https://github.com/public-health-scotland/waiting_times_clinical_prioritisation

This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.

data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time

Last synced: 26 Jul 2025

https://github.com/lmuffato/project-restaurant-orders-trybe

Projeto restaurant orders - Projeto avaliativo da Trybe do Bloco 36: Estrutura de Dados I: Arrays, Hashmaps e Sets

array array-set csv data data-analysis hashmap python set trybe trybe-projects

Last synced: 13 Sep 2025

https://github.com/alhonaut/quant-assigment

Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho

analysis data defi quantitative-finance

Last synced: 16 May 2026

https://github.com/yoursrijit/data-structure-with-java

A data structure is a named location that can be used to store and organize data. And, an algorithm is a collection of steps to solve a particular problem. Learning data structures and algorithms allow us to write efficient and optimized computer programs.

data datastructures dsa-algorithm java linked-list

Last synced: 13 Mar 2025

https://github.com/denisecase/nw-network-data-analytics

Network for those earning a NW Masters of Applied Data Science

analytics data

Last synced: 02 Feb 2026

https://github.com/chrisru/f1stats

🗄️ Speedy API for Formula 1 statistics

api data fast formula1

Last synced: 20 Mar 2025

https://github.com/linas/archeo

File Recovery, Integrity and Archive Management

corruption data monitoring recovery

Last synced: 29 Mar 2025

https://github.com/diddypod/crop-data-comparer

A Python script to compare crop data over years

comparison crop data openpyxl python

Last synced: 28 Jun 2026

https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian

I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively

data data-visualization tableau

Last synced: 17 Feb 2026

https://github.com/stdlib-js/ndarray-base-reverse-dimension

Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 07 Mar 2026

https://github.com/vishwagauravin/screener-scraper-pro

Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.

data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks

Last synced: 18 Feb 2026

https://github.com/sandravizz/global_inequality_story

Dataviz Project about Global Inequality

data data-visualization inequality

Last synced: 03 Jul 2025

https://github.com/frequentlymisseddeadlines/chessfessor

Command line tool to extract game data from Lichess.org and Chess.com

chess data extract lichess pgn

Last synced: 19 May 2026

https://github.com/kevinsames/spark-fuse

spark-fuse is an open-source toolkit for PySpark — providing utilities, connectors, and tools to fuse your data workflows together.

data databricks fabric pyspark python spark

Last synced: 08 May 2026

https://github.com/geo-c/oct-ckan

The Open City Toolkit (more information about the project: http://geo-c.eu)

cities collaboration data open participation transparency

Last synced: 16 May 2026

https://github.com/bayer-group/cmc-ontologies

This is a submodule of cmc-knowledge-graph-setup. It contains ontologies and relevant data graph files

data ontologies owl turtle

Last synced: 16 Jun 2025

https://github.com/ayushverma135/accenture-data-analytics-and-visualization

This program provided practical experience in advising a hypothetical social media client as a Data Analyst at Accenture. The simulation involved cleaning, modeling, and analyzing multiple datasets, culminating in the creation of a PowerPoint deck and video presentation to communicate key insights.

accenture analytics data data-visualization forage presentation

Last synced: 19 Sep 2025

https://github.com/chompfoods/sdk-go

Go SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go grocery ingredients nutrition raw recipe-api recipes sdk

Last synced: 19 May 2026

https://github.com/parzibyte/cifrar-descifrar-php

Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios

crypto data decrypt encryption password php security

Last synced: 20 May 2026

https://github.com/incubrain/awesome-maharashtra-data

A collection of datasets specific to Maharashtra, India. WIP

ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi

Last synced: 23 May 2026

https://github.com/whatheheckisthis/pwc_project-

Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver

case-study data data-science dataanalytics databases datavisualization powerbi virtual

Last synced: 05 Apr 2025

https://github.com/ibnz36/arrowpipe

Build complex pipelines easily

cargo crate data pipe rust

Last synced: 13 Apr 2025

https://github.com/thomd/git-scrape-hacker-news

scrape hacker news metadata for data analysis

data data-science git-scraping hacker-news

Last synced: 16 Sep 2025

https://github.com/aditya172926/blockchain_indexers

Indexers to fetch data from blockchain events and transactions data with their parameters

blockchain data indexers rust

Last synced: 02 Aug 2025

https://github.com/stdlib-js/array-base-any-by-right

Test whether at least one element in an array passes a test implemented by a predicate function, while iterating from right to left.

any array data generic javascript node node-js nodejs predicate some stdlib structure test types validate

Last synced: 14 Apr 2025

https://github.com/tsiarokhin/student_bsu_by

Tool for parsing various BSU student information from student.bsu.by website.

belarus bsu data grades python students study university

Last synced: 28 May 2026

https://github.com/jonsafari/toy-data

Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks

data language-data machine-translation nlp sanity-checks toy-data

Last synced: 06 Nov 2025

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/eugenedakin/steganography-pictures

Add and remove a picture-in-a-picture with steganography

compare data steganography steganography-tools xojo

Last synced: 12 Feb 2026

https://github.com/epogrebnyak/business-conditions-digest-2017

Replicate illustration from Business Conditions Digest

data economics

Last synced: 22 Mar 2025

https://github.com/lucaaszsx/spyder

A powerful schema-based web scraping library for Node.js built for fast, structured, and reliable data extraction.

cheerio crawler data dom dom-manipulation html json json-ld parser scraper web xml

Last synced: 11 Jun 2026

https://github.com/lmuffato/project-mongodb-commerce-trybe

Projeto Mongodb Commerce - Projeto avaliativo da Trybe do Bloco 23: MongoDB: Updates Simples e Complexos

api back-end crud data database mongo mongodb query trybe-projects update

Last synced: 08 May 2026

https://github.com/stdlib-js/ndarray-base-dtype-resolve-str

Return the data type string associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 06 Mar 2026

https://github.com/habedi/adbis-2023-paper

This repository hosts the code and data used for the experiments reported in the paper titled "Diversification of Top-k Geosocial Queries", published in ADBIS 2023

artifacts conference-paper data experiments graphs java research-paper

Last synced: 19 May 2026

https://github.com/marxmit7/kaggle

Kaggle competitions

data kaggle kaggle-competition

Last synced: 19 May 2026

https://github.com/aboualine/sql-formation

Library Management System Database: A MySQL project with tables, triggers, stored procedures, and views for managing books, members, and borrowings. Includes sample data for testing. Ideal for learning SQL or building a library app.

data database library-management-system mysql sql system

Last synced: 18 Apr 2026

https://github.com/coral/ddp

Distributed Display Protocol (DDP) in Go

data ddp distributed golang led pixel protocol wled

Last synced: 26 Jun 2025

https://github.com/ushkinaz/cbn-data

Automated game data extraction and processing for Cataclysm: Bright Nights. Provides JSON mirrors, WebP asset conversion, and unified translation data.

cataclysm-bn data wiki

Last synced: 07 Mar 2026

https://github.com/cont-limno/lagosus-reservoir

Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.

data module

Last synced: 17 Jan 2026

https://github.com/jimbrig/jimstaskviews

CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com

cran data docs rstats shiny-app submodules task-views

Last synced: 06 Mar 2026

https://github.com/d-ganchar/thedus

Thedus is a lightweight migration tool for Clickhouse

cli clickhouse data database migration migrations python

Last synced: 12 Apr 2025

https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days

I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!

algorithms-and-data-structures and data data-structures dsa python python3 structure

Last synced: 08 May 2025

https://github.com/qeeqbox/data-security

Safeguarding your personal information (How your info is protected)

data data-security infosecsimplified qeeqbox security

Last synced: 19 Mar 2026

https://github.com/qeeqbox/data-lifecycle-management

Data Lifecycle Management (DLM) is a policy-based model for managing data in an organization

data data-lifecycle-management infosecsimplified lifecycle management qeeqbox

Last synced: 07 Mar 2026

https://github.com/exponea/exponea-python-sdk

⚠️ DEPRECATED Python SDK for Exponea Data and Tracking API

api data exponea python sdk

Last synced: 09 Apr 2026

https://github.com/kerlossony/nested-formdata

Nested-FormData is a Function designed to handle nested form data structures in a simplified and efficient way. It helps in managing complex form data, making it easier to work with forms that require hierarchical data

data forms javascript nested-structures nextjs reactjs typescript

Last synced: 08 Mar 2026

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/jvrck/australianpayphones

Get Australian payphone data in GeoJSON format.

australia data geojson geojson-data scraper

Last synced: 04 Apr 2025

https://github.com/wklee610/de_project

[Data Engineer] Personal Toy Project For Study

data dataengineering

Last synced: 31 Mar 2025

https://github.com/lagden/injection

Inject data into file

data file inject nodejs

Last synced: 24 Apr 2026

https://github.com/panukatan/senso

An Interface to the Philippine Census of Population and Housing Data

census data philippines r rstats

Last synced: 29 Jun 2026

https://github.com/open-i18n/data

Open i18n Data packages

data internationalization

Last synced: 07 Mar 2026

https://github.com/redodo/shipper

Hide encrypted data in files.

audio data images python steganography

Last synced: 26 Mar 2025

https://github.com/yazeed44/reform-api

A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.

data geojson python

Last synced: 18 May 2026

https://github.com/penspanic/datra

Datra is a comprehensive data management system for game development.

data game game-development gamedata unity unity-package unity3d-plugin

Last synced: 19 May 2026

https://github.com/gcoronelc/ucv_gdi-1_202302-b2

Taller de Gestión de Datos e Información I con Gustavo Coronel.

data data-science data-structures database databases online oracle query relational-databases security sql sql-server

Last synced: 19 May 2026

https://github.com/akatrevorjay/helm-nuke

Nukes all helm releases as well as tiller-owned k8s objects that may be left lying around.

all data destroy helm plugin

Last synced: 19 Sep 2025

https://github.com/uvaio/datasets

Notebooks for data processing, scraping, machine learning

data dataset jupyter jupyter-notebook learning machine ml model ontology

Last synced: 21 Mar 2025

https://github.com/inzhenerka/scooters_data_uploader

Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех

data dbt postgresql

Last synced: 04 May 2026

https://github.com/sixarm/sixarm_ruby_fab

SixArm.com → Ruby → Fab gem to fabricate sample data for testing

data fabrication factory fake gem mock ruby

Last synced: 24 Jul 2025

https://github.com/codedotjs/indiaartfairy

:beetle: Data & More - India Art Fair • 2018 - 2024

csv data python scraper

Last synced: 16 Jun 2025

https://github.com/luminati-io/pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 17 Mar 2025

https://github.com/georginapuig/graps-from-csv

📊 Data visualization with c3.js and Papaparse from CSV files.

c3 c3js chart d3 d3js data data-visualization graphs javascript javascript-library visualization

Last synced: 19 May 2026

https://github.com/nafisalawalidris/elfeenah

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.

artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning

Last synced: 11 Sep 2025