An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/open-i18n/data-iso-15924

Git mirror for ISO 15924, Codes for the representation of names of scripts data

data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems

Last synced: 14 Mar 2026

https://github.com/akv3sic/cryptocurrency-charts

Cryptocurrency API data visualizations 📈 with Matplolib.

cryptocurrency data data-visualization matplotlib python

Last synced: 16 Oct 2025

https://github.com/ireddragonicy/wascrub

Clean WhatsApp chat export easily.

chat clean data meta whatsapp

Last synced: 03 May 2026

https://github.com/potreic/etl-fashion-trend-analysis

✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊

airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends

Last synced: 27 Jan 2026

https://github.com/data-forge-notebook/javascript-cheat-sheet

Cheat sheet that accompanies my book Data Wrangling with JavaScript

cheatsheet data data-wrangling javascript nodejs

Last synced: 15 Apr 2026

https://github.com/florianwendelborn/metatypes

Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)

code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript

Last synced: 27 Jan 2026

https://github.com/gematik/poc-isik-patient-merge

The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.

data fhir isik poc

Last synced: 19 Oct 2025

https://github.com/divithraju/divith-aju-hadoop-pyspark-pipeline

This project demonstrates the creation of a scalable data processing pipeline for handling and analyzing log data from a hypothetical e-commerce platform. Leveraging Hadoop and PySpark, the pipeline is designed to process large volumes of log files, providing meaningful insights into user behavior, system performance, and sales metrics.

apache-hadoop-framework apache-spark bigdata client data database dataengineering dataingestionframework datapreprocessing documentation ecommerce-platform hdfs pipeline project project-repository pyspark python3 software-engineering

Last synced: 27 Jan 2026

https://github.com/marcelo-earth/H5N8-Data

🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.

csv data h5n8 h5n8-cases h5n8-virus russia

Last synced: 20 Oct 2025

https://github.com/osiota10/alx-low_level_programming

C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering

algorithms assembly c data data-structures linux shell unix

Last synced: 25 Jun 2025

https://github.com/jaldekoa/fiscaldataapi

A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 27 Jan 2026

https://github.com/gbv/cocoda-mappings

concordances, mappings and conversion scripts to create JSKOS mappings

coli-conc data jskos

Last synced: 28 Oct 2025

https://github.com/azrunguraya/kabyle-corpus-dataset

Dans l'univers du Traitement Automatique des Langues , l'accès à des datasets diversifiés et bien annotés est essentiel pour développer des modèles performants. Ce projet vise à combler cette lacune spécifique pour la langue taqbaylit, une langue berbère parlée principalement en Kabylie

ber berber berber-dataset corpus data dataset ia kabyle kabyle-art kb machine-learning nlp nlp-machine-learning python taqbaylit text words

Last synced: 31 Jul 2025

https://github.com/lmuffato/project-job-insights-trybe

Projeto job insights - Projeto avaliativo da Trybe do Bloco 32: Introdução à Python

data data-science data-transformation filter python

Last synced: 12 Jun 2025

https://github.com/datenoio/internacia-db

Public registry of the intergovernmental organizations, country groups and countries. Available as JSONl, Parquet, YAML and DuckDB database datasets

countries data datasets international international-trade reference

Last synced: 29 May 2026

https://github.com/rodekruis/510-data-catalog

The Project is CKAN based Data Catalog Portal for 510

catalog ckan data opendata

Last synced: 23 Jan 2026

https://github.com/fredhutch/gdscnsoilsites

Homepage for BioDIGS Project. Learn about the project and download data.

biodigs data metagenomics student-research

Last synced: 25 Mar 2025

https://github.com/garcane/Income-Prediction-ML

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 24 Oct 2025

https://github.com/shivam1808/data-cleaning-project

We take raw housing data and transform it in SQL Server to make it more usable for analysis.

analysis data datacleaning sql sqlserver

Last synced: 29 May 2026

https://github.com/michalwols/awesome-data-curation

🗑️ ✨ 📊 Awesome things related to data collection, annotation, cleaning and management.

active-learning annotation cleaning-data data data-science deep-learning machine-learning

Last synced: 24 Jun 2026

https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights

Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.

analytics data excel sas sasprogramming statistical-analysis

Last synced: 24 Feb 2026

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/mihaiconstantin/lavot

A `React` application that allows users to indicate how votes will be redistributed among candidates for the second round of Romanian presidential elections.

data data-visualization elections react sankey typescript

Last synced: 06 Feb 2026

https://github.com/williamwutq/mappedpages

A fixed-size page provider backed by memory mapping, intended for building higher-level allocators and storage systems

allocation allocator data data-storage database file memory-mapping mmap page rust rust-crate rust-library storage

Last synced: 25 Jun 2026

https://github.com/bijx/firestore-data-fetcher

A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.

automation data database downloader exporter fetcher firebase firestore open-source script

Last synced: 12 Apr 2026

https://github.com/maccccd/wsoa3029a_2444372

This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.

d3js data hacking visualization

Last synced: 24 Jan 2026

https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali

Työllisyyden kuntakokeilujen palvelutietovarannon manuaali

data drupal drupal-9 unemployment

Last synced: 24 Jan 2026

https://github.com/zoekelepiri/ota_observatory

A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.

backend data database spring-boot

Last synced: 06 Feb 2026

https://github.com/stdlib-js/ndarray-base-output-policy-str2enum

Return the enumeration constant associated with an output ndarray data type policy string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils

Last synced: 15 Apr 2026

https://github.com/itu-helper/data-updater

Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.

data istanbul-technical-university scraper selenium-python

Last synced: 29 Jan 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/stefanbohacek/exploring-the-mapping-police-violence-dataset

Using my Gutenberg Data Visualization plugin to explore police violence against civilians.

data dataviz police police-brutality police-misconduct

Last synced: 03 Dec 2025

https://github.com/sandk21/etude_eau_potable_monde

Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau

analysis data tableau tableau-public visualization

Last synced: 19 Mar 2026

https://github.com/GiveMePseudonyms/PiVisualisations

A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.

data data-visualization pi pygame python self-organising-criticality tkinter

Last synced: 08 Apr 2025

https://github.com/cworld1/novel-data

The data repository of novel analysis

analysis data novel

Last synced: 01 Feb 2026

https://github.com/giladbarnea/to

A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.

conversion data data-conversion json json5 parser script terminal toml yaml

Last synced: 08 Feb 2026

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/garcane/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 09 Feb 2026

https://github.com/3squared/smoulder

Smoulder is a really good data pipe

composition data facade-pattern forge-framework object-oriented

Last synced: 25 Apr 2026

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/danielbello7/nosql-json-database

Simple and quick database to help development process and speed

data database json json-database models nosql nosql-database nosql-json-database schema

Last synced: 09 May 2026

https://github.com/codenoid/alodokter.com-database

a Alodokter.com Database, collected by Hofesh Bot (Scrapper)

alodokter data extraction hofesh

Last synced: 18 Mar 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/diddypod/crop-data-converter

A Python script to convert crop data from .txt to .xlsx format

converter crop data openpyxl python

Last synced: 29 Jun 2026

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://github.com/skygenesisenterprise/aether-account

Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.

account data javascript nextjs platform service sso-service typescript user-interface

Last synced: 16 Apr 2026

https://github.com/tushar2704/applied-ai-playground

This repository serves as a comprehensive collection of resources and projects for Applied Artificial Intelligence (AI). Whether you're an AI enthusiast, a data scientist, or a developer looking to explore practical applications of AI, this repository aims to provide you with valuable materials and hands-on projects to deepen your understanding.

artificial-intelligence data data-science machine-learning machine-learning-algorithms

Last synced: 12 Feb 2026

https://github.com/walidkorchi/data-analysis

📈 Projet universitaire d'analyse des données à l'ENCG

analysis data encg science statistics

Last synced: 29 Jun 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/m0nica/datalogues-outdated

Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll

data pelican pelican-blog pelican-theme

Last synced: 28 Feb 2026

https://github.com/ismail-mouyahada/lodscroljs-library

LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks

data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll

Last synced: 13 Feb 2026

https://github.com/stdlib-js/array-base-every-by-right

Test whether all elements in an array pass a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 13 Feb 2026

https://github.com/frictionlessdata/extensiondp

Extension DP (Data Package Extension Template) is a Git repository template for rapid Data Package extension development

data datapackage exchange extension format

Last synced: 13 Feb 2026

https://github.com/wooldoughnut310/xboxgamertag

Python module to get data from www.xboxgamertag.com

data gamertag html python3 requests xbox

Last synced: 24 Mar 2025

https://github.com/stdlib-js/array-base-assert-is-complex-floating-point-data-type

Test if an input value is a supported array complex-valued floating-point data type.

array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate

Last synced: 14 Feb 2026

https://github.com/blacksujit/shikshamitra

Shiksha Mitra is an innovative MVP designed to reshape the way students learn through gamification. Our platform transforms the traditional approach to education by making learning engaging, interactive, and rewarding. As an MVP, Shiksha Mitra focuses on delivering core features that showcase the value of gamified learning,

ai data gamified-learning hackathon lms ml mlflow mlops mlops-workflow mvp pipeline platforn

Last synced: 28 Feb 2026

https://github.com/diegoperea20/own_dataset_segmentation_yolov8

Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.

coins colombia data opencv own python segmentation tensorflow yolov8

Last synced: 12 Apr 2026

https://github.com/neomutt/sample-data

📚 Lists of things. Useful for developing and testing.

data list sample

Last synced: 19 Mar 2026

https://github.com/mvicens/sporscor

TypeScript API to manage sport data getting scoreboards and statistics

api-client data score scoreboards sport statistics typescript

Last synced: 16 Feb 2026

https://github.com/linx-software/file-import-to-rest-api

Import a CSV file and make the data available via a REST API.

csv data linx low-code

Last synced: 19 Mar 2026

https://github.com/stdlib-js/array-base-none-by-right

Test whether all elements in an array fail a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs none predicate stdlib structure test types validate

Last synced: 01 Mar 2026

https://github.com/rafaelfloressouza/Covid-19-Dashboard

Python web application to display COVID19 data from the world using Plotly and Dash

bootstrap covid-19 css data datavisualization plotly-dash python3

Last synced: 10 Mar 2025

https://github.com/theonlybeardedbeast/exercise-data

Datasets for workout exercises

data dataset fitness health healthcare

Last synced: 20 Mar 2026

https://github.com/docusign/extension-app-data-io-reference-implementation

Extension App for Data IO Reference Implementation for the Docusign IAM Platform

apps data extension

Last synced: 02 Mar 2026

https://github.com/anthonybench/datapeek

Peek summary of datafile in a succinct, opinionated manner.

cli data data-analysis

Last synced: 02 Mar 2026

https://github.com/metapsy-project/data-depression-inpatients

Database of depression psychotherapy trials in inpatient settings

data

Last synced: 27 Mar 2026

https://github.com/victorowinoke/after-work-data-science-project-showcase-eda

You work for Lublu as a Data Science Consultant and you have been tasked to perform analysis on pricing, product and assortment of Adidas and Nike. Create a descriptive analysis report, making relevant observations and recommendations that will help Lublu in the launch of such similar products.

adidas analysis data deliverables nike pythonanalysis ranges

Last synced: 28 May 2026

https://github.com/insolite/react-data-frame

Table for huge data sets

data react table

Last synced: 14 May 2026

https://github.com/stdlib-js/array-base-every-by

Test whether all elements in an array pass a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 03 Mar 2026

https://github.com/ginga1402/chinook_database

Microsoft SQL Server Management Studio

business-query data sql-server

Last synced: 30 Mar 2025

https://github.com/jorgeatgu/apaga-luz

💡 ¿Cuánto cuesta la luz? 💶

data data-visualization flat-data

Last synced: 04 Feb 2026

https://github.com/denisecase/datakit-lite

Helpful utilities for Python data projects

analysis data education kit lite utils

Last synced: 04 Mar 2026

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/tatey/list_of_countries

A list of countries, states, and cities in Ruby

cities countries data ruby states

Last synced: 11 Nov 2025

https://github.com/palewire/nyc-hpd-bronx-lead-paint-violations

Download and process housing code lead paint violations in the Bronx from NYC Open Data

bronx data data-journalism news nyc python

Last synced: 02 Apr 2026