An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/chompfoods/stub-go-server

Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes

Last synced: 17 Apr 2026

https://github.com/stdlib-js/array-zero-to-like

Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 07 Jan 2026

https://github.com/fiskeben/meetjescraper

HTTP proxy for Meet je stad project

api data go iot meetjestad proxy scraper weather

Last synced: 29 May 2026

https://github.com/stdlib-js/array-zero-to

Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 08 Jan 2026

https://github.com/igorwastaken/math-problems

Solve math problems easily with this utility library.

algorithm area data demography geography javascript math npm package population school typescript util utils

Last synced: 23 Feb 2026

https://github.com/kledenai/jsonweaver

A powerful and easy-to-use library for transforming JSON data into popular formats such as CSV, XML, Markdown tables, YAML, and JSONLines (NDJSON).

csv data data-transform format json jsonlines jsonweaver markdown markdown-tables xml yaml

Last synced: 24 Feb 2026

https://github.com/dev-owdenmag/dataflow-manager

A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.

data data-management data-management-platform data-visualization python

Last synced: 30 Mar 2025

https://github.com/palewire/nyc-hpd-bronx-lead-paint-violations

Download and process housing code lead paint violations in the Bronx from NYC Open Data

bronx data data-journalism news nyc python

Last synced: 02 Apr 2026

https://github.com/iwconfig/svtplay-data

Daily JSON backup of content metadata from SVTPlay

data metadata streamlink svtplay svtplay-dl youtube-dl

Last synced: 24 Oct 2025

https://github.com/rohancyberops/r-language

R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.

caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse

Last synced: 12 Oct 2025

https://github.com/bukalapak/bukadata

Data supplier plugin for populating design with real data.

data plugin sketch sketch-plugin

Last synced: 05 Jul 2025

https://github.com/rtmigo/pickledir_py

File-based key-value storage. Serializes keys and values with pickle

cache caching data directory file linux macos package pickle python windows

Last synced: 17 Apr 2026

https://github.com/jahilldev/immutable-parsejs

Parse a JS object or array/map into an Immutable collection. Makes use of ImmutableJs List, and Record primitives.

data immutablejs javascript json nodejs parse typescript

Last synced: 13 Apr 2026

https://github.com/toransahu/metoffice

Data visualisation - MetOffice

data metoffice uk visualization weather

Last synced: 25 Mar 2025

https://github.com/strata/data

Tools to help you read data from a range of different data providers.

api data data-integration

Last synced: 27 Jan 2026

https://github.com/rousan/weshare

An application that transfers files between devices

c-sharp data dot-net file lan phone share transfer-data weshare wifi

Last synced: 17 Apr 2026

https://github.com/open-i18n/data-iso-15924

Git mirror for ISO 15924, Codes for the representation of names of scripts data

data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems

Last synced: 14 Mar 2026

https://github.com/gbv/cocoda-mappings

concordances, mappings and conversion scripts to create JSKOS mappings

coli-conc data jskos

Last synced: 28 Oct 2025

https://github.com/sadmanca/uoft-pey-coop-job-postings

Code for parsing approximately 1.8k HTML pages of UofT PEY co-op job postings (from September 2023 to May 2024) to a single sqlite3 database file.

co-op data html python singlefile sqlite sqlite3 uoft uoft-pey

Last synced: 17 Apr 2026

https://github.com/tarantinoarchive/dec

Developer-Easy CMS

cms data easy ejs js json simple

Last synced: 11 Mar 2026

https://github.com/gallo13/neuralnetworks-deeplearning-stats-classification

Descriptive Statistics, Classification and Analysis Using Python & Python Libraries (Assignment 1)

analysis data datasets deep-learning jupyter-notebook matplotlib neural-networks numpy pandas plotting python seaborn

Last synced: 17 Apr 2026

https://github.com/jerryfzhang/rockets

A Node + React App that displays space launch missions around the world.

bootstrap data expressjs less momentjs nodejs react reactjs reactstrap

Last synced: 10 Apr 2026

https://github.com/sogful/archive

you will NEVER believe what this repository contains

archive crawl data scrapes sites

Last synced: 03 Jun 2026

https://github.com/snimmagadda1/stack-exchange-dump-to-mysql

Batch pipeline to import Stack Exchange XML data dumps to relational DB

batch data mysql spring-batch stackoverflow

Last synced: 30 Mar 2025

https://github.com/izam-mohammed/data-source

🌐 A source directory for the data of my projects and experiments.📂 This curated collection simplifies access to diverse data that used in various projects💡

csv-files data data-source zip-files

Last synced: 03 Jun 2026

https://github.com/eugenedakin/caesarcipher

Native Xojo code for the Caesar Cipher algorithm with an example program

caesar-cipher data decryption encryption xojo

Last synced: 07 Jan 2026

https://github.com/iamlucianojr/laravel-api-query-handler

:flashlight: This Laravel package helps to handle a query request properly

api collection data eloquent handler l5x laravel query

Last synced: 28 Apr 2026

https://github.com/nnavales/desafios-data-engineer

En este proyecto abordaremos desafíos comunes en el rol de un Data Engineer con tecnologías modernas.

data data-engineering database dataengineering docker minio scrapping spark

Last synced: 01 Jun 2026

https://github.com/fritzrehde/asciibar

A cli tool to print percentages as ascii bar charts

cli data percentage visualization

Last synced: 31 Oct 2025

https://github.com/ttitcombe/timekeep

Defensive timeseries analysis in python

data data-science sklearn time-series time-series-analysis timeseries

Last synced: 05 Jan 2026

https://github.com/jtpio/data-playground

Experiments using public APIs and data

data experiments python

Last synced: 28 Apr 2026

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 08 Apr 2025

https://github.com/mikeintoshsystems/dhis2heat

A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.

analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization

Last synced: 28 Apr 2026

https://github.com/saulojoab/crato-ce-json

Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.

data database geolocation json json-api localization

Last synced: 28 Apr 2026

https://github.com/ahmetcansolak/developer-insights

New project of ClubRockers from Sarıyer Hills

bitbucket data data-science data-visualization github python3

Last synced: 28 Apr 2026

https://github.com/bijx/firestore-data-fetcher

A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.

automation data database downloader exporter fetcher firebase firestore open-source script

Last synced: 12 Apr 2026

https://github.com/gmersy/data-carbon

Repository accompanying the paper: Toward a Life Cycle Assessment for the Carbon Footprint of Data

carbon-emissions carbon-footprint climate-change data data-science sustainability sustainable-software

Last synced: 31 Mar 2025

https://github.com/ahmad-ali-rafique/comment-generation-tool

This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.

ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning

Last synced: 07 Oct 2025

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1969-1988

US birth data from 1969 to 1988, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 19 Apr 2025

https://github.com/antononcube/raku-data-cryptocurrencies

Raku package of cryptocurrency data retrieval.

crypto cryptocurrency data

Last synced: 02 Apr 2025

https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis

This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle

data database mysql sql walmart

Last synced: 24 Feb 2026

https://github.com/rayenfathallah/students_analysis

This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.

analysis d3 data education javascript python students

Last synced: 12 Apr 2026

https://github.com/nesterenko-kv/object-id

ObjectIDs are a special type of identifier mainly used in MongoDB to uniquely identify documents within a collection. They consist of a 12-byte binary value that includes a timestamp, a machine identifier, a process identifier, and a counter.

c-sharp data id net object-id unique-identifier

Last synced: 16 May 2025

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/jrmedd/emojinal

An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.

data emojis sentiment user-research ux

Last synced: 19 Jan 2026

https://github.com/dataspoclab/dataspoc-lens

Virtual warehouse — SQL + Jupyter + AI over cloud Parquet via DuckDB

cli data data-engineering data-lake duckdb etl parquet python singer sql

Last synced: 20 Apr 2026

https://github.com/gniquyij/tuqiu

Dark side of OS X

data notes osx safari

Last synced: 12 Oct 2025

https://github.com/trstringer/pywave2

:ocean: Get swell buoy data

data ocean python

Last synced: 31 Mar 2025

https://github.com/ktbarrett/scdil

simple configuration and data interchange language

configuration data json python yaml

Last synced: 20 Apr 2026

https://github.com/sinedied/htf-data

CLI tool to process Hadra Trance Festival database export into valid data for the app

cleaner cli data database hadra tool

Last synced: 20 Apr 2026

https://github.com/tushar2704/insurance-cross-sell

This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.

data datascience h20automl machine-learning mlflow python streamlit-tushar2704

Last synced: 08 Oct 2025

https://github.com/tatey/list_of_countries

A list of countries, states, and cities in Ruby

cities countries data ruby states

Last synced: 11 Nov 2025

https://github.com/gkapfham/ast2016-paper

Source Code of and Supporting Files for a Paper Published at AST 2016

data latex-document paper research

Last synced: 19 Oct 2025

https://github.com/mishra-krishna/analysis-and-optimization-of-supply-chain-operations

Analyzed supply chain data to identify trends and key factors. Visualized sales, defect rates, lead times, and costs. Used Decision Tree Regressor to find top features impacting product costs and lead times.

data dataanalytics datavisualization supplychain supplychainanalytics

Last synced: 20 Apr 2026

https://github.com/nikoshet/rust-dms-cdc-operator

The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.

aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation

Last synced: 18 Jan 2026

https://github.com/vapourismo/binary-io

Read and write values of types that implement Binary from and to Handles

data haskell haskell-library io parsing

Last synced: 28 Mar 2025

https://github.com/cicerotcv/br-gen

A browser extension for generating Brazilian placeholder data.

chrome data extension generation hacktoberfest

Last synced: 21 Apr 2026

https://github.com/benmaier/boarding_school_sir

Fit SIR dynamics to the prevalence curve of an H1N1 outbreak of a British boarding school in 1978.

boarding data disease epidemiology modeling school spreading

Last synced: 31 Mar 2025

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/athul64/powerbi

Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.

data data-an data-visualization dax dax-expression powerbi

Last synced: 23 Feb 2026

https://github.com/ariqf1/learn_data

Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.

data data-engineering mysql

Last synced: 15 Apr 2026

https://github.com/desktopcleaner/naturemagazinescraper

Scrapes open-access Nature magazine articles and store as txt files.

data nature-magazine python scrapper word-frequency

Last synced: 06 Feb 2026

https://github.com/diegoperea20/own_dataset_segmentation_yolov8

Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.

coins colombia data opencv own python segmentation tensorflow yolov8

Last synced: 12 Apr 2026

https://github.com/aymane-maghouti/mobile-data-hive-insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi

Last synced: 09 Mar 2026

https://github.com/iguptashubham/walmart-eda

Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.

analysis data dataset eda mysql portfolio-project python sql

Last synced: 10 Apr 2026

https://github.com/jayantur13/data-bharat

Get states their capital and districts,UTS and other useful information

data js node npmjs package yarn

Last synced: 28 Jan 2026

https://github.com/jessielw/parse-fel-master-data

Simple CLI to parse Dolby Vision master data via the RPU/MediaInfo and output data needed for x265

data dolby fel master mediainfo mi parse rpu vision

Last synced: 26 Aug 2025

https://github.com/stdlib-js/ndarray-base-output-policy-str2enum

Return the enumeration constant associated with an output ndarray data type policy string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils

Last synced: 15 Apr 2026

https://github.com/spiceai/datasets

Spice AI curated dataset definitions for Spice.ai

ai bitcoin blockchain data ethereum polygon

Last synced: 20 Apr 2026

https://github.com/stdlib-js/array-base-to-deduped

Copy elements to a new generic array after removing consecutive duplicated values.

array compress copy data dedupe deduplicate deduplication duplicate generic javascript node node-js nodejs stdlib structure types uniq unique

Last synced: 14 Jun 2025

https://github.com/rafaelfloressouza/Covid-19-Dashboard

Python web application to display COVID19 data from the world using Plotly and Dash

bootstrap covid-19 css data datavisualization plotly-dash python3

Last synced: 10 Mar 2025

https://github.com/ahmadjamil888/facial-recognition-ai-model

A facial recognition AI model powered by CNN , and trained by thousands of images.

ai cnn data data-science facial facial-recognition recognition

Last synced: 30 Jun 2025

https://github.com/stdlib-js/ndarray-base-empty

Create an uninitialized ndarray having a specified shape and data type.

base data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 19 Feb 2026

https://github.com/SAP-archive/signavio-qualtrics-di

Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.

data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio

Last synced: 09 May 2025