An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/eosdis-nasa/earthdata-pub-dashboard

Front-end Dashboard for Earthdata Pub

data earthdata edpub publication

Last synced: 15 Jan 2026

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 14 May 2026

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/kom-senapati/ghw-data-hacks

🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...

data ghw

Last synced: 12 Mar 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/dmnsgn/airports-data

Airports data: static, dynamic and custom dump.

airport airports cli data database json

Last synced: 30 Apr 2025

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 20 Jan 2026

https://github.com/pirhoo/spytula

A Python library that provides a simple and convenient way to build JSON and YAML data structures using a builder pattern.

data dsl json python yaml

Last synced: 13 Apr 2026

https://github.com/tuananh/opentravel

✈ A collection of travel related data

data travel

Last synced: 09 Oct 2025

https://github.com/unaygney/js-challenges-data-structures-and-algorithms

Repo of the challenges I'm trying to solve to understand data structures and algorithms..

algorithms-and-data-structures data javascript structure

Last synced: 29 Oct 2025

https://github.com/imadsaddik/bodmaghdataset

BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language

arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft

Last synced: 03 Apr 2025

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/kodie/migrate-acf-field-data-to-repeater

A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater

acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin

Last synced: 19 May 2026

https://github.com/joisino/twinpaper

Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)

causal-inference data research science-of-science

Last synced: 21 Mar 2025

https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network

Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 03 Mar 2025

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 06 Nov 2025

https://github.com/philhawksworth/netlify-plugin-trello-lists

A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.

api data eleventy netlify plugin trello

Last synced: 20 Jan 2026

https://github.com/synthead/timex-datalink-toebes-tutorials

Toebes' WristApp tutorial sources for the Timex Datalink

6800 6805 app assembly crt data data-link datalink link timex watch wrist wristapp

Last synced: 08 Apr 2025

https://github.com/quetz-al/quetzal-client

Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 28 Jul 2025

https://github.com/kocyigitkim/realtime.io

Real time data streaming & socket programming library

data realtime socket streaming

Last synced: 29 Jul 2025

https://github.com/windwalker-io/data

[READ ONLY] A library contains data/collection objects with null-object pattern.

collection collections data data-object iterator nullobject value-object

Last synced: 12 Mar 2026

https://github.com/coasterfreakde/ork

Object Relational Mapping for Kotlin

data database kotlin mariadb mysql orm sql sqlite

Last synced: 29 Jul 2025

https://github.com/hmeleiro/opencis

R package to import data from spanish Sociological Research Center (CIS)

abiertos api centro cis data datos estudios open r sociologicos

Last synced: 31 Jul 2025

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 12 Apr 2026

https://github.com/mo-karbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 02 Aug 2025

https://github.com/ctechhindi/auto-fill-form-data

AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME

autocomplete chrome-extension data extension

Last synced: 17 Apr 2026

https://github.com/steelcake/cherry-pipelines

A collection of pipelines built with cherry

blockchain clickhouse data pipeline pyhton

Last synced: 09 Mar 2026

https://github.com/yash22222/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3

Last synced: 09 Apr 2026

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/divithraju/divith-raju-openmetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction

Last synced: 20 Feb 2026

https://github.com/lovethebomb/data-tiles

🍜 Data Tiles is a small website that shows data.

data express javascript nextjs typescript

Last synced: 10 Apr 2026

https://github.com/debdutto/algorhythm

Algorithmic music driven by data and / or algorithms

algorithm data music nodejs

Last synced: 18 Apr 2026

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/espoirmur/balobi_nini

An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.

data nlp nlp-machine-learning

Last synced: 24 Aug 2025

https://github.com/sectly/suchdb

SuchDB: An simple, dependency free, require & go node.js and browser database

browser data database db json node node-js nodejs npm npm-package

Last synced: 09 Mar 2026

https://github.com/macsual/dotgov-jamaica-domains

A listing of .gov.jm domains.

data domains

Last synced: 03 Jan 2026

https://github.com/fabriziosalmi/phpapi

A public, free and simple interface to get data.

api data php public service

Last synced: 07 Apr 2025

https://github.com/felipesousa/usa-cities-api

A JSON with all USA cities/states.

cities data json nodejs usa

Last synced: 03 May 2026

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 03 Feb 2026

https://github.com/slipke/eurlex-model-go

This projects implements the EUR-Lex XML data model in Golang. For more information see README.md

data datamodel eur-lex eurlex webservice

Last synced: 09 Mar 2026

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025

https://github.com/mlr-org/mlr3data

Data sets used in the book, gallery, or in examples of mlr3.

data data-science data-sets machine-learning mlr3 r r-package

Last synced: 09 Apr 2025

https://github.com/bradlindblad/quotableoffice

Repo for the quotable office R Shiny app

data datascience golem-apps r shiny shiny-apps text text-mining

Last synced: 26 May 2026

https://github.com/rikvdh/zabuffer

Zero-Allocation buffer handling in C

buffer c clib data embedded memory string zero-allocation

Last synced: 03 Mar 2025

https://github.com/guslovesmath/top_tech_sp_500_forecasting

Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.

arima-forecasting arima-model data data-science forecasting vector-autoregression

Last synced: 14 Mar 2025

https://github.com/hasnocool/war_thunder_camouflage_scraper

A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.

asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web

Last synced: 04 Jan 2026

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/andrew-johnson-4/misspeller

Take correctly spelled words and return common spelling mistakes

common-mistakes data language natural nlp processing rust

Last synced: 30 Apr 2025

https://github.com/kylekirkby/cardatasnatch

CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.

beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering

Last synced: 15 Apr 2025

https://github.com/karashiiro/lodestone-id-time

Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.

data ffxiv ffxiv-character lodestone

Last synced: 30 Jun 2025

https://github.com/imtiaz-emu/exploratory-data-analysis-with-r

Data Transformation, Descriptive statistics, data visualization, Linear regression using R

data dplyr ggplot2 r rstudio visualization

Last synced: 15 Mar 2025

https://github.com/jimut123/scrapers

All Scrapers that I'll build

bs4 data python3 real-time-visualisations scrapers scrapy wget

Last synced: 16 Jan 2026

https://github.com/keosariel/nairagazer-clustered-news

Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer

ai data data-science news nigeria nigerian-data python

Last synced: 30 Aug 2025

https://github.com/ryanmorr/typed

Statically typed properties for object literals

data javascript object properties statically-typed

Last synced: 12 Jun 2026

https://github.com/erwan-simon/aws-data-platform-framework

A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.

aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module

Last synced: 23 May 2026

https://github.com/wamphlett/input-collection

A smarter and stricter way to capture and validate request data

data dto forms php validation

Last synced: 27 May 2026

https://github.com/georgetdn/syscppcp

Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework object persistence serialize sql windows

Last synced: 04 Apr 2025

https://github.com/amacd31/daily_hydromet_sample_data

This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.

data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow

Last synced: 16 Jan 2026

https://github.com/lukekim/demo

Luke's Spice.ai demo app

ai data web3

Last synced: 18 Jan 2026

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/lmantw/binarion

A simple binary format for storing JavaScript objects.

binary data decoding encoding format javascript

Last synced: 02 Sep 2025

https://github.com/sdhutchins/jxn-open-data-api

Access Jackson, MS open government data using a python API wrapper.

api data jackson jxn mississippi open-gov

Last synced: 08 Apr 2025

https://github.com/tusharnankani/analysis-2.0

An Exhaustive WhatsApp Chat Data Analysis 2.0

analysis data data-science plots trends visualization

Last synced: 31 Mar 2025

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/jackallabs/canine-oracle

The Oracle Daemon for the Jackal Blockchain

blockchain cosmos data feed jackal oracle stream

Last synced: 06 Feb 2026

https://github.com/praveenpuglia/css-support

The source of truth for CSS browser support of info

api browser compatibility css data properties selectors support

Last synced: 31 Mar 2025

https://github.com/zoo-js/zoo-data

🍩 The data for zoo-js.

actions data js json nodejs workflow

Last synced: 22 Apr 2025

https://github.com/stdlib-js/ndarray-base-dtype-resolve-enum

Return the enumeration constant associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 13 Apr 2025

https://github.com/ingmarboeschen/jatsdecoderevaluation

Evaluation data and code

data evaluation jatsdecoder

Last synced: 04 Feb 2026

https://github.com/qit-tools/unicode-emoji-json-lite

This library provides a lightweight version of the unicode-emoji-json library.

data emoji emojipedia emojis json lite unicode

Last synced: 07 Jan 2026

https://github.com/cosmos-loops/cosmos-efcore

Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.

cosmos-loops data efcore entityframeworkcore

Last synced: 14 Aug 2025

https://github.com/sanand0/texas-deathrow

Texas deathrow inmates data

data

Last synced: 04 Sep 2025

https://github.com/OliverHennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 30 Jul 2025

https://github.com/Nazaniiin/EDA_QualityofRedWine

:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.

charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization

Last synced: 30 Jul 2025