An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/minightdev/paperclip

Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.

beaches data database pwn pwned search-engine

Last synced: 22 Mar 2025

https://github.com/itzshoaib/hashtegrity

A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity

crypto-hash data data-integrity hacktoberfest hash integrity

Last synced: 07 Mar 2026

https://github.com/ournet/news-sources

A repository of news sources for every country

data news news-sources sources

Last synced: 11 Jul 2025

https://github.com/gabrielu3/ori-inverted-list

Inverted List made for a college discipline named Organization and Retrieval of Information

c data data-structures index inverted-index list

Last synced: 24 Feb 2026

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 20 Jan 2026

https://github.com/heliomarpm/keyvalues-storage

A simple and robust KeyValues Storage's library

app-library config data db json json-db keyvalues localdb settings storage store

Last synced: 12 Apr 2025

https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 27 Mar 2025

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/rrighart/rrighart.github.io

A webpage about data science, programming, statistics and related topics

analyses data data-mining programming statistics

Last synced: 20 Jan 2026

https://github.com/aaronmeder/social-history

A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.

archives data facebook instagram statistics stats

Last synced: 27 Mar 2025

https://github.com/kodie/migrate-acf-field-data-to-repeater

A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater

acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin

Last synced: 19 May 2026

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml

Generate synthetic drilling data that can be used for testing machine learning (ML) models.

classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training

Last synced: 08 Apr 2026

https://github.com/relintai/ess_data

Godot plugin that helps to create/manage resource files.

addon data data-management godot

Last synced: 18 Aug 2025

https://github.com/reppon97/cryptosnake

Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.

api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics

Last synced: 05 Mar 2025

https://github.com/defano/chicago-oasis

A visualization of Chicago business accessibility by neighborhood or census tract.

census chicago data data-science javascript neighborhood

Last synced: 11 Mar 2026

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/meetyildiz/pandazip

Reduce RAM footprint of Pandas DataFrame without losing information.

compression data data-mining data-science machine-learning pandas utilities

Last synced: 20 Jan 2026

https://github.com/uttamo/messenger-data-analysis

Analyse Messenger data exports from Facebook

analysis data facebook json messenger pandas parse python

Last synced: 18 Jul 2025

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 06 Nov 2025

https://github.com/pirhoo/spytula

A Python library that provides a simple and convenient way to build JSON and YAML data structures using a builder pattern.

data dsl json python yaml

Last synced: 13 Apr 2026

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/philhawksworth/netlify-plugin-trello-lists

A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.

api data eleventy netlify plugin trello

Last synced: 20 Jan 2026

https://github.com/smithsonian/massdigi-tools

Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO

data data-validation digital-files digitization digitization-workflows museums

Last synced: 13 Apr 2025

https://github.com/dataopstix/modelt

Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.

airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization

Last synced: 28 Mar 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/ubc-library-rc/data-manipulation-dplyr

Workshop about data manipulation using the dplyr R package

data featured workshop

Last synced: 01 Jul 2026

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/kabeech/real-dice

Random number generation based on physical media touched by humans

data dice haskell human-computer-interaction physical random random-generation random-number-generators rng

Last synced: 10 Apr 2025

https://github.com/divithraju/divith-raju-openmetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction

Last synced: 20 Feb 2026

https://github.com/havocesp/dictexpire

Dict like with specific expire time per item.

data dict expire item items key map mapping python python3 time timeout value volatile

Last synced: 17 Jul 2025

https://github.com/null-none/jwlibrary2json

Handler for jwlibrary to json

data json jwlibrary library parser

Last synced: 14 May 2026

https://github.com/lemmotresto/migrational

A data migration library

data java migration versioning

Last synced: 30 Oct 2025

https://github.com/chickennungets/ifsc-data-analysis

An implementation of Elo-MMR ranking in the boulder and lead disciplines.

d3 data elo elo-rating ifsc visualization webscraping

Last synced: 20 Jan 2026

https://github.com/johnmackintosh/simd2016_tmap

Mapping SIMD with tmap - static & interactive

data data-science data-visualization mapping r visualisation

Last synced: 20 Mar 2025

https://github.com/d2hydro/hydrodashboards

Open Source Dashboards for hydro data

bokeh data geopandas hydrology hydrometrics pandas sheetjs

Last synced: 26 Jul 2025

https://github.com/miguelgargallo/next13-fetch-data-turbo

Next13 Fetch Data Request

ai data fetch pylar request

Last synced: 27 Jul 2025

https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network

Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 03 Mar 2025

https://github.com/instaclustr/cassandra-parquet-transformer

Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar

analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation

Last synced: 29 Aug 2025

https://github.com/randomfractals/unfolded-map-renderer

Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.

cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode

Last synced: 21 Mar 2025

https://github.com/weisscharlesj/data_scicompforchem

Zipped data for SciCompforChem book for easy download

chemistry chemistry-education data data-visualization python

Last synced: 07 Nov 2025

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/amethyst-php/address

The place where a person or organization can be found or communicated with. Contains fields such as: street, postal code, city, country etc... Can be used for example as a shipment address or as an invoice address.

address amethyst amethyst-package api data laravel

Last synced: 13 Aug 2025

https://github.com/quetz-al/quetzal-client

Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 28 Jul 2025

https://github.com/unaygney/js-challenges-data-structures-and-algorithms

Repo of the challenges I'm trying to solve to understand data structures and algorithms..

algorithms-and-data-structures data javascript structure

Last synced: 29 Oct 2025

https://github.com/hyper63/copy

hyper copy tool

copy data hyper

Last synced: 20 Jul 2025

https://github.com/antoineaugusti/purchasing-power

Archive daily data about purchasing power parity: how much goods should cost in various countries

archive data purchasing-power-parity

Last synced: 28 Oct 2025

https://github.com/jeroenvdmeer/feyod

Open data set of matches played by Feyenoord.

data dataset feyenoord open-data sql

Last synced: 25 Jan 2026

https://github.com/kocyigitkim/realtime.io

Real time data streaming & socket programming library

data realtime socket streaming

Last synced: 29 Jul 2025

https://github.com/kom-senapati/ghw-data-hacks

🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...

data ghw

Last synced: 12 Mar 2025

https://github.com/shahiakhilesh1304/dsa

This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.

algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3

Last synced: 13 Apr 2025

https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms

Contains codes related to data structures

algorithms cplusplus data data-structures

Last synced: 10 Jul 2025

https://github.com/robjg/dido

Data In/Data Out in many formats

csv-parser data etl java json-parser

Last synced: 11 Jan 2026

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 14 May 2026

https://github.com/coasterfreakde/ork

Object Relational Mapping for Kotlin

data database kotlin mariadb mysql orm sql sqlite

Last synced: 29 Jul 2025

https://github.com/seregpie/vuefort

The state management for Vue.

data model state store vue

Last synced: 13 Apr 2026

https://github.com/kale-ko/ejcl

An advanced configuration library for Java with support for local files, mysql databases, and more.

config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml

Last synced: 02 Feb 2026

https://github.com/stdlib-js/ndarray-base-dtype2c

Return the C data type associated with a provided data type string.

array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 18 Jun 2025

https://github.com/eosdis-nasa/earthdata-pub-dashboard

Front-end Dashboard for Earthdata Pub

data earthdata edpub publication

Last synced: 15 Jan 2026

https://github.com/lovethebomb/data-tiles

🍜 Data Tiles is a small website that shows data.

data express javascript nextjs typescript

Last synced: 10 Apr 2026

https://github.com/petermeissner/statsgrokse

R-API-binding to stats.grok.se server providing Wikipedia page view statistics for 2008 up to 2015

api binding data pageviews r wikipedia

Last synced: 17 May 2026

https://github.com/ahmedkhalf/arabic-keyword-scraper

Stop wasting your time! And obtain Arabic definitions without having to look it up.

arabic data definitions scraper sentences wordsearch

Last synced: 12 Mar 2025

https://github.com/hmeleiro/alquilermad

Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid

data data-science data-visualization datascience housing-location-visualization rent renting

Last synced: 13 Sep 2025

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/schbenedikt/datamining

Heise (https://heise.de) News Crawler

data data-science heise postgresql web-crawler

Last synced: 10 Apr 2025

https://github.com/anonympins/data-primals-engine

Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.

api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api

Last synced: 07 Mar 2026

https://github.com/sheweny/discord-resolve

This module groups together functions to retrieve data from different types of arguments.

data discord discord-js mentions resolver sheweny utility

Last synced: 29 Oct 2025

https://github.com/zarr-developers/cookiecutter-zarr-store

Cookiecutter for Zarr store implementations

chunked data n-dimensional zarr

Last synced: 16 Jun 2025

https://github.com/stanford-oval/medxchange

Medical Data Exchange (MedXchange) platform

data ethereum exchange medical medxchange

Last synced: 16 May 2026

https://github.com/royruddle/vizdataquality

Python package for visualizing data quality

data data-science data-visualization jupyter-notebook missing-data python

Last synced: 05 May 2025

https://github.com/marlenezw/speech-to-text

Turn any video or audio recording into a written transcript using python

data data-science python speech speech-recognition speech-synthesis speech-to-text

Last synced: 27 Apr 2026

https://github.com/dhruvldrp9/simpledht

A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.

cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication

Last synced: 28 Feb 2026

https://github.com/johntocci/nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

data data-analysis data-science datacleaning pandas polars python

Last synced: 06 Apr 2026

https://github.com/ndarville/ipa

HTML hex codes for the IPA table.

data ipa linguistics

Last synced: 24 Jan 2026

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/bdpedigo/neuropull

A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.

connectome connectomes connectomics data dataset networks networks-biology

Last synced: 05 Oct 2025

https://github.com/zgbjgg/quetzal-examples

Examples using Quetzal :rocket: :bird:

analytics dashboard data data-visualization elixir erlang plotly web-app

Last synced: 24 Apr 2026

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/synthead/timex-datalink-toebes-tutorials

Toebes' WristApp tutorial sources for the Timex Datalink

6800 6805 app assembly crt data data-link datalink link timex watch wrist wristapp

Last synced: 08 Apr 2025

https://github.com/sanand0/imdbscrape

A weekly archive of the IMDB Top 250 results. Automatically scraped via GitHub Actions. Useful to see trends on IMDb Top 250

data

Last synced: 30 May 2026

https://github.com/feltex/datahora-java

Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.

brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime

Last synced: 18 Jun 2026