An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/minightdev/paperclip

Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.

beaches data database pwn pwned search-engine

Last synced: 22 Mar 2025

https://github.com/fiedsch/datamanagement

Data management helpers (PHP-CLI)

csv-data data datamanagement helper php

Last synced: 05 Apr 2025

https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset

BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?

bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote

Last synced: 21 Jun 2025

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/moumouls/data-toulouse-grapql

A (partial) GraphQL Support for the Toulouse Métropole Open Data service

api data graphql open service toulouse

Last synced: 16 May 2026

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/polina-prokofieva/viewjson

The class for convenient visualization of json with some settings.

data data-visualization es5 es6 javascript json

Last synced: 15 May 2026

https://github.com/woctezuma/download-steam-banners-data

Data consisting of Steam banners.

data steam steam-api

Last synced: 06 Jan 2026

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/sharmadhiraj/free-json-datasets

Collection of free JSON data that are scraped and parsed from different websites.

collection crawler data data-scraping datasets json sports statistics web-scraping

Last synced: 28 Mar 2025

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 20 Jan 2026

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/kodie/migrate-acf-field-data-to-repeater

A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater

acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin

Last synced: 19 May 2026

https://github.com/heikomuller/histore

Library for maintaining snapshots of evolving tabular data sets

data version-control

Last synced: 10 Apr 2025

https://github.com/philhawksworth/netlify-plugin-trello-lists

A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.

api data eleventy netlify plugin trello

Last synced: 20 Jan 2026

https://github.com/lemmotresto/migrational

A data migration library

data java migration versioning

Last synced: 30 Oct 2025

https://github.com/amethyst-php/address

The place where a person or organization can be found or communicated with. Contains fields such as: street, postal code, city, country etc... Can be used for example as a shipment address or as an invoice address.

address amethyst amethyst-package api data laravel

Last synced: 13 Aug 2025

https://github.com/robjg/dido

Data In/Data Out in many formats

csv-parser data etl java json-parser

Last synced: 11 Jan 2026

https://github.com/ubc-library-rc/data-manipulation-dplyr

Workshop about data manipulation using the dplyr R package

data featured workshop

Last synced: 01 Jul 2026

https://github.com/writetome51/big-dataset-paginator

A TypeScript/JavaScript class for pagination in a real-world web app.

app data javascript pagination paginator typescript

Last synced: 17 May 2026

https://github.com/max-tonny8/android_web3

This is a library for Android to call data from Node on Ethereum Chain or Solana Chain

android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j

Last synced: 27 Mar 2025

https://github.com/exaluc/webhookcatcher

Catch your webhooks like a dream

api catcher data webhook webhook-callbacks webhooks-catcher

Last synced: 14 Apr 2025

https://github.com/ange007/jquery.mydata

jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.

data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs

Last synced: 19 May 2026

https://github.com/felixklauke/atomizer

Playing around with butter knife, android bindings and rx java.

binding butterknife data java react rx rxjava

Last synced: 15 May 2026

https://github.com/Ekey/ER.DATA.Tool

Tool for extract data archives from mobile game Earth Revival (Project Arrival)

data earth-revival idx project-arrival

Last synced: 19 May 2026

https://github.com/danlsn/causality

A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.

data data-engineering datawarehousing personal-data quantified-self

Last synced: 06 Mar 2026

https://github.com/satyam4229/college-predictor-system

The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.

data data-science jupyter-notebook kaggle prediction python

Last synced: 06 Apr 2026

https://github.com/shysolocup/aepl

A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.

aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package

Last synced: 28 Oct 2025

https://github.com/asirihewage/simplest-xpath-web-scraper

Simplest web scraper created using Python3 and MongoDB

data data-mining python3 scraper web webscrping

Last synced: 29 Jan 2026

https://github.com/espoirmur/balobi_nini

An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.

data nlp nlp-machine-learning

Last synced: 24 Aug 2025

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/priyanka7411/customer-segmentation-churn-dashboard

📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.

churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization

Last synced: 14 Apr 2026

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/debdutto/algorhythm

Algorithmic music driven by data and / or algorithms

algorithm data music nodejs

Last synced: 18 Apr 2026

https://github.com/srijanshetty/amfitools

Tools to get the open NAV for any MF in India

amfi cli data funds india investing mutual nav

Last synced: 04 Oct 2025

https://github.com/woo071002/parcel-management-system

A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.

cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml

Last synced: 08 Oct 2025

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/georgetdn/syscppcp

Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework object persistence serialize sql windows

Last synced: 04 Apr 2025

https://github.com/lovethebomb/data-tiles

🍜 Data Tiles is a small website that shows data.

data express javascript nextjs typescript

Last synced: 10 Apr 2026

https://github.com/yash22222/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3

Last synced: 09 Apr 2026

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/pommes-public/pommesdata

A full-featured transparent data preparation routine from raw data to POMMES model inputs

data opensource power raw-data transparent

Last synced: 07 Oct 2025

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/amacd31/daily_hydromet_sample_data

This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.

data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow

Last synced: 16 Jan 2026

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/frnt-end/weather-app-react

:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO 🌞 https://Frnt-End.github.io/Weather-App-React 👈

api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp

Last synced: 18 Feb 2026

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/jaldekoa/nyfedapi

A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 08 Feb 2026

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/relintai/ess_data

Godot plugin that helps to create/manage resource files.

addon data data-management godot

Last synced: 18 Aug 2025

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/divithraju/divith-raju-openmetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction

Last synced: 20 Feb 2026

https://github.com/nononoexe/setariaviridis

🌾 Field-collected data of green foxtail

data data-science dataset rpackage

Last synced: 27 Feb 2026

https://github.com/kaos599/apollo-synthetic-data-generator

Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.

data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui

Last synced: 22 Jul 2025

https://github.com/mo-karbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 02 Aug 2025

https://github.com/coasterfreakde/ork

Object Relational Mapping for Kotlin

data database kotlin mariadb mysql orm sql sqlite

Last synced: 29 Jul 2025

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/tusharnankani/analysis-2.0

An Exhaustive WhatsApp Chat Data Analysis 2.0

analysis data data-science plots trends visualization

Last synced: 31 Mar 2025

https://github.com/leeper/mcode

Functions to merge and recode across multiple variables

data data-transformation r recode recoding

Last synced: 16 May 2025

https://github.com/clinical-genomics/housekeeper

File data orchestrator

data file orchestrator

Last synced: 15 Aug 2025

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah

Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544

checklist data gbif google-earth kmz sabah

Last synced: 09 Mar 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/lmantw/binarion

A simple binary format for storing JavaScript objects.

binary data decoding encoding format javascript

Last synced: 02 Sep 2025

https://github.com/yashmistry-24/ytcomment-iq

YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.

analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube

Last synced: 15 Feb 2026

https://github.com/rulox/faker

A Go library to create Fake Data for your projects

data dummy dummy-data fake fake-data faker go golang

Last synced: 28 May 2026

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/windwalker-io/data

[READ ONLY] A library contains data/collection objects with null-object pattern.

collection collections data data-object iterator nullobject value-object

Last synced: 12 Mar 2026

https://github.com/vutran/yahoo-stocks-cli

Fetch stock data from the CLI

cli data finance stocks yahoo

Last synced: 08 Jun 2026

https://github.com/ssiarhei115/customer-classification

Developing ML model predicting bank' customer inclination to open a deposit

big-data big-data-analytics data data-science data-visualization mashine-learning

Last synced: 09 Apr 2025

https://github.com/liamross/use-data

A React hook for async fetching of data, data manipulation, and take latest vs take every functionality.

async data hook hooks react

Last synced: 22 Jan 2026

https://github.com/deepwaterpaladin/statscanpy

Basic package for querying & downloading StatsCan data by table name.

api data

Last synced: 16 Jan 2026

https://github.com/jazeee/dexcom-android-wall-panel

Display data as a Graph on Android, jazeee data plotter

android data jazeee plotter

Last synced: 02 May 2026

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/bradlindblad/quotableoffice

Repo for the quotable office R Shiny app

data datascience golem-apps r shiny shiny-apps text text-mining

Last synced: 26 May 2026