An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/caelean/twittermap

Map of twitter user's influence as defined on by influencetracker

data google-maps maps sparql twitter visualization

Last synced: 14 Jun 2025

https://github.com/karashiiro/lodestone-id-time

Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.

data ffxiv ffxiv-character lodestone

Last synced: 30 Jun 2025

https://github.com/vyahello/fake-employee-api

👨‍🔧 Simple mock employees data parser (responder + heroku + pytest + github/travis CI)

data employee employer mock responder rest-api

Last synced: 09 Jun 2026

https://github.com/ingmarboeschen/jatsdecoderevaluation

Evaluation data and code

data evaluation jatsdecoder

Last synced: 04 Feb 2026

https://github.com/sadcenter/messenger

Data messaging system between servers using popular messaging brokers

data message

Last synced: 06 Aug 2025

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/csadorf/pydata-ann-arbor-2018

Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018

data data-management jupyter signac workflow

Last synced: 04 Jun 2026

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/acdh-oeaw/histogis-data

Data created by HistoGIS

data histogis

Last synced: 24 Oct 2025

https://github.com/socketsupply/dynavolt

A highly opinionated DynamoDB client for aws-sdk v3 using esm.

aws data database dynamo dynamodb key-value kvstore

Last synced: 05 May 2026

https://github.com/joaocarmo/react-very-simple-data-table

When all you want is a table

data react simple table

Last synced: 06 Mar 2025

https://github.com/purarue/listenbrainz_export

Export your scrobbling history from ListenBrainz

data data-export music scrobbling

Last synced: 24 Jan 2026

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/skywarth/fenrir-wolfpack-simulator

Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.

data data-structures javascript max-heap simulation simulations wolfpack

Last synced: 14 Oct 2025

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026

https://github.com/aleklukanen/chapterhousedb-v1

Allows you to create simple data streaming warehouses written in Golang using Apache Parquet and Arrow.

arrow data database event golang ingestion parquet pipeline processing stream

Last synced: 27 Feb 2026

https://github.com/physio/flatten-ts

Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.

array conversion data flatten javascript json object typescript

Last synced: 06 May 2026

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 03 Feb 2026

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 16 May 2026

https://github.com/stdlib-js/ndarray-base-dtype-str2enum

Return the enumeration constant associated with an ndarray data type string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 15 Mar 2026

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 12 Apr 2026

https://github.com/windwalker-io/data

[READ ONLY] A library contains data/collection objects with null-object pattern.

collection collections data data-object iterator nullobject value-object

Last synced: 12 Mar 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/liamross/use-data

A React hook for async fetching of data, data manipulation, and take latest vs take every functionality.

async data hook hooks react

Last synced: 22 Jan 2026

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/lovethebomb/data-tiles

🍜 Data Tiles is a small website that shows data.

data express javascript nextjs typescript

Last synced: 10 Apr 2026

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/nrennie/data

A collection of random datasets, either from web-scraping or processing more complex data.

data

Last synced: 30 May 2026

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/macsual/dotgov-jamaica-domains

A listing of .gov.jm domains.

data domains

Last synced: 03 Jan 2026

https://github.com/slipke/eurlex-model-go

This projects implements the EUR-Lex XML data model in Golang. For more information see README.md

data datamodel eur-lex eurlex webservice

Last synced: 09 Mar 2026

https://github.com/rikvdh/zabuffer

Zero-Allocation buffer handling in C

buffer c clib data embedded memory string zero-allocation

Last synced: 03 Mar 2025

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/kylekirkby/cardatasnatch

CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.

beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering

Last synced: 15 Apr 2025

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/zoo-js/zoo-data

🍩 The data for zoo-js.

actions data js json nodejs workflow

Last synced: 22 Apr 2025

https://github.com/stdlib-js/ndarray-base-dtype-resolve-enum

Return the enumeration constant associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 13 Apr 2025

https://github.com/stdlib-js/ndarray-base-from-scalar

Convert a scalar value to a zero-dimensional ndarray.

base convert data javascript ndarray node node-js nodejs scalar stdlib structure types wrap

Last synced: 03 Jul 2025

https://github.com/synthead/timex-datalink-toebes-tutorials

Toebes' WristApp tutorial sources for the Timex Datalink

6800 6805 app assembly crt data data-link datalink link timex watch wrist wristapp

Last synced: 08 Apr 2025

https://github.com/marcuwynu23/phaddress

Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines

address address-data-api api barangay city data geolocation municipalities provinces

Last synced: 14 Feb 2026

https://github.com/sanand0/texas-deathrow

Texas deathrow inmates data

data

Last synced: 04 Sep 2025

https://github.com/j1sk1ss/dateapppc.exmpl

Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.

data oop-principles parsing pgadmin4 sql wpf

Last synced: 11 Apr 2026

https://github.com/OliverHennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 30 Jul 2025

https://github.com/thyringer/cast

CLI tool for reading strings or complex data sets from CSV files to output them in other text formats.

csv-converter data data-preprocessing python python3 sql-builder

Last synced: 02 Feb 2026

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/georgetdn/syscppcplinux

Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework linux object persistence serialize sql

Last synced: 12 Feb 2026

https://github.com/ozanarkancan/sailx

This repo contains the code for generating artificial navigational instruction following data.

data grounded-language-learning

Last synced: 08 Jan 2026

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/jorgeatgu/wiki-cachitos

¿Cuánto aumentan las visitas a la Wikipedia cuando un artista sale en Cachitos?

cachitos data dataviz musi musica rtve

Last synced: 05 Jul 2025

https://github.com/yaoguangduan/protosync

generate go code from protobuf ,sync proto dirty data

data golang protobuf sync

Last synced: 12 Mar 2026

https://github.com/kshitij1235/boxdb

This a database managment lib made for python, which works like any Libraries and is very lite no aditional setup require but there is some procedure to create a project is very easy.

boxdb data database library python

Last synced: 14 Jan 2026

https://github.com/techbureau/zaifdata

:blue_book: Data Reader for zaif Exchange

bitcoin blockchain cryptocurrency data exchange nem token trading xem zaif

Last synced: 19 Apr 2026

https://github.com/cdcgov/nchsdata

NCHS data: public use files (PUFs) from the National Center for Health Statistics (NCHS)

data public-health r survey survey-data

Last synced: 13 Apr 2026

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/kaos599/apollo-synthetic-data-generator

Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.

data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui

Last synced: 22 Jul 2025

https://github.com/nrennie/londonmarathon

R package containing data relating to London Marathon.

data r r-package

Last synced: 02 Apr 2025

https://github.com/wonderium/browser-feature-compatibility

This repository contains browser support details for HTML, CSS, JS and SVG features.

browsers compatability css data html js json releases support svg wonderium

Last synced: 27 Jan 2026

https://github.com/priyanka7411/customer-segmentation-churn-dashboard

📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.

churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization

Last synced: 14 Apr 2026

https://github.com/ium101/files-and-folders-lister-z

Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.

application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows

Last synced: 09 Oct 2025

https://github.com/antononcube/raku-data-importers

Various data importing routines with a unified interface (data-import, slurp).

data data-ingestion raku rakulang slurp

Last synced: 23 Feb 2026

https://github.com/utrechtuniversity/dataprivacyproject

This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.

data gdpr open-science privacy rdm research research-data-management utrecht-university

Last synced: 10 Oct 2025

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/efark/data-receiver

Small web server to receive data through HTTP requests.

data gin-gonic go golang http

Last synced: 23 Feb 2026

https://github.com/yorkulibraries/vendorpol

URLs for vendor privacy policies and terms of use.

data libraries privacy-policy

Last synced: 15 Oct 2025

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/audeering/emodb

Publishes Berlin Database of Emotional Speech with audb

audb data emotion

Last synced: 19 Oct 2025

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/olegegoism/datagenerator

Django web application for managing database connections and generating test data.

app application big-data csv data database dataset db django fake generator schema teable work

Last synced: 26 Oct 2025

https://github.com/mrnazu/eth-data-library

eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.

blockchain data ethereum nodejs smart-contracts web3

Last synced: 28 Jan 2026

https://github.com/banbord/data-vis-tornados

This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.

d3-visualization d3js data javascript json visualization

Last synced: 24 Feb 2026

https://github.com/wonderium/browser-releases

This repository contains release dates for browser versions.

browsers data json releases wonderium

Last synced: 31 Jan 2026

https://github.com/countervolts/apple-music-stats-calculator

how to get your most streamed songs/artists

apple apple-music applemusic calculator data

Last synced: 11 Feb 2026

https://github.com/axa-ch/health-insurance-data

Swiss health insurance data

axa data health insurance swiss

Last synced: 19 Mar 2026