An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/yanpitangui/iteminfoconverter

Application that converts ragnarok legacy data files to iteminfo.lua

data itemdbconf iteminfo luafiles ragnarok

Last synced: 12 Oct 2025

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/stefen-taime/real-time-data-pipeline-snake-game

Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API

chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis

Last synced: 04 May 2026

https://github.com/dhimmel/het.io-rep-data

Data from Project Rephetio for the het.io website

browser data datatables drug-repurposing rephetio

Last synced: 07 Feb 2026

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/efark/data-receiver

Small web server to receive data through HTTP requests.

data gin-gonic go golang http

Last synced: 23 Feb 2026

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/tomwhite/chernoff

A visual mood indicator. One of the first Java programs I ever wrote.

chernoff-faces data visualization

Last synced: 20 Apr 2026

https://github.com/kefniark/kaaya

JS Library for State management and Data synchronization between Applications

data game kaaya mutation network serialization state-management

Last synced: 06 Jun 2026

https://github.com/secret-guest/file_organizer

Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.

c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory

Last synced: 10 Jun 2026

https://github.com/stdlib-js/ndarray-base-char2dtype

Return the data type string associated with a provided single letter abbreviation.

abbr abbreviation array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 12 Mar 2026

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/socketsupply/dynavolt

A highly opinionated DynamoDB client for aws-sdk v3 using esm.

aws data database dynamo dynamodb key-value kvstore

Last synced: 05 May 2026

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/woo071002/parcel-management-system

A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.

cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml

Last synced: 08 Oct 2025

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/joelllllll/up-sync

Sync account and transaction data from up bank to your local environment

accounts bank data postgres sync transactions up upbank

Last synced: 06 Jul 2025

https://github.com/arcticsnow/climatepy

Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)

climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray

Last synced: 05 Feb 2026

https://github.com/saleh0987/mohamed_saleh

That's my personal website where I show my skills and projects.

aos-animation axios boot data json nextjs portfolio portfolio-website projects react-icons reactjs sass swiper

Last synced: 09 Mar 2026

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/yashmistry-24/ytcomment-iq

YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.

analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube

Last synced: 15 Feb 2026

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/zig-utils/zig-faker

A high-performance, lightweight fake data generator. Generate realistic fake data for testing, prototyping, and development.

data faker library mocker zig

Last synced: 01 Apr 2026

https://github.com/camara94/data-visualization-with-python

Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.

data data-science data-structures data-visualization python3

Last synced: 16 May 2026

https://github.com/bkamapantula/india-pc-nfhs4

Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.

data government health india nfhs nfhs4

Last synced: 19 Mar 2026

https://github.com/farovictor/mongodbextractor

This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.

data go mongodb pipeline

Last synced: 14 Jan 2026

https://github.com/techbureau/zaifdata

:blue_book: Data Reader for zaif Exchange

bitcoin blockchain cryptocurrency data exchange nem token trading xem zaif

Last synced: 19 Apr 2026

https://github.com/rastmob/wordpress-llms-output-plugin

A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).

ai data llm llms training training-data wordpress wordpress-development wordpress-plugin

Last synced: 03 May 2026

https://github.com/pitmonticone/covid-italy

References for COVID-19 situation in Italy.

coronavirus covid-19 covid-19-italy data data-analysis documentation testing

Last synced: 05 Apr 2026

https://github.com/asidlo/po

Data science library for manipulating data in Go using the familiar DataFrame and Series constructs from the Python Pandas library.

data dataframe go pandas series

Last synced: 14 Jan 2026

https://github.com/assem-elqersh/creativa-data-science-bootcamp

Jupyter notebooks from the Creativa Data Science Bootcamp, covering key data science concepts and practices across multiple sessions, from data preprocessing to model building and time series analysis.

data data-science eda exploratory-data-analysis machine-learning pandas time-series-analysis xgboost xgboost-classifier

Last synced: 03 May 2026

https://github.com/d3oxy/country-state-data

A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.

address cities countries currency data dropdown geographical iso json languages location regions states typescript

Last synced: 24 Jan 2026

https://github.com/ndarville/ipa

HTML hex codes for the IPA table.

data ipa linguistics

Last synced: 24 Jan 2026

https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation

Last synced: 12 Feb 2026

https://github.com/aleklukanen/chapterhousedb-v1

Allows you to create simple data streaming warehouses written in Golang using Apache Parquet and Arrow.

arrow data database event golang ingestion parquet pipeline processing stream

Last synced: 27 Feb 2026

https://github.com/StudyResearchProjects/arrbuffstr

Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser

arraybuffer browser data node string transform

Last synced: 09 Oct 2025

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/antononcube/raku-data-importers

Various data importing routines with a unified interface (data-import, slurp).

data data-ingestion raku rakulang slurp

Last synced: 23 Feb 2026

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/mihasm/arso-scraper

Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.

api arso cli data historical-data meteorological python slovenia weather

Last synced: 14 Feb 2026

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/instaclustr/cassandra-parquet-transformer

Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar

analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation

Last synced: 29 Aug 2025

https://github.com/macsual/dotgov-jamaica-domains

A listing of .gov.jm domains.

data domains

Last synced: 03 Jan 2026

https://github.com/havocesp/dictexpire

Dict like with specific expire time per item.

data dict expire item items key map mapping python python3 time timeout value volatile

Last synced: 17 Jul 2025

https://github.com/sheweny/discord-resolve

This module groups together functions to retrieve data from different types of arguments.

data discord discord-js mentions resolver sheweny utility

Last synced: 29 Oct 2025

https://github.com/nowosad/cllc

Country-level Land Cover - categories and transitions

data dataset land-cover land-cover-transitions r

Last synced: 04 Apr 2025

https://github.com/imadsaddik/bodmaghdataset

BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language

arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft

Last synced: 03 Apr 2025

https://github.com/headless-start/data-augmentation-impact

This repository contains effect of Data Augmentation of Training Set during Model Training.

augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data

Last synced: 05 Apr 2026

https://github.com/quetz-al/quetzal-client

Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 28 Jul 2025

https://github.com/stdlib-js/array-base-copy

Copy the elements of an array-like object to a new generic array.

array copy data generic javascript node node-js nodejs stdlib structure types

Last synced: 30 Oct 2025

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/zenwor/table_editor

A simple table data editor, with easily scalable functions and operations & a nice GUI

data data-science formula java parser parsing preprocessing swing tokenizer

Last synced: 22 Jun 2025

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/jesusgraterol/bitcoin-lightning-network-stats-dataset-builder

The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data data-science dataset dataset-generation lightning-network machine-learning

Last synced: 16 May 2026

https://github.com/ournet/news-sources

A repository of news sources for every country

data news news-sources sources

Last synced: 11 Jul 2025

https://github.com/itzshoaib/hashtegrity

A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity

crypto-hash data data-integrity hacktoberfest hash integrity

Last synced: 07 Mar 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/dmnsgn/airports-data

Airports data: static, dynamic and custom dump.

airport airports cli data database json

Last synced: 30 Apr 2025

https://github.com/kom-senapati/ghw-data-hacks

🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...

data ghw

Last synced: 12 Mar 2025

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025

https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms

Contains codes related to data structures

algorithms cplusplus data data-structures

Last synced: 10 Jul 2025

https://github.com/ezra-obiwale/vue-doo

Utility library for VueJS

ajax axios data function http js method store utility vue vuejs

Last synced: 07 May 2025

https://github.com/heliomarpm/keyvalues-storage

A simple and robust KeyValues Storage's library

app-library config data db json json-db keyvalues localdb settings storage store

Last synced: 12 Apr 2025

https://github.com/inspect-js/is-data-view

Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.

data dataview ecmascript javascript typedarray typedarrays view

Last synced: 05 Apr 2025

https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset

BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?

bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote

Last synced: 21 Jun 2025

https://github.com/wpp-public/akqa-nz-tagmanager-connector

A simple javascript library to send events to a tag manager container

data google-tag-manager

Last synced: 05 Apr 2025

https://github.com/bradlindblad/quotableoffice

Repo for the quotable office R Shiny app

data datascience golem-apps r shiny shiny-apps text text-mining

Last synced: 26 May 2026

https://github.com/rikvdh/zabuffer

Zero-Allocation buffer handling in C

buffer c clib data embedded memory string zero-allocation

Last synced: 03 Mar 2025

https://github.com/lukanedimovic/table_editor

A simple table data editor, with easily scalable functions and operations & a nice GUI

data data-science formula java parser parsing preprocessing swing tokenizer

Last synced: 04 Apr 2025

https://github.com/guslovesmath/top_tech_sp_500_forecasting

Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.

arima-forecasting arima-model data data-science forecasting vector-autoregression

Last synced: 14 Mar 2025

https://github.com/moumouls/data-toulouse-grapql

A (partial) GraphQL Support for the Toulouse Métropole Open Data service

api data graphql open service toulouse

Last synced: 16 May 2026

https://github.com/mawburn/across-a-thousand-dead-worlds-data

Across a Thousand Dead Worlds Data

data json ttrpg

Last synced: 21 Apr 2026