An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/jaldekoa/nyfedapi

A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 08 Feb 2026

https://github.com/xsolla/data-fast-insights

Xsolla data analytics tool for fast business insights and reporting.

analytics data data-analysis data-science python reporting xsolla

Last synced: 29 Jun 2026

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/openpeeps/zxc-nim

Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed

archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc

Last synced: 07 Jun 2026

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 03 Feb 2026

https://github.com/jackallabs/canine-oracle

The Oracle Daemon for the Jackal Blockchain

blockchain cosmos data feed jackal oracle stream

Last synced: 06 Feb 2026

https://github.com/mujadded/facebook_scrapper

The fcebook scrapper gem that dont need the api

data data-mining facebook ruby-gem scrapper selenium-webdriver

Last synced: 28 Oct 2025

https://github.com/rikvdh/zabuffer

Zero-Allocation buffer handling in C

buffer c clib data embedded memory string zero-allocation

Last synced: 03 Mar 2025

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/vikashpr/18cse301j_ra2011003010737

This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.

css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization

Last synced: 03 May 2026

https://github.com/rastmob/wordpress-llms-output-plugin

A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).

ai data llm llms training training-data wordpress wordpress-development wordpress-plugin

Last synced: 03 May 2026

https://github.com/stdlib-js/ndarray-base-from-scalar

Convert a scalar value to a zero-dimensional ndarray.

base convert data javascript ndarray node node-js nodejs scalar stdlib structure types wrap

Last synced: 03 Jul 2025

https://github.com/mrnazu/eth-data-library

eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.

blockchain data ethereum nodejs smart-contracts web3

Last synced: 28 Jan 2026

https://github.com/asirihewage/simplest-xpath-web-scraper

Simplest web scraper created using Python3 and MongoDB

data data-mining python3 scraper web webscrping

Last synced: 29 Jan 2026

https://github.com/stdlib-js/array-base-reject

Return a shallow copy of an array containing only those elements which fail a test implemented by a predicate function.

array copy data filter generic javascript node node-js nodejs predicate reject stdlib structure test types

Last synced: 26 Dec 2025

https://github.com/velocitatem/cellviz

Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.

cellular-automata conways-game-of-life data economics

Last synced: 29 Jul 2025

https://github.com/vtalks/youtube_data_api3

A python3 library to interact with Youtube Data API.

api client data library python python3 youtube

Last synced: 09 Apr 2026

https://github.com/charliecm/meteorite-landings

Data visualization of meteorite landings on Earth.

astronomy d3 data data-visualization mapbox space visualization

Last synced: 18 Apr 2026

https://github.com/fairspec/fairspec-application

Fairspec Application is a visual tool for managing and validating tabular and structured data

ckan csv data dataset excel fair fairspec json ods polars python quality schema sqlite table typescript validation zenodo

Last synced: 23 May 2026

https://github.com/joeyism/py-cifar10

This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site

cifar cifar-10 cifar10 data machine-learning machinelearning

Last synced: 30 Jul 2025

https://github.com/asuozzo/medicare-data-analysis

An analysis of Medicare Part D data in Vermont

data python

Last synced: 04 May 2026

https://github.com/mouneshgouda/learn_dsa

This repository explores fundamental data structures and their implementations. Learn how to organize and manipulate data efficiently for various programming tasks. (Feel free to add your specific focus areas here, e.g., algorithms, interview prep)

c data queue sorting-algorithms stack structured-data

Last synced: 30 Jul 2025

https://github.com/visenger/prada

Profiling Datasets

cleaning data dataset profiling

Last synced: 24 Aug 2025

https://github.com/gappeah/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 31 Jul 2025

https://github.com/flowsynx/plugin-postgresql

FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.

data database flowsynx postgresql postgresql-database sql

Last synced: 09 May 2026

https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint

A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT

bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs

Last synced: 25 Sep 2025

https://github.com/v6ntage/sql-sales_data-analytics-project

This repository contains a SQL scripts demonstration analytical techniques.

analytics business-analytics data data-analysis database query sql sql-server

Last synced: 12 Apr 2026

https://github.com/simranjeet97/leetcode_practice

Practicing the Leet Code Codes for Competitive Programming

algorithms amazon coding competitive-programming data data-structures facebook google leetcode python

Last synced: 03 Aug 2025

https://github.com/dav009/bqt

Local unit tests for your BigQuery queries

bigquery bq data test unittest

Last synced: 11 Feb 2026

https://github.com/woctezuma/download-steam-screenshots-data

Data consisting of Steam screenshots.

data steam steam-api

Last synced: 19 Feb 2026

https://github.com/isaac-lal/english-arabic-dictionary

This is a dictionary website that implements a search feature which allows input for a word in either English or Arabic and returns the alternative translation.

data db javascript react web-development

Last synced: 09 Apr 2026

https://github.com/tpgillam/teafiles.jl

Tea file support for Julia

data julia time-series

Last synced: 03 Oct 2025

https://github.com/vikjam/ui-policy

Unemployment policy at the state level

data government government-data

Last synced: 13 Feb 2026

https://github.com/stdlib-js/array-one-to-like

Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 20 Feb 2026

https://github.com/pradeep221b/turbofan_predictive_maintenance

An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.

data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels

Last synced: 04 Oct 2025

https://github.com/zediculz/block

Block is a data structure/collection that uses Blockchain principle in managing data.

algorithm data structure

Last synced: 05 Oct 2025

https://github.com/garcane/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/dylanhogg/cloud-products

A package for getting cloud products and product descriptions from a cloud provider website.

aws cloud-products crawler data text-processing

Last synced: 05 Oct 2025

https://github.com/DefinetlyNotAI/VulnScan_Data

Logicytics VulnScan Module's Training Data and old model archive

ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data

Last synced: 17 Aug 2025

https://github.com/arif-miad/heart-attack-risk-prediction

This dataset explores key factors influencing heart attack risk, such as age, cholesterol, blood pressure, and lifestyle habits. Using machine learning models.

classification data data-science matplotlib ml pandas-python seaborn visualization

Last synced: 18 Aug 2025

https://github.com/gematik/app-fhir-snapshots-package-generator

The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.

data fhir miscellaneous

Last synced: 05 Oct 2025

https://github.com/giorgiosavastano/process

processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.

big-data data data-science parallel parallel-computing process processing processing-chain rust

Last synced: 05 Oct 2025

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent

Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.

agentic-ai analysis data deepseek langchain nba python streamlit visualization

Last synced: 08 May 2026

https://github.com/grkndev/twitcher

A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.

api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user

Last synced: 09 Mar 2026

https://github.com/sstendahl/giscan

Simple tool to read and analyze existing GISAXS data

cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray

Last synced: 11 Nov 2025

https://github.com/aymane-maghouti/mobile-data-hive-insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi

Last synced: 09 Mar 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/ukplab/pragtag2023

Code and data for the PragTag-2023 Shared Task

argument-mining data peer-review pragmatics shared-task

Last synced: 18 Jun 2025

https://github.com/marcelo-earth/h5n8-data

🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.

csv data h5n8 h5n8-cases h5n8-virus russia

Last synced: 19 Jan 2026

https://github.com/viisix/corecat

Core repository of DanceCats project.

data lightweight python3

Last synced: 25 May 2026

https://github.com/gorhkdwj/da_portfolio

Kim Jae Chun's DA_Portfolio

data data-analysis python sql

Last synced: 20 Feb 2026

https://github.com/desmondsanctity/abeona-kafka

A demo to show how to implement Upstash's serverless Kafka to a Node.js microservice. Presented at Berlin Buzzwords 2024

berlin-buzzwords data event-driven kafka microservice serverless streaming upstash-kafka

Last synced: 15 May 2025

https://github.com/francescodisalesgithub/data-for-developers

simple SQL database with problems and solution found on stackoverflow, documentation or chatgpt

chatgpt data database developer hacker hacking knowledge solutions sql targets

Last synced: 22 Mar 2025

https://github.com/stdlib-js/array-base-to-accessor-array

Convert an array-like object to a minimal array-like object supporting the accessor protocol.

accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper

Last synced: 04 Jan 2026

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/stdlib-js/array-one-to

Generate a linearly spaced numeric array whose elements increment by 1 starting from one.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 26 Feb 2026

https://github.com/mattqdev/koalaz

Why don't use koalas as data mock? With this npm package you can!

data koala lorem-ipsum meme mock placeholder

Last synced: 13 Jan 2026

https://github.com/castdrian/kdapi

A TypeScript library that scrapes K-pop idol and group information from online sources to create comprehensive JSON datasets.

api data kpop scraper typescript

Last synced: 15 May 2025

https://github.com/nouman6093/advanced-statistical-models

in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl

data data-analysis data-science data-visualization

Last synced: 11 Jun 2026

https://github.com/stdlib-js/strided-base-dtype-str2enum

Return the enumeration constant associated with a strided array data type string.

array data dtype dtypes enum javascript multidimensional node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 30 Apr 2025

https://github.com/rremple/intervalidus

For all your interval-based data needs.

data intervals

Last synced: 21 Feb 2026

https://github.com/bilalmehrban/data-log-monitor

A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.

csharp data desktop-app logging

Last synced: 14 Mar 2025

https://github.com/themost-framework/memory

MOST Web Framework in-memory data adapter for testing environments

adapter data orm

Last synced: 06 Mar 2025

https://github.com/brianali-codes/github-searcher

A website for API experimentation that users the github Api to search for different users and some of their (public) information

api data github user

Last synced: 21 May 2026

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/rohancyberops/rp1

This project performs an analysis of Starbucks (SBUX) stock returns using R. The analysis includes both simple returns and continuously compounded returns (CC returns) for a period of one month. It also calculates the growth of $1 invested in SBUX and provides visual insights through various plots.

analysis cc data r rlanguage sbux

Last synced: 15 Mar 2025

https://github.com/kingabzpro/makefile-actions

GitHub Actions and MakeFile tutorial and project for beginners.

actions analytics automation data data-science makefile

Last synced: 18 Apr 2026

https://github.com/antononcube/raku-data-cryptocurrencies

Raku package of cryptocurrency data retrieval.

crypto cryptocurrency data

Last synced: 02 Apr 2025

https://github.com/sbdk-dev/sbdk.dev

A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.

ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer

Last synced: 27 May 2026

https://github.com/cosmos-loops/cosmos-dapper

Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.

dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver

Last synced: 11 Apr 2026

https://github.com/andygeiss/pipeline

Build your own data pipeline to gather, organize and transform data by using protobuf as an intermediate format.

data data-pipeline data-science go golang machine-learning protobuf protobuf-compiler

Last synced: 31 Mar 2025

https://github.com/dataship/beam

Get collimate'd data into Frame, in Node or the Browser

column-store data data-science

Last synced: 27 Apr 2026

https://github.com/benmaier/boarding_school_sir

Fit SIR dynamics to the prevalence curve of an H1N1 outbreak of a British boarding school in 1978.

boarding data disease epidemiology modeling school spreading

Last synced: 31 Mar 2025

https://github.com/h2lsoft/validator

A library of validators values in multilanguage with CSRF protection

csrf csrf-protection data form php validator

Last synced: 04 Feb 2026

https://github.com/trstringer/pywave2

:ocean: Get swell buoy data

data ocean python

Last synced: 31 Mar 2025

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/rayenfathallah/students_analysis

This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.

analysis d3 data education javascript python students

Last synced: 12 Apr 2026

https://github.com/stefanbohacek/exploring-the-mapping-police-violence-dataset

Using my Gutenberg Data Visualization plugin to explore police violence against civilians.

data dataviz police police-brutality police-misconduct

Last synced: 03 Dec 2025