An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/flowsynx/plugin-postgresql

FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.

data database flowsynx postgresql postgresql-database sql

Last synced: 09 May 2026

https://github.com/jayantur13/data-bharat

Get states their capital and districts,UTS and other useful information

data js node npmjs package yarn

Last synced: 28 Jan 2026

https://github.com/luminati-io/Crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 09 Apr 2025

https://github.com/iamlucianojr/laravel-api-query-handler

:flashlight: This Laravel package helps to handle a query request properly

api collection data eloquent handler l5x laravel query

Last synced: 28 Apr 2026

https://github.com/jtpio/data-playground

Experiments using public APIs and data

data experiments python

Last synced: 28 Apr 2026

https://github.com/ahmetcansolak/developer-insights

New project of ClubRockers from Sarıyer Hills

bitbucket data data-science data-visualization github python3

Last synced: 28 Apr 2026

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/player29879/sketch

AI code-writing assistant that understands data content

ai codex data dataframe dats-science df ds gpt3 pandas python sketchs

Last synced: 28 Apr 2026

https://github.com/gappeah/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 31 Jul 2025

https://github.com/kouisamine/data-uri-to-image

Convert Data URI into Image(png, jpeg, webp, gif, svg, ...) files.

conversion convert converter data datauri datauri-to-image image js online php script source-code tools uri

Last synced: 10 May 2026

https://github.com/the-aerospace-corporation/pivt

PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.

analytics dashboards data devops jenkins pipeline python splunk visualization

Last synced: 29 Apr 2026

https://github.com/alexandregazagnes/rica-analysis

This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.

analysis argiculture business data data-analysis data-analytics food python

Last synced: 29 Apr 2026

https://github.com/definetlynotai/vulnscan_data

Logicytics VulnScan Module's Training Data and old model archive

ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data

Last synced: 11 Oct 2025

https://github.com/yord/klp-json

A JSON plugin for klp (Kelpie), the small, fast, and magical command-line data processor.

csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv

Last synced: 29 Apr 2026

https://github.com/katiesaund/dresden_maps

Contains a data file with locations from The Dresden Files. The data file is to be used for my map tutorial in R.

data

Last synced: 05 Jan 2026

https://github.com/sgarciaddev/proyecto-poo

Proyecto de software de gestión de asistencia de alumnos en un colegio, utilizando el lenguaje Java y el paradigma de programación orientada a objetos.

alumnos csv data java mysql poo

Last synced: 29 Apr 2026

https://github.com/sodascience/open_supply_hub

Processing supply chain data obtained from Open Supply Hub

data global-supply-chain open-supply-hub python

Last synced: 29 Apr 2026

https://github.com/v-mayya/python-sales-data-analysis

Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python

analysis data matplotlib numpy python

Last synced: 29 Apr 2026

https://github.com/frefrik/covid19norge-api

API for COVID-19 cases in Norway

api covid covid-19 covid19 data fastapi norge norway

Last synced: 10 May 2026

https://github.com/iamjuniorb/data_structures_and_algorithms

I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.

data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3

Last synced: 08 Jun 2026

https://github.com/spectrochempy/spectrochempy_data

Test and examples data repository for SpectroChemPy

data

Last synced: 04 Apr 2025

https://github.com/CentralFloridaAttorney/ComfyUI-ZMongo

An Easy-to-Use database framework and parameter library for ComfyUI. Centralize node presets, capture workflow logic, manage structured image collections, and build document-driven text automation pipelines on an offline Local File Store or BusinessProcessApplications.com .

api comfy comfy-ui comfyui comfyui-custom-node comfyui-custom-nodes comfyui-manager comfyui-node comfyui-nodes comfyui-workflow data database

Last synced: 21 Jun 2026

https://github.com/rohancyberops/rp1

This project performs an analysis of Starbucks (SBUX) stock returns using R. The analysis includes both simple returns and continuously compounded returns (CC returns) for a period of one month. It also calculates the growth of $1 invested in SBUX and provides visual insights through various plots.

analysis cc data r rlanguage sbux

Last synced: 15 Mar 2025

https://github.com/stdlib-js/ndarray-empty-like

Create an uninitialized ndarray having the same shape and data type as a provided ndarray.

data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 11 Oct 2025

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/jrmedd/emojinal

An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.

data emojis sentiment user-research ux

Last synced: 19 Jan 2026

https://github.com/kirkalyn13/portfolio-dashboard-site

Portfolio Site; Initially a Service Provider Metrics Dashboard using React.

dashboard data data-visualization react

Last synced: 15 Apr 2026

https://github.com/cintia0528/data_analytics_and_visualization-sql_tableau

Evaluate Magist as a strategic partner for Eniac's Brazilian expansion. Use SQL to analyze growth, tech accessory sales potential, delivery times, and customer satisfaction in Magist's database.

data dataanalysis datavisualization sql strategy tableau

Last synced: 31 Mar 2025

https://github.com/visenger/prada

Profiling Datasets

cleaning data dataset profiling

Last synced: 24 Aug 2025

https://github.com/francescodisalesgithub/data-for-developers

simple SQL database with problems and solution found on stackoverflow, documentation or chatgpt

chatgpt data database developer hacker hacking knowledge solutions sql targets

Last synced: 22 Mar 2025

https://github.com/danish-foundation-models/dfm-processing

Toolkit for processing data in the danish foundation models project.

data text-processing

Last synced: 02 Jul 2025

https://github.com/qeeqbox/data-classification

Data classification defines and categorizes data according to its type, sensitivity, and value

classification data data-classification infosecsimplified qeeqbox

Last synced: 09 Mar 2026

https://github.com/genert/metis

Asynchronous data sender library

analytics asynchronous data dependency-free typescript

Last synced: 27 Jan 2026

https://github.com/openearth/rws-viewer

This viewer is created by Deltares in cooperation with Voorhoede under OpenEarth GPL License. The viewer can be used via several RWS websites, please visit https://www.informatiehuismarien.nl/, https://waterinfo-extra.rws.nl/ and https://basismonitoringwadden.waddenzee.nl/.

data mapbox-gl-js ogc-services viewer

Last synced: 01 Feb 2026

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 10 Apr 2026

https://github.com/cworld1/novel-data

The data repository of novel analysis

analysis data novel

Last synced: 01 Feb 2026

https://github.com/matusf/glasgow_wifi

Script that plots wifi access points to map and labels them by their protection

data data-visualization folium python python3

Last synced: 24 Jun 2026

https://github.com/jub0t/eso

An application to manage all your Encryption & Decryption keys and other related tools.

data encryption encryption-decryption hacking hacking-tool keys pgp privacy private

Last synced: 07 Feb 2026

https://github.com/R-Mahesh45/HR---Resume-Text-Classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 13 Oct 2025

https://github.com/camara94/introduction-to-data-engineering

Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.

business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis

Last synced: 09 Apr 2025

https://github.com/rubenhortas/python_examples

Examples of Python code and DSA (data structures and algorithms).

algorithm algorithms data dsa examples python python-3 python3 samples snippets structures

Last synced: 03 Oct 2025

https://github.com/dominhduy09/my-links

All of my links and websites I have been creating - For saving all of my website's links

data database link linked-list linktree list save storage website

Last synced: 25 Jun 2026

https://github.com/Lemniscate-world/StratAI

This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.

ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading

Last synced: 13 Oct 2025

https://github.com/connectaman/deepseek-ocr-multigpu-infer

Efficient multi-GPU OCR inference framework leveraging parallel processes for accelerated token throughput and faster batch processing. Designed for scalable, high-performance optical character recognition workloads using PyTorch. Supports dynamic GPU assignment, optimized resource utilization, and easy integration for large-scale image datasets.

agentic-extraction data deepseek document-parser extraction extractor gpu image-parser llm multigpu nvidia ocr parallel-computing parser pdf-parser vlm

Last synced: 22 Jan 2026

https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer

Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.

convert converter data data-science deserialize json lightweight powershell serializer

Last synced: 04 May 2026

https://github.com/desmondsanctity/abeona-kafka

A demo to show how to implement Upstash's serverless Kafka to a Node.js microservice. Presented at Berlin Buzzwords 2024

berlin-buzzwords data event-driven kafka microservice serverless streaming upstash-kafka

Last synced: 15 May 2025

https://github.com/jeanmanguy/milk-sci-fi

Census of every mention of milk in sci-fi works.

data milk sci-fi

Last synced: 26 Feb 2026

https://github.com/asuozzo/medicare-data-analysis

An analysis of Medicare Part D data in Vermont

data python

Last synced: 04 May 2026

https://github.com/raghavendranhp/credit_card_fraud_detection

This repository contains code for a credit card fraud detection model using autoencoders and logistic regression, achieving 95.3% accuracy.

anomaly-detection autoencoder-neural-network credit-card-fraud data keras logistic-regression machine-learning preprocessing tensorflow

Last synced: 04 May 2026

https://github.com/3squared/smoulder

Smoulder is a really good data pipe

composition data facade-pattern forge-framework object-oriented

Last synced: 25 Apr 2026

https://github.com/stdlib-js/strided-base-dtype-resolve-str

Return the data type string associated with a supported strided array data type value.

array data dtype dtypes enum javascript node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 13 Oct 2025

https://github.com/willdev12/rjson

Encryptable Json file format for .NET projects!

csharp csharp-library data dotnet json json-data json-plugin variables vbdotnet vbnet

Last synced: 11 Apr 2026

https://github.com/nfaltir/dataxplorer

🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.

data data-science python streamlit

Last synced: 05 May 2026

https://github.com/joeyism/py-cifar10

This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site

cifar cifar-10 cifar10 data machine-learning machinelearning

Last synced: 30 Jul 2025

https://github.com/williamwutq/bblock

Persistent checksummed blocks built on top of bstack's allocators

allocation binary block data data-structures database rust rust-crate rust-library serialization

Last synced: 25 Jun 2026

https://github.com/lakecountryhuntclub/dnr-map-data-model

Data Model for the 2023 DNR Pheasant Stocking Property Data

data data-model documentation excel gis hunting mapping powerquery vba

Last synced: 29 Jul 2025

https://github.com/athul64/powerbi

Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.

data data-an data-visualization dax dax-expression powerbi

Last synced: 23 Feb 2026

https://github.com/ispyhumanfly/prowler

Query the web, extract data from the results, and transform that data into a format you can use.

ai analytics business cryptocurrency data extract-data machine-learning mining scraping web

Last synced: 06 Sep 2025

https://github.com/outofbedlam/tine

TINE a data pipeline runner.

data pipeline

Last synced: 05 Oct 2025

https://github.com/cosmos-loops/cosmos-dapper

Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.

dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver

Last synced: 11 Apr 2026

https://github.com/rrwen/twitter2mongodb

Module for extracting Twitter data to MongoDB databases

api data database geo get location mdb media mongo mongod mongodb oauth post rest sample social stream token tweet twitter

Last synced: 06 May 2026

https://github.com/h2lsoft/validator

A library of validators values in multilanguage with CSRF protection

csrf csrf-protection data form php validator

Last synced: 04 Feb 2026

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://github.com/sbdk-dev/sbdk.dev

A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.

ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer

Last synced: 27 May 2026

https://github.com/satur-io/estoraje

Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.

data database distributed go golang key-value performance training

Last synced: 07 May 2026

https://github.com/augustoarraes/corais

App Python de Monitoramento de vida marinha de Recife de Corais 🪸

coral data iot matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis

This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle

data database mysql sql walmart

Last synced: 24 Feb 2026

https://github.com/skygenesisenterprise/aether-account

Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.

account data javascript nextjs platform service sso-service typescript user-interface

Last synced: 16 Apr 2026

https://github.com/svetlanam/twitter-ads

Get data about campaigns from Twitter Ads API

api data keboola keboola-extractor twitter twitter-ads twitter-api

Last synced: 12 Jun 2026

https://github.com/connectomicslab/cmtklib-data

Datalad dataset that stores all data resources of the cmtklib module of Connectome Mapper 3 (https://github.com/connectomicslab/connectomemapper3).

brain data parcellation resources software

Last synced: 16 Jan 2026

https://github.com/fastbolt/excel-writer

Excel-Writer component

data excel excel-export

Last synced: 14 Apr 2025

https://github.com/brianali-codes/github-searcher

A website for API experimentation that users the github Api to search for different users and some of their (public) information

api data github user

Last synced: 21 May 2026

https://github.com/souvik09-tech/adventure-works-kpi-dashboard

This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.

analysis data kpi powerbi visualization

Last synced: 27 Jan 2026

https://github.com/fritzrehde/asciibar

A cli tool to print percentages as ascii bar charts

cli data percentage visualization

Last synced: 31 Oct 2025

https://github.com/orisai/nette-data-sources

Orisai Data Sources integration for Nette

data decoder encoder file-format files json neon nette orisai parser php yaml

Last synced: 05 Feb 2026

https://github.com/bastianolea/campamentos_chile

Datos del Catastro de campamentos nacional 2024, del Ministerio de Vivienda y urbanismo

chile comunas data pobreza social

Last synced: 24 Aug 2025

https://github.com/rrwen/twitter2pg-cli

Command line tool for extracting Twitter data to PostgreSQL databases

api cli cmd command data database geo interface line location media pg postgres postgresql rest social stream tool tweet twitter

Last synced: 12 Apr 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/lane-romuald/iot-irrigation-data-collection-system

An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.

arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi

Last synced: 12 Apr 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026