An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/theryston/db-mycro

A node module with a json database that saves data in a specific directory, similar to sqlite, but in JSON

base crud data database db db-mycro javascript json jsondatabase nodejs nosql typescript

Last synced: 09 Apr 2026

https://github.com/CentralFloridaAttorney/ComfyUI-ZMongo

An Easy-to-Use database framework and parameter library for ComfyUI. Centralize node presets, capture workflow logic, manage structured image collections, and build document-driven text automation pipelines on an offline Local File Store or BusinessProcessApplications.com .

api comfy comfy-ui comfyui comfyui-custom-node comfyui-custom-nodes comfyui-manager comfyui-node comfyui-nodes comfyui-workflow data database

Last synced: 21 Jun 2026

https://github.com/ryanjoy0000/yt-notifier

Youtube Notifier (Telegram Bot) - A real time data processing pipeline

data go kafka-streams real-time telegram-api youtube-api

Last synced: 14 Jan 2026

https://github.com/yord/klp-json

A JSON plugin for klp (Kelpie), the small, fast, and magical command-line data processor.

csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv

Last synced: 29 Apr 2026

https://github.com/jerryfzhang/rockets

A Node + React App that displays space launch missions around the world.

bootstrap data expressjs less momentjs nodejs react reactjs reactstrap

Last synced: 10 Apr 2026

https://github.com/neelravi/fairtool

A CLI tool for FAIR processing of computational materials science data.

computational data data-analytics fair management materials physics python science

Last synced: 14 Jan 2026

https://github.com/timxor/bitcoind-data-ingestion

crypto payments bitcoind data ingestion

bitcoind data ingestion

Last synced: 27 Oct 2025

https://github.com/iamjuniorb/data_structures_and_algorithms

I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.

data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3

Last synced: 08 Jun 2026

https://github.com/chompfoods/stub-asp-net-core

ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server

Last synced: 30 Apr 2026

https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer

Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.

analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob

Last synced: 17 Apr 2026

https://github.com/ahmadjamil888/facial-recognition-ai-model

A facial recognition AI model powered by CNN , and trained by thousands of images.

ai cnn data data-science facial facial-recognition recognition

Last synced: 30 Jun 2025

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/782e616c6d/covid-d.a

Academic project, using Apache Spark for ETL and Data Studio for data analysis.

academic analytics automation cluster covid-19 data database etl python spark sql

Last synced: 10 May 2026

https://github.com/kamal-singh22/ai-driven-emotional-sentiments-analysis

This project leverages machine learning to analyze and classify the emotional sentiment of textual data. The goal is to accurately identify and categorize emotions, aiding applications in customer feedback analysis, social media sentiment analysis, and mental health monitoring.

analysis artificial-intelligence data emotion nlp-machine-learning python sentiment-analysis streamlit text-classification

Last synced: 14 Apr 2026

https://github.com/gkapfham/ast2016-paper

Source Code of and Supporting Files for a Paper Published at AST 2016

data latex-document paper research

Last synced: 19 Oct 2025

https://github.com/gbv/cocoda-mappings

concordances, mappings and conversion scripts to create JSKOS mappings

coli-conc data jskos

Last synced: 28 Oct 2025

https://github.com/iguptashubham/walmart-eda

Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.

analysis data dataset eda mysql portfolio-project python sql

Last synced: 10 Apr 2026

https://github.com/ybelenko/openapi-data-mocker-server-middleware

PSR-15 HTTP Server Middleware to create mock responses from OpenAPI Schemas(OAS 3.0).

data fake faker middleware mock mocker oas oas3 openapi psr-15 swagger

Last synced: 15 Jun 2025

https://github.com/stdlib-js/datasets-herndon-venus-semidiameters

Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.

astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus

Last synced: 09 Oct 2025

https://github.com/izaaccoding36/dados-dinamicos

Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global

api data redes-sociais social-media website

Last synced: 26 Mar 2025

https://github.com/fiskeben/meetjescraper

HTTP proxy for Meet je stad project

api data go iot meetjestad proxy scraper weather

Last synced: 29 May 2026

https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint

A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT

bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs

Last synced: 25 Sep 2025

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 10 Apr 2026

https://github.com/snimmagadda1/stack-exchange-dump-to-mysql

Batch pipeline to import Stack Exchange XML data dumps to relational DB

batch data mysql spring-batch stackoverflow

Last synced: 30 Mar 2025

https://github.com/joanjpx/excel

📄Exporting .XLS from MySQL🐬through PHP 🐘 without using libraries

data excel export ms-excel mysql php poo xls xlsx

Last synced: 14 May 2026

https://github.com/codenoid/webtoons.com-database

a Webtoons.com Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/sandk21/etude_eau_potable_monde

Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau

analysis data tableau tableau-public visualization

Last synced: 19 Mar 2026

https://github.com/mattqdev/koalaz

Why don't use koalas as data mock? With this npm package you can!

data koala lorem-ipsum meme mock placeholder

Last synced: 13 Jan 2026

https://github.com/ishaansathaye/data40x-1_2_3

Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab

capstone-prep data data-science ethics lab python

Last synced: 04 May 2026

https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist

This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).

ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling

Last synced: 09 Jun 2026

https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang

Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024

data data-science dataanalytics dataset json

Last synced: 28 Jun 2025

https://github.com/kenmwaura1/nuvo-data-cleaning-functions

Collection of scripts and functions to clean and preprocess data using Nuvo SDK.

data nuvo react

Last synced: 04 May 2026

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/east-empire-trading-company/eetc-data-client

Client library for retrieving data managed by EETC Data Hub.

client-library data data-science finance library python

Last synced: 31 May 2026

https://github.com/nfaltir/dataxplorer

🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.

data data-science python streamlit

Last synced: 05 May 2026

https://github.com/kirkalyn13/portfolio-dashboard-site

Portfolio Site; Initially a Service Provider Metrics Dashboard using React.

dashboard data data-visualization react

Last synced: 15 Apr 2026

https://github.com/definetlynotai/vulnscan_data

Logicytics VulnScan Module's Training Data and old model archive

ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data

Last synced: 11 Oct 2025

https://github.com/flowsynx/plugin-postgresql

FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.

data database flowsynx postgresql postgresql-database sql

Last synced: 09 May 2026

https://github.com/eradical/analytics-unibody

Ansible role that sets up a farm of analytics collectors based on nginx

analytics ansible ansible-role big-data collectors data nginx

Last synced: 06 May 2026

https://github.com/derrickbaruga7/python-data-analysis

This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.

analytics data data-science visualization

Last synced: 31 Jul 2025

https://github.com/erwan-simon/aws-serverless-notebook-platform

A self-hosted, serverless platform offering an intuitive UI to manage, schedule, and execute Jupyter notebooks on AWS.

aws data docker notebook python serverless terraform webapp

Last synced: 13 Jun 2026

https://github.com/famarks/grafarg

Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License

analytics charts data data-analysis data-science data-visualization grafana grafarg graph

Last synced: 19 Jan 2026

https://github.com/kingabzpro/makefile-actions

GitHub Actions and MakeFile tutorial and project for beginners.

actions analytics automation data data-science makefile

Last synced: 18 Apr 2026

https://github.com/montanaz0r/imdb-ratings-auto-inserter

A Python script that enables auto-inserting movie ratings into the IMDB profile.

data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping

Last synced: 07 May 2026

https://github.com/cworld1/novel-data

The data repository of novel analysis

analysis data novel

Last synced: 01 Feb 2026

https://github.com/tylerben/data-spring

Easily generate a dummy dataset based on a provided config

data data-spring datagenerator fake-data generator javascript typescript

Last synced: 27 May 2026

https://github.com/eve-ning/osumania_data

processed osu!mania data from osu!API

data osu rhythm-game vsrg

Last synced: 24 Feb 2026

https://github.com/jmcanterafonseca/leaflet-context-information

A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2

context data fiware information leaflet map open visualization web

Last synced: 21 Apr 2026

https://github.com/augustoarraes/corais

App Python de Monitoramento de vida marinha de Recife de Corais 🪸

coral data iot matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/giladbarnea/to

A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.

conversion data data-conversion json json5 parser script terminal toml yaml

Last synced: 08 Feb 2026

https://github.com/noahweasley/node-user-settings

A universal but simple node library to implement user settings, built to work with Electron.js with little or no configurations

app data electronjs json nodejs persist settings storage sync user

Last synced: 08 Feb 2026

https://github.com/raymondcm/strawberrydata

Tool suite for fast multi-camera strawberry data collection project. The standards document houses cross compatibility/purpose implementation details.

camera cpp data intel multi-camera

Last synced: 08 Feb 2026

https://github.com/themuhd/world-cup-analysis

Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023

data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python

Last synced: 08 May 2026

https://github.com/gniquyij/tuqiu

Dark side of OS X

data notes osx safari

Last synced: 12 Oct 2025

https://github.com/jayantur13/kountry

Node module variant of the Country API

api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn

Last synced: 26 Jan 2026

https://github.com/flowsynx/plugin-json

FlowSynx plugin to loads and parses local JSON files. Supports transformation, extraction, and mapping of hierarchical data structures in workflows.

data data-platform flowsynx json

Last synced: 10 Mar 2026

https://github.com/tatey/list_of_countries

A list of countries, states, and cities in Ruby

cities countries data ruby states

Last synced: 11 Nov 2025

https://github.com/prpriesler/covid19-insights-and-analytics

This project delves into the realm of data analytics and programming, focusing on four pivotal datasets related to the COVID-19 pandemic: confirmed global, death global, vaccination & population data, and Twitter data.

covid19 covid19-data data data-science dataanalytics deep-neural-networks machine-learning natural-language-processing

Last synced: 31 Aug 2025

https://github.com/ajityadav2621/datadoom

Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.

ai data

Last synced: 09 Feb 2026

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/trstringer/pywave2

:ocean: Get swell buoy data

data ocean python

Last synced: 31 Mar 2025

https://github.com/agavitalis/sample-c-codes

A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.

ageteller atm binary data gpcalculator logging

Last synced: 09 Apr 2025

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/scottleechua/data

Public datasets under CC-BY-4.0 license.

data public-data

Last synced: 18 Mar 2026

https://github.com/joeyism/py-cifar10

This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site

cifar cifar-10 cifar10 data machine-learning machinelearning

Last synced: 30 Jul 2025

https://github.com/genert/metis

Asynchronous data sender library

analytics asynchronous data dependency-free typescript

Last synced: 27 Jan 2026

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://github.com/luminati-io/Crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 09 Apr 2025

https://github.com/skygenesisenterprise/aether-account

Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.

account data javascript nextjs platform service sso-service typescript user-interface

Last synced: 16 Apr 2026

https://github.com/n4ze3m/timezone-json

JSON file with more than 1642 cities timezone in UTC format.

data json timeszone

Last synced: 19 Jul 2025

https://github.com/richardschoen/sshnetibmi

This .Net/.Net Core class library is used to interface with existing IBM i database, program calls, CL commands, service programs and data queues via the PASE based xmlservice-cli PASE command program or regular qsh/bash commands. qsh/bash commands can be used to interface with any qsh/pase based utilities such as the IBM i db2util utility

as400 cl command csharp data db2 ddm dotnet drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg xmlservice xmlservice-cli

Last synced: 04 Feb 2026

https://github.com/R-Mahesh45/HR---Resume-Text-Classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 13 Oct 2025

https://github.com/stdlib-js/ndarray-base-empty-like

Create an uninitialized ndarray having the same shape and data type as a provided ndarray.

base data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 09 Mar 2026

https://github.com/raynardj/r_notes

Learning notebooks of R

data docker guru99 jupyter learning r

Last synced: 09 May 2026

https://github.com/lakecountryhuntclub/dnr-map-data-model

Data Model for the 2023 DNR Pheasant Stocking Property Data

data data-model documentation excel gis hunting mapping powerquery vba

Last synced: 29 Jul 2025

https://github.com/rafaelfloressouza/Covid-19-Dashboard

Python web application to display COVID19 data from the world using Plotly and Dash

bootstrap covid-19 css data datavisualization plotly-dash python3

Last synced: 10 Mar 2025