An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/bkamapantula/india-pc-nfhs4

Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.

data government health india nfhs nfhs4

Last synced: 19 Mar 2026

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/planarnetwork/feeds.planar.network

GTFS feeds for bus, train and plane

data feeds gtfs transit transportation

Last synced: 11 Feb 2026

https://github.com/socketsupply/dynavolt

A highly opinionated DynamoDB client for aws-sdk v3 using esm.

aws data database dynamo dynamodb key-value kvstore

Last synced: 05 May 2026

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/rastmob/wordpress-llms-output-plugin

A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).

ai data llm llms training training-data wordpress wordpress-development wordpress-plugin

Last synced: 03 May 2026

https://github.com/stdlib-js/ndarray-base-dtype-str2enum

Return the enumeration constant associated with an ndarray data type string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 15 Mar 2026

https://github.com/OliverHennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 30 Jul 2025

https://github.com/jahilldev/immutable-parsejs

Parse a JS object or array/map into an Immutable collection. Makes use of ImmutableJs List, and Record primitives.

data immutablejs javascript json nodejs parse typescript

Last synced: 13 Apr 2026

https://github.com/thiagopanini/datadelivery

Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.

analytics athena aws catalog crawler data datamesh glue s3 terraform

Last synced: 29 Nov 2025

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/cainmi/data-page-project

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 21 Oct 2025

https://github.com/desininja/data-engineer-interview-questions

This repository contains all the Data Engineer Interview Questions asked by interviewers.

data data-engineer-interview-questions

Last synced: 31 Mar 2025

https://github.com/bredalis/datastructure

📚 Estructuras de Datos en Python

algorithms data data-structure python

Last synced: 12 Apr 2026

https://github.com/eve-ning/osumania_data

processed osu!mania data from osu!API

data osu rhythm-game vsrg

Last synced: 24 Feb 2026

https://github.com/agavitalis/sample-c-codes

A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.

ageteller atm binary data gpcalculator logging

Last synced: 09 Apr 2025

https://github.com/shawnduong/pacman-digest

Generate a digest of package space usage for Linux systems using pacman.

arch data pacman

Last synced: 13 May 2026

https://github.com/stdlib-js/ndarray-slice-dimension-from

Return a read-only shifted view of an input ndarray along a specific dimension.

copy data javascript matrix ndarray node node-js nodejs shift slice stdlib structure truncate types vector view

Last synced: 24 Apr 2025

https://github.com/dalikewara/typego

typego provides custom type that can be used to construct information (such as success data, error data, etc)

custom data golang helper type typego

Last synced: 09 Apr 2025

https://github.com/yasenstar/powerbi_tutorial

Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool

analytics data microsoft powerbi tutorial visualization

Last synced: 07 Jan 2026

https://github.com/bukalapak/bukadata

Data supplier plugin for populating design with real data.

data plugin sketch sketch-plugin

Last synced: 05 Jul 2025

https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning

This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics

data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn

Last synced: 23 Jul 2025

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/jaldekoa/fdicapi

A Python wrapper to easily retrieve data from the BankFind Suite official API from FDIC in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 07 Jan 2026

https://github.com/alexscigalszky/palabras-aleatorias-data

This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types

aleatorias data palabras random words

Last synced: 04 Oct 2025

https://github.com/san089/black-friday-sales-analysis

This Project gives an insight into few statistics related to black Friday Sale.

custom data dataanalysis insights sales statistics

Last synced: 13 Jul 2025

https://github.com/hardwario/cloud-fetch

HARDWARIO Cloud Fetch - Data Extraction Tool

cli cloud data excel python

Last synced: 07 Feb 2026

https://github.com/varbrad/mindb

🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.

browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema

Last synced: 13 Apr 2026

https://github.com/tarantinoarchive/dec

Developer-Easy CMS

cms data easy ejs js json simple

Last synced: 11 Mar 2026

https://github.com/kuro337/scalamono

Scala Monorepo Tooling for Kafka, Opensearch, Spark, Redpanda, Hadoop - and Lang Reference.

data database duckdb hadoop kafka redpanda sdala spark

Last synced: 13 Apr 2026

https://github.com/gkapfham/ast2016-paper

Source Code of and Supporting Files for a Paper Published at AST 2016

data latex-document paper research

Last synced: 19 Oct 2025

https://github.com/spiceai/datasets

Spice AI curated dataset definitions for Spice.ai

ai bitcoin blockchain data ethereum polygon

Last synced: 20 Apr 2026

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/izaaccoding36/dados-dinamicos

Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global

api data redes-sociais social-media website

Last synced: 26 Mar 2025

https://github.com/nafisalawalidris/buybuy-e-commerce-company

The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.

buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql

Last synced: 16 Mar 2025

https://github.com/bastianolea/palestina

Visualizador sobre cifras de la masacre que Israel está llevando a cabo en Palestina y la franja de Gaza

app data meses palestina politica shiny social tiempo

Last synced: 06 Jul 2025

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/basemax/buskool.com-data

This repository contains the collected product data from the Buskool website (باسکول). The data is stored in 20k+ JSON files, each containing detailed information about products available on the website.

buskool buskoolcom data farsi information ir iran json persian

Last synced: 03 Apr 2025

https://github.com/sandipbera35/blogapp.spring.boot

A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .

data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot

Last synced: 13 Apr 2026

https://github.com/vincentlaucsb/csv-data

A curated repository of real and fake CSV data for use in testing suites

csv data test testing

Last synced: 08 Mar 2026

https://github.com/s-raza/csvio

Wrapper for conveniently processing CSV files

csv data file processing wrapper

Last synced: 14 Jan 2026

https://github.com/stdlib-js/array-base-fancy-slice-assign

Assign element values from a broadcasted input array to corresponding elements in an output array.

array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types

Last synced: 06 Oct 2025

https://github.com/outofbedlam/tine

TINE a data pipeline runner.

data pipeline

Last synced: 05 Oct 2025

https://github.com/sycho9/populater

:elephant: PHP script that populates your database tables with fake data using fzaninotto/faker

composer data database fake packagist php populate

Last synced: 13 Apr 2026

https://github.com/dixslyf/nbparts

Unpack a Jupyter notebook into its sources, outputs and metadata.

data haskell jupyter jupyter-notebook nix nix-flake

Last synced: 05 Oct 2025

https://github.com/helins/ex.clj

Java exceptions as clojure data

clojure data exception java java-exceptions

Last synced: 12 Dec 2025

https://github.com/igorwastaken/math-problems

Solve math problems easily with this utility library.

algorithm area data demography geography javascript math npm package population school typescript util utils

Last synced: 23 Feb 2026

https://github.com/iwconfig/svtplay-data

Daily JSON backup of content metadata from SVTPlay

data metadata streamlink svtplay svtplay-dl youtube-dl

Last synced: 24 Oct 2025

https://github.com/hyperversal-blocks/averveil

Averveil is OpenSea for Data.

blockchain data golang iot privacy zero-knowledge zkp

Last synced: 14 Jan 2026

https://github.com/alexandregazagnes/rica-analysis

This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.

analysis argiculture business data data-analysis data-analytics food python

Last synced: 29 Apr 2026

https://github.com/strata/data

Tools to help you read data from a range of different data providers.

api data data-integration

Last synced: 27 Jan 2026

https://github.com/famarks/grafarg

Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License

analytics charts data data-analysis data-science data-visualization grafana grafarg graph

Last synced: 19 Jan 2026

https://github.com/davorg/dmp

Data Munging with Perl

book data hacktoberfest munging perl

Last synced: 21 Jan 2026

https://github.com/stdlib-js/ndarray-empty-like

Create an uninitialized ndarray having the same shape and data type as a provided ndarray.

data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 11 Oct 2025

https://github.com/stdlib-js/array-base-assert-is-real-floating-point-data-type

Test if an input value is a supported array real-valued floating-point data type.

array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate

Last synced: 12 Oct 2025

https://github.com/jrmedd/emojinal

An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.

data emojis sentiment user-research ux

Last synced: 19 Jan 2026

https://github.com/rohancyberops/r-language

R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.

caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse

Last synced: 12 Oct 2025

https://github.com/genert/metis

Asynchronous data sender library

analytics asynchronous data dependency-free typescript

Last synced: 27 Jan 2026

https://github.com/anobaka/insidecollector

这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。

collection data excel-like list list-manager table

Last synced: 19 Jan 2026

https://github.com/R-Mahesh45/HR---Resume-Text-Classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 13 Oct 2025

https://github.com/Lemniscate-world/StratAI

This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.

ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading

Last synced: 13 Oct 2025

https://github.com/twistezo/ts-dto-mapper

DTO (Data Transfer Object) to Object Model transformer

data dto map mapper model object transfer transform transformer typescript

Last synced: 05 Feb 2026

https://github.com/player29879/neum-ai

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 18 Apr 2026

https://github.com/athul64/powerbi

Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.

data data-an data-visualization dax dax-expression powerbi

Last synced: 23 Feb 2026

https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis

This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle

data database mysql sql walmart

Last synced: 24 Feb 2026

https://github.com/souvik09-tech/adventure-works-kpi-dashboard

This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.

analysis data kpi powerbi visualization

Last synced: 27 Jan 2026

https://github.com/orisai/nette-data-sources

Orisai Data Sources integration for Nette

data decoder encoder file-format files json neon nette orisai parser php yaml

Last synced: 05 Feb 2026

https://github.com/nnavales/desafios-data-engineer

En este proyecto abordaremos desafíos comunes en el rol de un Data Engineer con tecnologías modernas.

data data-engineering database dataengineering docker minio scrapping spark

Last synced: 01 Jun 2026

https://github.com/intersystems-ib/workshop-healthcare-interop

Learn the basics in HealthCare Interoperability using InterSystems IRIS for Health

data fhir health hl7 interoperability

Last synced: 14 Apr 2026

https://github.com/open-i18n/data-iso-15924

Git mirror for ISO 15924, Codes for the representation of names of scripts data

data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems

Last synced: 14 Mar 2026

https://github.com/akv3sic/cryptocurrency-charts

Cryptocurrency API data visualizations 📈 with Matplolib.

cryptocurrency data data-visualization matplotlib python

Last synced: 16 Oct 2025

https://github.com/potreic/etl-fashion-trend-analysis

✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊

airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends

Last synced: 27 Jan 2026

https://github.com/data-forge-notebook/javascript-cheat-sheet

Cheat sheet that accompanies my book Data Wrangling with JavaScript

cheatsheet data data-wrangling javascript nodejs

Last synced: 15 Apr 2026

https://github.com/florianwendelborn/metatypes

Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)

code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript

Last synced: 27 Jan 2026

https://github.com/mscbuild/analysis

🎢 This collection of data analysis projects demonstrates techniques for extracting, transforming, analyzing, and visualizing data. Data Analytics Projects for Beginners 📈 ⚡

anallysis analysis chart csv dashboard data data-science data-science-projects excel google html5 mashine-learning portfolio pyton

Last synced: 19 Oct 2025

https://github.com/divithraju/divith-aju-hadoop-pyspark-pipeline

This project demonstrates the creation of a scalable data processing pipeline for handling and analyzing log data from a hypothetical e-commerce platform. Leveraging Hadoop and PySpark, the pipeline is designed to process large volumes of log files, providing meaningful insights into user behavior, system performance, and sales metrics.

apache-hadoop-framework apache-spark bigdata client data database dataengineering dataingestionframework datapreprocessing documentation ecommerce-platform hdfs pipeline project project-repository pyspark python3 software-engineering

Last synced: 27 Jan 2026

https://github.com/juliaextremes/idfdatacanada.jl

A set of methods to get ECCC IDF data from .txt files

canada climate data julia netcdf

Last synced: 21 Oct 2025

https://github.com/jaldekoa/fiscaldataapi

A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 27 Jan 2026