An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/m0nica/datalogues-outdated

Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll

data pelican pelican-blog pelican-theme

Last synced: 28 Feb 2026

https://github.com/ismail-mouyahada/lodscroljs-library

LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks

data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll

Last synced: 13 Feb 2026

https://github.com/stdlib-js/array-base-every-by-right

Test whether all elements in an array pass a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 13 Feb 2026

https://github.com/frictionlessdata/cardealerdp

Cardealer DP (Car Dealer Data Package) is a data exchange format for car dealerships. It is developed on top of the Data Package standard

car data datapackage dealer exchange extension format

Last synced: 13 Feb 2026

https://github.com/jopanel/factual-scraper

Data scraper for Factual v2 API

data

Last synced: 15 Feb 2026

https://github.com/neomutt/sample-data

📚 Lists of things. Useful for developing and testing.

data list sample

Last synced: 19 Mar 2026

https://github.com/m-rishab/stock_trend-analysis-power-bi-project-

In this project, I've harnessed the robust capabilities of Power BI to analyse, visualize, and uncover the story behind HUL's stock performance.

data datavisualization datavisualization-project powerbi

Last synced: 19 Mar 2026

https://github.com/theonlybeardedbeast/exercise-data

Datasets for workout exercises

data dataset fitness health healthcare

Last synced: 20 Mar 2026

https://github.com/docusign/extension-app-data-io-reference-implementation

Extension App for Data IO Reference Implementation for the Docusign IAM Platform

apps data extension

Last synced: 02 Mar 2026

https://github.com/metapsy-project/data-depression-inpatients

Database of depression psychotherapy trials in inpatient settings

data

Last synced: 27 Mar 2026

https://github.com/stdlib-js/array-base-every-by

Test whether all elements in an array pass a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 03 Mar 2026

https://github.com/rousan/weshare

An application that transfers files between devices

c-sharp data dot-net file lan phone share transfer-data weshare wifi

Last synced: 17 Apr 2026

https://github.com/izam-mohammed/data-source

🌐 A source directory for the data of my projects and experiments.📂 This curated collection simplifies access to diverse data that used in various projects💡

csv-files data data-source zip-files

Last synced: 03 Jun 2026

https://github.com/jinsyin/dataorigin

数据之源 | A data source management framework

data data-source datasource

Last synced: 21 Apr 2026

https://github.com/howtoquitvivek/ai-crop-yeild-prediction

AI-driven crop yield prediction and agricultural optimization system (SIH 2025)

2025 2026 ai crop-yeild data minor-project ml predcition python science sih

Last synced: 23 Apr 2026

https://github.com/sebastianbrzustowicz/collision-detection-ai

Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.

accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow

Last synced: 24 Apr 2026

https://github.com/andygol/osm-diff-state

CLI tool to search OSM diff state files

custom data openstreetmap planet replication

Last synced: 24 Apr 2026

https://github.com/ahmad-ali-rafique/pyviznotebook

PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.

analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization

Last synced: 06 Jun 2026

https://github.com/aidenellis/connectmp

🍰 ConnectMP - An easy way to share data between Processes in Python.

aidenellis connectmp data data-sharing multiprocessing process sharing

Last synced: 27 Apr 2026

https://github.com/iamlucianojr/laravel-api-query-handler

:flashlight: This Laravel package helps to handle a query request properly

api collection data eloquent handler l5x laravel query

Last synced: 28 Apr 2026

https://github.com/saulojoab/crato-ce-json

Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.

data database geolocation json json-api localization

Last synced: 28 Apr 2026

https://github.com/reubano/ckanny

A Python command line interface (CLI) for interacting with CKAN instances

ckan cli data featured open-data

Last synced: 28 Apr 2026

https://github.com/the-aerospace-corporation/pivt

PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.

analytics dashboards data devops jenkins pipeline python splunk visualization

Last synced: 29 Apr 2026

https://github.com/scarblase/salary-comparison

Submission for the DataCamp Salary Competition(1 level). 🏆

data data-analysis data-science data-visualization engineering python sql structured-data

Last synced: 01 May 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/danielgiljam/orbit-utils

A collection of utility packages for Orbit.js.

data inference orbit orbitjs schema synchronization type typescript validation zod

Last synced: 01 May 2026

https://github.com/ishaansathaye/data40x-1_2_3

Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab

capstone-prep data data-science ethics lab python

Last synced: 04 May 2026

https://github.com/kucingkode/dmerge

Small javascript library to help you merge same formatted data in a string

cithak data data-merge javascript library lightweight lightweight-javascript-library merge open-source

Last synced: 04 May 2026

https://github.com/nfaltir/dataxplorer

🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.

data data-science python streamlit

Last synced: 05 May 2026

https://github.com/eradical/analytics-unibody

Ansible role that sets up a farm of analytics collectors based on nginx

analytics ansible ansible-role big-data collectors data nginx

Last synced: 06 May 2026

https://github.com/sivas-2/coffee-sales-visualization

This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.

data data-analysis data-science data-visualization python visualization

Last synced: 07 May 2026

https://github.com/augustoarraes/corais

App Python de Monitoramento de vida marinha de Recife de Corais 🪸

coral data iot matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/chompfoods/stub-jaxrs-resteasy

JAX-RS RESTEasy server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jax-rs-server nutrition raw recipe-api recipes resteasy server server-stub stub stub-server

Last synced: 08 May 2026

https://github.com/raynardj/r_notes

Learning notebooks of R

data docker guru99 jupyter learning r

Last synced: 09 May 2026

https://github.com/keanteng/nextjs-directory

🌐A Draft Website For Data Catalogue Using NextJs

catalogue climate-change css data directory html javascript nextjs website

Last synced: 09 May 2026

https://github.com/bastianolea/comisarias_chile

Base de datos con las comisarías, retenes, tenencias y otras instalaciones de Carabineros

chile data estado social

Last synced: 23 Jun 2025

https://github.com/782e616c6d/covid-d.a

Academic project, using Apache Spark for ETL and Data Studio for data analysis.

academic analytics automation cluster covid-19 data database etl python spark sql

Last synced: 10 May 2026

https://github.com/masu-baumgartner/dbsync.net

A c# mysql model sync library

cshap data library mysql

Last synced: 13 May 2026

https://github.com/cdcgov/importsurvey

Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)

data import r sas survey survey-data

Last synced: 19 Jun 2026

https://github.com/williamwutq/mappedpages

A fixed-size page provider backed by memory mapping, intended for building higher-level allocators and storage systems

allocation allocator data data-storage database file memory-mapping mmap page rust rust-crate rust-library storage

Last synced: 25 Jun 2026

https://github.com/maccccd/wsoa3029a_2444372

This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.

d3js data hacking visualization

Last synced: 24 Jan 2026

https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali

Työllisyyden kuntakokeilujen palvelutietovarannon manuaali

data drupal drupal-9 unemployment

Last synced: 24 Jan 2026

https://gitlab.com/Native-Coder/d3-react-component

This is a dead-simple React component that makes D3 implementation a breeze.

chart component d3 data react vis visualization viz

Last synced: 24 Jan 2026

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 08 Apr 2025

https://github.com/jayantur13/data-bharat

Get states their capital and districts,UTS and other useful information

data js node npmjs package yarn

Last synced: 28 Jan 2026

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/desktopcleaner/naturemagazinescraper

Scrapes open-access Nature magazine articles and store as txt files.

data nature-magazine python scrapper word-frequency

Last synced: 06 Feb 2026

https://github.com/stdlib-js/ndarray-base-output-policy-str2enum

Return the enumeration constant associated with an output ndarray data type policy string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils

Last synced: 15 Apr 2026

https://github.com/itu-helper/data-updater

Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.

data istanbul-technical-university scraper selenium-python

Last synced: 29 Jan 2026

https://github.com/sandk21/etude_eau_potable_monde

Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau

analysis data tableau tableau-public visualization

Last synced: 19 Mar 2026

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 05 Jan 2026

https://github.com/igorskyflyer/npm-adblock-header-extract

✂️ Parse and extract ad-block filter list headers with ease. Works on strings or files, trims whitespace, and returns clean metadata for tooling and automation. 📃

adblock back-end biome data filter header igorskyflyer javascript js metadata node nodejs npm string ts typescript utility

Last synced: 11 Mar 2026

https://github.com/GiveMePseudonyms/PiVisualisations

A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.

data data-visualization pi pygame python self-organising-criticality tkinter

Last synced: 08 Apr 2025

https://github.com/tee8z/noaa-oracle

NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style

data duckdb-wasm noaa-weather parquet-files sql weather

Last synced: 17 Feb 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/aniketkkajania/wassupanalyzer

WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.

data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat

Last synced: 25 Feb 2026

https://github.com/cworld1/novel-data

The data repository of novel analysis

analysis data novel

Last synced: 01 Feb 2026

https://github.com/khalyomede/fetch

Quickly retrieve your PHP data

config configuration data fetch php php7

Last synced: 15 Mar 2025

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/mbolam/DSWS_OpenRefine

Cleaning and Linking Data with OpenRefine

cleaning data metadata openrefine

Last synced: 07 Apr 2025

https://github.com/garcane/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 09 Feb 2026

https://github.com/ajityadav2621/datadoom

Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.

ai data

Last synced: 09 Feb 2026

https://github.com/pharo-ai/data-preprocessing

Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.

data pharo pharo-smalltalk preprocessing smalltalk

Last synced: 09 Feb 2026

https://github.com/zituocn/dean

Task flow framework for data processing

data golang task

Last synced: 18 Jan 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/lookininward/data-formatter-demo

You have directories containing data files and specification files. The specification files describe the structure of the data files. Write an app that reads format definitions from specification files. Use these definitions to convert the parsed files to NDJSON files.

csv data demo files json ndjson python txt unittest

Last synced: 27 Apr 2026

https://github.com/scottleechua/data

Public datasets under CC-BY-4.0 license.

data public-data

Last synced: 18 Mar 2026

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema

Last synced: 11 Feb 2026

https://github.com/ewertondrigues02/engenharia-de-dados

Varios Projetos de Engenharia de Dados usando principais ferramentas como: Airflow, Snowflake, dbt, Postrgres, Looker Studio, Power BI

airflow analise-exploratoria analytics aws-ec2 dados data dbt-cloud engenharia-de-dados looker-studio postgres pyspark python3 snowflake spark

Last synced: 16 Apr 2026