An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kledenai/jsonweaver

A powerful and easy-to-use library for transforming JSON data into popular formats such as CSV, XML, Markdown tables, YAML, and JSONLines (NDJSON).

csv data data-transform format json jsonlines jsonweaver markdown markdown-tables xml yaml

Last synced: 24 Feb 2026

https://github.com/ireddragonicy/wascrub

Clean WhatsApp chat export easily.

chat clean data meta whatsapp

Last synced: 03 May 2026

https://github.com/doziestar/datavinci

DataVinci enables you to visualize data from various sources, generate insights, analyze data with AI models, and receive real-time updates on anomalies

data golang logs pipeline

Last synced: 23 Jan 2026

https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights

Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.

analytics data excel sas sasprogramming statistical-analysis

Last synced: 24 Feb 2026

https://github.com/capire/xtravels-java

Travel booking app using master data from xflights built with CAP Java

cap cds data federation flights java reuse

Last synced: 23 Jan 2026

https://github.com/cmda-tt/course-24-25

🎓 tech track · 2024-2025 · curriculum and syllabus 📊

d3 data datavis datavisualization es6 functional javascript programming svelte

Last synced: 28 Jan 2026

https://github.com/imahdimir/githubdata

A very simple Python package to easily download from and manage a GitHub "Data Repository"

data data-repository python-package

Last synced: 23 Jan 2026

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/rnabla/cuda-des

Bruteforcing DES using CUDA

bruteforce cuda data des encryption gpu parallel standard

Last synced: 27 Oct 2025

https://github.com/mihaiconstantin/lavot

A `React` application that allows users to indicate how votes will be redistributed among candidates for the second round of Romanian presidential elections.

data data-visualization elections react sankey typescript

Last synced: 06 Feb 2026

https://github.com/aleenprd/docbt

Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.

ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit

Last synced: 11 Nov 2025

https://github.com/medz/block

A flexible and efficient binary data block handling library for Dart.

binary blob block data streams

Last synced: 24 Feb 2026

https://github.com/timxor/bitcoind-data-ingestion

crypto payments bitcoind data ingestion

bitcoind data ingestion

Last synced: 27 Oct 2025

https://github.com/patrikmasiar/algorythm-of-the-night

Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻‍💻

algorithms data interview-questions logic logic-programming math mathematics science

Last synced: 27 Oct 2025

https://github.com/maccccd/wsoa3029a_2444372

This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.

d3js data hacking visualization

Last synced: 24 Jan 2026

https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali

Työllisyyden kuntakokeilujen palvelutietovarannon manuaali

data drupal drupal-9 unemployment

Last synced: 24 Jan 2026

https://gitlab.com/Native-Coder/d3-react-component

This is a dead-simple React component that makes D3 implementation a breeze.

chart component d3 data react vis visualization viz

Last synced: 24 Jan 2026

https://github.com/zoekelepiri/ota_observatory

A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.

backend data database spring-boot

Last synced: 06 Feb 2026

https://github.com/jayantur13/data-bharat

Get states their capital and districts,UTS and other useful information

data js node npmjs package yarn

Last synced: 28 Jan 2026

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/ariqf1/learn_data

Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.

data data-engineering mysql

Last synced: 15 Apr 2026

https://github.com/desktopcleaner/naturemagazinescraper

Scrapes open-access Nature magazine articles and store as txt files.

data nature-magazine python scrapper word-frequency

Last synced: 06 Feb 2026

https://github.com/stdlib-js/ndarray-base-output-policy-str2enum

Return the enumeration constant associated with an output ndarray data type policy string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils

Last synced: 15 Apr 2026

https://github.com/fairspec/fairspec-typescript

Fairspec TypeScript is a fast data management framework built on top of the Fairspec standard and Polars DataFrames

ckan csv data dataframe dataset excel fair json ods polars quality schema sqlite table typescript validation zenodo

Last synced: 09 Feb 2026

https://github.com/alejo1630/titanic_kaggle

This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.

data data-science jupyter-notebook notebook python titanic-survival-prediction

Last synced: 03 May 2026

https://github.com/itu-helper/data-updater

Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.

data istanbul-technical-university scraper selenium-python

Last synced: 29 Jan 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/jinsyin/datagovernance

公众号:「数据之道」

data data-governance datagovernance governance

Last synced: 30 Jan 2026

https://github.com/sandk21/etude_eau_potable_monde

Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau

analysis data tableau tableau-public visualization

Last synced: 19 Mar 2026

https://github.com/kirkalyn13/portfolio-dashboard-site

Portfolio Site; Initially a Service Provider Metrics Dashboard using React.

dashboard data data-visualization react

Last synced: 15 Apr 2026

https://github.com/simranjeet97/quotes-analysis

Kaggle Dataset on Quotes Analysis and Visualization With Python, Pandas and MatplotLib Using Jupyter Notebook.

data data-science datavisualization jupyter-notebook kaggle kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python quotes quotes-application

Last synced: 15 Apr 2026

https://github.com/tee8z/noaa-oracle

NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style

data duckdb-wasm noaa-weather parquet-files sql weather

Last synced: 17 Feb 2026

https://github.com/openearth/rws-viewer

This viewer is created by Deltares in cooperation with Voorhoede under OpenEarth GPL License. The viewer can be used via several RWS websites, please visit https://www.informatiehuismarien.nl/, https://waterinfo-extra.rws.nl/ and https://basismonitoringwadden.waddenzee.nl/.

data mapbox-gl-js ogc-services viewer

Last synced: 01 Feb 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/aniketkkajania/wassupanalyzer

WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.

data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat

Last synced: 25 Feb 2026

https://github.com/cworld1/novel-data

The data repository of novel analysis

analysis data novel

Last synced: 01 Feb 2026

https://github.com/jub0t/eso

An application to manage all your Encryption & Decryption keys and other related tools.

data encryption encryption-decryption hacking hacking-tool keys pgp privacy private

Last synced: 07 Feb 2026

https://github.com/giladbarnea/to

A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.

conversion data data-conversion json json5 parser script terminal toml yaml

Last synced: 08 Feb 2026

https://github.com/noahweasley/node-user-settings

A universal but simple node library to implement user settings, built to work with Electron.js with little or no configurations

app data electronjs json nodejs persist settings storage sync user

Last synced: 08 Feb 2026

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/raymondcm/strawberrydata

Tool suite for fast multi-camera strawberry data collection project. The standards document houses cross compatibility/purpose implementation details.

camera cpp data intel multi-camera

Last synced: 08 Feb 2026

https://github.com/jeanmanguy/milk-sci-fi

Census of every mention of milk in sci-fi works.

data milk sci-fi

Last synced: 26 Feb 2026

https://github.com/garcane/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 09 Feb 2026

https://github.com/ajityadav2621/datadoom

Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.

ai data

Last synced: 09 Feb 2026

https://github.com/3squared/smoulder

Smoulder is a really good data pipe

composition data facade-pattern forge-framework object-oriented

Last synced: 25 Apr 2026

https://github.com/pharo-ai/data-preprocessing

Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.

data pharo pharo-smalltalk preprocessing smalltalk

Last synced: 09 Feb 2026

https://github.com/danielbello7/nosql-json-database

Simple and quick database to help development process and speed

data database json json-database models nosql nosql-database nosql-json-database schema

Last synced: 09 May 2026

https://github.com/codenoid/alodokter.com-database

a Alodokter.com Database, collected by Hofesh Bot (Scrapper)

alodokter data extraction hofesh

Last synced: 18 Mar 2026

https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis

Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.

data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn

Last synced: 10 Feb 2026

https://github.com/jhpoelen/rats

self-replicating data publication related to rat (Rattus sp.) specimen.

biodiversity data natural-history-collections provenance

Last synced: 18 Mar 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/scottleechua/data

Public datasets under CC-BY-4.0 license.

data public-data

Last synced: 18 Mar 2026

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://github.com/mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema

Last synced: 11 Feb 2026

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 05 Mar 2026

https://github.com/chompfoods/stub-go-server

Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes

Last synced: 17 Apr 2026

https://github.com/rtmigo/pickledir_py

File-based key-value storage. Serializes keys and values with pickle

cache caching data directory file linux macos package pickle python windows

Last synced: 17 Apr 2026

https://github.com/rousan/weshare

An application that transfers files between devices

c-sharp data dot-net file lan phone share transfer-data weshare wifi

Last synced: 17 Apr 2026

https://github.com/sogful/archive

you will NEVER believe what this repository contains

archive crawl data scrapes sites

Last synced: 03 Jun 2026

https://github.com/aiwithqasim/recommendationengines

Recommendations Engines with IBM a project of DataScientist Nanodegree on Udacity. For this project i will analyze the interactions that users have with articles on the IBM Watson Studio platform, and make recommendations to them about new articles you think they will like.

data data-manging data-science ibm ipython-notebook normalization python3

Last synced: 18 Apr 2026

https://github.com/jinsyin/dataorigin

数据之源 | A data source management framework

data data-source datasource

Last synced: 21 Apr 2026

https://github.com/tkonopka/makealive

Dynamic web content through controlled javascript

conversion-functions d3 data data-science javascript visualization

Last synced: 22 Apr 2026

https://github.com/aidenellis/connectmp

🍰 ConnectMP - An easy way to share data between Processes in Python.

aidenellis connectmp data data-sharing multiprocessing process sharing

Last synced: 27 Apr 2026

https://github.com/reubano/ckanny

A Python command line interface (CLI) for interacting with CKAN instances

ckan cli data featured open-data

Last synced: 28 Apr 2026

https://github.com/sgarciaddev/proyecto-poo

Proyecto de software de gestión de asistencia de alumnos en un colegio, utilizando el lenguaje Java y el paradigma de programación orientada a objetos.

alumnos csv data java mysql poo

Last synced: 29 Apr 2026

https://github.com/aidanjuma/ankideckextractor

A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.

anki apkg cli data decompression extraction flashcards learning python zip

Last synced: 29 Apr 2026

https://github.com/v-mayya/python-sales-data-analysis

Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python

analysis data matplotlib numpy python

Last synced: 29 Apr 2026

https://github.com/lucien-loua/libgn

Manipulate geographical and administrative data about Guinea.

data guinea

Last synced: 08 Jun 2026

https://github.com/divanny/academixbackend

🧑‍🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.

backend csharp data net webapi

Last synced: 07 Jun 2026

https://github.com/gdhhgnbnvbn/f1-2025-ai-predict

fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.

agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe

Last synced: 01 May 2026

https://github.com/liuliqiang/laueagle

YAML/JSON Lints and Converters

converter data formater json linter python serialization yaml

Last synced: 02 May 2026

https://github.com/dominhduy09/my-links

All of my links and websites I have been creating - For saving all of my website's links

data database link linked-list linktree list save storage website

Last synced: 25 Jun 2026

https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer

Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.

convert converter data data-science deserialize json lightweight powershell serializer

Last synced: 04 May 2026

https://github.com/kenmwaura1/nuvo-data-cleaning-functions

Collection of scripts and functions to clean and preprocess data using Nuvo SDK.

data nuvo react

Last synced: 04 May 2026

https://github.com/eradical/analytics-unibody

Ansible role that sets up a farm of analytics collectors based on nginx

analytics ansible ansible-role big-data collectors data nginx

Last synced: 06 May 2026