An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/yaoguangduan/protosync

generate go code from protobuf ,sync proto dirty data

data golang protobuf sync

Last synced: 12 Mar 2026

https://github.com/bastianolea/siedu_indicadores_urbanos

Datos del Sistema de Indicadores y Estándares de Desarrollo Urbano, con datos comunales sobre temas como transporte, urbanismo, servicios básicos, calidad de vida y más.

ambiental app chile ciudad comunas data estado social

Last synced: 19 Feb 2026

https://github.com/feltex/datahora-java

Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.

brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime

Last synced: 18 Jun 2026

https://github.com/bastianolea/sinim_info_municipal

Base de datos del Sistema Nacional de Información Municipal, que incluye datos comunales sobre finanzas municipales, recursos humanos, educación, salud, pensiones, organizaciones sociales, y más.

chile comunas data estado laboral politica social tiempo

Last synced: 26 Oct 2025

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/kaos599/apollo-synthetic-data-generator

Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.

data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui

Last synced: 22 Jul 2025

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/nop-dev/learning-js

Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰

data data-structures functions javascript js

Last synced: 17 Apr 2026

https://github.com/dark-art108/yonk

A cli-utility to streamline data science work by creating templates

data machine-learning python3

Last synced: 08 May 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/jimut123/scrapers

All Scrapers that I'll build

bs4 data python3 real-time-visualisations scrapers scrapy wget

Last synced: 16 Jan 2026

https://github.com/ayoub-amzil/offline-globe

Offline country data for PHP Laravel framework. Over 200 countries, capitals, flags, languages, currencies. No internet needed.

composer data internet laravel offline php

Last synced: 09 May 2026

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/danielgiljam/orbit-utils

A collection of utility packages for Orbit.js.

data inference orbit orbitjs schema synchronization type typescript validation zod

Last synced: 01 May 2026

https://github.com/orisai/nette-data-sources

Orisai Data Sources integration for Nette

data decoder encoder file-format files json neon nette orisai parser php yaml

Last synced: 05 Feb 2026

https://github.com/nafisalawalidris/buybuy-e-commerce-company

The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.

buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql

Last synced: 16 Mar 2025

https://github.com/kledenai/jsonweaver

A powerful and easy-to-use library for transforming JSON data into popular formats such as CSV, XML, Markdown tables, YAML, and JSONLines (NDJSON).

csv data data-transform format json jsonlines jsonweaver markdown markdown-tables xml yaml

Last synced: 24 Feb 2026

https://github.com/gdhhgnbnvbn/f1-2025-ai-predict

fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.

agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe

Last synced: 01 May 2026

https://github.com/GiveMePseudonyms/PiVisualisations

A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.

data data-visualization pi pygame python self-organising-criticality tkinter

Last synced: 08 Apr 2025

https://github.com/ireddragonicy/wascrub

Clean WhatsApp chat export easily.

chat clean data meta whatsapp

Last synced: 03 May 2026

https://github.com/divanny/academixbackend

🧑‍🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.

backend csharp data net webapi

Last synced: 07 Jun 2026

https://github.com/lucien-loua/libgn

Manipulate geographical and administrative data about Guinea.

data guinea

Last synced: 08 Jun 2026

https://github.com/potreic/etl-fashion-trend-analysis

✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊

airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends

Last synced: 27 Jan 2026

https://github.com/data-forge-notebook/javascript-cheat-sheet

Cheat sheet that accompanies my book Data Wrangling with JavaScript

cheatsheet data data-wrangling javascript nodejs

Last synced: 15 Apr 2026

https://github.com/gematik/poc-isik-patient-merge

The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.

data fhir isik poc

Last synced: 19 Oct 2025

https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico

Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel

analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3

Last synced: 01 May 2026

https://github.com/bredalis/scikitlearn

🤖 Library to create ML models 🤖

data ia learning-python librery ml python

Last synced: 30 May 2026

https://github.com/sadmanca/uoft-pey-coop-job-postings

Code for parsing approximately 1.8k HTML pages of UofT PEY co-op job postings (from September 2023 to May 2024) to a single sqlite3 database file.

co-op data html python singlefile sqlite sqlite3 uoft uoft-pey

Last synced: 17 Apr 2026

https://github.com/juliaextremes/idfdatacanada.jl

A set of methods to get ECCC IDF data from .txt files

canada climate data julia netcdf

Last synced: 21 Oct 2025

https://github.com/chompfoods/sdk-php

PHP SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients php raw recipe-api recipes sdk

Last synced: 30 Apr 2026

https://github.com/stdlib-js/array-base-to-deduped

Copy elements to a new generic array after removing consecutive duplicated values.

array compress copy data dedupe deduplicate deduplication duplicate generic javascript node node-js nodejs stdlib structure types uniq unique

Last synced: 14 Jun 2025

https://github.com/chompfoods/stub-asp-net-core

ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server

Last synced: 30 Apr 2026

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/wu-rymd/pyobjectify

Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects

data objects python3

Last synced: 08 Jun 2026

https://github.com/iamjuniorb/data_structures_and_algorithms

I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.

data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3

Last synced: 08 Jun 2026

https://github.com/rousan/weshare

An application that transfers files between devices

c-sharp data dot-net file lan phone share transfer-data weshare wifi

Last synced: 17 Apr 2026

https://github.com/purarue/git_doc_history

copy/track file history in git, with python bindings to traverse and extract history/files/lines at some date

data git

Last synced: 17 May 2026

https://github.com/v-mayya/python-sales-data-analysis

Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python

analysis data matplotlib numpy python

Last synced: 29 Apr 2026

https://github.com/humbertocg18/pucrs-alest-i-2.3-2023.24

Trabalhos, Projetos, Exercícios e aulas realizados em Java na cadeira de Algoritimos e estrutura de dados 1, matéria do segundo semestre.

beecrowd beecrowd-solution-in-js beecrowd-solutions-in-java data data-structures datastructures-algorithms hashmap hashtable java-8 leetcode leetcode-javascript leetcode-solutions leetcodepra pucrs sorting-algorithms

Last synced: 29 Mar 2025

https://github.com/bileljegham/api-sport-cli

Cli for https://api-sports.io/ Retreive data and convert to sql file

cli data database match nodejs sports sports-analytics

Last synced: 08 May 2026

https://github.com/dbriane208/omdena-apprenticeship-project

This is part of my contribution to the Omdena apprenticeship program .

data data-science feature-engineering machine-learning

Last synced: 14 Mar 2026

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/sodascience/open_supply_hub

Processing supply chain data obtained from Open Supply Hub

data global-supply-chain open-supply-hub python

Last synced: 29 Apr 2026

https://github.com/aidanjuma/ankideckextractor

A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.

anki apkg cli data decompression extraction flashcards learning python zip

Last synced: 29 Apr 2026

https://github.com/sgarciaddev/proyecto-poo

Proyecto de software de gestión de asistencia de alumnos en un colegio, utilizando el lenguaje Java y el paradigma de programación orientada a objetos.

alumnos csv data java mysql poo

Last synced: 29 Apr 2026

https://github.com/tatey/list_of_countries

A list of countries, states, and cities in Ruby

cities countries data ruby states

Last synced: 11 Nov 2025

https://github.com/yord/klp-json

A JSON plugin for klp (Kelpie), the small, fast, and magical command-line data processor.

csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv

Last synced: 29 Apr 2026

https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights

Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.

analytics data excel sas sasprogramming statistical-analysis

Last synced: 24 Feb 2026

https://github.com/imahdimir/githubdata

A very simple Python package to easily download from and manage a GitHub "Data Repository"

data data-repository python-package

Last synced: 23 Jan 2026

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/the-aerospace-corporation/pivt

PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.

analytics dashboards data devops jenkins pipeline python splunk visualization

Last synced: 29 Apr 2026

https://github.com/mtingers/opacify

Opacify reads a file and builds a manifest of external sources to rebuild said file.

backup data obfuscation python

Last synced: 18 May 2026

https://github.com/castelao/bufr

BUFR binary data format from WMO

binary data format meteorology oceanography wmo

Last synced: 13 Jul 2025

https://github.com/player29879/sketch

AI code-writing assistant that understands data content

ai codex data dataframe dats-science df ds gpt3 pandas python sketchs

Last synced: 28 Apr 2026

https://github.com/jackosheadev/databasetechproject

This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction

data database mssql sql

Last synced: 18 May 2026

https://github.com/ntia/compound_radar_waveforms-data

Data used by NTIA/ITS TR-23-566 Examining the Effects of Resolution Bandwidth when Measuring Compound Radar Waveforms.

bandwidth data measurement p0n q3n radar resolution stepped waveform

Last synced: 27 Jan 2026

https://github.com/gbowne1/jsonhelix

This is a X11 GUI JSON application for editing, debugging and converting JSON and schemas and API data.

api data gui gui-application json x11

Last synced: 10 Jun 2025

https://github.com/maccccd/wsoa3029a_2444372

This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.

d3js data hacking visualization

Last synced: 24 Jan 2026

https://github.com/rdjarbeng/rdjarbeng

Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below

data jekyll machine-learning ruby website

Last synced: 28 Apr 2026

https://github.com/jayantur13/data-bharat

Get states their capital and districts,UTS and other useful information

data js node npmjs package yarn

Last synced: 28 Jan 2026

https://github.com/cainmi/data-page-project

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 21 Oct 2025

https://github.com/luminati-io/Crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 09 Apr 2025

https://github.com/jtpio/data-playground

Experiments using public APIs and data

data experiments python

Last synced: 28 Apr 2026

https://github.com/hasnocool/war_thunder_data_scraper

A web scraping tool designed to extract valuable data from War Thunder, a popular online game.

data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war

Last synced: 06 May 2026

https://github.com/flowsynx/plugin-csv

FlowSynx plugin to reads and writes CSV files, enabling easy batch data import/export operations and integration with spreadsheet-based data workflows.

comma-separated-values csv data data-platform flowsynx

Last synced: 10 Mar 2026

https://github.com/karthikmprakash/github_repos_scraper

A tool to extract names of github repos of any user

automation bs4 data github python repositories requests webscraping

Last synced: 27 Apr 2026

https://github.com/ispyhumanfly/prowler

Query the web, extract data from the results, and transform that data into a format you can use.

ai analytics business cryptocurrency data extract-data machine-learning mining scraping web

Last synced: 06 Sep 2025

https://github.com/sap-samples/security-research-codegraphsmote

Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.

augmentation data detection learning machine research sample security vulnerability

Last synced: 07 Jun 2026

https://github.com/zalweny26/open_data_unipa

Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31

data open project python

Last synced: 26 Apr 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/kuro337/scalamono

Scala Monorepo Tooling for Kafka, Opensearch, Spark, Redpanda, Hadoop - and Lang Reference.

data database duckdb hadoop kafka redpanda sdala spark

Last synced: 13 Apr 2026

https://github.com/kirkalyn13/portfolio-dashboard-site

Portfolio Site; Initially a Service Provider Metrics Dashboard using React.

dashboard data data-visualization react

Last synced: 15 Apr 2026

https://github.com/sebastianbrzustowicz/collision-detection-ai

Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.

accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow

Last synced: 24 Apr 2026

https://github.com/varbrad/mindb

🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.

browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema

Last synced: 13 Apr 2026

https://github.com/tee8z/noaa-oracle

NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style

data duckdb-wasm noaa-weather parquet-files sql weather

Last synced: 17 Feb 2026

https://github.com/osiota10/alx-low_level_programming

C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering

algorithms assembly c data data-structures linux shell unix

Last synced: 25 Jun 2025

https://github.com/palewire/nyc-hpd-bronx-lead-paint-violations

Download and process housing code lead paint violations in the Bronx from NYC Open Data

bronx data data-journalism news nyc python

Last synced: 02 Apr 2026

https://github.com/waylonwalker/exceltocsv

A usefull tool to convert excel spreadsheets to csv files without launching excel

csv-converter csv-files data excel python spreadsheet

Last synced: 05 May 2025

https://github.com/jub0t/eso

An application to manage all your Encryption & Decryption keys and other related tools.

data encryption encryption-decryption hacking hacking-tool keys pgp privacy private

Last synced: 07 Feb 2026

https://github.com/howtoquitvivek/ai-crop-yeild-prediction

AI-driven crop yield prediction and agricultural optimization system (SIH 2025)

2025 2026 ai crop-yeild data minor-project ml predcition python science sih

Last synced: 23 Apr 2026

https://github.com/ofelipelucca/cdc-kafka-debezium-pipeline

A real-time event-driven social network API built with CDC (Change Data Capture), Kafka, Debezium, PostgreSQL and MongoDB implementing CQRS-style architecture with streaming data pipelines.

cdc data data-engineering data-integration data-pipeline debezium event-driven fastapi kafka kafka-connect microservices mongodb postgresql python sqlalchemy

Last synced: 05 Jun 2026