An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/seafloor-geodesy/gnatss-test-data

Repository to host test data for GNATSS software

data testing

Last synced: 06 Apr 2026

https://github.com/luminati-io/crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 17 Mar 2025

https://github.com/luminati-io/pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 17 Mar 2025

https://github.com/alhonaut/quant-assigment

Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho

analysis data defi quantitative-finance

Last synced: 16 May 2026

https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing

Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.

data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates

Last synced: 13 Jul 2025

https://github.com/lucaaszsx/spyder

A powerful schema-based web scraping library for Node.js built for fast, structured, and reliable data extraction.

cheerio crawler data dom dom-manipulation html json json-ld parser scraper web xml

Last synced: 11 Jun 2026

https://github.com/shukkkur/py_dash

Assignment for ETL Course - Dashbaord (plotly & dash)

dash dashboard data data-visualization plotly

Last synced: 06 Oct 2025

https://github.com/elvis-not-presley-one/lostcassowary

LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks

data data-mining data-science dataminer minecraft nbt nbt-parser scraper

Last synced: 12 Apr 2025

https://github.com/yasir13001/moonai_api

This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.

ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python

Last synced: 20 Jun 2025

https://github.com/owengombas/genyus

🐍 Lyrics analysis with genius.com, Python and Jupyter Notebooks

api data data-science genius jupyter-notebook lyrics python statistics

Last synced: 20 May 2026

https://github.com/uvaio/datasets

Notebooks for data processing, scraping, machine learning

data dataset jupyter jupyter-notebook learning machine ml model ontology

Last synced: 21 Mar 2025

https://github.com/sksubhadeep/nashville-housing-data-cleaning-project-using-sql

SQL Data Cleaning Project on Nashville Housing Dataset

data datacleaning sql

Last synced: 19 Mar 2026

https://github.com/michellepellon/jobx

A modern, powerful job scraper for LinkedIn, Indeed and beyond.

compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper

Last synced: 17 Jan 2026

https://github.com/MikeBairdRocks/Fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 02 Apr 2025

https://github.com/LisaKey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 03 Mar 2025

https://github.com/thelich2112/bluesky-weather-poster

a Wordpress plugin that takes info from a clientraw.txt file and posts to Bluesky with variable options for posting.

data posting station weather wordpress

Last synced: 17 May 2026

https://github.com/rob-med/data-visualizations-for-python

A collection of useful snippets for clean data visualizations in Python (with matplotlib)

academic-publishing data data-science data-visualization dataviz matplotlib python scientific-publications storytelling visualization

Last synced: 08 May 2026

https://github.com/2kabhishek/pokemon-stats

Gotta stat 'em all 🖲🐭

d3 data emoji pokemon rollup statistics

Last synced: 14 May 2026

https://github.com/junkwaxdata/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 13 Mar 2025

https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang

Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024

data data-science dataanalytics dataset json

Last synced: 28 Jun 2025

https://github.com/tbrowder/classfactory

Provides tools to create a data collection with classes to manipulate the persistent data.

class data persistent raku

Last synced: 04 Apr 2025

https://github.com/pedro-donoso/productoskotlin

App que carga una lista de Productos con ID, Nombre, Descripción, Disponible, Habilitado y Stock, convierte el nombre a mayúsculas, cambia boolean por SI o NO si está disponible y habilitado, los ordena descendente según Stock

class data fun id kotlin kotlin-android list

Last synced: 19 May 2026

https://github.com/stdlib-js/array-base-filled4d-by

Create a filled four-dimensional nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 07 Sep 2025

https://github.com/sarincr/basics-of-julia-programming-language

Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.

data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning

Last synced: 19 May 2026

https://github.com/velocitatem/cellviz

Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.

cellular-automata conways-game-of-life data economics

Last synced: 29 Jul 2025

https://github.com/ybelenko/openapi-data-mocker-server-middleware

PSR-15 HTTP Server Middleware to create mock responses from OpenAPI Schemas(OAS 3.0).

data fake faker middleware mock mocker oas oas3 openapi psr-15 swagger

Last synced: 15 Jun 2025

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 21 Mar 2025

https://github.com/utkarshverma439/simple-sms-spam-detector

Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.

data data-science data-visualization spam-detection

Last synced: 20 Jun 2025

https://github.com/tezcatlipoca0000/ayudante

It's mainly a program for a store to manage the products data

data javascript scraping self-taught web

Last synced: 09 Apr 2025

https://github.com/asuozzo/medicare-data-analysis

An analysis of Medicare Part D data in Vermont

data python

Last synced: 04 May 2026

https://github.com/tezcatlipoca0000/db-helper_sf

A program tailored for my workplace; it analyze, visualize and manipulate a Firebird 2.0 database

data data-visualization fdb firebird jupyter-notebook pandas python3

Last synced: 09 Apr 2025

https://github.com/ibnz36/arrowpipe

Build complex pipelines easily

cargo crate data pipe rust

Last synced: 13 Apr 2025

https://github.com/uhstray-io/just-dashboards

Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source

analytics data dioxus rust visualization

Last synced: 29 Mar 2025

https://github.com/bredalis/seaborn

📊 Library to create graphics 📊

data graphics-programming librery python seaborn seaborn-plots

Last synced: 04 Mar 2025

https://github.com/penspanic/datra

Datra is a comprehensive data management system for game development.

data game game-development gamedata unity unity-package unity3d-plugin

Last synced: 19 May 2026

https://github.com/ornella-gigante/wildlife-data-analysis-toolkit-ml

A data-driven exploration of Canis lupus signatus (Iberian) and Canis lupus labradorius (Labrador) subspecies, leveraging Jupyter Notebook and pandas to analyze weight distributions (25-56 kg), geographic patterns, and reproductive behaviors. Features size-weight correlations and NaN-handling workflows for robust ecological insights

analysis data datasets jupyter-notebook pandas-dataframe python

Last synced: 15 May 2026

https://github.com/dbrennand/rm-content

A Python 3.7 script to remove a specific string from all files and repos (owned by the user).

content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content

Last synced: 29 Mar 2025

https://github.com/georginapuig/graps-from-csv

📊 Data visualization with c3.js and Papaparse from CSV files.

c3 c3js chart d3 d3js data data-visualization graphs javascript javascript-library visualization

Last synced: 19 May 2026

https://github.com/desilinguist/hanukkah-of-data-2022

My solutions to Hanukkah of Data 2022

2022 data hanukkah pandas python

Last synced: 17 May 2026

https://github.com/exponea/exponea-python-sdk

⚠️ DEPRECATED Python SDK for Exponea Data and Tracking API

api data exponea python sdk

Last synced: 09 Apr 2026

https://github.com/real-veersandhu/cia-country-comparison

Data analysis system on the CIA World Factbook

data

Last synced: 25 Feb 2025

https://github.com/sergkash7/fdc-facade

Facade for The FoodData Central API.

api center data food usda

Last synced: 15 May 2026

https://github.com/alireza29675/goudi

GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.

analysis data goudi visualization

Last synced: 11 Jul 2025

https://github.com/xrahul/android-logs

Get logs of various sensors and events in android 6.0+

android data events logs

Last synced: 20 May 2026

https://github.com/benji-lewis/archivord

An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.

archive data data-mining discord discord-bot typescript

Last synced: 16 May 2026

https://github.com/lunastev/wson-rust

WSON data serialization parser

data parser serialization

Last synced: 07 Apr 2025

https://github.com/nichtich/wikidata-taxonomy-examples

Extract classifications from Wikidata

coli-conc data knowledge-organization wikidata

Last synced: 12 Jul 2025

https://github.com/snitkin-lab-umich/prewas_manuscript_analysis

Manuscript in support of prewas software

data data-visualisation manuscript r

Last synced: 08 Jul 2025

https://github.com/elazar/pycopyql

Exports a subset of data from a relational database.

data database export relational tool utility

Last synced: 16 May 2026

https://github.com/akatrevorjay/helm-nuke

Nukes all helm releases as well as tiller-owned k8s objects that may be left lying around.

all data destroy helm plugin

Last synced: 19 Sep 2025

https://github.com/kevinsames/spark-fuse

spark-fuse is an open-source toolkit for PySpark — providing utilities, connectors, and tools to fuse your data workflows together.

data databricks fabric pyspark python spark

Last synced: 08 May 2026

https://github.com/mikebairdrocks/fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 17 May 2026

https://github.com/fjc0k/vue-merge-data

Intelligently merge data for Vue render functions.

data merge-data render-functions vue

Last synced: 17 May 2026

https://github.com/ushkinaz/cbn-data

Automated game data extraction and processing for Cataclysm: Bright Nights. Provides JSON mirrors, WebP asset conversion, and unified translation data.

cataclysm-bn data wiki

Last synced: 07 Mar 2026

https://github.com/chalk-ai/roadmap

Chalk public roadmap

chalk data data-science mlops pipeline python

Last synced: 17 Jan 2026

https://github.com/concaption/ksa-lawyers-data

scraped data of ksa lawyers and law firms

data lawyers

Last synced: 03 Apr 2025

https://github.com/stdlib-js/array-base-any-by-right

Test whether at least one element in an array passes a test implemented by a predicate function, while iterating from right to left.

any array data generic javascript node node-js nodejs predicate some stdlib structure test types validate

Last synced: 14 Apr 2025

https://github.com/jonsafari/toy-data

Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks

data language-data machine-translation nlp sanity-checks toy-data

Last synced: 06 Nov 2025

https://github.com/epogrebnyak/business-conditions-digest-2017

Replicate illustration from Business Conditions Digest

data economics

Last synced: 22 Mar 2025

https://github.com/yazeed44/reform-api

A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.

data geojson python

Last synced: 18 May 2026

https://github.com/flownrecords/flightTracker

A mobile app built to record essential flight data for post-flight review and debriefing.

aviation data gps tracking

Last synced: 23 Jun 2025

https://github.com/giscience/measures-rest-sparql

A SPARQL endpoint for the Measures REST OSHDB App framework.

data osm quality semantics sparql sparql-endpoints

Last synced: 24 Jun 2025

https://github.com/coral/ddp

Distributed Display Protocol (DDP) in Go

data ddp distributed golang led pixel protocol wled

Last synced: 26 Jun 2025

https://github.com/millengustavo/salarios-data-science

Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers

brasil brazil ciencia-de-dados data data-science heroku salarios salary

Last synced: 07 Oct 2025

https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days

I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!

algorithms-and-data-structures and data data-structures dsa python python3 structure

Last synced: 08 May 2025

https://github.com/qeeqbox/data-security

Safeguarding your personal information (How your info is protected)

data data-security infosecsimplified qeeqbox security

Last synced: 19 Mar 2026

https://github.com/mouneshgouda/learn_dsa

This repository explores fundamental data structures and their implementations. Learn how to organize and manipulate data efficiently for various programming tasks. (Feel free to add your specific focus areas here, e.g., algorithms, interview prep)

c data queue sorting-algorithms stack structured-data

Last synced: 30 Jul 2025

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/nia-cloud-official/influx

Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.

data find people-search search-engine

Last synced: 27 Jun 2025

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/giscience/measures-rest-oshdb-docker

Scripts for starting measures for geospatial datasets in docker container, using the OSHDB

data dggs docker geospatial mesure openstreetmap rest

Last synced: 18 Apr 2026

https://github.com/glassflow/pipelines-push-action

This Github Action lets you automate GlassFlow pipelines deployments as code

data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing

Last synced: 19 May 2026

https://github.com/codenoid/storial.co-database

a Storial.co Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/redodo/shipper

Hide encrypted data in files.

audio data images python steganography

Last synced: 26 Mar 2025

https://github.com/gappeah/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 25 Feb 2025

https://github.com/gappeah/beverage-sales-analytics

This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.

data data-visualization powerbi powerbi-report powerbi-visuals

Last synced: 25 Feb 2025

https://github.com/oya163/corteva

Corteva Data Ingestion Pipeline

corteva data engineering etl

Last synced: 25 Jul 2025

https://github.com/gappeah/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 11 Jun 2025

https://github.com/hoaihuongbk/lakeops

A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.

data data-operations dataengineering datalake

Last synced: 07 Mar 2026

https://github.com/speakeasy-sdks/fivetran-python-sdk

Python SDK for accessing Fivetran API.

api connector data fivetran fivetran-connector python sdk

Last synced: 01 Jul 2025

https://github.com/hmeleiro/r_dataviz

Data visualization projects with R / Proyectos de visualización de datos con R

data dataviz r rmd-files social-science survey-data

Last synced: 21 Jun 2026