An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 27 Mar 2025

https://github.com/antoineaugusti/purchasing-power

Archive daily data about purchasing power parity: how much goods should cost in various countries

archive data purchasing-power-parity

Last synced: 28 Oct 2025

https://github.com/manuelblancovalentin/dynamictable

An easy and user-friendly way to display dynamic data in Python using command-line tables

data deep display dynamic friendly gui interface io keras learning loop machine python python3 pytorch table tensorflow user

Last synced: 20 Jan 2026

https://github.com/lukanedimovic/table_editor

A simple table data editor, with easily scalable functions and operations & a nice GUI

data data-science formula java parser parsing preprocessing swing tokenizer

Last synced: 04 Apr 2025

https://github.com/royruddle/vizdataquality

Python package for visualizing data quality

data data-science data-visualization jupyter-notebook missing-data python

Last synced: 05 May 2025

https://github.com/marlenezw/speech-to-text

Turn any video or audio recording into a written transcript using python

data data-science python speech speech-recognition speech-synthesis speech-to-text

Last synced: 27 Apr 2026

https://github.com/purarue/bleanser

my bleanser modules

data

Last synced: 22 Feb 2026

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/wfamous/fiv_update-data

This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.

ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu

Last synced: 17 Feb 2026

https://github.com/shahiakhilesh1304/dsa

This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.

algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3

Last synced: 13 Apr 2025

https://github.com/stdlib-js/ndarray-base-dtype2c

Return the C data type associated with a provided data type string.

array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 18 Jun 2025

https://github.com/gabrielu3/ori-inverted-list

Inverted List made for a college discipline named Organization and Retrieval of Information

c data data-structures index inverted-index list

Last synced: 24 Feb 2026

https://github.com/frnt-end/weather-app-react

:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO 🌞 https://Frnt-End.github.io/Weather-App-React 👈

api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp

Last synced: 18 Feb 2026

https://github.com/seregpie/vuefort

The state management for Vue.

data model state store vue

Last synced: 13 Apr 2026

https://github.com/zarr-developers/cookiecutter-zarr-store

Cookiecutter for Zarr store implementations

chunked data n-dimensional zarr

Last synced: 16 Jun 2025

https://github.com/dhruvldrp9/simpledht

A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.

cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication

Last synced: 28 Feb 2026

https://github.com/dataopstix/modelt

Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.

airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization

Last synced: 28 Mar 2025

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/null-none/jwlibrary2json

Handler for jwlibrary to json

data json jwlibrary library parser

Last synced: 14 May 2026

https://github.com/anonympins/data-primals-engine

Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.

api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api

Last synced: 07 Mar 2026

https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml

Generate synthetic drilling data that can be used for testing machine learning (ML) models.

classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training

Last synced: 08 Apr 2026

https://github.com/smithsonian/massdigi-tools

Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO

data data-validation digital-files digitization digitization-workflows museums

Last synced: 13 Apr 2025

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 10 Apr 2026

https://github.com/jasondrawdy/compendio

Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.

compendium converters cryptography data extensions generators hashing library security utilities validation windows

Last synced: 18 May 2026

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/rahulraikwar00/advault

Advault is a adhaar data vault generation tool

aadhaar data hacktoberfest uidai vault

Last synced: 05 Apr 2025

https://github.com/hmeleiro/alquilermad

Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid

data data-science data-visualization datascience housing-location-visualization rent renting

Last synced: 13 Sep 2025

https://github.com/johntocci/nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

data data-analysis data-science datacleaning pandas polars python

Last synced: 06 Apr 2026

https://github.com/uttamo/messenger-data-analysis

Analyse Messenger data exports from Facebook

analysis data facebook json messenger pandas parse python

Last synced: 18 Jul 2025

https://github.com/hasnocool/war_thunder_camouflage_scraper

A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.

asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web

Last synced: 04 Jan 2026

https://github.com/mollybeach/cherryether

CherryEther: Typescript Staking Deposits Ethereum Transactions

blockchain data data-science ethereum typescripts

Last synced: 21 May 2026

https://github.com/askaniy/celestialocationsmaker

Tool for making Celestia location files

celestia data geology locations mapping planetary-science space

Last synced: 14 Mar 2025

https://github.com/zgbjgg/quetzal-examples

Examples using Quetzal :rocket: :bird:

analytics dashboard data data-visualization elixir erlang plotly web-app

Last synced: 24 Apr 2026

https://github.com/ryanmorr/typed

Statically typed properties for object literals

data javascript object properties statically-typed

Last synced: 12 Jun 2026

https://github.com/mawburn/across-a-thousand-dead-worlds-data

Across a Thousand Dead Worlds Data

data json ttrpg

Last synced: 21 Apr 2026

https://github.com/ange007/jquery.mydata

jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.

data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs

Last synced: 19 May 2026

https://github.com/instaclustr/cassandra-parquet-transformer

Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar

analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation

Last synced: 29 Aug 2025

https://github.com/caelean/twittermap

Map of twitter user's influence as defined on by influencetracker

data google-maps maps sparql twitter visualization

Last synced: 14 Jun 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/weisscharlesj/data_scicompforchem

Zipped data for SciCompforChem book for easy download

chemistry chemistry-education data data-visualization python

Last synced: 07 Nov 2025

https://github.com/emrecpp/datapacket-cpp

Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.

compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage

Last synced: 28 Mar 2025

https://github.com/kocyigitkim/realtime.io

Real time data streaming & socket programming library

data realtime socket streaming

Last synced: 29 Jul 2025

https://github.com/windwalker-io/data

[READ ONLY] A library contains data/collection objects with null-object pattern.

collection collections data data-object iterator nullobject value-object

Last synced: 12 Mar 2026

https://github.com/tomwhite/chernoff

A visual mood indicator. One of the first Java programs I ever wrote.

chernoff-faces data visualization

Last synced: 20 Apr 2026

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/kshitij1235/boxdb

This a database managment lib made for python, which works like any Libraries and is very lite no aditional setup require but there is some procedure to create a project is very easy.

boxdb data database library python

Last synced: 14 Jan 2026

https://github.com/vikashpr/18cse301j_ra2011003010737

This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.

css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization

Last synced: 03 May 2026

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/purarue/listenbrainz_export

Export your scrobbling history from ListenBrainz

data data-export music scrobbling

Last synced: 24 Jan 2026

https://github.com/joaocarmo/react-very-simple-data-table

When all you want is a table

data react simple table

Last synced: 06 Mar 2025

https://github.com/andreaselia/quotes-xd

A plugin for Adobe XD to insert a text element with a random quote and respective author.

adobe adobe-xd data design design-tool design-tools quote random xd

Last synced: 24 Apr 2026

https://github.com/hmeleiro/opencis

R package to import data from spanish Sociological Research Center (CIS)

abiertos api centro cis data datos estudios open r sociologicos

Last synced: 31 Jul 2025

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 03 Feb 2026

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 12 Apr 2026

https://github.com/StudyResearchProjects/arrbuffstr

Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser

arraybuffer browser data node string transform

Last synced: 09 Oct 2025

https://github.com/wonderium/browser-releases

This repository contains release dates for browser versions.

browsers data json releases wonderium

Last synced: 31 Jan 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/priyanka7411/customer-segmentation-churn-dashboard

📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.

churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization

Last synced: 14 Apr 2026

https://github.com/felipesousa/usa-cities-api

A JSON with all USA cities/states.

cities data json nodejs usa

Last synced: 03 May 2026

https://github.com/bradlindblad/quotableoffice

Repo for the quotable office R Shiny app

data datascience golem-apps r shiny shiny-apps text text-mining

Last synced: 26 May 2026

https://github.com/vyahello/fake-employee-api

👨‍🔧 Simple mock employees data parser (responder + heroku + pytest + github/travis CI)

data employee employer mock responder rest-api

Last synced: 09 Jun 2026

https://github.com/georgetdn/syscppcp

Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework object persistence serialize sql windows

Last synced: 04 Apr 2025

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/dantesc03/uberpool-case-study

This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.

analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization

Last synced: 16 Apr 2026

https://github.com/Nazaniiin/EDA_QualityofRedWine

:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.

charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization

Last synced: 30 Jul 2025

https://github.com/drkenreid/introductory-data-science

Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.

cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials

Last synced: 28 Jan 2026

https://github.com/mlr-org/mlr3data

Data sets used in the book, gallery, or in examples of mlr3.

data data-science data-sets machine-learning mlr3 r r-package

Last synced: 09 Apr 2025

https://github.com/Duartemartins/dados

Resultados de Eleições Portuguesas por Freguesia

data elections open-data portugal

Last synced: 20 Nov 2025

https://github.com/xtrendence/comp2001-coursework

Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.

api asp-net csharp data dataset linked-data php restful restful-api

Last synced: 13 Apr 2026

https://github.com/steelcake/cherry-pipelines

A collection of pipelines built with cherry

blockchain clickhouse data pipeline pyhton

Last synced: 09 Mar 2026

https://github.com/mrnazu/eth-data-library

eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.

blockchain data ethereum nodejs smart-contracts web3

Last synced: 28 Jan 2026

https://github.com/squareslab/probabilisticmodel_saner2018

Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018

code data mausotog published replication

Last synced: 26 Oct 2025

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/oliver021/entity-dock

A superset with libraries, components, tools and more to work with entity on .Net

api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi

Last synced: 09 May 2026

https://github.com/yash22222/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3

Last synced: 09 Apr 2026

https://github.com/ayoub-amzil/offline-globe

Offline country data for PHP Laravel framework. Over 200 countries, capitals, flags, languages, currencies. No internet needed.

composer data internet laravel offline php

Last synced: 09 May 2026

https://github.com/mrsaeeddev/data-science-roadmap-for-beginners

📈 A minimal and easy road map for beginners who want to dive into the field of Data Science

data data-science datascience python

Last synced: 29 Jun 2025

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 15 May 2025

https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah

Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544

checklist data gbif google-earth kmz sabah

Last synced: 09 Mar 2026

https://github.com/clinical-genomics/housekeeper

File data orchestrator

data file orchestrator

Last synced: 15 Aug 2025