An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/tobinchilongo/oop-school-library

This project consists of Ruby script for the school library app. I implemented encapsulation and inheritance with Ruby by creating classes to represent students and teachers in the school.

data database gemfile input-output preserve rspec-testing rubocop unit-test

Last synced: 02 May 2026

https://github.com/kingsley-ezenwaka/app-profile-data-analysis

A Python data analysis project that aims to propose an app profile based on analysis of Google Playstore dataset.

analysis data jupyter-notebook matplotlib pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/nitsc/spell-from-threebodytrilogy

Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.

3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization

Last synced: 02 May 2026

https://github.com/tupizz/data-processing-pipeline-aws

This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.

aws data pipeline serverless typescript

Last synced: 13 May 2026

https://github.com/tbrowder/classfactory

Provides tools to create a data collection with classes to manipulate the persistent data.

class data persistent raku

Last synced: 04 Apr 2025

https://github.com/jonsafari/toy-data

Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks

data language-data machine-translation nlp sanity-checks toy-data

Last synced: 06 Nov 2025

https://github.com/epogrebnyak/business-conditions-digest-2017

Replicate illustration from Business Conditions Digest

data economics

Last synced: 22 Mar 2025

https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days

I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!

algorithms-and-data-structures and data data-structures dsa python python3 structure

Last synced: 08 May 2025

https://github.com/qeeqbox/data-security

Safeguarding your personal information (How your info is protected)

data data-security infosecsimplified qeeqbox security

Last synced: 19 Mar 2026

https://github.com/kerlossony/nested-formdata

Nested-FormData is a Function designed to handle nested form data structures in a simplified and efficient way. It helps in managing complex form data, making it easier to work with forms that require hierarchical data

data forms javascript nested-structures nextjs reactjs typescript

Last synced: 08 Mar 2026

https://github.com/sixarm/sixarm_ruby_fab

SixArm.com → Ruby → Fab gem to fabricate sample data for testing

data fabrication factory fake gem mock ruby

Last synced: 24 Jul 2025

https://github.com/oya163/corteva

Corteva Data Ingestion Pipeline

corteva data engineering etl

Last synced: 25 Jul 2025

https://github.com/shysolocup/stews

Stews is a Node.JS package meant to make storing data easier by mixing parts from common data types.

aepl array arrays data datatypes html javascript js json map maps nodejs object objects package set sets stews

Last synced: 25 Jul 2025

https://github.com/stonecharioteer/renfield

Synchronize and Search through Hard Drives

catalogue data search storage synchronization

Last synced: 09 Feb 2026

https://github.com/patelabhi574/hotel_reservation_analysis

Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.

data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server

Last synced: 19 Feb 2026

https://github.com/public-health-scotland/waiting_times_clinical_prioritisation

This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.

data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time

Last synced: 26 Jul 2025

https://github.com/incubrain/awesome-maharashtra-data

A collection of datasets specific to Maharashtra, India. WIP

ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi

Last synced: 23 May 2026

https://github.com/akatrevorjay/helm-nuke

Nukes all helm releases as well as tiller-owned k8s objects that may be left lying around.

all data destroy helm plugin

Last synced: 19 Sep 2025

https://github.com/discindo/natochak

Analysis of bicycle accidents in Macedonia using Rmarkdown and ggplot2

cycling data macedonia

Last synced: 19 Feb 2026

https://github.com/ayushverma135/accenture-data-analytics-and-visualization

This program provided practical experience in advising a hypothetical social media client as a Data Analyst at Accenture. The simulation involved cleaning, modeling, and analyzing multiple datasets, culminating in the creation of a PowerPoint deck and video presentation to communicate key insights.

accenture analytics data data-visualization forage presentation

Last synced: 19 Sep 2025

https://github.com/akhi07rx/f1-statistics-dashboard

A comprehensive command-line tool for analyzing Formula 1 race data using the FastF1 library.

akhi07rx cli cli-tools data f1 f1-score f1cli f1dashboard f1stats fastf1 formula1 opensource race race-analytics

Last synced: 23 May 2026

https://github.com/velocitatem/cellviz

Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.

cellular-automata conways-game-of-life data economics

Last synced: 29 Jul 2025

https://github.com/lakecountryhuntclub/dnr-map-data-model

Data Model for the 2023 DNR Pheasant Stocking Property Data

data data-model documentation excel gis hunting mapping powerquery vba

Last synced: 29 Jul 2025

https://github.com/vtalks/youtube_data_api3

A python3 library to interact with Youtube Data API.

api client data library python python3 youtube

Last synced: 09 Apr 2026

https://github.com/asuozzo/medicare-data-analysis

An analysis of Medicare Part D data in Vermont

data python

Last synced: 04 May 2026

https://github.com/flowsynx/plugin-postgresql

FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.

data database flowsynx postgresql postgresql-database sql

Last synced: 09 May 2026

https://github.com/ajsalemo/python-pandas-datalib

Testing and experimenting with some simple Pandas functionality using Flask to serve the parsed data.

csv data flask json pandas pandas-dataframe pandas-series python tabular tabular-data terminal

Last synced: 09 Apr 2026

https://github.com/v6ntage/sql-sales_data-analytics-project

This repository contains a SQL scripts demonstration analytical techniques.

analytics business-analytics data data-analysis database query sql sql-server

Last synced: 12 Apr 2026

https://github.com/theryston/db-mycro

A node module with a json database that saves data in a specific directory, similar to sqlite, but in JSON

base crud data database db db-mycro javascript json jsondatabase nodejs nosql typescript

Last synced: 09 Apr 2026

https://github.com/ddeutils/ddedocs

📖 Data Developer & Engineer Documents and Hands-On

blogs data data-engineering documents hands-on

Last synced: 08 Aug 2025

https://github.com/rubenhortas/python_examples

Examples of Python code and DSA (data structures and algorithms).

algorithm algorithms data dsa examples python python-3 python3 samples snippets structures

Last synced: 03 Oct 2025

https://github.com/vikjam/ui-policy

Unemployment policy at the state level

data government government-data

Last synced: 13 Feb 2026

https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml

Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.

analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc

Last synced: 16 Apr 2026

https://github.com/frefrik/covid19norge-api

API for COVID-19 cases in Norway

api covid covid-19 covid19 data fastapi norge norway

Last synced: 10 May 2026

https://github.com/pradeep221b/turbofan_predictive_maintenance

An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.

data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels

Last synced: 04 Oct 2025

https://github.com/garcane/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/giorgiosavastano/process

processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.

big-data data data-science parallel parallel-computing process processing processing-chain rust

Last synced: 05 Oct 2025

https://github.com/gusenov/qazaqstan-geography-data

:world_map: Географические данные Казахстана.

data geographic-data geography json kazakhstan qazaqstan regions

Last synced: 20 Feb 2026

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent

Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.

agentic-ai analysis data deepseek langchain nba python streamlit visualization

Last synced: 08 May 2026

https://github.com/sstendahl/giscan

Simple tool to read and analyze existing GISAXS data

cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray

Last synced: 11 Nov 2025

https://github.com/jessielw/parse-fel-master-data

Simple CLI to parse Dolby Vision master data via the RPU/MediaInfo and output data needed for x265

data dolby fel master mediainfo mi parse rpu vision

Last synced: 26 Aug 2025

https://github.com/xdrokra/road-accident-analytics

A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023

analytics data design italy javascript

Last synced: 30 Aug 2025

https://github.com/stdlib-js/array-base-last

Return the last element of an array-like object.

array data generic javascript last node node-js nodejs stdlib structure types

Last synced: 30 Aug 2025

https://github.com/ukplab/pragtag2023

Code and data for the PragTag-2023 Shared Task

argument-mining data peer-review pragmatics shared-task

Last synced: 18 Jun 2025

https://github.com/n4ze3m/timezone-json

JSON file with more than 1642 cities timezone in UTC format.

data json timeszone

Last synced: 19 Jul 2025

https://github.com/marcelo-earth/h5n8-data

🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.

csv data h5n8 h5n8-cases h5n8-virus russia

Last synced: 19 Jan 2026

https://github.com/jackokring/www

Generic www flask server with phinka module

compression data flask phinka python

Last synced: 16 Jan 2026

https://github.com/ngambip/priscilla

About my work and Experience

accounting analytics data finance-management

Last synced: 03 Feb 2026

https://github.com/gorhkdwj/da_portfolio

Kim Jae Chun's DA_Portfolio

data data-analysis python sql

Last synced: 20 Feb 2026

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/desmondsanctity/abeona-kafka

A demo to show how to implement Upstash's serverless Kafka to a Node.js microservice. Presented at Berlin Buzzwords 2024

berlin-buzzwords data event-driven kafka microservice serverless streaming upstash-kafka

Last synced: 15 May 2025

https://github.com/francescodisalesgithub/data-for-developers

simple SQL database with problems and solution found on stackoverflow, documentation or chatgpt

chatgpt data database developer hacker hacking knowledge solutions sql targets

Last synced: 22 Mar 2025

https://github.com/stdlib-js/array-base-to-accessor-array

Convert an array-like object to a minimal array-like object supporting the accessor protocol.

accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper

Last synced: 04 Jan 2026

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/stdlib-js/array-one-to

Generate a linearly spaced numeric array whose elements increment by 1 starting from one.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 26 Feb 2026

https://github.com/neelravi/data-management

A data management plan for computational chemists/physicists and material scientists for a FAIR storage of raw data

data dmp fair management workflows

Last synced: 16 Jan 2026

https://github.com/milandjurdjevic/discriminalizer

.NET library designed for seamless JSON deserialization of objects with complex discrimination requirements, built on top of System.Text.Json.

data deserialization dotnet json

Last synced: 15 Apr 2025

https://github.com/codenoid/webtoons.com-database

a Webtoons.com Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/castdrian/kdapi

A TypeScript library that scrapes K-pop idol and group information from online sources to create comprehensive JSON datasets.

api data kpop scraper typescript

Last synced: 15 May 2025

https://github.com/stdlib-js/datasets-herndon-venus-semidiameters

Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.

astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus

Last synced: 09 Oct 2025

https://github.com/iguptashubham/walmart-eda

Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.

analysis data dataset eda mysql portfolio-project python sql

Last synced: 10 Apr 2026

https://github.com/nouman6093/advanced-statistical-models

in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl

data data-analysis data-science data-visualization

Last synced: 11 Jun 2026

https://github.com/ilejuxepwaduzd/structured-data-extractor

🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.

cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data

Last synced: 10 Apr 2026

https://github.com/aranfononi/h4x0r-news-section-17-project

A SwiftUI-powered app that displays top stories from Hacker News. Users can open articles directly within the app, utilizing SwiftUI’s NavigationLink and custom WebView integration.

app-development data data-binding data-binding-library ios swift swiftui xcode

Last synced: 18 May 2026

https://github.com/alja7dali/swift-bits

A bite sized library for dealing with bytes.

binary bit bits byte bytes comprehension data manipulation swift

Last synced: 09 Jun 2026

https://github.com/qeeqbox/data-states

Data states refer to structured and unstructured data divided into three categories (At Rest, In Use, and In Transit)

data data-state infosecsimplified qeeqbox

Last synced: 10 Mar 2026

https://github.com/makepath/medaprep

medaprep is a data preparation and feature engineering toolkit for geospatial applications.

data data-science datacleaning eda exploratory-data-analysis xarray

Last synced: 29 Jun 2025

https://github.com/stdlib-js/strided-base-dtype-str2enum

Return the enumeration constant associated with a strided array data type string.

array data dtype dtypes enum javascript multidimensional node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 30 Apr 2025

https://github.com/exoticknight/juhe

simple way to analyze complex data in one chain call

aggregation aggregator analysis data statistic typescript

Last synced: 21 May 2026

https://github.com/rremple/intervalidus

For all your interval-based data needs.

data intervals

Last synced: 21 Feb 2026

https://github.com/bilalmehrban/data-log-monitor

A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.

csharp data desktop-app logging

Last synced: 14 Mar 2025

https://github.com/jayantur13/kountry

Node module variant of the Country API

api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn

Last synced: 26 Jan 2026

https://github.com/mews-labs/dataframe-memory

This tools aims to provide simple solution to save memory when using pandas' data frame.

data data-science memory-usage pandas-dataframe python3

Last synced: 22 May 2026

https://github.com/themost-framework/memory

MOST Web Framework in-memory data adapter for testing environments

adapter data orm

Last synced: 06 Mar 2025

https://github.com/brianali-codes/github-searcher

A website for API experimentation that users the github Api to search for different users and some of their (public) information

api data github user

Last synced: 21 May 2026

https://github.com/connectomicslab/cmtklib-data

Datalad dataset that stores all data resources of the cmtklib module of Connectome Mapper 3 (https://github.com/connectomicslab/connectomemapper3).

brain data parcellation resources software

Last synced: 16 Jan 2026

https://github.com/mindawei/alimusic-predict

阿里音乐流行趋势预测大赛代码(包括初赛、复赛)

data java predict pyhton tianchi

Last synced: 22 Mar 2025

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/nikoshet/rust-dms-cdc-operator

The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.

aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation

Last synced: 18 Jan 2026

https://github.com/patrickdavies100/datapipeline37

Some Data Science practice using datasets available online. Currently test data is similar to this dataset: https://www.kaggle.com/datasets/asaniczka/amazon-uk-products-dataset-2023 but the plan is to expand.

data data-science pandas-dataframe python3

Last synced: 08 Oct 2025

https://github.com/pharo-ai/data-imputers

This project contains transformers for missing value imputation

ai data data-science imputer pharo pharo-smalltalk smalltalk

Last synced: 18 Jan 2026

https://github.com/liyakhathshaik/datascout.jl

This is a julia package

data datascout julia

Last synced: 09 Oct 2025

https://github.com/scienxlab/datasets

Some small datasets for demos, courses, testing, etc.

data open-data sample-data teaching-resources

Last synced: 09 Oct 2025

https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer

Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.

analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob

Last synced: 17 Apr 2026