An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/jeanmanguy/milk-sci-fi

Census of every mention of milk in sci-fi works.

data milk sci-fi

Last synced: 26 Feb 2026

https://github.com/3squared/smoulder

Smoulder is a really good data pipe

composition data facade-pattern forge-framework object-oriented

Last synced: 25 Apr 2026

https://github.com/danielbello7/nosql-json-database

Simple and quick database to help development process and speed

data database json json-database models nosql nosql-database nosql-json-database schema

Last synced: 09 May 2026

https://github.com/jhpoelen/rats

self-replicating data publication related to rat (Rattus sp.) specimen.

biodiversity data natural-history-collections provenance

Last synced: 18 Mar 2026

https://github.com/lmuffato/project-mongodb-dataflights-trybe

Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB

back-end crud data database filter mongo mongodb query trybe-projects

Last synced: 16 Apr 2026

https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis

Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard

analysis data data-visualization datacleaning excel excel-vba

Last synced: 19 Mar 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/m0nica/datalogues-outdated

Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll

data pelican pelican-blog pelican-theme

Last synced: 28 Feb 2026

https://github.com/ismail-mouyahada/lodscroljs-library

LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks

data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll

Last synced: 13 Feb 2026

https://github.com/stdlib-js/array-base-every-by-right

Test whether all elements in an array pass a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 13 Feb 2026

https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-

Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.

analysis analytics data excel sales

Last synced: 14 Feb 2026

https://github.com/blacksujit/shikshamitra

Shiksha Mitra is an innovative MVP designed to reshape the way students learn through gamification. Our platform transforms the traditional approach to education by making learning engaging, interactive, and rewarding. As an MVP, Shiksha Mitra focuses on delivering core features that showcase the value of gamified learning,

ai data gamified-learning hackathon lms ml mlflow mlops mlops-workflow mvp pipeline platforn

Last synced: 28 Feb 2026

https://github.com/garcane/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/stdlib-js/datasets-harrison-boston-house-prices-corrected

A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).

boston data dataset datasets house housing javascript linear-regression node node-js nodejs prediction prices statistics stats stdlib value

Last synced: 15 Feb 2026

https://github.com/neomutt/sample-data

📚 Lists of things. Useful for developing and testing.

data list sample

Last synced: 19 Mar 2026

https://github.com/m-rishab/stock_trend-analysis-power-bi-project-

In this project, I've harnessed the robust capabilities of Power BI to analyse, visualize, and uncover the story behind HUL's stock performance.

data datavisualization datavisualization-project powerbi

Last synced: 19 Mar 2026

https://github.com/linx-software/file-import-to-rest-api

Import a CSV file and make the data available via a REST API.

csv data linx low-code

Last synced: 19 Mar 2026

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 27 Mar 2026

https://github.com/droduit/grand-comics-database

EPFL course project to manage a huge database containing hundreds of millions data, and optimize the queries to create a smooth experience on user interface.

big-data data database epfl sql

Last synced: 16 Apr 2026

https://github.com/stdlib-js/array-base-every-by

Test whether all elements in an array pass a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 03 Mar 2026

https://github.com/ismailarilik/react-covid-maps

A global maps application aims to display COVID-19 statistics by countries, written with React

covid-19 data global maps react statistics

Last synced: 16 Apr 2026

https://github.com/denisecase/datakit-lite

Helpful utilities for Python data projects

analysis data education kit lite utils

Last synced: 04 Mar 2026

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 05 Mar 2026

https://github.com/chompfoods/stub-go-server

Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes

Last synced: 17 Apr 2026

https://github.com/rtmigo/pickledir_py

File-based key-value storage. Serializes keys and values with pickle

cache caching data directory file linux macos package pickle python windows

Last synced: 17 Apr 2026

https://github.com/nitrosh/nitro-validate

A powerful, standalone, dependency-free data validation library for Python with extensible rules and a clean, intuitive API.

data python3 validation validation-library

Last synced: 17 Apr 2026

https://github.com/sogful/archive

you will NEVER believe what this repository contains

archive crawl data scrapes sites

Last synced: 03 Jun 2026

https://github.com/izam-mohammed/data-source

🌐 A source directory for the data of my projects and experiments.📂 This curated collection simplifies access to diverse data that used in various projects💡

csv-files data data-source zip-files

Last synced: 03 Jun 2026

https://github.com/timmymatten/spikeball-stat-tracker

Spikeball stat tracking web app built with Streamlit and Python, designed to easily log and analyze player performance over multiple games.

data data-analysis data-visualization dataset matplotlib-pyplot multipage python spikeball statistics streamlit

Last synced: 18 Apr 2026

https://github.com/aiwithqasim/recommendationengines

Recommendations Engines with IBM a project of DataScientist Nanodegree on Udacity. For this project i will analyze the interactions that users have with articles on the IBM Watson Studio platform, and make recommendations to them about new articles you think they will like.

data data-manging data-science ibm ipython-notebook normalization python3

Last synced: 18 Apr 2026

https://github.com/adriweb/wsualizer

Some random code to visualize things coming from a websocket (pronounced 'visualizer')

bootstrap data html jquery real-time visualization visualizer websockets

Last synced: 20 Apr 2026

https://github.com/dataspoclab/dataspoc-lens

Virtual warehouse — SQL + Jupyter + AI over cloud Parquet via DuckDB

cli data data-engineering data-lake duckdb etl parquet python singer sql

Last synced: 20 Apr 2026

https://github.com/sinedied/htf-data

CLI tool to process Hadra Trance Festival database export into valid data for the app

cleaner cli data database hadra tool

Last synced: 20 Apr 2026

https://github.com/mrpudn/maltrends

(mirror) MyAnimeList.net manga and anime trend data.

anime data json jsonl jsonlines manga myanimelist

Last synced: 20 Apr 2026

https://github.com/cicerotcv/br-gen

A browser extension for generating Brazilian placeholder data.

chrome data extension generation hacktoberfest

Last synced: 21 Apr 2026

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 22 Apr 2026

https://github.com/stefen-taime/myubereats_datapipeline

Building a Modern Uber Eats Data Pipeline

airflow api data datawarehouse mongodb pipeline powerbi snowflake

Last synced: 22 Apr 2026

https://github.com/tkonopka/makealive

Dynamic web content through controlled javascript

conversion-functions d3 data data-science javascript visualization

Last synced: 22 Apr 2026

https://github.com/ofelipelucca/cdc-kafka-debezium-pipeline

A real-time event-driven social network API built with CDC (Change Data Capture), Kafka, Debezium, PostgreSQL and MongoDB implementing CQRS-style architecture with streaming data pipelines.

cdc data data-engineering data-integration data-pipeline debezium event-driven fastapi kafka kafka-connect microservices mongodb postgresql python sqlalchemy

Last synced: 05 Jun 2026

https://github.com/sebastianbrzustowicz/collision-detection-ai

Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.

accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow

Last synced: 24 Apr 2026

https://github.com/yord/klp-core

A plugin with basic operations for klp (Kelpie), the small, fast, and magical command-line data processor.

csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv

Last synced: 24 Apr 2026

https://github.com/chriseaton/sample-database

A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.

data database examples flat-file samples schema unit-testing

Last synced: 25 Apr 2026

https://github.com/zalweny26/open_data_unipa

Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31

data open project python

Last synced: 26 Apr 2026

https://github.com/stdlib-js/array-base-slice

Return a shallow copy of a portion of an array.

array copy data generic javascript node node-js nodejs shallow slice stdlib structure types

Last synced: 27 Apr 2026

https://github.com/aero-db/airports

A public and free dataset of all airports in the world

airports aviation csv data dataset json

Last synced: 27 Apr 2026

https://github.com/sap-samples/security-research-codegraphsmote

Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.

augmentation data detection learning machine research sample security vulnerability

Last synced: 07 Jun 2026

https://github.com/aidenellis/connectmp

🍰 ConnectMP - An easy way to share data between Processes in Python.

aidenellis connectmp data data-sharing multiprocessing process sharing

Last synced: 27 Apr 2026

https://github.com/karthikmprakash/github_repos_scraper

A tool to extract names of github repos of any user

automation bs4 data github python repositories requests webscraping

Last synced: 27 Apr 2026

https://github.com/nightroman/farnet.fsharp.data

FSharp.Data package for FarNet.FSharpFar

data farmanager farnet fsharp

Last synced: 27 Apr 2026

https://github.com/jtpio/data-playground

Experiments using public APIs and data

data experiments python

Last synced: 28 Apr 2026

https://github.com/saulojoab/crato-ce-json

Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.

data database geolocation json json-api localization

Last synced: 28 Apr 2026

https://github.com/ahmetcansolak/developer-insights

New project of ClubRockers from Sarıyer Hills

bitbucket data data-science data-visualization github python3

Last synced: 28 Apr 2026

https://github.com/rdjarbeng/rdjarbeng

Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below

data jekyll machine-learning ruby website

Last synced: 28 Apr 2026

https://github.com/jackosheadev/databasetechproject

This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction

data database mssql sql

Last synced: 18 May 2026

https://github.com/player29879/sketch

AI code-writing assistant that understands data content

ai codex data dataframe dats-science df ds gpt3 pandas python sketchs

Last synced: 28 Apr 2026

https://github.com/the-aerospace-corporation/pivt

PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.

analytics dashboards data devops jenkins pipeline python splunk visualization

Last synced: 29 Apr 2026

https://github.com/yord/klp-json

A JSON plugin for klp (Kelpie), the small, fast, and magical command-line data processor.

csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv

Last synced: 29 Apr 2026

https://github.com/aidanjuma/ankideckextractor

A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.

anki apkg cli data decompression extraction flashcards learning python zip

Last synced: 29 Apr 2026

https://github.com/sodascience/open_supply_hub

Processing supply chain data obtained from Open Supply Hub

data global-supply-chain open-supply-hub python

Last synced: 29 Apr 2026

https://github.com/iamjuniorb/data_structures_and_algorithms

I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.

data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3

Last synced: 08 Jun 2026

https://github.com/wu-rymd/pyobjectify

Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects

data objects python3

Last synced: 08 Jun 2026

https://github.com/chompfoods/stub-asp-net-core

ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server

Last synced: 30 Apr 2026

https://github.com/chompfoods/sdk-php

PHP SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients php raw recipe-api recipes sdk

Last synced: 30 Apr 2026

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico

Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel

analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3

Last synced: 01 May 2026

https://github.com/gdhhgnbnvbn/f1-2025-ai-predict

fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.

agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe

Last synced: 01 May 2026

https://github.com/henrylin03/china-gdp

Analysis and visualisation of China GDP data using Python.

data data-analysis data-visualisation dataset kaggle pandas

Last synced: 01 May 2026

https://github.com/windomz/gitdate

git commit date trick

data git git-commit trick

Last synced: 02 May 2026

https://github.com/liuliqiang/laueagle

YAML/JSON Lints and Converters

converter data formater json linter python serialization yaml

Last synced: 02 May 2026

https://github.com/rbruinier/mysqlbulkimportbenchmark

Benchmarking some methods to import big data sets into mysql tables

benchmark data database mysql php

Last synced: 02 May 2026

https://github.com/shogunbanik18/budgetify

End-to-End Budget Analysis enables effective budgeting through detailed analysis and strategic planning

analysis data data-engineering data-exploration databricks databricks-notebooks etl etl-process python3

Last synced: 09 Jun 2026

https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer

Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.

convert converter data data-science deserialize json lightweight powershell serializer

Last synced: 04 May 2026

https://github.com/raghavendranhp/credit_card_fraud_detection

This repository contains code for a credit card fraud detection model using autoencoders and logistic regression, achieving 95.3% accuracy.

anomaly-detection autoencoder-neural-network credit-card-fraud data keras logistic-regression machine-learning preprocessing tensorflow

Last synced: 04 May 2026

https://github.com/thenoim/youtubelibrary

Nils little youtube library :)

api browser data nodejs simple youtube

Last synced: 04 May 2026

https://github.com/issacto/animmender

Deployed Web App

angularjs anime data

Last synced: 05 May 2026

https://github.com/eradical/analytics-unibody

Ansible role that sets up a farm of analytics collectors based on nginx

analytics ansible ansible-role big-data collectors data nginx

Last synced: 06 May 2026