An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://github.com/mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema

Last synced: 11 Feb 2026

https://github.com/mohsinali08000/myportfolio

I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.

css data data-science html

Last synced: 22 Apr 2026

https://github.com/lmuffato/project-mongodb-dataflights-trybe

Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB

back-end crud data database filter mongo mongodb query trybe-projects

Last synced: 16 Apr 2026

https://github.com/sbdk-dev/sbdk.dev

A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.

ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer

Last synced: 27 May 2026

https://github.com/seabbs/estzoonotictb

Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden

bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb

Last synced: 28 Feb 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/sakshisrivastava-2601/credit-card-fraud-detection

Credit Card Fraud Detection Project Using Machine Learning. This project focuses on leveraging advanced Machine learning techniques to identify fraudulent transactions with high accuracy.

advanced-machine data machine-learning numpy project-repository python pytorch random-forest

Last synced: 16 Apr 2026

https://github.com/garcane/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 13 Feb 2026

https://github.com/obsidianplusplus/5e_play_cs-go

Python工具,分析你在5EPlay的CS:GO比赛数据。抓取、分析、筛选并导出。 | Python tool to analyze your 5EPlay CS:GO match data. Fetches, analyzes, filters, and exports.

5eplay analysis api automation csgo data esports excel json match pandas performance player python reporting scraping stats team

Last synced: 13 Feb 2026

https://github.com/frictionlessdata/cardealerdp

Cardealer DP (Car Dealer Data Package) is a data exchange format for car dealerships. It is developed on top of the Data Package standard

car data datapackage dealer exchange extension format

Last synced: 13 Feb 2026

https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-

Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.

analysis analytics data excel sales

Last synced: 14 Feb 2026

https://github.com/diddypod/crop-data-converter

A Python script to convert crop data from .txt to .xlsx format

converter crop data openpyxl python

Last synced: 29 Jun 2026

https://github.com/blacksujit/shikshamitra

Shiksha Mitra is an innovative MVP designed to reshape the way students learn through gamification. Our platform transforms the traditional approach to education by making learning engaging, interactive, and rewarding. As an MVP, Shiksha Mitra focuses on delivering core features that showcase the value of gamified learning,

ai data gamified-learning hackathon lms ml mlflow mlops mlops-workflow mvp pipeline platforn

Last synced: 28 Feb 2026

https://github.com/garcane/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/stdlib-js/datasets-harrison-boston-house-prices-corrected

A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).

boston data dataset datasets house housing javascript linear-regression node node-js nodejs prediction prices statistics stats stdlib value

Last synced: 15 Feb 2026

https://github.com/m-rishab/stock_trend-analysis-power-bi-project-

In this project, I've harnessed the robust capabilities of Power BI to analyse, visualize, and uncover the story behind HUL's stock performance.

data datavisualization datavisualization-project powerbi

Last synced: 19 Mar 2026

https://github.com/ghonimo/diode-pn-junction-characterization-psu-ece515

A detailed analysis of the I-V characteristics of a PN junction diode (1N4148) under different temperatures, utilizing Excel for graphical analysis and parameter extraction. This study was conducted as part of the ECE 515: Fundamentals of Semiconductor Devices course at Portland State University.

analysis characterization data device diode diodes excel mosfet-transistor pn-junction

Last synced: 28 Feb 2026

https://github.com/linx-software/file-import-to-rest-api

Import a CSV file and make the data available via a REST API.

csv data linx low-code

Last synced: 19 Mar 2026

https://github.com/stdlib-js/array-base-none-by-right

Test whether all elements in an array fail a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs none predicate stdlib structure test types validate

Last synced: 01 Mar 2026

https://github.com/efler/microservice-data-bus

Data bus based on Apache Kafka and consisting of separate components [copied from own private repos]

data data-bus deduplication enrichment filtering kafka microservice mongodb postgresql redis

Last synced: 16 Apr 2026

https://github.com/skywardai/paper_gallery

Papers gallery for using LLMs ability over dataset

ai data data-science llm medicine neural-network research security

Last synced: 19 Mar 2026

https://github.com/anthonybench/datapeek

Peek summary of datafile in a succinct, opinionated manner.

cli data data-analysis

Last synced: 02 Mar 2026

https://github.com/agnosticeng/cli

Agnostic magic is now at your fingertips.

cli clickhouse data datalake datalakehouse

Last synced: 03 Mar 2026

https://github.com/lookininward/data-formatter-demo

You have directories containing data files and specification files. The specification files describe the structure of the data files. Write an app that reads format definitions from specification files. Use these definitions to convert the parsed files to NDJSON files.

csv data demo files json ndjson python txt unittest

Last synced: 27 Apr 2026

https://github.com/stdlib-js/array-base-every-by

Test whether all elements in an array pass a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 03 Mar 2026

https://github.com/ismailarilik/react-covid-maps

A global maps application aims to display COVID-19 statistics by countries, written with React

covid-19 data global maps react statistics

Last synced: 16 Apr 2026

https://github.com/denisecase/datakit-lite

Helpful utilities for Python data projects

analysis data education kit lite utils

Last synced: 04 Mar 2026

https://github.com/mbolam/DSWS_OpenRefine

Cleaning and Linking Data with OpenRefine

cleaning data metadata openrefine

Last synced: 07 Apr 2025

https://github.com/antononcube/raku-data-cryptocurrencies

Raku package of cryptocurrency data retrieval.

crypto cryptocurrency data

Last synced: 02 Apr 2025

https://github.com/chompfoods/stub-go-server

Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes

Last synced: 17 Apr 2026

https://github.com/rtmigo/pickledir_py

File-based key-value storage. Serializes keys and values with pickle

cache caching data directory file linux macos package pickle python windows

Last synced: 17 Apr 2026

https://github.com/rousan/weshare

An application that transfers files between devices

c-sharp data dot-net file lan phone share transfer-data weshare wifi

Last synced: 17 Apr 2026

https://github.com/sadmanca/uoft-pey-coop-job-postings

Code for parsing approximately 1.8k HTML pages of UofT PEY co-op job postings (from September 2023 to May 2024) to a single sqlite3 database file.

co-op data html python singlefile sqlite sqlite3 uoft uoft-pey

Last synced: 17 Apr 2026

https://github.com/gallo13/neuralnetworks-deeplearning-stats-classification

Descriptive Statistics, Classification and Analysis Using Python & Python Libraries (Assignment 1)

analysis data datasets deep-learning jupyter-notebook matplotlib neural-networks numpy pandas plotting python seaborn

Last synced: 17 Apr 2026

https://github.com/timmymatten/spikeball-stat-tracker

Spikeball stat tracking web app built with Streamlit and Python, designed to easily log and analyze player performance over multiple games.

data data-analysis data-visualization dataset matplotlib-pyplot multipage python spikeball statistics streamlit

Last synced: 18 Apr 2026

https://github.com/ktbarrett/scdil

simple configuration and data interchange language

configuration data json python yaml

Last synced: 20 Apr 2026

https://github.com/cicerotcv/br-gen

A browser extension for generating Brazilian placeholder data.

chrome data extension generation hacktoberfest

Last synced: 21 Apr 2026

https://github.com/jinsyin/dataorigin

数据之源 | A data source management framework

data data-source datasource

Last synced: 21 Apr 2026

https://github.com/seguradevinn/data-project

A healthcare data audit demo using CMS SynPUF and DuckDB, showing how raw claims are cleaned, validated, and transformed into a 2009 cohort with descriptives and a RADV-style chase list.

auditing cms data duckdb sql

Last synced: 02 Sep 2025

https://github.com/tkonopka/makealive

Dynamic web content through controlled javascript

conversion-functions d3 data data-science javascript visualization

Last synced: 22 Apr 2026

https://github.com/howtoquitvivek/ai-crop-yeild-prediction

AI-driven crop yield prediction and agricultural optimization system (SIH 2025)

2025 2026 ai crop-yeild data minor-project ml predcition python science sih

Last synced: 23 Apr 2026

https://github.com/sebastianbrzustowicz/collision-detection-ai

Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.

accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow

Last synced: 24 Apr 2026

https://github.com/yord/klp-core

A plugin with basic operations for klp (Kelpie), the small, fast, and magical command-line data processor.

csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv

Last synced: 24 Apr 2026

https://github.com/chriseaton/sample-database

A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.

data database examples flat-file samples schema unit-testing

Last synced: 25 Apr 2026

https://github.com/sap-samples/security-research-codegraphsmote

Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.

augmentation data detection learning machine research sample security vulnerability

Last synced: 07 Jun 2026

https://github.com/karthikmprakash/github_repos_scraper

A tool to extract names of github repos of any user

automation bs4 data github python repositories requests webscraping

Last synced: 27 Apr 2026

https://github.com/nightroman/farnet.fsharp.data

FSharp.Data package for FarNet.FSharpFar

data farmanager farnet fsharp

Last synced: 27 Apr 2026

https://github.com/mikeintoshsystems/dhis2heat

A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.

analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization

Last synced: 28 Apr 2026

https://github.com/saulojoab/crato-ce-json

Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.

data database geolocation json json-api localization

Last synced: 28 Apr 2026

https://github.com/rdjarbeng/rdjarbeng

Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below

data jekyll machine-learning ruby website

Last synced: 28 Apr 2026

https://github.com/reubano/ckanny

A Python command line interface (CLI) for interacting with CKAN instances

ckan cli data featured open-data

Last synced: 28 Apr 2026

https://github.com/codeforafrica/ckanext-followy

[ARCHIVED] A CKAN extension to show the datasets a user is following.

ckan ckan-extension ckanext-followy data dataset followy-extension open-data

Last synced: 29 Jun 2026

https://github.com/sgarciaddev/proyecto-poo

Proyecto de software de gestión de asistencia de alumnos en un colegio, utilizando el lenguaje Java y el paradigma de programación orientada a objetos.

alumnos csv data java mysql poo

Last synced: 29 Apr 2026

https://github.com/aidanjuma/ankideckextractor

A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.

anki apkg cli data decompression extraction flashcards learning python zip

Last synced: 29 Apr 2026

https://github.com/sodascience/open_supply_hub

Processing supply chain data obtained from Open Supply Hub

data global-supply-chain open-supply-hub python

Last synced: 29 Apr 2026

https://github.com/wu-rymd/pyobjectify

Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects

data objects python3

Last synced: 08 Jun 2026

https://github.com/chompfoods/stub-asp-net-core

ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server

Last synced: 30 Apr 2026

https://github.com/chompfoods/sdk-php

PHP SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients php raw recipe-api recipes sdk

Last synced: 30 Apr 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/walderlansena/datastructureinc

:battery: Algoritmos de Estrutura de Dados em C++

c cplusplus data fila list lista pilha stack struct structure structured-data

Last synced: 03 May 2026

https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer

Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.

convert converter data data-science deserialize json lightweight powershell serializer

Last synced: 04 May 2026

https://github.com/ishaansathaye/data40x-1_2_3

Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab

capstone-prep data data-science ethics lab python

Last synced: 04 May 2026

https://github.com/thenoim/youtubelibrary

Nils little youtube library :)

api browser data nodejs simple youtube

Last synced: 04 May 2026

https://github.com/issacto/animmender

Deployed Web App

angularjs anime data

Last synced: 05 May 2026

https://github.com/nfaltir/dataxplorer

🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.

data data-science python streamlit

Last synced: 05 May 2026

https://github.com/hasnocool/war_thunder_data_scraper

A web scraping tool designed to extract valuable data from War Thunder, a popular online game.

data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war

Last synced: 06 May 2026

https://github.com/rrwen/twitter2mongodb

Module for extracting Twitter data to MongoDB databases

api data database geo get location mdb media mongo mongod mongodb oauth post rest sample social stream token tweet twitter

Last synced: 06 May 2026

https://github.com/chompfoods/stub-python-flask

Flask (Python) server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database flask flask-server food grocery ingredients nutrition python raw recipe-api recipes server stub stub-server

Last synced: 07 May 2026

https://github.com/augustoarraes/corais

App Python de Monitoramento de vida marinha de Recife de Corais 🪸

coral data iot matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/themuhd/world-cup-analysis

Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023

data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python

Last synced: 08 May 2026

https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci

Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov

analysis analytics data data-analysis data-science database mysql reiter sql

Last synced: 08 May 2026

https://github.com/n0nag0n/flee-intercom

For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.

again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year

Last synced: 09 May 2026

https://github.com/keanteng/nextjs-directory

🌐A Draft Website For Data Catalogue Using NextJs

catalogue climate-change css data directory html javascript nextjs website

Last synced: 09 May 2026

https://github.com/alechash/rndmzr

Randomizer is a random data generator.

data data-science random random-generation random-number-generators

Last synced: 10 Jun 2026

https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk

The aim of this analysis is to understand the smoking patterns among adults in the UK.

data data-analysis data-visualization python3

Last synced: 09 May 2026

https://github.com/dimitryzub/walmart-stores-coffee-analysis

Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn

Last synced: 10 May 2026

https://github.com/kouisamine/data-uri-to-image

Convert Data URI into Image(png, jpeg, webp, gif, svg, ...) files.

conversion convert converter data datauri datauri-to-image image js online php script source-code tools uri

Last synced: 10 May 2026

https://github.com/782e616c6d/covid-d.a

Academic project, using Apache Spark for ETL and Data Studio for data analysis.

academic analytics automation cluster covid-19 data database etl python spark sql

Last synced: 10 May 2026