An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/radekbednarik/covid-czech-data-api

Library to make it easy to work with REST API of official Czech Covid data.

api covid-19 data deno library typescript

Last synced: 02 May 2026

https://github.com/vidupriya/aws-glue--data-copy

The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket

aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark

Last synced: 02 May 2026

https://github.com/jesuscc1993/data-cleaner-extension

Clears browser data in a single click.

application-data chrome chrome-extension data

Last synced: 02 May 2026

https://github.com/wiseql/wiseql

The wise data browser — run SQL recipes as small, observable, debuggable steps

data debugging duckdb oracle quality sql tui

Last synced: 13 Jun 2026

https://github.com/12458/99co

99co Web Scraping

99co data property scraper website

Last synced: 02 May 2026

https://github.com/asjadnaqvi/stata-tidytuesday

A Stata package for fetching Tidy Tuesday meta data and files

ado data r stata tidytuesday

Last synced: 13 Jun 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/viniddev/active_finance

Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário

automation data data-analisis rpa selenium-webdriver telegram-bot

Last synced: 03 May 2026

https://github.com/antoineaugusti/youtubers-tips

Collecting data about tips given to Youtubers

data economy youtube youtubers

Last synced: 03 May 2026

https://github.com/word2vect/beijing-new-house-data-visualization

Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab

data python visualization

Last synced: 13 Jun 2026

https://git.sheetjs.com/sheetjs/sheetjs

📗 SheetJS Community Edition -- Spreadsheet Data Toolkit

angular bun csv data database deno excel grid html html5 ios javascript json nodejs react spreadsheet table vue xlsx xml

Last synced: 06 Oct 2025

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/flyconnectome/hnf

Documentation for the hierarchical neuron format

annotations data dotprops hdf5 mesh neurons skeleton storage

Last synced: 17 Jan 2026

https://github.com/abdullahashfaqvirk/earth-engine-data-scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 10 May 2026

https://github.com/vim89/flowforge

Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently

archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming

Last synced: 14 Apr 2026

https://github.com/pathilink/ebury_case

Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.

bigquery data modeling sql

Last synced: 05 Oct 2025

https://github.com/fuadarradhi/gps_data_reset

Flutter plugin to reset and download gps data

cache data extra gps reset

Last synced: 23 Feb 2026

https://github.com/eshan-sud/secureit

A Blockchain-based Data Sovereignty Platform

blockchain data decentralised-application platform sovereignty

Last synced: 21 Jan 2026

https://github.com/albanecoiffe/jo2024_visualization

Tableau de bord avec Streamlit sur les JO de Paris 2024.

data streamlit visualization

Last synced: 30 Apr 2026

https://github.com/prajjwol09/sql_retail_analysis_project

This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.

data data-analysis datacleaning dataexploration pgadmin4 sql

Last synced: 15 Feb 2026

https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors

A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.

cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest

Last synced: 10 Apr 2025

https://github.com/openwashdata/ugabore

Borehole repair data from central Uganda associated with a project report completed by Joseph Lwere for the “data science for openwashdata” course

analysis borehole data open-data r uganda wash water

Last synced: 17 Jan 2026

https://github.com/charityeverett/gobackfetchit

Award Winning WebXR Data Journalism Storytelling Project

3d aframe ar css data html html-css-javascript nodejs visuzalization vr webxr xr

Last synced: 03 May 2026

https://github.com/stupidcucumber/elephant-crawler

System for mining texts from websites.

data data-mining-python python

Last synced: 25 Apr 2026

https://github.com/stdlib-js/array-base-last-index-of-same-value

Return the index of the last element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 13 Apr 2026

https://github.com/aiwithqasim/project_allocation_system

Project Allocation System (PAS) automates and simplifies the process of Allocating projects to students. Teachers can simply add details on prompting for input and perform a number of operation modules including Adding Projects, Updating Projects, Searching Projects , Deleting Projects and Display All Projects

algorithms-and-data-structures algorthims c-plus-plus data data-structures linked-list

Last synced: 08 Oct 2025

https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project

Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.

data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface

Last synced: 08 Oct 2025

https://github.com/roshaka/samplr

Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.

data data-analysis data-engineering decorators list python sampling

Last synced: 14 Jan 2026

https://github.com/quonverbat/ordner

A simple, customizable and cross-platform data tracker.

data datatracker javafx management

Last synced: 07 Jul 2025

https://github.com/arkanovicz/skorm

Simple Kotlin Object Relational Mapping

data database model orm sql

Last synced: 19 Apr 2026

https://github.com/tn3w/moviedb-json

A JSON library with 981,530 films.

data database db json movie movie-database movies

Last synced: 03 May 2026

https://github.com/vidushibhadana/covid19-data-exploration-using-sql

Deployed diverse SQL techniques to analyze COVID-19 data for an improved understanding of pandemic's regression.

data database database-management sql

Last synced: 19 Aug 2025

https://github.com/programmer-rd-ai/competitive-programming-solutions

A collection of my solutions to various competitive programming problems from platforms like LeetCode. This repository serves as a personal archive of my problem-solving journey, covering a range of algorithms, data structures, and problem-solving techniques.

algorithm algorithms algorithms-and-data-structures data datastructures dsa javascript pandas python structures

Last synced: 01 Mar 2025

https://github.com/djdhairya/whatsapp-chat-analysis

WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.

data data-science dataanalytics datapreprocessing machine-learning ml

Last synced: 08 Oct 2025

https://github.com/sadratehranian/data-collection-and-machine-learning

create a model using logistic regression to predict whether the fire alarm of a smoke detector should sound or not. Second, predicts whether an electric drive in a production plant may be faulty or not.

data data-analysis data-science datacollection logistic-regression machine-learning ml nn

Last synced: 05 Jan 2026

https://github.com/udofia2/crudwithdatabase

A simple Nodejs app that connect to a database.

crud data databse

Last synced: 08 Oct 2025

https://github.com/paezha/bsantiago

A data package with the results of a travel and well-being survey conducted in Santiago in 2016

data equity package r santiago survey travel well-being

Last synced: 18 Mar 2025

https://github.com/anisimov-anthony/data_forest

Implementation of various types of trees

algorithms-and-data-structures data lib rust tree

Last synced: 28 Apr 2025

https://github.com/welli7ngton/mysql-server-formacao-alura

repositório para guardar códigos escritos em SQL de cursos da formação em mysql server da alura

data database mysql

Last synced: 19 Apr 2026

https://github.com/boytchev/coursedataviz

Supplementary materials for "Data Visualization" course

data fmi su visualization

Last synced: 16 Mar 2025

https://github.com/anthonybench/convert

A quick way to convert data, document, and image formats.

cli converter data documents images

Last synced: 14 Jan 2026

https://github.com/arnavk-09/phishing-detection

🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI

csv data fastapi flask python scikit-learn

Last synced: 03 May 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/danielrosehill/global-value-factors-explorer-dataset

Derivative database of IFVI Global Value Factors for data analysis and visualization use cases.

data environmental-data sustainability-data

Last synced: 23 Feb 2026

https://github.com/davitshahnazaryan3/data-management-web

Explore datasets with ease using taxonomy filtering, allowing you to quickly identify the specific experimental datasets you need and download them effortlessly

data environmental experiments filtering-data seismic taxonomy

Last synced: 17 Jan 2026

https://github.com/preritdas/covidactnow

A wrapper for the Covid Act Now database of live COVID-19 state-based statistics.

api covid covid-19 data python python3 science wrapper

Last synced: 09 Oct 2025

https://github.com/cburmeister/disc-golf-courses

All the disc golf courses i've played at. Maintained with http://geojson.io/.

data geojson

Last synced: 21 Jan 2026

https://github.com/dcmox/algorithms

General purpose data structures and algorithms

algorithms binary data hash linked list structures tree

Last synced: 10 Jun 2026

https://github.com/luminati-io/jupyter-notebooks-web-scraping

Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.

beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping

Last synced: 13 Apr 2026

https://github.com/nafisalawalidris/nafisalawalidris

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.

artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning

Last synced: 16 May 2026

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/anand-sony/mttr-dashboard

Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.

analytics business-analytics dashboard data python statistical-analysis

Last synced: 30 May 2026

https://github.com/itsmeyogesh22/Solved-8-Weeks-SQL-Challenge-Correct-Solutions

Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/

8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql

Last synced: 29 Aug 2025

https://github.com/laguer/jupyt-nb

Mathematical and Physical Constants ratios in Cosmology and micro physics

analysis constants cosmology data dimensional julia mathematical micro notebook physical physics python ratios science

Last synced: 13 Apr 2026

https://github.com/rse/nebulize

Nebulize Security-Sensitive Information

data dsgvo gdpr information nebulize security sensitive

Last synced: 16 Mar 2025

https://github.com/newrelic-experimental/newrelic-java-atomikos

Gives status of Atomikos Data Sources since this information is unavailable via JMX

atomikos data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data

Last synced: 30 May 2026

https://github.com/kashifkhan7/cleaning-analysis_cli

Analyze sales data easily with our CLI app. Gain insights on revenue trends and visualize results using Python, Pandas, and Matplotlib. 🚀📊

conditional-statements css data datacleaning exception-handling exiftool html json matplotlib-pyplot metadata metadata-extraction pandas-python python sales-analysis seaborn-python speech-to-text transcription youtube

Last synced: 13 Apr 2026

https://github.com/isaacmaffeis/imad-2023

Model Identification and Data Analysis (IMAD) | University course

data data-analysis data-science model model-identification

Last synced: 09 May 2026

https://github.com/fcoagz/rate-reader-epv

pyDolarVenezuela API utilities, image processing (EnParaleloVzla) to extract currency exchange rates from specific platforms, validating content against expected patterns

data finance json processing-images pydolarvenezuela

Last synced: 14 Jun 2025

https://github.com/rod-persky/sungrowdatacollector

Data collector for a SunGrow SG8.0RT Inverter

data opentelemetry sungrow

Last synced: 19 Jan 2026

https://github.com/jooapa/bytebrother

Byte Brother is watching YOU

data data-analysis security

Last synced: 26 Jan 2026

https://github.com/redgoose-dev/baguni

이미지를 보관하고 탐색하는 웹 프로그램

data explorer file management upload

Last synced: 14 Apr 2026

https://github.com/theopenwebjp/theopenweb-data-loader

Package for loading data to local project

data downloader import javascript typings

Last synced: 10 Oct 2025

https://github.com/dahmansphi/analysis_from_start_to_end

The Big Bang of Data Science- Analysis from the Start to The End- [Book Two]

analysis data data-analytics data-mining data-science hypothesis-testing jamovi machine-learning

Last synced: 08 Jan 2026

https://github.com/nukopian/shell-flatten

Flatten a series into a single record

automation data shell

Last synced: 18 Jun 2025

https://github.com/bastianolea/minsal_suicidios

Casos de intento de suicidio y suicidio consumado en Chile

chile comunas data genero salud tiempo

Last synced: 19 Jan 2026

https://github.com/loggdme/kyro

Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.

data package

Last synced: 14 Jan 2026

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/soenneker/soenneker.constants.data

A set of commonly used constants related to various types of data

constants csharp data dotnet

Last synced: 12 Mar 2026

https://github.com/bkestelman/dasy-ml

DaSy DataSynthesizer - Create synthetic data with desired statistical properties for machine learning research.

data data-science machine-learning

Last synced: 14 Jan 2026

https://github.com/atiqurcode/scrap-spec

Scrap data from the html to table html code / json

data html-table json-data scarp

Last synced: 05 Feb 2026

https://github.com/writetome51/pagination-page-info

Intended to help a separate Paginator class paginate data. Specifically, this class contains the properties `itemsPerPage` and `totalPages`, which will be used by other classes

batch data javascript paginate pagination typescript

Last synced: 09 May 2026

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/meokullu/prefill

PreFill adds desired characters onto output values to increase their legibility.

alignment data data-analysis data-engineering data-science legibility

Last synced: 17 Jan 2026

https://github.com/nukopian/shell-series

Extract columns from tabular text

automation data shell

Last synced: 11 Oct 2025

https://github.com/aldro61/mmit-data

The data used in the Maximum Margin Interval Trees paper

data machine-learning machine-learning-algorithms reproducible-research

Last synced: 19 Feb 2026

https://github.com/juangesino/research-project

Course files for Research Project @ University of Amsterdam

data data-science economics stata

Last synced: 02 Jan 2026

https://github.com/q-aware-labs/bias-insights

Bias detection project for the Chicago Face Database (CFD)

ai chicago-data-portal data data-science llm statistical-analysis

Last synced: 21 Jan 2026