An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/rremple/intervalidus

For all your interval-based data needs.

data intervals

Last synced: 21 Feb 2026

https://github.com/mews-labs/dataframe-memory

This tools aims to provide simple solution to save memory when using pandas' data frame.

data data-science memory-usage pandas-dataframe python3

Last synced: 22 May 2026

https://github.com/mohsinali08000/myportfolio

I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.

css data data-science html

Last synced: 22 Apr 2026

https://github.com/ahmadjamil888/facial-recognition-ai-model

A facial recognition AI model powered by CNN , and trained by thousands of images.

ai cnn data data-science facial facial-recognition recognition

Last synced: 30 Jun 2025

https://github.com/antononcube/raku-data-cryptocurrencies

Raku package of cryptocurrency data retrieval.

crypto cryptocurrency data

Last synced: 02 Apr 2025

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 05 Jan 2026

https://github.com/lookininward/data-formatter-demo

You have directories containing data files and specification files. The specification files describe the structure of the data files. Write an app that reads format definitions from specification files. Use these definitions to convert the parsed files to NDJSON files.

csv data demo files json ndjson python txt unittest

Last synced: 27 Apr 2026

https://github.com/emnetdegafe/allesoverfilm-backend

AllesOverFilm-backend is part of the AllesOverFilm mobile app development project and contains the database structure, server query scripts, and Sequelize-cli database structures.

backend data data-model express postgresql sequelize-cli

Last synced: 11 Apr 2026

https://github.com/humbertocg18/pucrs-alest-i-2.3-2023.24

Trabalhos, Projetos, Exercícios e aulas realizados em Java na cadeira de Algoritimos e estrutura de dados 1, matéria do segundo semestre.

beecrowd beecrowd-solution-in-js beecrowd-solutions-in-java data data-structures datastructures-algorithms hashmap hashtable java-8 leetcode leetcode-javascript leetcode-solutions leetcodepra pucrs sorting-algorithms

Last synced: 29 Mar 2025

https://github.com/mtingers/opacify

Opacify reads a file and builds a manifest of external sources to rebuild said file.

backup data obfuscation python

Last synced: 18 May 2026

https://github.com/andygeiss/pipeline

Build your own data pipeline to gather, organize and transform data by using protobuf as an intermediate format.

data data-pipeline data-science go golang machine-learning protobuf protobuf-compiler

Last synced: 31 Mar 2025

https://github.com/abdul-rafay19/youngdevinterns_machine-learning_tasks

This internship offers hands-on exposure to real-world Machine Learning applications — from data visualization and preprocessing to model development, evaluation, and deployment. It focuses on real ML workflows, problem-solving, neural networks, and hyperparameter tuning — all within a collaborative, remote, and growth-oriented environment.

ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data data-visualization internship machine-learning machine-learning-algorithms machinelearning ml model model-development neural-network preprocessing programming-language python task tasks youngdevintern

Last synced: 29 Apr 2026

https://github.com/richardschoen/sshnetibmi

This .Net/.Net Core class library is used to interface with existing IBM i database, program calls, CL commands, service programs and data queues via the PASE based xmlservice-cli PASE command program or regular qsh/bash commands. qsh/bash commands can be used to interface with any qsh/pase based utilities such as the IBM i db2util utility

as400 cl command csharp data db2 ddm dotnet drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg xmlservice xmlservice-cli

Last synced: 04 Feb 2026

https://github.com/trstringer/pywave2

:ocean: Get swell buoy data

data ocean python

Last synced: 31 Mar 2025

https://github.com/flowsynx/plugin-csv

FlowSynx plugin to reads and writes CSV files, enabling easy batch data import/export operations and integration with spreadsheet-based data workflows.

comma-separated-values csv data data-platform flowsynx

Last synced: 10 Mar 2026

https://github.com/willdev12/rjson

Encryptable Json file format for .NET projects!

csharp csharp-library data dotnet json json-data json-plugin variables vbdotnet vbnet

Last synced: 11 Apr 2026

https://github.com/gmersy/data-carbon

Repository accompanying the paper: Toward a Life Cycle Assessment for the Carbon Footprint of Data

carbon-emissions carbon-footprint climate-change data data-science sustainability sustainable-software

Last synced: 31 Mar 2025

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/mikezange/laravel-encryptable

A simple encryptable trait for encrypting model fields in laravel

data encrypt field gdpr laravel model trait

Last synced: 16 May 2026

https://github.com/katiesaund/dresden_maps

Contains a data file with locations from The Dresden Files. The data file is to be used for my map tutorial in R.

data

Last synced: 05 Jan 2026

https://github.com/SAP-archive/signavio-qualtrics-di

Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.

data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio

Last synced: 09 May 2025

https://github.com/ginga1402/chinook_database

Microsoft SQL Server Management Studio

business-query data sql-server

Last synced: 30 Mar 2025

https://github.com/insolite/react-data-frame

Table for huge data sets

data react table

Last synced: 14 May 2026

https://github.com/dev-owdenmag/dataflow-manager

A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.

data data-management data-management-platform data-visualization python

Last synced: 30 Mar 2025

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 08 Apr 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/bijx/firestore-data-fetcher

A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.

automation data database downloader exporter fetcher firebase firestore open-source script

Last synced: 12 Apr 2026

https://github.com/castelao/bufr

BUFR binary data format from WMO

binary data format meteorology oceanography wmo

Last synced: 13 Jul 2025

https://github.com/fredhutch/gdscnsoilsites

Homepage for BioDIGS Project. Learn about the project and download data.

biodigs data metagenomics student-research

Last synced: 25 Mar 2025

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/toransahu/metoffice

Data visualisation - MetOffice

data metoffice uk visualization weather

Last synced: 25 Mar 2025

https://github.com/lane-romuald/iot-irrigation-data-collection-system

An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.

arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi

Last synced: 12 Apr 2026

https://github.com/edugmenes/azure-data-engineering

This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.

azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark

Last synced: 29 Jan 2026

https://github.com/eugenedakin/caesarcipher

Native Xojo code for the Caesar Cipher algorithm with an example program

caesar-cipher data decryption encryption xojo

Last synced: 07 Jan 2026

https://github.com/bastianolea/campamentos_chile

Datos del Catastro de campamentos nacional 2024, del Ministerio de Vivienda y urbanismo

chile comunas data pobreza social

Last synced: 24 Aug 2025

https://github.com/cleanzr/restaurant

Restaurant data set for entity resolution

data linkage

Last synced: 11 Mar 2026

https://github.com/fiskeben/meetjescraper

HTTP proxy for Meet je stad project

api data go iot meetjestad proxy scraper weather

Last synced: 29 May 2026

https://github.com/fastbolt/excel-writer

Excel-Writer component

data excel excel-export

Last synced: 14 Apr 2025

https://github.com/avahoffman/dataplay

🤸‍♂️ Load data to play with

data data-package r r-package rstats

Last synced: 25 Mar 2025

https://github.com/rrwen/twitter2pg-cli

Command line tool for extracting Twitter data to PostgreSQL databases

api cli cmd command data database geo interface line location media pg postgres postgresql rest social stream tool tweet twitter

Last synced: 12 Apr 2026

https://github.com/thiagopanini/datadelivery

Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.

analytics athena aws catalog crawler data datamesh glue s3 terraform

Last synced: 29 Nov 2025

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/xpotify/scraper

Scraper designed for Xpotify's client to gather information from websites🌟

axios cheerio data javascript scraper webscraper

Last synced: 07 Jul 2025

https://github.com/desininja/data-engineer-interview-questions

This repository contains all the Data Engineer Interview Questions asked by interviewers.

data data-engineer-interview-questions

Last synced: 31 Mar 2025

https://github.com/bredalis/datastructure

📚 Estructuras de Datos en Python

algorithms data data-structure python

Last synced: 12 Apr 2026

https://github.com/eve-ning/osumania_data

processed osu!mania data from osu!API

data osu rhythm-game vsrg

Last synced: 24 Feb 2026

https://github.com/agavitalis/sample-c-codes

A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.

ageteller atm binary data gpcalculator logging

Last synced: 09 Apr 2025

https://github.com/shawnduong/pacman-digest

Generate a digest of package space usage for Linux systems using pacman.

arch data pacman

Last synced: 13 May 2026

https://github.com/stdlib-js/ndarray-slice-dimension-from

Return a read-only shifted view of an input ndarray along a specific dimension.

copy data javascript matrix ndarray node node-js nodejs shift slice stdlib structure truncate types vector view

Last synced: 24 Apr 2025

https://github.com/dalikewara/typego

typego provides custom type that can be used to construct information (such as success data, error data, etc)

custom data golang helper type typego

Last synced: 09 Apr 2025

https://github.com/yasenstar/powerbi_tutorial

Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool

analytics data microsoft powerbi tutorial visualization

Last synced: 07 Jan 2026

https://github.com/sefakcmn00/tensorflow_car_price_analysis

In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.

data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow

Last synced: 14 Apr 2026

https://github.com/seanowenhayes/recipe-scraper

A simple scraper uses puppeteer to scrape recipes and more from the web

crawler crawling data recipes scraping

Last synced: 22 Feb 2026

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/alexscigalszky/palabras-aleatorias-data

This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types

aleatorias data palabras random words

Last synced: 04 Oct 2025

https://github.com/nikhilash45/live_ipl_report

This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.

analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi

Last synced: 19 Mar 2026

https://github.com/hardwario/cloud-fetch

HARDWARIO Cloud Fetch - Data Extraction Tool

cli cloud data excel python

Last synced: 07 Feb 2026

https://github.com/gkapfham/ast2016-paper

Source Code of and Supporting Files for a Paper Published at AST 2016

data latex-document paper research

Last synced: 19 Oct 2025

https://github.com/nodef/infoods

Kit for International Network of Food Data Systems (INFOODS).

component data food identifier infoods international network systems tagnames

Last synced: 11 Mar 2026

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 10 Apr 2026

https://github.com/goncaloperes/datavisualization

Here I will share some of my data visualizations using a variety of datasets, technologies and tools.

d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick

Last synced: 04 Feb 2026

https://github.com/tylerben/data-spring

Easily generate a dummy dataset based on a provided config

data data-spring datagenerator fake-data generator javascript typescript

Last synced: 27 May 2026

https://github.com/luminati-io/Crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 09 Apr 2025

https://github.com/cintia0528/data_cleaning_and_analytics-python

Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.

colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 Jan 2026

https://github.com/marabesi/d3-visualization

Different visualizations using data and d3.js

charts css d3js data html js json timeline-chart visualization

Last synced: 01 May 2026

https://github.com/ayushai/salesfoce-hospital-management

A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.

analytics dashboard data salesforce-developers visualization

Last synced: 22 Feb 2026

https://github.com/stdlib-js/ndarray-slice

Return a read-only view of an input ndarray.

copy data javascript matrix ndarray node node-js nodejs slice stdlib structure types vector view

Last synced: 10 Mar 2026

https://github.com/spiceai/datasets

Spice AI curated dataset definitions for Spice.ai

ai bitcoin blockchain data ethereum polygon

Last synced: 20 Apr 2026

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/qedsoftware/afsisdb-demos

AfSIS DB Demos

agriculture data soil

Last synced: 27 Oct 2025

https://github.com/izaaccoding36/dados-dinamicos

Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global

api data redes-sociais social-media website

Last synced: 26 Mar 2025

https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas

Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python

data digital-innovation-one dio jupiter-notebook labs ms-excel panda python

Last synced: 14 May 2026

https://github.com/danielrosehill/monetised-ghg-emissions

Calculating monetised GHG emissions for various companies based upon disclosure data

data sustainability sustainability-data

Last synced: 07 Sep 2025

https://github.com/programmer-rd-ai/library-management-system-oraclesql

The Library Management System project, part of the CI6320 Advanced Data Modelling coursework, features comprehensive SQL scripts utilizing OracleSQL to facilitate efficient data modeling and management.

adm advanced ci6320 cw data icw library management modelling oracle oraclesql report sql system

Last synced: 29 Oct 2025

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/basemax/buskool.com-data

This repository contains the collected product data from the Buskool website (باسکول). The data is stored in 20k+ JSON files, each containing detailed information about products available on the website.

buskool buskoolcom data farsi information ir iran json persian

Last synced: 03 Apr 2025

https://github.com/unownone/spenddy-link

Simple Privacy Friendly chrome extension to track your spends and more!

analytics data extension link

Last synced: 12 Mar 2026

https://github.com/aiwithqasim/competitive-programming

I will add all material which i did or in the future i will do to make my programming skill more enhanced to become a competitive programmer

c-plus-plus code data java programming structured-data

Last synced: 20 May 2026

https://github.com/stdlib-js/array-base-fancy-slice-assign

Assign element values from a broadcasted input array to corresponding elements in an output array.

array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types

Last synced: 06 Oct 2025

https://github.com/outofbedlam/tine

TINE a data pipeline runner.

data pipeline

Last synced: 05 Oct 2025

https://github.com/sycho9/populater

:elephant: PHP script that populates your database tables with fake data using fzaninotto/faker

composer data database fake packagist php populate

Last synced: 13 Apr 2026

https://github.com/dixslyf/nbparts

Unpack a Jupyter notebook into its sources, outputs and metadata.

data haskell jupyter jupyter-notebook nix nix-flake

Last synced: 05 Oct 2025

https://github.com/deva-246/data-analytics-with-walmart-dataset

Power BI dashboard contruction , data cleaning using PYTHON and Data projection in SQL.

dashboard data dataquery powerbi python sql

Last synced: 06 Oct 2025