An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/flowsynx/plugin-csv

FlowSynx plugin to reads and writes CSV files, enabling easy batch data import/export operations and integration with spreadsheet-based data workflows.

comma-separated-values csv data data-platform flowsynx

Last synced: 10 Mar 2026

https://github.com/gbowne1/jsonhelix

This is a X11 GUI JSON application for editing, debugging and converting JSON and schemas and API data.

api data gui gui-application json x11

Last synced: 10 Jun 2025

https://github.com/flowsynx/plugin-json

FlowSynx plugin to loads and parses local JSON files. Supports transformation, extraction, and mapping of hierarchical data structures in workflows.

data data-platform flowsynx json

Last synced: 10 Mar 2026

https://github.com/tatey/list_of_countries

A list of countries, states, and cities in Ruby

cities countries data ruby states

Last synced: 11 Nov 2025

https://github.com/zonggen/data-structure

Course notes on data structures and analysis (CSC263)

algo clrs csc263 data

Last synced: 23 Mar 2025

https://github.com/jorgeatgu/apaga-luz

💡 ¿Cuánto cuesta la luz? 💶

data data-visualization flat-data

Last synced: 04 Feb 2026

https://github.com/SAP-archive/signavio-qualtrics-di

Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.

data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio

Last synced: 09 May 2025

https://github.com/ginga1402/chinook_database

Microsoft SQL Server Management Studio

business-query data sql-server

Last synced: 30 Mar 2025

https://github.com/victorowinoke/after-work-data-science-project-showcase-eda

You work for Lublu as a Data Science Consultant and you have been tasked to perform analysis on pricing, product and assortment of Adidas and Nike. Create a descriptive analysis report, making relevant observations and recommendations that will help Lublu in the launch of such similar products.

adidas analysis data deliverables nike pythonanalysis ranges

Last synced: 28 May 2026

https://github.com/diegoperea20/own_dataset_segmentation_yolov8

Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.

coins colombia data opencv own python segmentation tensorflow yolov8

Last synced: 12 Apr 2026

https://github.com/dev-owdenmag/dataflow-manager

A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.

data data-management data-management-platform data-visualization python

Last synced: 30 Mar 2025

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 08 Apr 2025

https://github.com/GiveMePseudonyms/PiVisualisations

A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.

data data-visualization pi pygame python self-organising-criticality tkinter

Last synced: 08 Apr 2025

https://github.com/khalyomede/fetch

Quickly retrieve your PHP data

config configuration data fetch php php7

Last synced: 15 Mar 2025

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/castelao/bufr

BUFR binary data format from WMO

binary data format meteorology oceanography wmo

Last synced: 13 Jul 2025

https://github.com/shivam1808/data-cleaning-project

We take raw housing data and transform it in SQL Server to make it more usable for analysis.

analysis data datacleaning sql sqlserver

Last synced: 29 May 2026

https://github.com/agahkarakuzu/datavis_edu

Presented in BrainHack School 2019-2020, QBIN SciComm 2021

binder dashboard data notebooks repo2docker visualization

Last synced: 01 Apr 2025

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/toransahu/metoffice

Data visualisation - MetOffice

data metoffice uk visualization weather

Last synced: 25 Mar 2025

https://github.com/lane-romuald/iot-irrigation-data-collection-system

An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.

arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi

Last synced: 12 Apr 2026

https://github.com/edugmenes/azure-data-engineering

This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.

azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark

Last synced: 29 Jan 2026

https://github.com/eugenedakin/caesarcipher

Native Xojo code for the Caesar Cipher algorithm with an example program

caesar-cipher data decryption encryption xojo

Last synced: 07 Jan 2026

https://github.com/vapourismo/binary-io

Read and write values of types that implement Binary from and to Handles

data haskell haskell-library io parsing

Last synced: 28 Mar 2025

https://github.com/cleanzr/restaurant

Restaurant data set for entity resolution

data linkage

Last synced: 11 Mar 2026

https://github.com/fiskeben/meetjescraper

HTTP proxy for Meet je stad project

api data go iot meetjestad proxy scraper weather

Last synced: 29 May 2026

https://github.com/grycap/cdmi-client-go

A basic Go library to perform CDMI core operations

cdmi cloud data go

Last synced: 21 Jan 2026

https://github.com/fastbolt/excel-writer

Excel-Writer component

data excel excel-export

Last synced: 14 Apr 2025

https://github.com/rrwen/twitter2pg-cli

Command line tool for extracting Twitter data to PostgreSQL databases

api cli cmd command data database geo interface line location media pg postgres postgresql rest social stream tool tweet twitter

Last synced: 12 Apr 2026

https://github.com/desininja/data-engineer-interview-questions

This repository contains all the Data Engineer Interview Questions asked by interviewers.

data data-engineer-interview-questions

Last synced: 31 Mar 2025

https://github.com/devlive-community/mockaroo

一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。

data mock

Last synced: 08 Jul 2025

https://github.com/himel-sarder/web-scraping-it-jobs-dataset

This project is a Python-based web scraping tool that collects job listings from TimesJobs for IT-related positions. It extracts job titles, company names, locations, and experience requirements, and saves the data into a CSV file. The tool uses BeautifulSoup and Pandas for web scraping and data manipulation.

data datascience dataset kaggle-dataset machine-learning machinelearning ml web-scraping

Last synced: 22 Feb 2026

https://github.com/stdlib-js/ndarray-slice-dimension-from

Return a read-only shifted view of an input ndarray along a specific dimension.

copy data javascript matrix ndarray node node-js nodejs shift slice stdlib structure truncate types vector view

Last synced: 24 Apr 2025

https://github.com/dalikewara/typego

typego provides custom type that can be used to construct information (such as success data, error data, etc)

custom data golang helper type typego

Last synced: 09 Apr 2025

https://github.com/ourouimed/github-profile

Simple Github Profile HTML CSS JS Using Github APi data

api css data github html js json

Last synced: 13 Apr 2026

https://github.com/yasenstar/powerbi_tutorial

Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool

analytics data microsoft powerbi tutorial visualization

Last synced: 07 Jan 2026

https://github.com/seanowenhayes/recipe-scraper

A simple scraper uses puppeteer to scrape recipes and more from the web

crawler crawling data recipes scraping

Last synced: 22 Feb 2026

https://github.com/geo-y20/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber

Last synced: 13 Apr 2026

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/jaldekoa/fdicapi

A Python wrapper to easily retrieve data from the BankFind Suite official API from FDIC in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 07 Jan 2026

https://github.com/alexscigalszky/palabras-aleatorias-data

This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types

aleatorias data palabras random words

Last synced: 04 Oct 2025

https://github.com/san089/black-friday-sales-analysis

This Project gives an insight into few statistics related to black Friday Sale.

custom data dataanalysis insights sales statistics

Last synced: 13 Jul 2025

https://github.com/nikhilash45/live_ipl_report

This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.

analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi

Last synced: 19 Mar 2026

https://github.com/hardwario/cloud-fetch

HARDWARIO Cloud Fetch - Data Extraction Tool

cli cloud data excel python

Last synced: 07 Feb 2026

https://github.com/kuro337/scalamono

Scala Monorepo Tooling for Kafka, Opensearch, Spark, Redpanda, Hadoop - and Lang Reference.

data database duckdb hadoop kafka redpanda sdala spark

Last synced: 13 Apr 2026

https://github.com/gkapfham/ast2016-paper

Source Code of and Supporting Files for a Paper Published at AST 2016

data latex-document paper research

Last synced: 19 Oct 2025

https://github.com/ispyhumanfly/prowler

Query the web, extract data from the results, and transform that data into a format you can use.

ai analytics business cryptocurrency data extract-data machine-learning mining scraping web

Last synced: 06 Sep 2025

https://github.com/stdlib-js/array-zero-to

Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 08 Jan 2026

https://github.com/jrcichra/ingestd

HTTP server that easily ingests data into a database

data gin hacktoberfest ingest ingestion restful-api

Last synced: 28 Apr 2026

https://github.com/ayushai/salesfoce-hospital-management

A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.

analytics dashboard data salesforce-developers visualization

Last synced: 22 Feb 2026

https://github.com/stdlib-js/ndarray-slice

Return a read-only view of an input ndarray.

copy data javascript matrix ndarray node node-js nodejs slice stdlib structure types vector view

Last synced: 10 Mar 2026

https://github.com/spiceai/datasets

Spice AI curated dataset definitions for Spice.ai

ai bitcoin blockchain data ethereum polygon

Last synced: 20 Apr 2026

https://github.com/qedsoftware/afsisdb-demos

AfSIS DB Demos

agriculture data soil

Last synced: 27 Oct 2025

https://github.com/bredalis/scikitlearn

🤖 Library to create ML models 🤖

data ia learning-python librery ml python

Last synced: 30 May 2026

https://github.com/bastianolea/palestina

Visualizador sobre cifras de la masacre que Israel está llevando a cabo en Palestina y la franja de Gaza

app data meses palestina politica shiny social tiempo

Last synced: 06 Jul 2025

https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas

Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python

data digital-innovation-one dio jupiter-notebook labs ms-excel panda python

Last synced: 14 May 2026

https://github.com/jmcanterafonseca/leaflet-context-information

A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2

context data fiware information leaflet map open visualization web

Last synced: 21 Apr 2026

https://github.com/gagolews/clustering-results-v1

A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)

benchmark benchmark-datasets clustering data dataset datasets machine-learning

Last synced: 16 Mar 2025

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/reubano/swutils

ScraperWiki box utility library

data library

Last synced: 14 Jan 2026

https://github.com/sandipbera35/blogapp.spring.boot

A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .

data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot

Last synced: 13 Apr 2026

https://github.com/unownone/spenddy-link

Simple Privacy Friendly chrome extension to track your spends and more!

analytics data extension link

Last synced: 12 Mar 2026

https://github.com/s-raza/csvio

Wrapper for conveniently processing CSV files

csv data file processing wrapper

Last synced: 14 Jan 2026

https://github.com/outofbedlam/tine

TINE a data pipeline runner.

data pipeline

Last synced: 05 Oct 2025

https://github.com/sycho9/populater

:elephant: PHP script that populates your database tables with fake data using fzaninotto/faker

composer data database fake packagist php populate

Last synced: 13 Apr 2026

https://github.com/wangshouh/cryptofinancedata

An ipynb file containing data acquisition of futures, options and other financial derivatives

data financial-data

Last synced: 05 Oct 2025

https://github.com/helins/ex.clj

Java exceptions as clojure data

clojure data exception java java-exceptions

Last synced: 12 Dec 2025

https://github.com/hyperversal-blocks/averveil

Averveil is OpenSea for Data.

blockchain data golang iot privacy zero-knowledge zkp

Last synced: 14 Jan 2026

https://github.com/tushar2704/insurance-cross-sell

This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.

data datascience h20automl machine-learning mlflow python streamlit-tushar2704

Last synced: 08 Oct 2025

https://github.com/patrickdavies100/datapipeline37

Some Data Science practice using datasets available online. Currently test data is similar to this dataset: https://www.kaggle.com/datasets/asaniczka/amazon-uk-products-dataset-2023 but the plan is to expand.

data data-science pandas-dataframe python3

Last synced: 08 Oct 2025

https://github.com/liyakhathshaik/datascout.jl

This is a julia package

data datascout julia

Last synced: 09 Oct 2025

https://github.com/kamal-singh22/ai-driven-emotional-sentiments-analysis

This project leverages machine learning to analyze and classify the emotional sentiment of textual data. The goal is to accurately identify and categorize emotions, aiding applications in customer feedback analysis, social media sentiment analysis, and mental health monitoring.

analysis artificial-intelligence data emotion nlp-machine-learning python sentiment-analysis streamlit text-classification

Last synced: 14 Apr 2026

https://github.com/allwells/climate-change-api

Climate Change API with NodeJS, Axios, Express and Cheerio

api apis axios cheerio climate-change data expressjs heroku herokuapp news newsapi nodejs

Last synced: 11 Apr 2026

https://github.com/jpmens/airports-zonedata

$INCLUDE airport locations

airports data dns

Last synced: 19 Jan 2026

https://github.com/east-empire-trading-company/eetc-data-client

Client library for retrieving data managed by EETC Data Hub.

client-library data data-science finance library python

Last synced: 31 May 2026

https://github.com/strata/data

Tools to help you read data from a range of different data providers.

api data data-integration

Last synced: 27 Jan 2026

https://github.com/davorg/dmp

Data Munging with Perl

book data hacktoberfest munging perl

Last synced: 21 Jan 2026

https://github.com/stdlib-js/array-base-assert-is-real-floating-point-data-type

Test if an input value is a supported array real-valued floating-point data type.

array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate

Last synced: 12 Oct 2025

https://github.com/jrmedd/emojinal

An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.

data emojis sentiment user-research ux

Last synced: 19 Jan 2026

https://github.com/mccarthy-m-g/alda

An R data package for the book "Applied longitudinal data analysis: Modeling change and event occurrence" by Singer and Willett (2003).

data growth-curves longitudinal-data mixed-models nonlinear-mixed-models r r-package structural-equation-modeling survival-analysis time-to-event

Last synced: 19 Jan 2026

https://github.com/rohancyberops/r-language

R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.

caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse

Last synced: 12 Oct 2025

https://github.com/simonbernarding/ml_project_simonbernarding

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

data data-science flight-delay-prediction machine-learning machinelearning prediction

Last synced: 12 Oct 2025

https://github.com/Lemniscate-world/StratAI

This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.

ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading

Last synced: 13 Oct 2025

https://github.com/player29879/neum-ai

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors

Last synced: 18 Apr 2026

https://github.com/stdlib-js/strided-base-dtype-resolve-str

Return the data type string associated with a supported strided array data type value.

array data dtype dtypes enum javascript node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 13 Oct 2025

https://github.com/athul64/powerbi

Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.

data data-an data-visualization dax dax-expression powerbi

Last synced: 23 Feb 2026