An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/mmaithani/singapore-residents-data-eda

The data contains Population by ethnicity, age and gender for the country of Singapore from the year 1957 to 2018

data data-visualization ethnicity kaggle-dataset python singapore singapore-residents-data

Last synced: 16 Apr 2026

https://github.com/visenger/prada

Profiling Datasets

cleaning data dataset profiling

Last synced: 24 Aug 2025

https://github.com/gappeah/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 31 Jul 2025

https://github.com/derrickbaruga7/python-data-analysis

This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.

analytics data data-science visualization

Last synced: 31 Jul 2025

https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard

An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊

dashboard data data-analysis data-science data-visualization tableau tableau-public

Last synced: 17 Feb 2026

https://github.com/dannyben/datamix

DSL for manipulating tabular data

csv data data-analysis data-engineering gem ruby tabular-data

Last synced: 31 Jul 2025

https://github.com/margostino/job-pulse

PoC to analyse the hiring market

data golang mongodb visualization

Last synced: 16 May 2026

https://github.com/stdlib-js/ndarray-slice-assign

Assign element values from a broadcasted input ndarray to corresponding elements in an output ndarray view.

assign assignment copy data javascript matrix ndarray node node-js nodejs set setitem slice stdlib structure types vector view

Last synced: 11 Apr 2025

https://github.com/clabe45/kaz

Minimalistic local storage cli

cli data minimalistic storage utility

Last synced: 17 Jul 2025

https://github.com/erinaldi/bmn2-lattice

Data analysis of lattice Monte Carlo simulations of quantum matrix models.

data data-science data-visualisation lattice

Last synced: 27 Mar 2025

https://github.com/mvuorre/psyarxivdb

Datasette serving PsyArXiv preprint metadata

data datasette open-science preprints psyarxiv

Last synced: 14 May 2026

https://github.com/chandraprakash-bathula/keywords_prediction-machine-learning-integration

Keywords Prediction Model Built the Model By: Data Cleaning Removing Stopwords Constructing Word2vec Advancing to TF-IDF Weighted Word2vec.

algori artifici data machine-learning tf-idf weighted-word2vec word2vec

Last synced: 08 Nov 2025

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/tonykipkemboi/ens_subgraph_data

Query On-Chain Data from Subgraphs by The Graph Protocol using Python

data subgraphs thegraphprotocol web3

Last synced: 17 Sep 2025

https://github.com/LisaKey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 03 Mar 2025

https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint

A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT

bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs

Last synced: 25 Sep 2025

https://github.com/tillahoffmann/idxhound

🐶 Track indices across one or more numpy selections.

data numpy scientific-computing

Last synced: 14 May 2026

https://github.com/sergkash7/fdc-facade

Facade for The FoodData Central API.

api center data food usda

Last synced: 15 May 2026

https://github.com/v6ntage/sql-sales_data-analytics-project

This repository contains a SQL scripts demonstration analytical techniques.

analytics business-analytics data data-analysis database query sql sql-server

Last synced: 12 Apr 2026

https://github.com/marians/tour-tracker

Track the general classification development of the Tour De France, stage over stage

cycling data sports statistics

Last synced: 24 Jun 2025

https://github.com/makosai/covid19datachart

A basic chart for checking corona data. Written in a single HTML file for convenience. Grab the single file and run it anywhere. Or visit the webpage.

chart chartjs corona coronavirus coronavirus-analysis covid-19 covid-2019 covid19 covid19-data data data-analysis datasets

Last synced: 23 Feb 2026

https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian

I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively

data data-visualization tableau

Last synced: 17 Feb 2026

https://github.com/michellepellon/jobx

A modern, powerful job scraper for LinkedIn, Indeed and beyond.

compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper

Last synced: 17 Jan 2026

https://github.com/MikeBairdRocks/Fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 02 Apr 2025

https://github.com/2kabhishek/pokemon-stats

Gotta stat 'em all 🖲🐭

d3 data emoji pokemon rollup statistics

Last synced: 14 May 2026

https://github.com/chalk-ai/roadmap

Chalk public roadmap

chalk data data-science mlops pipeline python

Last synced: 17 Jan 2026

https://github.com/stdlib-js/array-base-filled4d-by

Create a filled four-dimensional nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 07 Sep 2025

https://github.com/derstimmler/aokexporter

Exporter for data from the statutory health insurance company AOK

aok cocona console csharp data dotnet export polly

Last synced: 15 May 2026

https://github.com/simranjeet97/leetcode_practice

Practicing the Leet Code Codes for Competitive Programming

algorithms amazon coding competitive-programming data data-structures facebook google leetcode python

Last synced: 03 Aug 2025

https://github.com/theryston/db-mycro

A node module with a json database that saves data in a specific directory, similar to sqlite, but in JSON

base crud data database db db-mycro javascript json jsondatabase nodejs nosql typescript

Last synced: 09 Apr 2026

https://github.com/undistraction/grid-model

A small API for creating a grid and accessing the positions of the cells, rows and columns within it.

2d calculations cells data grid layout model

Last synced: 04 Aug 2025

https://github.com/hoaihuongbk/lakeops

A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.

data data-operations dataengineering datalake

Last synced: 07 Mar 2026

https://github.com/wahyuwsslah/salary_prediction-aiml

Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest

ai analytics data data-science datascience machine-learning python python3

Last synced: 19 May 2026

https://github.com/gematik/app-fhir-snapshots-package-generator

The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.

data fhir miscellaneous

Last synced: 05 Oct 2025

https://github.com/diddypod/crop-data-comparer

A Python script to compare crop data over years

comparison crop data openpyxl python

Last synced: 31 Oct 2025

https://github.com/shgysk8zer0/schema

A PHP implementation of schema.org structured data objects

data microdata schema seo structured-data

Last synced: 24 Jun 2025

https://github.com/vishwagauravin/screener-scraper-pro

Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.

data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks

Last synced: 18 Feb 2026

https://github.com/dostuffthatmatters/circadian-scp-upload

Resumable, interruptible, SCP upload client for any files or directories generated day by day

checksum daily data directories files library python scp ssh synchronization time-series upload utilities

Last synced: 24 Jun 2025

https://github.com/chompfoods/sdk-go

Go SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go grocery ingredients nutrition raw recipe-api recipes sdk

Last synced: 19 May 2026

https://github.com/tiaanduplessis/country-currency-data

Data about currencies of countries

countries currencies data symbols

Last synced: 08 Aug 2025

https://github.com/ate329/nsl-kdd-feature-extractor

Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.

cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset

Last synced: 30 Oct 2025

https://github.com/rustytake-off/datasets

Various datasets for 🤗 HuggingFace

data datasets docs huggingface

Last synced: 27 Mar 2025

https://github.com/dav009/bqt

Local unit tests for your BigQuery queries

bigquery bq data test unittest

Last synced: 11 Feb 2026

https://github.com/kockarevicivan/dot-net-snippets

Set of .NET code snippets: algorithms, data structures, graph searches etc, created for demonstration purposes.

algorithms binary c-sharp data generics graphs-pathfinding list structures

Last synced: 27 Mar 2025

https://github.com/ddeutils/ddedocs

📖 Data Developer & Engineer Documents and Hands-On

blogs data data-engineering documents hands-on

Last synced: 08 Aug 2025

https://github.com/garcane/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/panukatan/senso

An Interface to the Philippine Census of Population and Housing Data

census data philippines r rstats

Last synced: 31 Oct 2025

https://github.com/bastianolea/fonasa_beneficiarios

Datos de beneficiarios del Fondo Nacional de Salud, por tramo del sistema, edad, tramo de edad, sexo, y comuna.

chile comunas data estado genero salud social

Last synced: 27 Feb 2026

https://github.com/gcoronelc/ucv_gdi-1_202302-b2

Taller de Gestión de Datos e Información I con Gustavo Coronel.

data data-science data-structures database databases online oracle query relational-databases security sql sql-server

Last synced: 19 May 2026

https://github.com/millengustavo/salarios-data-science

Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers

brasil brazil ciencia-de-dados data data-science heroku salarios salary

Last synced: 07 Oct 2025

https://github.com/stdlib-js/ndarray-base-empty-like

Create an uninitialized ndarray having the same shape and data type as a provided ndarray.

base data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 09 Mar 2026

https://github.com/rubenhortas/python_examples

Examples of Python code and DSA (data structures and algorithms).

algorithm algorithms data dsa examples python python-3 python3 samples snippets structures

Last synced: 03 Oct 2025

https://github.com/tpgillam/teafiles.jl

Tea file support for Julia

data julia time-series

Last synced: 03 Oct 2025

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/alhonaut/quant-assigment

Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho

analysis data defi quantitative-finance

Last synced: 16 May 2026

https://github.com/chrisru/f1stats

🗄️ Speedy API for Formula 1 statistics

api data fast formula1

Last synced: 20 Mar 2025

https://github.com/vikjam/ui-policy

Unemployment policy at the state level

data government government-data

Last synced: 13 Feb 2026

https://github.com/giorgiosavastano/process

processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.

big-data data data-science parallel parallel-computing process processing processing-chain rust

Last synced: 05 Oct 2025

https://github.com/denisecase/nw-network-data-analytics

Network for those earning a NW Masters of Applied Data Science

analytics data

Last synced: 02 Feb 2026

https://github.com/danieljdufour/fast-bin

Quickly Convert an Array of Numbers into their Minimal Binary Representations

array binarize binary bits data nbits numbers unbinarize

Last synced: 13 Apr 2025

https://github.com/yoursrijit/data-structure-with-java

A data structure is a named location that can be used to store and organize data. And, an algorithm is a collection of steps to solve a particular problem. Learning data structures and algorithms allow us to write efficient and optimized computer programs.

data datastructures dsa-algorithm java linked-list

Last synced: 13 Mar 2025

https://github.com/cont-limno/lagosus-reservoir

Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.

data module

Last synced: 17 Jan 2026

https://github.com/jimbrig/jimstaskviews

CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com

cran data docs rstats shiny-app submodules task-views

Last synced: 06 Mar 2026

https://github.com/habedi/adbis-2023-paper

This repository hosts the code and data used for the experiments reported in the paper titled "Diversification of Top-k Geosocial Queries", published in ADBIS 2023

artifacts conference-paper data experiments graphs java research-paper

Last synced: 19 May 2026

https://github.com/stdlib-js/ndarray-base-zeros-like

Create a zero-filled ndarray having the same shape and data type as a provided ndarray.

base data fill filled javascript matrix ndarray node node-js nodejs stdlib structure types vector zeros

Last synced: 04 Oct 2025

https://github.com/bayer-group/cmc-ontologies

This is a submodule of cmc-knowledge-graph-setup. It contains ontologies and relevant data graph files

data ontologies owl turtle

Last synced: 16 Jun 2025

https://github.com/redodo/shipper

Hide encrypted data in files.

audio data images python steganography

Last synced: 26 Mar 2025

https://github.com/frefrik/covid19norge-api

API for COVID-19 cases in Norway

api covid covid-19 covid19 data fastapi norge norway

Last synced: 10 May 2026

https://github.com/thelich2112/bluesky-weather-poster

a Wordpress plugin that takes info from a clientraw.txt file and posts to Bluesky with variable options for posting.

data posting station weather wordpress

Last synced: 17 May 2026

https://github.com/geo-c/oct-ckan

The Open City Toolkit (more information about the project: http://geo-c.eu)

cities collaboration data open participation transparency

Last synced: 16 May 2026

https://github.com/stdlib-js/ndarray-base-empty

Create an uninitialized ndarray having a specified shape and data type.

base data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 19 Feb 2026

https://github.com/pedro-donoso/productoskotlin

App que carga una lista de Productos con ID, Nombre, Descripción, Disponible, Habilitado y Stock, convierte el nombre a mayúsculas, cambia boolean por SI o NO si está disponible y habilitado, los ordena descendente según Stock

class data fun id kotlin kotlin-android list

Last synced: 19 May 2026

https://github.com/bluecolor/lauda

Cross database data transfer tool

data database etl extract jdbc load

Last synced: 02 May 2026

https://github.com/tsiarokhin/student_bsu_by

Tool for parsing various BSU student information from student.bsu.by website.

belarus bsu data grades python students study university

Last synced: 28 May 2026

https://github.com/lucaaszsx/spyder

A powerful schema-based web scraping library for Node.js built for fast, structured, and reliable data extraction.

cheerio crawler data dom dom-manipulation html json json-ld parser scraper web xml

Last synced: 11 Jun 2026

https://github.com/m-muecke/isocountry

R package containing ISO codes for countries and currencies

country-codes currency-codes data iso-3166-1 iso-4217 r r-package

Last synced: 20 Mar 2025

https://github.com/stdlib-js/array-one-to-like

Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 20 Feb 2026

https://github.com/rishabh-agarwal/datastructuremachineproblem

Data Structure MP - Clemson University (Language C)

273 alogrithms clemson data ece structure university

Last synced: 26 Oct 2025

https://github.com/nafisalawalidris/elfeenah

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.

artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning

Last synced: 11 Sep 2025

https://github.com/zediculz/block

Block is a data structure/collection that uses Blockchain principle in managing data.

algorithm data structure

Last synced: 05 Oct 2025

https://github.com/harmonydata/harmony_examples

Example Jupyter notebook and R scripts using Harmony in real research problems

data data-harmonisation data-harmonization harmonisation psychology python r research

Last synced: 11 Jul 2025

https://github.com/panda-official/driftcli

CLI Client for Drift Platform

cli click command-line data

Last synced: 17 Feb 2026

https://github.com/parzibyte/cifrar-descifrar-php

Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios

crypto data decrypt encryption password php security

Last synced: 20 May 2026