An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/metriccoders/metriccoders_datasets

This is the Metric Coders repository containing all the datasets for machine learning.

data datasets machine-learning natural-language-processing scikit-learn

Last synced: 08 Apr 2025

https://github.com/the-aerospace-corporation/pivt

PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.

analytics dashboards data devops jenkins pipeline python splunk visualization

Last synced: 29 Apr 2026

https://github.com/reubano/ckanny

A Python command line interface (CLI) for interacting with CKAN instances

ckan cli data featured open-data

Last synced: 28 Apr 2026

https://github.com/gniquyij/tuqiu

Dark side of OS X

data notes osx safari

Last synced: 12 Oct 2025

https://github.com/jackosheadev/databasetechproject

This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction

data database mssql sql

Last synced: 18 May 2026

https://github.com/ilejuxepwaduzd/structured-data-extractor

🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.

cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data

Last synced: 10 Apr 2026

https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml

Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.

analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc

Last synced: 16 Apr 2026

https://github.com/stdlib-js/ndarray-base-zeros-like

Create a zero-filled ndarray having the same shape and data type as a provided ndarray.

base data fill filled javascript matrix ndarray node node-js nodejs stdlib structure types vector zeros

Last synced: 04 Oct 2025

https://github.com/nouman6093/advanced-statistical-models

in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl

data data-analysis data-science data-visualization

Last synced: 11 Jun 2026

https://github.com/khalyomede/fetch

Quickly retrieve your PHP data

config configuration data fetch php php7

Last synced: 15 Mar 2025

https://github.com/iguptashubham/walmart-eda

Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.

analysis data dataset eda mysql portfolio-project python sql

Last synced: 10 Apr 2026

https://github.com/sap-samples/security-research-codegraphsmote

Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.

augmentation data detection learning machine research sample security vulnerability

Last synced: 07 Jun 2026

https://github.com/aero-db/airports

A public and free dataset of all airports in the world

airports aviation csv data dataset json

Last synced: 27 Apr 2026

https://github.com/chriseaton/sample-database

A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.

data database examples flat-file samples schema unit-testing

Last synced: 25 Apr 2026

https://github.com/castelao/bufr

BUFR binary data format from WMO

binary data format meteorology oceanography wmo

Last synced: 13 Jul 2025

https://github.com/codenoid/webtoons.com-database

a Webtoons.com Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/mattqdev/koalaz

Why don't use koalas as data mock? With this npm package you can!

data koala lorem-ipsum meme mock placeholder

Last synced: 13 Jan 2026

https://github.com/tkonopka/makealive

Dynamic web content through controlled javascript

conversion-functions d3 data data-science javascript visualization

Last synced: 22 Apr 2026

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/undistraction/grid-model

A small API for creating a grid and accessing the positions of the cells, rows and columns within it.

2d calculations cells data grid layout model

Last synced: 04 Aug 2025

https://github.com/lane-romuald/iot-irrigation-data-collection-system

An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.

arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi

Last synced: 12 Apr 2026

https://github.com/jinsyin/dataorigin

数据之源 | A data source management framework

data data-source datasource

Last synced: 21 Apr 2026

https://github.com/cicerotcv/br-gen

A browser extension for generating Brazilian placeholder data.

chrome data extension generation hacktoberfest

Last synced: 21 Apr 2026

https://github.com/connectaman/c-and-data-structure

Program,Notes,Explanation on Data Structure using C++

cpp data data-structures sorting-algorithms

Last synced: 14 Mar 2025

https://github.com/ktbarrett/scdil

simple configuration and data interchange language

configuration data json python yaml

Last synced: 20 Apr 2026

https://github.com/eugenedakin/caesarcipher

Native Xojo code for the Caesar Cipher algorithm with an example program

caesar-cipher data decryption encryption xojo

Last synced: 07 Jan 2026

https://github.com/simranjeet97/leetcode_practice

Practicing the Leet Code Codes for Competitive Programming

algorithms amazon coding competitive-programming data data-structures facebook google leetcode python

Last synced: 03 Aug 2025

https://github.com/fastbolt/excel-writer

Excel-Writer component

data excel excel-export

Last synced: 14 Apr 2025

https://github.com/quasilyte/phpcorpus

A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like

analysis corpus data php php-corpus

Last synced: 04 Jul 2025

https://github.com/avahoffman/dataplay

🤸‍♂️ Load data to play with

data data-package r r-package rstats

Last synced: 25 Mar 2025

https://github.com/aiwithqasim/recommendationengines

Recommendations Engines with IBM a project of DataScientist Nanodegree on Udacity. For this project i will analyze the interactions that users have with articles on the IBM Watson Studio platform, and make recommendations to them about new articles you think they will like.

data data-manging data-science ibm ipython-notebook normalization python3

Last synced: 18 Apr 2026

https://github.com/stdlib-js/array-base-reject

Return a shallow copy of an array containing only those elements which fail a test implemented by a predicate function.

array copy data filter generic javascript node node-js nodejs predicate reject stdlib structure test types

Last synced: 26 Dec 2025

https://github.com/izam-mohammed/data-source

🌐 A source directory for the data of my projects and experiments.📂 This curated collection simplifies access to diverse data that used in various projects💡

csv-files data data-source zip-files

Last synced: 03 Jun 2026

https://github.com/sogful/archive

you will NEVER believe what this repository contains

archive crawl data scrapes sites

Last synced: 03 Jun 2026

https://github.com/stdlib-js/array-one-to-like

Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 20 Feb 2026

https://github.com/sadmanca/uoft-pey-coop-job-postings

Code for parsing approximately 1.8k HTML pages of UofT PEY co-op job postings (from September 2023 to May 2024) to a single sqlite3 database file.

co-op data html python singlefile sqlite sqlite3 uoft uoft-pey

Last synced: 17 Apr 2026

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/rishabh-agarwal/datastructuremachineproblem

Data Structure MP - Clemson University (Language C)

273 alogrithms clemson data ece structure university

Last synced: 26 Oct 2025

https://github.com/rousan/weshare

An application that transfers files between devices

c-sharp data dot-net file lan phone share transfer-data weshare wifi

Last synced: 17 Apr 2026

https://github.com/desininja/data-engineer-interview-questions

This repository contains all the Data Engineer Interview Questions asked by interviewers.

data data-engineer-interview-questions

Last synced: 31 Mar 2025

https://github.com/devsujay19/knowledgebase

My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.

data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel

Last synced: 10 Apr 2026

https://github.com/mohsinali08000/myportfolio

I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.

css data data-science html

Last synced: 22 Apr 2026

https://github.com/garcane/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/devlive-community/mockaroo

一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。

data mock

Last synced: 08 Jul 2025

https://github.com/awesomelistsio/awesome-open-data

A curated list of high-quality open data resources, tools, platforms, and projects across domains.

awesome awesome-list awesome-lists data open open-data

Last synced: 29 Jun 2025

https://github.com/sefakcmn00/tensorflow_car_price_analysis

In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.

data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow

Last synced: 14 Apr 2026

https://github.com/seanowenhayes/recipe-scraper

A simple scraper uses puppeteer to scrape recipes and more from the web

crawler crawling data recipes scraping

Last synced: 22 Feb 2026

https://github.com/bukalapak/bukadata

Data supplier plugin for populating design with real data.

data plugin sketch sketch-plugin

Last synced: 05 Jul 2025

https://github.com/desmondsanctity/abeona-kafka

A demo to show how to implement Upstash's serverless Kafka to a Node.js microservice. Presented at Berlin Buzzwords 2024

berlin-buzzwords data event-driven kafka microservice serverless streaming upstash-kafka

Last synced: 15 May 2025

https://github.com/docusign/extension-app-data-io-reference-implementation

Extension App for Data IO Reference Implementation for the Docusign IAM Platform

apps data extension

Last synced: 02 Mar 2026

https://github.com/stdlib-js/array-base-none-by-right

Test whether all elements in an array fail a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs none predicate stdlib structure test types validate

Last synced: 01 Mar 2026

https://github.com/jaldekoa/fdicapi

A Python wrapper to easily retrieve data from the BankFind Suite official API from FDIC in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 07 Jan 2026

https://github.com/ghonimo/diode-pn-junction-characterization-psu-ece515

A detailed analysis of the I-V characteristics of a PN junction diode (1N4148) under different temperatures, utilizing Excel for graphical analysis and parameter extraction. This study was conducted as part of the ECE 515: Fundamentals of Semiconductor Devices course at Portland State University.

analysis characterization data device diode diodes excel mosfet-transistor pn-junction

Last synced: 28 Feb 2026

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/stdlib-js/array-base-assert-is-complex-floating-point-data-type

Test if an input value is a supported array complex-valued floating-point data type.

array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate

Last synced: 14 Feb 2026

https://github.com/anobaka/insidecollector

这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。

collection data excel-like list list-manager table

Last synced: 19 Jan 2026

https://github.com/m0nica/datalogues-outdated

Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll

data pelican pelican-blog pelican-theme

Last synced: 28 Feb 2026

https://github.com/freddy03h/immutable-data-structure

Normalize and Merge your application's data store using Immutable.JS objects

data immutable redux store

Last synced: 05 Oct 2025

https://github.com/ispyhumanfly/prowler

Query the web, extract data from the results, and transform that data into a format you can use.

ai analytics business cryptocurrency data extract-data machine-learning mining scraping web

Last synced: 06 Sep 2025

https://github.com/stdlib-js/ndarray-base-fliplr

Return a view of an input ndarray in which the order of elements along the last dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 11 Feb 2026

https://github.com/gematik/app-fhir-snapshots-package-generator

The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.

data fhir miscellaneous

Last synced: 05 Oct 2025

https://github.com/chandraprakash-bathula/keywords_prediction-machine-learning-integration

Keywords Prediction Model Built the Model By: Data Cleaning Removing Stopwords Constructing Word2vec Advancing to TF-IDF Weighted Word2vec.

algori artifici data machine-learning tf-idf weighted-word2vec word2vec

Last synced: 08 Nov 2025

https://github.com/stdlib-js/ndarray-base-empty-like

Create an uninitialized ndarray having the same shape and data type as a provided ndarray.

base data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 09 Mar 2026

https://github.com/viisix/corecat

Core repository of DanceCats project.

data lightweight python3

Last synced: 25 May 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/ngambip/priscilla

About my work and Experience

accounting analytics data finance-management

Last synced: 03 Feb 2026

https://github.com/jrcichra/ingestd

HTTP server that easily ingests data into a database

data gin hacktoberfest ingest ingestion restful-api

Last synced: 28 Apr 2026

https://github.com/danielbello7/nosql-json-database

Simple and quick database to help development process and speed

data database json json-database models nosql nosql-database nosql-json-database schema

Last synced: 09 May 2026

https://github.com/3squared/smoulder

Smoulder is a really good data pipe

composition data facade-pattern forge-framework object-oriented

Last synced: 25 Apr 2026

https://github.com/garcane/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 09 Feb 2026

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/dbriane208/omdena-apprenticeship-project

This is part of my contribution to the Omdena apprenticeship program .

data data-science feature-engineering machine-learning

Last synced: 14 Mar 2026