An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/leapfrogtechnology/datamegh

Datamegh - Data Engineering for the cloud.

cloud cloud-native data datamegh docker megha python serverless

Last synced: 14 May 2026

https://github.com/0xdir/relief_web_dart

A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters

api dart data humanitarian jobs

Last synced: 11 Jun 2026

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/quetz-al/quetzal-client

Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 28 Jul 2025

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/clinical-genomics/housekeeper

File data orchestrator

data file orchestrator

Last synced: 15 Aug 2025

https://github.com/tomwhite/chernoff

A visual mood indicator. One of the first Java programs I ever wrote.

chernoff-faces data visualization

Last synced: 20 Apr 2026

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah

Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544

checklist data gbif google-earth kmz sabah

Last synced: 09 Mar 2026

https://github.com/nrennie/londonmarathon

R package containing data relating to London Marathon.

data r r-package

Last synced: 02 Apr 2025

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 16 May 2026

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/ymougenel/referencecollector

Helps you gather, store and share references links

ansible data docker keycloak kotlin spring-boot thymeleaf

Last synced: 14 Apr 2026

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/bdpedigo/neuropull

A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.

connectome connectomes connectomics data dataset networks networks-biology

Last synced: 05 Oct 2025

https://github.com/richardschoen/ibmixmlservicestd

IBM i XMLSERVICE C# and VB.Net Data Access Service Wrapper for .Net 4.6.1 and above and .Net Core 2.0 and above

as400 cl cobol command data database db2 ddm drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg service xmlservice

Last synced: 18 Apr 2025

https://github.com/sapienzanlp/exploring-srl

Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"

acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl

Last synced: 31 Jan 2026

https://github.com/wonderium/browser-releases

This repository contains release dates for browser versions.

browsers data json releases wonderium

Last synced: 31 Jan 2026

https://github.com/mystpi/crossings

🌉 A tiny library focused on easily connecting JS to HTML.

connect data frontend html javascript reactive simple small tiny

Last synced: 10 Jun 2026

https://github.com/marcuwynu23/phaddress

Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines

address address-data-api api barangay city data geolocation municipalities provinces

Last synced: 14 Feb 2026

https://github.com/yashika-malhotra/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/joelllllll/up-sync

Sync account and transaction data from up bank to your local environment

accounts bank data postgres sync transactions up upbank

Last synced: 06 Jul 2025

https://github.com/nikolaydubina/aws-s3-reader

Efficient Go Reader for large AWS S3 Objects

aws data golang reader s3 streaming

Last synced: 30 Apr 2026

https://github.com/ismet55555/pdw-asym-2link

Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.

data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator

Last synced: 29 Apr 2026

https://github.com/arcticsnow/climatepy

Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)

climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray

Last synced: 05 Feb 2026

https://github.com/physio/flatten-ts

Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.

array conversion data flatten javascript json object typescript

Last synced: 06 May 2026

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/andreaselia/quotes-xd

A plugin for Adobe XD to insert a text element with a random quote and respective author.

adobe adobe-xd data design design-tool design-tools quote random xd

Last synced: 24 Apr 2026

https://github.com/yashmistry-24/ytcomment-iq

YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.

analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube

Last synced: 15 Feb 2026

https://github.com/stefen-taime/open-source-data

This repository contains structured datasets in various categories

csv data json python3 xml

Last synced: 19 Feb 2026

https://github.com/gauravkoradiya/tensorflow-data-and-deployement

This repository contains usage of data and deployment pipline in tensorflow.

data deployment machine-learning-algorithms pipline tensorflowjs

Last synced: 06 Oct 2025

https://github.com/mujadded/facebook_scrapper

The fcebook scrapper gem that dont need the api

data data-mining facebook ruby-gem scrapper selenium-webdriver

Last synced: 28 Oct 2025

https://github.com/wamphlett/input-collection

A smarter and stricter way to capture and validate request data

data dto forms php validation

Last synced: 27 May 2026

https://github.com/frnt-end/weather-app-react

:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO 🌞 https://Frnt-End.github.io/Weather-App-React 👈

api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp

Last synced: 18 Feb 2026

https://github.com/doughtnerd/pod

Read and write Excel data with Java

data excel extract poi-library

Last synced: 08 Apr 2025

https://github.com/j1sk1ss/dateapppc.exmpl

Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.

data oop-principles parsing pgadmin4 sql wpf

Last synced: 11 Apr 2026

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/stdlib-js/ndarray-base-dtype-str2enum

Return the enumeration constant associated with an ndarray data type string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 15 Mar 2026

https://github.com/tusharnankani/analysis-2.0

An Exhaustive WhatsApp Chat Data Analysis 2.0

analysis data data-science plots trends visualization

Last synced: 31 Mar 2025

https://github.com/cbartram/advancedai

AdvancedAI Selection Option for Command and Conquer Generals Zero Hour

data games java streams

Last synced: 30 May 2026

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/camara94/data-visualization-with-python

Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.

data data-science data-structures data-visualization python3

Last synced: 16 May 2026

https://github.com/yanpitangui/iteminfoconverter

Application that converts ragnarok legacy data files to iteminfo.lua

data itemdbconf iteminfo luafiles ragnarok

Last synced: 12 Oct 2025

https://github.com/leeper/mcode

Functions to merge and recode across multiple variables

data data-transformation r recode recoding

Last synced: 16 May 2025

https://github.com/sanand0/imdbscrape

A weekly archive of the IMDB Top 250 results. Automatically scraped via GitHub Actions. Useful to see trends on IMDb Top 250

data

Last synced: 30 May 2026

https://github.com/flrd/standardlastprofile

R Data Package for BDEW Standard Load Profiles in Electricity

data electricity germany r

Last synced: 16 Mar 2026

https://github.com/14richa/patient-readmission-analysis

This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.

analytics data jupyter-notebook numpy

Last synced: 29 Apr 2026

https://github.com/wibosco/modelingformchanges-example

An example project to show how we can implement a model to simplify form validation

data swift unit-testing validator

Last synced: 16 Mar 2025

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/pommes-public/pommesdata

A full-featured transparent data preparation routine from raw data to POMMES model inputs

data opensource power raw-data transparent

Last synced: 07 Oct 2025

https://github.com/infinitode/pwlds

A public dataset of over 10 million passwords, with assigned strength levels.

ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data

Last synced: 22 Feb 2026

https://github.com/caelean/twittermap

Map of twitter user's influence as defined on by influencetracker

data google-maps maps sparql twitter visualization

Last synced: 14 Jun 2025

https://github.com/OliverHennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 30 Jul 2025

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/mollybeach/cherryether

CherryEther: Typescript Staking Deposits Ethereum Transactions

blockchain data data-science ethereum typescripts

Last synced: 21 May 2026

https://github.com/zalweny26/tools

Just a bunch of tools made in TypeScript.

algorithms data dimensionality distances helpers reduction sortings structures tools utils

Last synced: 03 Feb 2026

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/ingmarboeschen/jatsdecoderevaluation

Evaluation data and code

data evaluation jatsdecoder

Last synced: 04 Feb 2026

https://github.com/woo071002/parcel-management-system

A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.

cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml

Last synced: 08 Oct 2025

https://github.com/stdlib-js/ndarray-base-from-scalar

Convert a scalar value to a zero-dimensional ndarray.

base convert data javascript ndarray node node-js nodejs scalar stdlib structure types wrap

Last synced: 03 Jul 2025

https://github.com/plabayo/datapoints.earth

Earth data liberation for and by its citizens.

data foss free scrape

Last synced: 15 Mar 2026

https://github.com/askaniy/celestialocationsmaker

Tool for making Celestia location files

celestia data geology locations mapping planetary-science space

Last synced: 14 Mar 2025

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/dark-art108/yonk

A cli-utility to streamline data science work by creating templates

data machine-learning python3

Last synced: 08 May 2026

https://github.com/lxcoding06/e-gereja

Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis

data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan

Last synced: 15 May 2025

https://github.com/ctechhindi/auto-fill-form-data

AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME

autocomplete chrome-extension data extension

Last synced: 17 Apr 2026

https://github.com/andrew-johnson-4/misspeller

Take correctly spelled words and return common spelling mistakes

common-mistakes data language natural nlp processing rust

Last synced: 30 Apr 2025

https://github.com/a3r0id/lightshot-data-miner

A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.

brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition

Last synced: 19 Oct 2025

https://github.com/oliver021/entity-dock

A superset with libraries, components, tools and more to work with entity on .Net

api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi

Last synced: 09 May 2026

https://github.com/axa-ch/health-insurance-data

Swiss health insurance data

axa data health insurance swiss

Last synced: 19 Mar 2026

https://github.com/iusztinpaul/airbnb-data-analysis

Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.

airbnb data datanalysis datascience machine-learning numpy pandas python

Last synced: 06 May 2026