An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/jbdesbas/custom-scripts

Custom SQL functions or scripts

data database sql

Last synced: 28 Jun 2026

https://github.com/anuraganalog/365-data-science

A Repository which contains lecture notes, exercise, solutions

365 data exercises ipynb lecture notes pdfs python python3 science solutions sql

Last synced: 15 May 2026

https://github.com/luminati-io/crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 17 Mar 2025

https://github.com/cliffano/volothamp

Random D&D stuffs my son and I dabble with

data dungeons-and-dragons info little-godzilla

Last synced: 06 Apr 2025

https://github.com/josephbarbierdarnal/cieri-analytics.com

CIERI Analytics is the applied research department of the non-profit organization CIERI.

analysis behavior data identity research

Last synced: 12 Jan 2026

https://github.com/tomasfarias/pipeline

A simple data pipeline done as a challenge project

challenge data python

Last synced: 29 Mar 2025

https://github.com/cpanse/tartare

raw file collection recorded on Thermo Fisher Scientific mass spectrometers for extented unit testing

bioconductor blob data r unittesting

Last synced: 03 Apr 2025

https://github.com/sstendahl/giscan

Simple tool to read and analyze existing GISAXS data

cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray

Last synced: 30 Jun 2026

https://github.com/owengombas/genyus

🐍 Lyrics analysis with genius.com, Python and Jupyter Notebooks

api data data-science genius jupyter-notebook lyrics python statistics

Last synced: 20 May 2026

https://github.com/seafloor-geodesy/gnatss-test-data

Repository to host test data for GNATSS software

data testing

Last synced: 06 Apr 2026

https://github.com/shukkkur/py_dash

Assignment for ETL Course - Dashbaord (plotly & dash)

dash dashboard data data-visualization plotly

Last synced: 06 Oct 2025

https://github.com/rob-med/data-visualizations-for-python

A collection of useful snippets for clean data visualizations in Python (with matplotlib)

academic-publishing data data-science data-visualization dataviz matplotlib python scientific-publications storytelling visualization

Last synced: 08 May 2026

https://github.com/tezcatlipoca0000/ayudante

It's mainly a program for a store to manage the products data

data javascript scraping self-taught web

Last synced: 09 Apr 2025

https://github.com/tezcatlipoca0000/db-helper_sf

A program tailored for my workplace; it analyze, visualize and manipulate a Firebird 2.0 database

data data-visualization fdb firebird jupyter-notebook pandas python3

Last synced: 09 Apr 2025

https://github.com/xrahul/android-logs

Get logs of various sensors and events in android 6.0+

android data events logs

Last synced: 20 May 2026

https://github.com/ornella-gigante/wildlife-data-analysis-toolkit-ml

A data-driven exploration of Canis lupus signatus (Iberian) and Canis lupus labradorius (Labrador) subspecies, leveraging Jupyter Notebook and pandas to analyze weight distributions (25-56 kg), geographic patterns, and reproductive behaviors. Features size-weight correlations and NaN-handling workflows for robust ecological insights

analysis data datasets jupyter-notebook pandas-dataframe python

Last synced: 15 May 2026

https://github.com/snitkin-lab-umich/prewas_manuscript_analysis

Manuscript in support of prewas software

data data-visualisation manuscript r

Last synced: 08 Jul 2025

https://github.com/nia-cloud-official/influx

Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.

data find people-search search-engine

Last synced: 27 Jun 2025

https://github.com/concaption/ksa-lawyers-data

scraped data of ksa lawyers and law firms

data lawyers

Last synced: 03 Apr 2025

https://github.com/gappeah/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 25 Feb 2025

https://github.com/gappeah/beverage-sales-analytics

This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.

data data-visualization powerbi powerbi-report powerbi-visuals

Last synced: 25 Feb 2025

https://github.com/gappeah/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 11 Jun 2025

https://github.com/speakeasy-sdks/fivetran-python-sdk

Python SDK for accessing Fivetran API.

api connector data fivetran fivetran-connector python sdk

Last synced: 01 Jul 2025

https://github.com/rafalwrzeszcz-wrzasqpl/pl.wrzasq.commons

General-purpose data structures and routines.

aws data data-structures library rust

Last synced: 10 Apr 2025

https://github.com/amazingtest/data4test

测试数据构造生成器,you can get useful data here for software testing

data test-automation testdata testdatabuilder testing testing-tools

Last synced: 16 Jan 2026

https://github.com/passidel/weedmap

Konsumverbot Cannabis

cannabis data map visual

Last synced: 14 Mar 2025

https://github.com/antoineaugusti/antennes-free

Historique des antennes relais Free Mobile en maintenance ou en panne

data free-mobile free-mobile-operator mobile-networks

Last synced: 30 Jul 2025

https://github.com/swarchal/morar

Processing phenotypic screening data

biology data data-analysis drug-discovery hts phenotypic

Last synced: 19 Jun 2025

https://github.com/instafluff/acdb

Animal Crossing Database API

animal api crossing data database json open villagers

Last synced: 28 Apr 2026

https://github.com/fairdataihub/fair-amd-oct-paper-code

Code associated with the paper on FAIR assessment of AMD-related datasets containing OCT data

amd biomedical data eye fair oct

Last synced: 03 Apr 2025

https://github.com/realabbas/instagram-user-meta-data

Instagram User Meta Data 📷 can be fetched using this script in an easy to use JSON Object for displaying Instagram Cards.

data instagram javascript metadata nodejs profile user xray

Last synced: 10 May 2026

https://github.com/puzzlef/graph-openmp

Design of high-performance parallel Graph interface supporting efficient Dynamic batch updates.

data digraph directed graph in mtx openmp parallel structure undirected weighted

Last synced: 06 Apr 2025

https://github.com/puzzlef/hybrid-csr

Comparing space usage of regular vs hybrid CSR.

csr data graph hybrid regular space structure usage

Last synced: 06 Apr 2025

https://github.com/cobluestars/dataherd-raika

"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."

big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience

Last synced: 02 Jan 2026

https://github.com/alexandregazagnes/ghisa

ghisa - Github Import Statistic Analyzer is a free and open-source software, app and python package that helps you to analyze the import statistics of your github repositories.

analytics data dependencies git github github-api import package pypi python skills tool

Last synced: 27 Jun 2025

https://github.com/adrian-pasek-prv/data-modeling-with-cassandra

Create a data model in Apache Cassandra for music streaming app

apache-cassandra data data-engineering data-modeling python

Last synced: 02 Jan 2026

https://github.com/beangreen247/osfetch-old.sh

script that fetches system information and displays it to the user

247 bash bean beangreen247 data fetch green information neofetch neofetch-clone os script sh shell storage system tem zsh

Last synced: 02 Nov 2025

https://github.com/ibz-04/data-encryption

Encrypting and Decrypting given data of hospital patients such as: audio & image files

data decryption encryption

Last synced: 23 Jul 2025

https://github.com/mheadd/SamDotNet

:office: A C# wrapper for the SAM.gov API.

api business client data gov-api government

Last synced: 30 Apr 2025

https://github.com/12joan/not-analytics

don't be creepy.

data metrics privacy

Last synced: 30 Apr 2025

https://github.com/avto-dev/data-migrations-laravel

Package for database data migrations

data database laravel migrations package

Last synced: 12 Jul 2025

https://github.com/davorg/data-tree

Perl library for handling trees

data perl tree

Last synced: 02 Apr 2025

https://github.com/katerynazakharova/common-ml

Creating this lib for ML tasks, because I'm bored of copy-pasting the same functions for different projects.

data data-processing deep-learning lib machi

Last synced: 26 Mar 2025

https://github.com/alpheustangs/jder

A standardized structure for JSON responses

api data error json response specification structure

Last synced: 26 Mar 2025

https://github.com/williamzebrowski/assistant-api

OpenAI Assistant API integrated with Elasticsearch, Logstash & Kibana

ai chatapp chatgpt conversational-ai data elasticsearch kibana llm-inference llms openai rag

Last synced: 16 Feb 2026

https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform

This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.

bigquery data data-analysis data-modeling live-streaming sql

Last synced: 23 Jun 2025

https://github.com/bredalis/matplotlib

📊 Library to create graphs in Python 📊

data graphics librery matplotlib matplotlib-pyplot python

Last synced: 30 Mar 2025

https://github.com/vulcalien/vulcdataformat

Simple data storage system for Java.

data data-storage java serialization

Last synced: 25 Feb 2025

https://github.com/e-kotov/mapineqr

Access Mapineq inequality indicators via API

data demogrpahy r rstats socio-economic-indicators

Last synced: 06 Apr 2025

https://github.com/mierune/tinygrib2

(experimental) A tiny toolkit for parsing JMA's GRIB2 files.

data grib grib2 meteorology rust weather

Last synced: 27 Jun 2025

https://github.com/soulyma/web_crawler

A focused web crawler to extract and structure Arabic content from web pages. Designed for researchers, data analysts, and developers working on Arabic language datasets.

beautifulsoup4 crawler csv data json python structured-data

Last synced: 15 May 2026

https://github.com/tobinchilongo/oop-school-library

This project consists of Ruby script for the school library app. I implemented encapsulation and inheritance with Ruby by creating classes to represent students and teachers in the school.

data database gemfile input-output preserve rspec-testing rubocop unit-test

Last synced: 02 May 2026

https://github.com/bhpcv252/dda-binapprox-on-fits

Using the binapprox algorithm to efficiently estimate the median of each pixel from a set of astronomy images in FITS files.

astronomy data median python

Last synced: 22 Mar 2025

https://github.com/jensz12/uhc

Datapack til Minecraft 1.13+ UHC

data minecraft pack

Last synced: 21 Sep 2025

https://github.com/maxnowack/elastic-sync

Connector to sync mongodb documents into a elasticsearch index

data elasticsearch mongodb sync

Last synced: 20 Jan 2026

https://github.com/ferhatgec/tuc

TinyUrl CLI, generate short link/s from terminal.

data little python3 request script

Last synced: 18 Feb 2026

https://github.com/stdlib-js/ndarray-empty

Create an uninitialized ndarray having a specified shape and data type.

data empty javascript matrix ndarray node node-js nodejs stdlib structure types vector

Last synced: 14 May 2025

https://github.com/qetdr/names-genders

Surnames, genders, and gender probabilities data extraction script and dataset

data python

Last synced: 01 May 2026

https://github.com/jen-uis/loan-status-prediction

This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.

data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration

Last synced: 02 Jan 2026

https://github.com/hoangsonww/fred-banking-data-analysis

💸 AI-powered banking data explorer that combines FRED API insights with vector search, regression analysis, and interactive chat via OpenAI, Claude, and Gemini. Built with TypeScript, React, and Express for seamless full-stack performance.

anthropic chartjs claude-ai data data-analysis data-analytics data-science data-visualization fred fred-api gemini google-generative-ai logistic-regression multiple-regression openai pinecone react regression typescript vector-database

Last synced: 09 Apr 2025

https://github.com/stefanpietrusky/facts

Repository for the article in the online magazine Data Science Collective.

ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver

Last synced: 09 May 2026

https://github.com/umbaji/yodi

This is the official repository for Yodi, the speech recognition model for 8 words, in Ewè. The yodi package is also useful for rapid inference inference on speech data, especially on the mini_speech datasets.

data data-visualization keras python3 speech-recognition tensorflow

Last synced: 12 Jan 2026

https://github.com/kingsley-ezenwaka/app-profile-data-analysis

A Python data analysis project that aims to propose an app profile based on analysis of Google Playstore dataset.

analysis data jupyter-notebook matplotlib pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/patelabhi574/hotel_reservation_analysis

Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.

data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server

Last synced: 19 Feb 2026

https://github.com/canelmas/data-producer

Fake data producer for Kafka, console and http endpoints

data fake-content fake-data fakerjs kafka kafka-producer

Last synced: 05 Apr 2025

https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift

Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift — automates data ingestion, transformation, and analytics with scalable, event-driven architecture.

athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift

Last synced: 02 May 2026

https://github.com/davidgamero/gatech-covid-chart

Line chart showing COVID19 cases per day at Georgia Tech

covid covid19 data gatech

Last synced: 28 Oct 2025

https://github.com/nitsc/spell-from-threebodytrilogy

Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.

3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization

Last synced: 02 May 2026

https://github.com/priyanka7411/customer-flight-prediction-app-mlflow

A comprehensive project predicting flight prices and customer satisfaction using machine learning models, deployed through interactive Streamlit apps.

classification customer-satisfaction data data-cleaning data-visualization feature-engineering flight-price-prediction machine-learning mlflow python regression streamlit

Last synced: 12 May 2026

https://github.com/mvuorre/psyarxivdb

Datasette serving PsyArXiv preprint metadata

data datasette open-science preprints psyarxiv

Last synced: 14 May 2026

https://github.com/tushar2704/interview-quest

Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.

artificial-intelligence data data-science interview interview-questions machine-learning

Last synced: 23 Jan 2026

https://github.com/tillahoffmann/idxhound

🐶 Track indices across one or more numpy selections.

data numpy scientific-computing

Last synced: 14 May 2026

https://github.com/public-health-scotland/waiting_times_clinical_prioritisation

This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.

data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time

Last synced: 26 Jul 2025

https://github.com/incubrain/awesome-maharashtra-data

A collection of datasets specific to Maharashtra, India. WIP

ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi

Last synced: 23 May 2026

https://github.com/eddybrando/peru-year-names

Directory of Peru's official year names

data json peru

Last synced: 23 Jul 2025

https://github.com/michellepellon/jobx

A modern, powerful job scraper for LinkedIn, Indeed and beyond.

compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper

Last synced: 17 Jan 2026

https://github.com/MikeBairdRocks/Fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 02 Apr 2025

https://github.com/dhimmel/erc

Processing human Evolutionary Rate Covariation data

data erc evolution evolutionary-rate-covariation genes hetionet human rephetio

Last synced: 23 Jul 2025

https://github.com/2kabhishek/pokemon-stats

Gotta stat 'em all 🖲🐭

d3 data emoji pokemon rollup statistics

Last synced: 14 May 2026

https://github.com/cyberoctane29/cyclistic-bike-share--analyzing-rider-behavior

Analyzed Cyclistic's bike-share data to uncover usage differences between casual riders and annual members. Utilized SQL and MySQL for data processing, R for visualisation, and Kaggle for collaboration. Insights will guide marketing strategies to convert casual riders into annual members.

data dataanalysis dataanalytics database rlanguage rmarkdown spreadsheet sql

Last synced: 22 May 2026

https://github.com/stdlib-js/array-base-filled4d-by

Create a filled four-dimensional nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 07 Sep 2025

https://github.com/tupizz/data-processing-pipeline-aws

This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.

aws data pipeline serverless typescript

Last synced: 13 May 2026

https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang

Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024

data data-science dataanalytics dataset json

Last synced: 28 Jun 2025

https://github.com/tbrowder/classfactory

Provides tools to create a data collection with classes to manipulate the persistent data.

class data persistent raku

Last synced: 04 Apr 2025

https://github.com/sarincr/basics-of-julia-programming-language

Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.

data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning

Last synced: 19 May 2026

https://github.com/ybelenko/openapi-data-mocker-server-middleware

PSR-15 HTTP Server Middleware to create mock responses from OpenAPI Schemas(OAS 3.0).

data fake faker middleware mock mocker oas oas3 openapi psr-15 swagger

Last synced: 15 Jun 2025

https://github.com/millengustavo/salarios-data-science

Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers

brasil brazil ciencia-de-dados data data-science heroku salarios salary

Last synced: 07 Oct 2025

https://github.com/raigu/ordered-lists-sync

Library for synchronizing ordered data with the minimum of insert and delete operations. Suitable for lage data sets in isolated environments

data lists ordering sync syncrhonization update

Last synced: 12 Jan 2026

https://github.com/real-veersandhu/cia-country-comparison

Data analysis system on the CIA World Factbook

data

Last synced: 25 Feb 2025