An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/zalweny26/open_data_unipa

Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31

data open project python

Last synced: 26 Apr 2026

https://github.com/karthikmprakash/github_repos_scraper

A tool to extract names of github repos of any user

automation bs4 data github python repositories requests webscraping

Last synced: 27 Apr 2026

https://github.com/saulojoab/crato-ce-json

Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.

data database geolocation json json-api localization

Last synced: 28 Apr 2026

https://github.com/rdjarbeng/rdjarbeng

Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below

data jekyll machine-learning ruby website

Last synced: 28 Apr 2026

https://github.com/jackosheadev/databasetechproject

This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction

data database mssql sql

Last synced: 18 May 2026

https://github.com/aidanjuma/ankideckextractor

A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.

anki apkg cli data decompression extraction flashcards learning python zip

Last synced: 29 Apr 2026

https://github.com/chrnthnkmutt/theartofstatistic_python

This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization

data data-science data-visualization machine-learning statistics

Last synced: 08 Jun 2026

https://github.com/chompfoods/stub-asp-net-core

ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server

Last synced: 30 Apr 2026

https://github.com/divanny/academixbackend

🧑‍🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.

backend csharp data net webapi

Last synced: 07 Jun 2026

https://github.com/gdhhgnbnvbn/f1-2025-ai-predict

fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.

agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe

Last synced: 01 May 2026

https://github.com/ggeop/multiple-fields-management

Fields management from/to different data sources. :bulb:

data data-engineering data-organization data-retrieval data-science pandas python

Last synced: 01 May 2026

https://github.com/windomz/gitdate

git commit date trick

data git git-commit trick

Last synced: 02 May 2026

https://github.com/ishaansathaye/data40x-1_2_3

Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab

capstone-prep data data-science ethics lab python

Last synced: 04 May 2026

https://github.com/raghavendranhp/credit_card_fraud_detection

This repository contains code for a credit card fraud detection model using autoencoders and logistic regression, achieving 95.3% accuracy.

anomaly-detection autoencoder-neural-network credit-card-fraud data keras logistic-regression machine-learning preprocessing tensorflow

Last synced: 04 May 2026

https://github.com/thenoim/youtubelibrary

Nils little youtube library :)

api browser data nodejs simple youtube

Last synced: 04 May 2026

https://github.com/satur-io/estoraje

Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.

data database distributed go golang key-value performance training

Last synced: 07 May 2026

https://github.com/yash22222/sync-intern-s-ml-tasks

SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.

bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns

Last synced: 07 May 2026

https://github.com/chompfoods/stub-jaxrs-resteasy

JAX-RS RESTEasy server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jax-rs-server nutrition raw recipe-api recipes resteasy server server-stub stub stub-server

Last synced: 08 May 2026

https://github.com/keanteng/nextjs-directory

🌐A Draft Website For Data Catalogue Using NextJs

catalogue climate-change css data directory html javascript nextjs website

Last synced: 09 May 2026

https://github.com/lmuffato/project-mysql-vocabulary-booster-trybe

Projeto mysql vocabulary booster - Projeto avaliativo da Trybe do Bloco 20: Funções SQL, Joins e Subqueries

back-end crud data database mysql mysqlworkbench query sql trybe-projects

Last synced: 10 May 2026

https://github.com/dimitryzub/walmart-stores-coffee-analysis

Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn

Last synced: 10 May 2026

https://github.com/suryavamsi-p/conflict-nlp-topic-modeling-sentiment-analysis-using-llms

Extracts insights from 26K+ protest events using BERTopic, Top2Vec, and LLMs for real-world applications like crisis monitoring, policy research, and social unrest analysis.

all-mpnet-base-v2 bertopic conflict-data data data-science lda llama2 llms machine-learning mistral-7b nlp nltk protest-analysis pyldavis python3 top2vec topic-modeling transformers visualization

Last synced: 11 May 2026

https://github.com/scarblase/russian-military-losses-analysis

This repository provides an in-depth analysis of Russian equipment losses using PySpark and data visualization techniques.

data data-science data-visualization jyputer-notebook matplotlib pyspark python3 seaborn seaborn-plots ukraine ukraine-invasion

Last synced: 12 May 2026

https://github.com/pferreirafabricio/data-immersion

🏊🏻‍♂️ Activities and exercises from 'Imersão Dados' event

data data-analysis data-science dataset jupiter-notebook python

Last synced: 14 May 2026

https://github.com/svetlanam/twitter-ads

Get data about campaigns from Twitter Ads API

api data keboola keboola-extractor twitter twitter-ads twitter-api

Last synced: 12 Jun 2026

https://github.com/fairspec/fairspec-standard

Fairspec is a data exchange format compatible with DataCite for metadata and JSON Schema for structured data

ckan csv data dataset excel fair fairspec json ods polars python quality schema sqlite table typescript validation zenodo

Last synced: 16 Jun 2026

https://github.com/cdcgov/importsurvey

Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)

data import r sas survey survey-data

Last synced: 19 Jun 2026

https://github.com/williamwutq/bllist

Durable, crash-safe, checksummed block-based linked list allocators stored in a single file

data data-storage data-structure database file-based linkedlist

Last synced: 25 Jun 2026

https://github.com/seabbs/estzoonotictb

Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden

bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb

Last synced: 28 Feb 2026

https://github.com/pradeep221b/turbofan_predictive_maintenance

An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.

data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels

Last synced: 04 Oct 2025

https://github.com/zediculz/block

Block is a data structure/collection that uses Blockchain principle in managing data.

algorithm data structure

Last synced: 05 Oct 2025

https://github.com/dylanhogg/cloud-products

A package for getting cloud products and product descriptions from a cloud provider website.

aws cloud-products crawler data text-processing

Last synced: 05 Oct 2025

https://github.com/DefinetlyNotAI/VulnScan_Data

Logicytics VulnScan Module's Training Data and old model archive

ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data

Last synced: 17 Aug 2025

https://github.com/freddy03h/immutable-data-structure

Normalize and Merge your application's data store using Immutable.JS objects

data immutable redux store

Last synced: 05 Oct 2025

https://github.com/mascanho/ruddit

CLI to interact with Reddit's API to programatically retrieve data

cli data marketing rust rust-lang rustlang sales

Last synced: 19 Aug 2025

https://github.com/carlotta94c/sql4datascientistsdemo

Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"

analysis data r sqlite tidyverse visualisation

Last synced: 18 Apr 2026

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/labwhatever/leetcode

Collection of LeetCode questions to ace the coding interview!

data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning

Last synced: 22 Aug 2025

https://github.com/jerryfzhang/rockets

A Node + React App that displays space launch missions around the world.

bootstrap data expressjs less momentjs nodejs react reactjs reactstrap

Last synced: 10 Apr 2026

https://github.com/grkndev/twitcher

A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.

api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user

Last synced: 09 Mar 2026

https://github.com/aymane-maghouti/mobile-data-hive-insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi

Last synced: 09 Mar 2026

https://github.com/jessielw/parse-fel-master-data

Simple CLI to parse Dolby Vision master data via the RPU/MediaInfo and output data needed for x265

data dolby fel master mediainfo mi parse rpu vision

Last synced: 26 Aug 2025

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/xdrokra/road-accident-analytics

A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023

analytics data design italy javascript

Last synced: 30 Aug 2025

https://github.com/tatey/list_of_baby_names

A list of baby names given to tiny humans in Ruby

data names ruby

Last synced: 11 Nov 2025

https://github.com/ukplab/pragtag2023

Code and data for the PragTag-2023 Shared Task

argument-mining data peer-review pragmatics shared-task

Last synced: 18 Jun 2025

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 03 Feb 2026

https://github.com/marcelo-earth/h5n8-data

🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.

csv data h5n8 h5n8-cases h5n8-virus russia

Last synced: 19 Jan 2026

https://github.com/snimmagadda1/stack-exchange-dump-to-mysql

Batch pipeline to import Stack Exchange XML data dumps to relational DB

batch data mysql spring-batch stackoverflow

Last synced: 30 Mar 2025

https://github.com/ngambip/priscilla

About my work and Experience

accounting analytics data finance-management

Last synced: 03 Feb 2026

https://github.com/viisix/corecat

Core repository of DanceCats project.

data lightweight python3

Last synced: 25 May 2026

https://github.com/gorhkdwj/da_portfolio

Kim Jae Chun's DA_Portfolio

data data-analysis python sql

Last synced: 20 Feb 2026

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/devsujay19/knowledgebase

My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.

data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel

Last synced: 10 Apr 2026

https://github.com/stdlib-js/array-base-to-accessor-array

Convert an array-like object to a minimal array-like object supporting the accessor protocol.

accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper

Last synced: 04 Jan 2026

https://github.com/stdlib-js/array-base-reject

Return a shallow copy of an array containing only those elements which fail a test implemented by a predicate function.

array copy data filter generic javascript node node-js nodejs predicate reject stdlib structure test types

Last synced: 26 Dec 2025

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/neelravi/data-management

A data management plan for computational chemists/physicists and material scientists for a FAIR storage of raw data

data dmp fair management workflows

Last synced: 16 Jan 2026

https://github.com/milandjurdjevic/discriminalizer

.NET library designed for seamless JSON deserialization of objects with complex discrimination requirements, built on top of System.Text.Json.

data deserialization dotnet json

Last synced: 15 Apr 2025

https://github.com/stdlib-js/datasets-herndon-venus-semidiameters

Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.

astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus

Last synced: 09 Oct 2025

https://github.com/ilejuxepwaduzd/structured-data-extractor

🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.

cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data

Last synced: 10 Apr 2026

https://github.com/qeeqbox/data-states

Data states refer to structured and unstructured data divided into three categories (At Rest, In Use, and In Transit)

data data-state infosecsimplified qeeqbox

Last synced: 10 Mar 2026

https://github.com/stdlib-js/strided-base-dtype-str2enum

Return the enumeration constant associated with a strided array data type string.

array data dtype dtypes enum javascript multidimensional node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 30 Apr 2025

https://github.com/exoticknight/juhe

simple way to analyze complex data in one chain call

aggregation aggregator analysis data statistic typescript

Last synced: 21 May 2026

https://github.com/rremple/intervalidus

For all your interval-based data needs.

data intervals

Last synced: 21 Feb 2026

https://github.com/bilalmehrban/data-log-monitor

A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.

csharp data desktop-app logging

Last synced: 14 Mar 2025

https://github.com/jayantur13/kountry

Node module variant of the Country API

api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn

Last synced: 26 Jan 2026

https://github.com/mews-labs/dataframe-memory

This tools aims to provide simple solution to save memory when using pandas' data frame.

data data-science memory-usage pandas-dataframe python3

Last synced: 22 May 2026

https://github.com/brianali-codes/github-searcher

A website for API experimentation that users the github Api to search for different users and some of their (public) information

api data github user

Last synced: 21 May 2026

https://github.com/mindawei/alimusic-predict

阿里音乐流行趋势预测大赛代码(包括初赛、复赛)

data java predict pyhton tianchi

Last synced: 22 Mar 2025

https://github.com/rohancyberops/rp1

This project performs an analysis of Starbucks (SBUX) stock returns using R. The analysis includes both simple returns and continuously compounded returns (CC returns) for a period of one month. It also calculates the growth of $1 invested in SBUX and provides visual insights through various plots.

analysis cc data r rlanguage sbux

Last synced: 15 Mar 2025

https://github.com/ahmadjamil888/facial-recognition-ai-model

A facial recognition AI model powered by CNN , and trained by thousands of images.

ai cnn data data-science facial facial-recognition recognition

Last synced: 30 Jun 2025

https://github.com/antononcube/raku-data-cryptocurrencies

Raku package of cryptocurrency data retrieval.

crypto cryptocurrency data

Last synced: 02 Apr 2025

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 05 Jan 2026

https://github.com/nesterenko-kv/object-id

ObjectIDs are a special type of identifier mainly used in MongoDB to uniquely identify documents within a collection. They consist of a 12-byte binary value that includes a timestamp, a machine identifier, a process identifier, and a counter.

c-sharp data id net object-id unique-identifier

Last synced: 16 May 2025

https://github.com/sbdk-dev/sbdk.dev

A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.

ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer

Last synced: 27 May 2026

https://github.com/spine-tools/metreload

Python application for downloading meteorological reanalysis data

data python reanalysis

Last synced: 01 Jul 2025

https://github.com/cosmos-loops/cosmos-dapper

Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.

dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver

Last synced: 11 Apr 2026

https://github.com/cintia0528/data_analytics_and_visualization-sql_tableau

Evaluate Magist as a strategic partner for Eniac's Brazilian expansion. Use SQL to analyze growth, tech accessory sales potential, delivery times, and customer satisfaction in Magist's database.

data dataanalysis datavisualization sql strategy tableau

Last synced: 31 Mar 2025

https://github.com/idea2app/public-meta-data

HTTP API for Public Meta Data, written in TypeScript & designed for CDN.

api cdn data http meta public typescript

Last synced: 15 Mar 2025

https://github.com/mtingers/opacify

Opacify reads a file and builds a manifest of external sources to rebuild said file.

backup data obfuscation python

Last synced: 18 May 2026

https://github.com/dataship/beam

Get collimate'd data into Frame, in Node or the Browser

column-store data data-science

Last synced: 27 Apr 2026