An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/lakshyakumar266/jee-dpp-manager-app

DPP manager app for JEE preparing Students

data expo javascript management react-native

Last synced: 07 May 2026

https://github.com/rudxain/xorsum

Get XOR checksum with this command-line tool

binary checksum cli data digest file files hexadecimal rust-crate xor

Last synced: 08 Mar 2026

https://github.com/mai-space/design-concept-sharing-recipes

🖼️ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework

concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories

Last synced: 14 May 2025

https://github.com/rsc-labs/see-open-data

Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.

data data-visualization flourish government poland

Last synced: 04 Apr 2025

https://github.com/jph5396/sumomodel

A data models related to sumo wrestling.

data go sumo

Last synced: 17 Jan 2026

https://github.com/hackolade/yugabytedb-ysql

Hackolade(https://hackolade.com) plugin for the Cloud Native Yugabyte database with YSQL API

data data-modeling entity-relationship-diagram schema-design ysql yugabyte yugabytedb

Last synced: 30 Apr 2025

https://github.com/gagolews/clustering-data-v0

Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]

clustering data dataset machine-learning

Last synced: 15 Sep 2025

https://github.com/hivesolutions/repos

Modular repository management system

data python repos storage system

Last synced: 14 May 2026

https://github.com/lu-sketch/chocolate-imports-dataset

Chocolate Imports for South Africa

data eda visualization

Last synced: 18 May 2026

https://github.com/darshjasani/claims-analysis

This repository contains a comprehensive analysis of claims data, detailing the workflow from data preprocessing to model evaluation. The goal of this analysis is to build predictive models to improve claims prediction and management.

analysis data linear machine-learning python

Last synced: 16 May 2026

https://github.com/miss-mhv/data-analysis-for-social-buzz

In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.

data jupyter-notebook python

Last synced: 14 May 2026

https://github.com/canadaluke888/terminaltablebuilder

Build and edit tabular data all from the terminal.

cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables

Last synced: 20 Apr 2026

https://github.com/reubano/pyconza-tutorial

Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial

data functional-programming jupyter-notebook meza pycon python tutorial

Last synced: 17 May 2026

https://github.com/chompfoods/sdk-scala

Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk

Last synced: 17 May 2026

https://github.com/austinv11/pypeline

A simple data pipeline builder for Python 3+

data leveldb pypeline python python3 stream-processing

Last synced: 20 Aug 2025

https://github.com/ppmim/papi4k_old2

PAPI: the PANIC data reduction pipeline

data near-infrared pipeline processing

Last synced: 23 Jun 2025

https://github.com/parmsam/rweekly.data

R package containing data on Rweekly posts

data package rweekly

Last synced: 21 May 2026

https://github.com/kuanjiahong/covid19-analysis

A simple project to familiarize myself with data analysis

data data-science data-visualization pandas python

Last synced: 02 Apr 2025

https://github.com/stdlib-js/array-base-fill-by

Fill all elements within a portion of an array according to a callback function.

accessor array data fill generic javascript map node node-js nodejs set stdlib structure transform typed types

Last synced: 14 May 2026

https://github.com/rajlabmssm/echodata

echoverse module: Example data.

data echoverse fine-mapping genomics gwas qtl

Last synced: 17 Jan 2026

https://github.com/jitsasmal/customer-purches-behavior-and-shopping-analysis

Create dashboard to analyse the data based to total product sales, terget, revenue, state and season wize analyse to show the current treand the data.

analytics dashboard data etl powerbi

Last synced: 14 Feb 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/nel-zi/zipco_foods

Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.

airflow apache-spark data dataengineering etl pyspark wsl

Last synced: 03 May 2026

https://github.com/srindot/average_flightdata_collection_fwuav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Sep 2025

https://github.com/sofyan48/wahoo

Data stream library with kinesis

aws data data-stream event kinesis stream

Last synced: 14 May 2026

https://github.com/badawy403/egy.list

A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.

city data egypt javascript nodejs npm package

Last synced: 08 Mar 2026

https://github.com/denisecase/cintel-04-reactive

Interactive analytics, reactive app built with Shiny for Python

analytics bokeh data flights interactive mtcars penguins python relationships shiny

Last synced: 20 Jun 2025

https://github.com/sakan811/gachascope

Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.

data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves

Last synced: 04 May 2026

https://github.com/namescode/hub_harvester

A python script to gather data on a user or organisations git repos

data github nix nix-flake python python3 sqlite

Last synced: 08 Apr 2026

https://github.com/UznetDev/Smoking-Prediction

This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.

ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking

Last synced: 28 Mar 2025

https://github.com/madhuresh2011/kulturehire-internship

☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql

Last synced: 17 Feb 2026

https://github.com/sajjadanwar0/booking.com-scraping

Scraping booking.com using Selenium and Beautiful Soup

crawler data python scraping selenium

Last synced: 18 Oct 2025

https://github.com/indhra/cats-ijcnn-data-2004

CATS IJCNN Data 2004 Competition of Artificial Time Series

2004 artificial cats data ijcnn time-series

Last synced: 22 Mar 2025

https://github.com/youmenomi/hydreigon

Are you looking for a Hydreigon to classify data for you? Come and catch it!

classify data hydreigon indexer items management pokemon sortable structure typescript

Last synced: 07 May 2025

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/maximkrouk/storage

Lightweight framework for storing data (beta)

cache data keychain memmory storage swift swift5-1 userdefaults

Last synced: 02 Jul 2026

https://github.com/allanotieno254/spss-nutrition-research

This repository contains the results of statistical analyses performed in IBM SPSS Statistics on a child nutrition dataset.

data data-preprocessing dataanalysis spss

Last synced: 17 Feb 2026

https://github.com/bho0920/crime-data-analysis-eu

Crime Data Analysis for Self-Defense Tool Market Entry in the EU.

data data-analysis sql sqlite tableau

Last synced: 21 Jun 2025

https://github.com/istinnew/cook-me-up

[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.

bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning

Last synced: 16 May 2026

https://github.com/skygenesisenterprise/api-service

The Official Sky Genesis Enterprise API Service Ecosystem

api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket

Last synced: 31 Dec 2025

https://github.com/ioboi/obloc-data

Scrape guest counter of O'BLOC 🧗‍♀️

data scraping

Last synced: 04 Nov 2025

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 03 Mar 2025

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/zshn1248/pyfilecrypto

PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.

data decryption encryption file security-tools

Last synced: 07 Apr 2026

https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate

This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.

correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif

Last synced: 17 May 2026

https://github.com/ashishsingh789/hr_analysis_dashboard

The HR Analyst Dashboard is an interactive Power BI tool that provides insights into HR metrics sourced from Excel. It focuses on data cleaning, transformation, and visualization, enabling stakeholders to explore key indicators like employee demographics and performance through intuitive charts.

dashboard data dataanalysis datacleaning powerbi-desktop visualization

Last synced: 06 Mar 2026

https://github.com/biril/audio-test-data

Audio data to use for testing

audio data mpeg test

Last synced: 11 Jan 2026

https://github.com/joseluisq/input-verifier

Some useful functions to check common data input.

data input utils validation

Last synced: 19 Jul 2025

https://github.com/ournet/quotes-data

Ournet quotes data package

data ournet ournet-quotes quotes

Last synced: 04 Apr 2025

https://github.com/dms-codes/scrape_tripsantai

Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.

beautifulsoup4 data python requests scraper webscraper

Last synced: 21 May 2026

https://github.com/uzinfocom-org/archive

📦 | Archived projects that aren't used anymore

archive archive-data data notused

Last synced: 01 Sep 2025

https://github.com/bfontaine/datatools

:triangular_ruler: Some scripts I use to work with data

data ruby script

Last synced: 23 Jul 2025

https://github.com/ournet/news-data

Ournet news data package

data news news-data news-storage ournet storage

Last synced: 04 Apr 2025

https://github.com/afeiship/data-selection

Data structure for radio/checkbox-group.

checkbox data group radio

Last synced: 17 Jun 2025

https://github.com/fintech-lsi/fintech-credit-risk-prediction

This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.

data fintech machine-learning model prediction risk

Last synced: 12 Oct 2025

https://github.com/jensostertag-archive/charts.js

A JavaScript Plugin to draw Charts to visualize Data and Statistics on Websites

charts data javascript statistics webapplication

Last synced: 22 Jun 2025

https://github.com/dimaa1608/azurecontent

AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.

azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization

Last synced: 10 Apr 2025

https://github.com/sap-samples/sap-bdc-explore-hyperscaler-data

The repository contains detailed steps to integrate external hyperscaler data sources to SAP Datasphere in the SAP Business Data Cloud per the Open data ecosystem integration principles .

aws azure business cloud data databricks datasphere gcp hyperscalers sap

Last synced: 16 May 2026

https://github.com/goutam1511/real-time-covid-19-tracker-for-slack

This automated tracker tracks the spread of Covid-19 in a real time basis by scraping data from Ministry of Health and Family Welfare and notifies the same at Slack

covid-19 data python slack-bot web-scraping

Last synced: 30 Aug 2025

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/omari-kd/recommendation-system-analysis-and-modelling

This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.

data data-science data-science-in-r machine-learning-algorithms recommendation-system

Last synced: 08 Jan 2026

https://github.com/bakangmonei/is_final_assignment

My intelligent systems assignment

data data-science intelligent-systems python

Last synced: 02 May 2026

https://github.com/j-hagedorn/locals

:globe_with_meridians: A collection of tidied, neighborhood-level public datasets

address-dataset census-data census-tract data neighborhood social-sciences

Last synced: 03 Feb 2026

https://github.com/mvuorre/osfdatasette

Harvest, wrangle, and serve preprint data from OSF API with Datasette

data datasette open-science preprints

Last synced: 11 Apr 2025

https://github.com/samharrison7/datamapper

Making mapping between datasets as simple as possible.

data data-mapper data-mapping data-science data-structures

Last synced: 17 Mar 2025

https://github.com/luminati-io/google-search-api

Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.

data google-scraper html python serp-api web-scraping

Last synced: 25 Jun 2025

https://github.com/mrk214/bible-data-es-spa

La Biblia en formato JSON

api bible biblia data god jesus json spanish

Last synced: 05 Apr 2025

https://github.com/naithikjorapur/practive-tanstacktsx

Practice TanStack with React, Vite, and TypeScript to build fast, type-safe apps. Leverage tools like TanStack Query for data management and Vite for a streamlined development experience.

data exercise fetching html-css-javascript json learning-by-doing practice query router tsx

Last synced: 05 Apr 2025

https://github.com/mekramy/ircity

Iran province, county and city data in json format.

data iran-city json mekramy

Last synced: 05 Apr 2025

https://github.com/fastbolt/entity-importer

Entity importing library for importing data from files (CSV and Excel currently) or API into doctrine.

data doctrine2 excel excel-import

Last synced: 17 Feb 2026

https://github.com/kylepw/multistack

Example of multiple stacks in one array.

algorithms array data data-structures python stack

Last synced: 17 Mar 2025

https://github.com/farovictor/mongodbloader

This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.

data go mongodb pipeline

Last synced: 15 May 2026

https://github.com/agusk/ilmudata-book-excel-analytics

Hallo Microsoft Excel: Mastering Data Analytics

analytics data data-analytics excel power-query-editor

Last synced: 06 Jan 2026

https://github.com/josemartinezrdev/logisticadb

Logistica Database

data ddl diagrama dml mysql sql

Last synced: 09 Jul 2025

https://github.com/styd/sd_struct

Searchable Deep Struct

activesupport data gem openstruct rails ruby structure

Last synced: 18 May 2026

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/ubc-library-rc/intro-api

Introduction to APIs

data digital-scholarship workshop

Last synced: 01 Jul 2026

https://github.com/eryks1999/data-collection-project_python

This project allowed me to practice classes, populating json files as well as extracting data.

data git json python

Last synced: 16 Apr 2026

https://github.com/ims94/ballerina-tsv-querying

An example Ballerina project to query tsv data using Ballerina language integrated queries

ballerina ballerina-lang data olympics query sql

Last synced: 03 Feb 2026

https://github.com/stdlib-js/array-base-index-of-same-value

Return the index of the first element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 15 May 2026