An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/bodfdaf/api

api data service provider

api data detail instagram lazada shopee tiktok video

Last synced: 11 Mar 2025

https://github.com/onyxwizard/coding-challenges

A collection of fundamental recursion problems solved in Java, demonstrating core concepts like base cases, recursive decomposition, and problem-solving strategies for beginners. Perfect for mastering the art of thinking recursively!

algomaster algorithm-challenges algorithms algorithms-and-data-structures coding data datastructures hackerrank java java-8 leetcode neetcode takeuforward w3schools

Last synced: 03 Jul 2026

https://github.com/henryssondaniel/teacup-service-visualization-mysql-java

Connect your Teacup visualization data to a MySQL database

data mysql service teacup visualization

Last synced: 19 May 2026

https://github.com/vin20777/drone-data-layer

Drone Project Data Layer

csharp data drone layer software-design

Last synced: 18 May 2026

https://github.com/apparaomulpuri/readline

Explains you the usage of readLine function in Swift.

data fromkeyboard keyboard reading readline swift

Last synced: 29 Mar 2025

https://github.com/ahabdel/amazon-web-scraper

Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis

data web-scraping

Last synced: 04 Jul 2026

https://github.com/nathanieliskandar26/data-analysis-project

This project demonstrates my ability to clean and analyze data using Python and SQL so far. The dataset used for this analysis focuses on general customer information. Through this project, I aimed to uncover meaningful insights and trends by cleaning the data and performing structured queries.

analysis data data-cleaning jupyter-notebook mysql mysql-database python

Last synced: 19 Apr 2026

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/pawlo77/messenger-analyser

Repo for Data Visualization project, part of IAD study program at Faculty of Mathematics and Information Science, Warsaw University of Technology

data visualization

Last synced: 17 May 2026

https://github.com/ramtinsoltani/safe-cli

A simple Command-line Interface which encrypts and decrypts UTF-8 files using AES-256.

aes-256 cli data data-hook decryption encryption generator handlebars hooks markup partial partial-decryption password safe swap temp temporary tool

Last synced: 16 Apr 2026

https://github.com/mightymetrika/scdtb

Single Case Design Toolbox

data math r science statistics

Last synced: 04 Jan 2026

https://github.com/lukaszkn/data-software-engineering-interview-questions

Data and Software engineering interview questions

data engineering interview-questions python

Last synced: 20 Jul 2025

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/ember-nexus/reference-dataset

Ember Nexus API backup containing different standardized scenarios

backup data ember-nexus

Last synced: 25 Jan 2026

https://github.com/ahmad-ali-rafique/logistic-regression-modeling

An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.

accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models

Last synced: 11 Sep 2025

https://github.com/wilcotomassen/lorem-datum-core

Java based data generator for data simulation

data dataset generator java lorem-ipsum simulated-data

Last synced: 11 Jan 2026

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/kaizadp/bbwm_moisture

HOBO data for soil moisture - Bear Brook Watershed in Maine

data hobo-data soil-moisture

Last synced: 17 May 2026

https://github.com/octoenergy/tentaclio-gs

A python project containing all the dependencies for gs tentaclio schema.

data

Last synced: 24 Jun 2025

https://github.com/octoenergy/tentaclio-postgres

A python project containing all the dependencies for postgresq tentaclio schema.

data

Last synced: 24 Jun 2025

https://github.com/octoenergy/tentaclio-athena

A python project containing all the dependencies for awsathena+rest tentaclio schema.

data

Last synced: 24 Jun 2025

https://github.com/octoenergy/tentaclio-s3

A python project containing all the dependencies for s3 tentaclio schema.

data

Last synced: 24 Jun 2025

https://github.com/octoenergy/tentaclio-databricks

Module to give tentaclio support to databricks

data

Last synced: 24 Jun 2025

https://github.com/thesfinox/fit-the-data

Data analysis using Wolfram Mathematica

analysis data data-analysis lab mathematica wolfram wolfram-mathematica

Last synced: 24 Jan 2026

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/hidayathamir/get-telegram-group-data

With these project you can get data in csv file from your telegram group.

bahasa-indonesia data python3 scrape telegram telethon

Last synced: 13 Sep 2025

https://github.com/octoenergy/tentaclio-gdrive

A python project containing all the dependencies for the gdrive tentaclio schema

data

Last synced: 24 Jun 2025

https://github.com/patrikcze/meshtatic_data

Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.

data meshtastic meshtastic-python

Last synced: 03 Feb 2026

https://github.com/opengeoshub/vdownload

A Powerful Geospatial Data Downloader

data geospatial opendata

Last synced: 19 May 2026

https://github.com/huspacy/huspacy-resources

Resources for building and evaluating huspacy

data huspacy

Last synced: 21 Mar 2025

https://github.com/prasad-chavan1/bank_data_analysis_r

Bank data analysis in R language

data data-analysis data-science r

Last synced: 24 Feb 2025

https://github.com/questionlp/wwdtm_uniquedates

Script that lists out the unique months and days of months that Wait Wait... Don't Tell Me! shows have aired

data python python3 script wwdtm

Last synced: 17 May 2026

https://github.com/furkantosun1607/cse201-data-structure

This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.

array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure

Last synced: 26 Jun 2025

https://github.com/brayflex/spy-sector-rotation-google-sheet

Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.

data etf google-sheets index price rotation script sector spreadsheet spy stock-market

Last synced: 29 Jun 2026

https://github.com/heitang/fcu-courseapi

逢甲大學:課程檢索系統 API 使用說明

api data fcu project

Last synced: 27 Jul 2025

https://github.com/mysociety/sync-ep-to-jkan

Syncs EveryPolitician data to mySociety's data portal.

data everypolitician jkan politicians

Last synced: 27 Jul 2025

https://github.com/gunn/covid-19-scripts

Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries

covid-19 data geography typescript

Last synced: 17 May 2026

https://github.com/i-rzr-i/domaincommonextensions

The purpose of this repository/library is to provide the most relevant and used extension methods in the life cycle of application development that allow us to improve our code, and writing speed, and use more efficiently dev team time during this period for more complex functionality.

api class data datatype extension helper object parser type util

Last synced: 20 Sep 2025

https://github.com/jacopodl/jcollections

Common data structures for the C language

c collections data data-structures jcollections

Last synced: 30 Jul 2025

https://github.com/iankitnegi/statistically_speaking

Explore diverse projects showcasing statistical techniques with real-world data, comprehensive docs, and interactive visualizations.

data excel statistical-analysis statistics

Last synced: 09 Feb 2026

https://github.com/gaemapiracicaba/norma_dec_8468-76

Padrões de qualidade e lançamento de efluentes de águas interiores

data python

Last synced: 19 Apr 2026

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/bastianolea/servel_elecciones_core

Resultados electorales desde Servel (2024)

chile comunas data elecciones genero

Last synced: 01 Aug 2025

https://github.com/canadaluke888/speedtable

Ultra-fast terminal table renderer written in C

c data datasets fast python python-wrapper python3 tables

Last synced: 01 Mar 2026

https://github.com/edjoukou/human_resources

A data analysis project using MySQL Server database

analysis data mysql powerbi sql visualization

Last synced: 25 Sep 2025

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/jun-labs/jq

🧷 Let's practice jq.

data jq json json-data parse

Last synced: 27 Sep 2025

https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 27 Sep 2025

https://github.com/alecxcode/table-parser

Python Table Parser (data extraction)

automation data extraction python robotic-process-automation

Last synced: 04 May 2026

https://github.com/haimonmon/j3mify

Convert your jejemon word into a formal sentence or word

data jejemon nlp normalization python regex tagalog tokenization

Last synced: 12 Oct 2025

https://github.com/elissorokin/data-analyst-portfolio

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/analyst-amitbisht/Pizza-Sales-Report-

Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.

analytics data data-visualization powerbi sql-server

Last synced: 01 Oct 2025

https://github.com/ahmadjamil888/ink-flow-share

A medium clone with all basic features such as blog generation , auth and history and user data

articles blogs cs data flow herald ink ink-flow-share journalism medium post react shad shadcn share users vite

Last synced: 09 Apr 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/chompfoods/sdk-java

Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk

Last synced: 09 Apr 2026

https://github.com/fabsdevx/files-to-database-loader-handout

Data Engineering project for learning purposes. Credits to itversity

csv data data-engineering database json pandas python

Last synced: 09 Apr 2026

https://github.com/srindot/fwuav-average-flight-data-collection

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 10 Aug 2025

https://github.com/ahmad-ali-rafique/heart-disease-detection-model

A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.

artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model

Last synced: 11 Aug 2025

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026

https://github.com/jleung51/foundations-dags

Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.

data data-engineering etl extract housing load pipeline transform

Last synced: 04 Oct 2025

https://github.com/itsachrafmansari/moroccan-real-estate-analysis

Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.

api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping

Last synced: 13 Aug 2025

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/rugwiroparfait/alx_sql

This repo is where I save my queries and learning materials in Data Science program from ALX

anaconda data data-analysis jupyter-notebook sql

Last synced: 19 Aug 2025

https://github.com/giscience/measures-rest-oshdb-app

A frontend for providing measures for geospatial datasets, using the OSHDB

data dggs geospatial measure openstreetmap rest

Last synced: 20 Apr 2026

https://github.com/ahmad-ali-rafique/wine-quality-dataset

Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.

analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model

Last synced: 21 Aug 2025

https://github.com/debjyotisaha/tableau-projects-phase-2

Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.

dashboards data data-analysis data-visualisation tableau

Last synced: 26 Aug 2025

https://github.com/schoolsquirrel/holiday-data

Automatically updated holiday data for SchoolSquirrel

data holidays schoolsquirrel scripts vacation

Last synced: 03 Oct 2025

https://github.com/miozilla/fraudfinder

fraudfinder :mag_right::smiling_imp::suspect: : Historical Payment Transactions # Fraud Detection # EDA # Feature Store # Model Registry

analysis data exploratory feature-store fraud-detection

Last synced: 29 Aug 2025

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/gman-au/white-knight-neo4j

Neo4j implementation of White Knight data abstraction library

abstractions data datastore dotnet neo4j repository-pattern specification-pattern

Last synced: 20 Jan 2026

https://github.com/koppalexander/flightdelaychallenge

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling

Last synced: 19 Jun 2026

https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst

Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.

analytics data database mysql sql statistics

Last synced: 22 May 2026

https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails

Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.

cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql

Last synced: 24 Jul 2025

https://github.com/snimmagadda1/luigi-etl-example

🔍 Example of an ETL pipeline using Spotify's Luigi

data luigi luigi-pipeline python spotify

Last synced: 30 Mar 2025

https://github.com/carlosrs14/parallel-data-preprocessig-system

A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.

barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads

Last synced: 24 Jul 2025

https://github.com/nmelgar/birthday_sports_dataviz

We will analyze how the Matthew Effect has influenced in professional sports players.

analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau

Last synced: 08 Jan 2026

https://github.com/ttozatto/sparkify

Churn Prediction for music streaming app with PySpark

analysis churn data learning machine predictive pyspark science spark

Last synced: 16 Jan 2026