An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/miroslavvidovic/distribution-graphs

Creating ASCII graphical histograms in the terminal with https://github.com/philovivero/distribution

ascii data graph histogram python terminal

Last synced: 24 Apr 2026

https://github.com/mawiegand/automatic-point-label-placement-data

Test instances for the automatic point label placement problem.

data datastructures generator javascript labeling problem ruby

Last synced: 16 May 2026

https://github.com/meta-llama/synthetic-data-kit

Tool for generating high quality Synthetic datasets

data generation llm python synthetic

Last synced: 08 May 2025

https://github.com/lordzintick/spellcaster-api-1.21.4

A server-side Fabric mod to load JSON spell files from datapacks.

api api-server data fabric fabric-mod fabricmc json magic minecraft-mod server-side small spells

Last synced: 08 May 2026

https://github.com/injamul3798/cpp_stl-discussion

As we know ,STL is mostly used tools is competitive programming.

data list map set structure vector

Last synced: 02 Apr 2025

https://github.com/paulveillard/cybersecurity-analytics

An ongoing collection of awesome software, libraries, learning tutorials, documents and books, technical resources and cool stuff about Analytics Engineering in Cybersecurity.

analytics bigdata bigquery cybernetics cybersecurity data data-engineering data-science encryption encryption-decryption seo seo-friendly seo-optimization

Last synced: 28 Mar 2025

https://github.com/naufalbasara/superstores-pipeline

Data Pipeline on Dummy E-commerce with Apache Airflow

airflow data data-engineering data-pipeline data-warehouse postgresql

Last synced: 16 May 2026

https://github.com/rd-uk/rduk-data-sqlite

SQLite Data Provider implementation for rduk-data

data rduk sqlite

Last synced: 16 May 2026

https://github.com/devbigboy/iti-database

This course will cover the following Topics: joins, Normalization, Aggregate function, Group By, Order By, Select, Ranking Functions, Built-In Functions

analytics data data-analytics mssql-database sql sql-server

Last synced: 03 Nov 2025

https://github.com/gabboraron/datacamp_projects

Here you can find my DataCamp Projects

data datacamp datacamp-projects

Last synced: 14 Jun 2026

https://github.com/wciesialka/top-names

A Python module for scraping the list of top first names in the United States.

data python python3

Last synced: 08 Jun 2026

https://github.com/fridex/real-estate

My machine learning in real estate

data machine-learning real-estate

Last synced: 27 Jun 2025

https://github.com/dhi13man/rca_ace

RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).

analytics data enterprise open-source python python3 rca

Last synced: 10 Sep 2025

https://github.com/ayresgneto/use-case-gcp-etl

ELT pipeline GCP. Tecnologias utilizadas: Postgresql, GCP Storage, Airflow (local), Pyspark (local), BigQuery

airflow big-data bigquery data data-engineering etl gcp pipeline postgresql programming-oriented-object pyspark python spark

Last synced: 03 Jan 2026

https://github.com/ivanshero400/kutub-al-salaf-database

أضخم مكتبة مفتوحة المصدر للكتب الإسلامية التراثية | 7,878 كتابا | 40 تصنيفا | المصدر: مكتبة كيزانه (Kizanah) | تحميل مباشر من بايثون بسطر واحد

arabic books-database data hadith islamic-books islamic-heritage kizanah open-source python sqlite

Last synced: 02 Jul 2026

https://github.com/radekbednarik/att

Python wrapper for calling Apitalks API.

api-wrapper apitalks data python3 rest-api wrapper

Last synced: 05 Apr 2025

https://github.com/talitalobo/statistics-with-python

Repo about statistical concepts and (not always) their python implementation.

data data-science machine-learning statistics

Last synced: 11 Jan 2026

https://github.com/stdlib-js/array-base-banded-filled2d-by

Create a filled two-dimensional banded nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 19 May 2026

https://github.com/csmith0651/ormy

A simple python ORM.

data database python

Last synced: 13 May 2026

https://github.com/stkisengese/numpy-data-fundamentals

A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.

data data-analysis numpy pre-processing

Last synced: 16 May 2026

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/erkylima/algorithms

Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! 🚀 #Python #Algorithms #DataStructures

algorithms algorithms-and-data-structures data data-structures

Last synced: 19 Jan 2026

https://github.com/emna-chebbi/student-performance

Predictive model for student exam scores based on student performance factors

ai computer-vision data kaggle machine-learning ml mse regression regression-models

Last synced: 15 May 2026

https://github.com/ericgio/history-of-jazz

Data and visualizations based on Ted Gioia's "The History of Jazz"

data jazz

Last synced: 28 Mar 2025

https://github.com/robsteranium/user2022-ldf-talk

Slides from my useR! 2022 talk about the Linked-Data Frames package

data data-frame linked-data r rdf

Last synced: 19 Apr 2025

https://github.com/praveendecode/data-analysis

Implemented data analysis projects with interactive Streamlit UI for user-friendly data exploration and insights presentation

data data-science dataanalysis exploratory-data-analysis insights python streamlit-dashboard tableau tableau-public

Last synced: 04 Apr 2025

https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra

Last synced: 12 Apr 2025

https://github.com/denisecase/buzzline-04-case

Adding live visualizations to streaming data applications

animation data kafka matplotlib python streaming

Last synced: 11 Apr 2025

https://github.com/denisecase/cintel-03-data

Getting started with interactive data analytics in Python

analytics data interactive python shiny

Last synced: 11 Apr 2025

https://github.com/moscatellimarco/webscrap-imdb

🎬 Python scraper for IMDB: Extract movie/TV details for 📊 analysis & 🗃️ storage. Easy setup, 🔧 customizable, with 🖥️ CLI.

css data datascience html movies python scrapy scrapy-crawler scrapy-spider web web-scraping webdata webscraping

Last synced: 15 May 2026

https://github.com/bagustris/dataits

Web for DataITS17: Summer School on Data Science

data data-science

Last synced: 28 Jun 2025

https://github.com/halyusa16/basic-sql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 16 May 2026

https://github.com/sumansuhag/wasserstoff-aiinterntask

Welcome to the AI Pipeline for Image Segmentation and Object Analysis project – a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.

artificial-intelligence cdn data data-science modeling pipline

Last synced: 28 Mar 2025

https://github.com/4ment/aiv-rate-heterogeneity

Avian influenza virus data sets

data influenza

Last synced: 24 Jan 2026

https://github.com/phette23/nces-ipeds-archive

download NCES IPEDS data

data datarescue ipeds nces

Last synced: 30 Jun 2026

https://github.com/thesfinox/sql-simple-backup

Simple script to backup data in a MySQL database and store it in a WebDAV server.

backup bash data mysql script sql webdav

Last synced: 18 Apr 2026

https://github.com/shysolocup/fndt

JavaScript package allowing you to see function data like body and arguments from outside of the function

aepl data fndt functions javascript javascript-tools js js-function js-functions lightweight nodejs nodejs-modules package stews

Last synced: 30 Apr 2026

https://github.com/purarue/blizzard_gdpr_parser

Parses date-related information from my blizzard GDPR export.

blizzard data gdpr webscraping

Last synced: 06 Apr 2025

https://github.com/purarue/hpi-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 06 Apr 2025

https://github.com/yourdataarchitect/abyat-scaring-

This Scrapy spider for automates the extraction of product data from the Abyat website using Hidden Backend API, supporting both Arabic and English content.

data database scraper scrapy-crawler

Last synced: 23 Apr 2026

https://github.com/sumansuhag/prediction_model

This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.

algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn

Last synced: 28 Mar 2025

https://github.com/renebentes/2808

Curso 2808 - Fundamentos do Entity Framework

course csharp data ef-core

Last synced: 27 Jun 2025

https://github.com/lakshyakumar266/jee-dpp-manager-app

DPP manager app for JEE preparing Students

data expo javascript management react-native

Last synced: 07 May 2026

https://github.com/rudxain/xorsum

Get XOR checksum with this command-line tool

binary checksum cli data digest file files hexadecimal rust-crate xor

Last synced: 08 Mar 2026

https://github.com/mai-space/design-concept-sharing-recipes

🖼️ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework

concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories

Last synced: 14 May 2025

https://github.com/rsc-labs/see-open-data

Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.

data data-visualization flourish government poland

Last synced: 04 Apr 2025

https://github.com/jph5396/sumomodel

A data models related to sumo wrestling.

data go sumo

Last synced: 17 Jan 2026

https://github.com/hackolade/yugabytedb-ysql

Hackolade(https://hackolade.com) plugin for the Cloud Native Yugabyte database with YSQL API

data data-modeling entity-relationship-diagram schema-design ysql yugabyte yugabytedb

Last synced: 30 Apr 2025

https://github.com/gagolews/clustering-data-v0

Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]

clustering data dataset machine-learning

Last synced: 15 Sep 2025

https://github.com/hivesolutions/repos

Modular repository management system

data python repos storage system

Last synced: 14 May 2026

https://github.com/lu-sketch/chocolate-imports-dataset

Chocolate Imports for South Africa

data eda visualization

Last synced: 18 May 2026

https://github.com/darshjasani/claims-analysis

This repository contains a comprehensive analysis of claims data, detailing the workflow from data preprocessing to model evaluation. The goal of this analysis is to build predictive models to improve claims prediction and management.

analysis data linear machine-learning python

Last synced: 16 May 2026

https://github.com/miss-mhv/data-analysis-for-social-buzz

In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.

data jupyter-notebook python

Last synced: 14 May 2026

https://github.com/canadaluke888/terminaltablebuilder

Build and edit tabular data all from the terminal.

cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables

Last synced: 20 Apr 2026

https://github.com/reubano/pyconza-tutorial

Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial

data functional-programming jupyter-notebook meza pycon python tutorial

Last synced: 17 May 2026

https://github.com/chompfoods/sdk-scala

Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk

Last synced: 17 May 2026

https://github.com/austinv11/pypeline

A simple data pipeline builder for Python 3+

data leveldb pypeline python python3 stream-processing

Last synced: 20 Aug 2025

https://github.com/ppmim/papi4k_old2

PAPI: the PANIC data reduction pipeline

data near-infrared pipeline processing

Last synced: 23 Jun 2025

https://github.com/parmsam/rweekly.data

R package containing data on Rweekly posts

data package rweekly

Last synced: 21 May 2026

https://github.com/kuanjiahong/covid19-analysis

A simple project to familiarize myself with data analysis

data data-science data-visualization pandas python

Last synced: 02 Apr 2025

https://github.com/stdlib-js/array-base-fill-by

Fill all elements within a portion of an array according to a callback function.

accessor array data fill generic javascript map node node-js nodejs set stdlib structure transform typed types

Last synced: 14 May 2026

https://github.com/rajlabmssm/echodata

echoverse module: Example data.

data echoverse fine-mapping genomics gwas qtl

Last synced: 17 Jan 2026

https://github.com/jitsasmal/customer-purches-behavior-and-shopping-analysis

Create dashboard to analyse the data based to total product sales, terget, revenue, state and season wize analyse to show the current treand the data.

analytics dashboard data etl powerbi

Last synced: 14 Feb 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/nel-zi/zipco_foods

Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.

airflow apache-spark data dataengineering etl pyspark wsl

Last synced: 03 May 2026

https://github.com/srindot/average_flightdata_collection_fwuav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Sep 2025

https://github.com/sofyan48/wahoo

Data stream library with kinesis

aws data data-stream event kinesis stream

Last synced: 14 May 2026

https://github.com/badawy403/egy.list

A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.

city data egypt javascript nodejs npm package

Last synced: 08 Mar 2026

https://github.com/denisecase/cintel-04-reactive

Interactive analytics, reactive app built with Shiny for Python

analytics bokeh data flights interactive mtcars penguins python relationships shiny

Last synced: 20 Jun 2025

https://github.com/sakan811/gachascope

Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.

data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves

Last synced: 04 May 2026

https://github.com/namescode/hub_harvester

A python script to gather data on a user or organisations git repos

data github nix nix-flake python python3 sqlite

Last synced: 08 Apr 2026

https://github.com/UznetDev/Smoking-Prediction

This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.

ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking

Last synced: 28 Mar 2025

https://github.com/madhuresh2011/kulturehire-internship

☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql

Last synced: 17 Feb 2026

https://github.com/sajjadanwar0/booking.com-scraping

Scraping booking.com using Selenium and Beautiful Soup

crawler data python scraping selenium

Last synced: 18 Oct 2025

https://github.com/indhra/cats-ijcnn-data-2004

CATS IJCNN Data 2004 Competition of Artificial Time Series

2004 artificial cats data ijcnn time-series

Last synced: 22 Mar 2025

https://github.com/youmenomi/hydreigon

Are you looking for a Hydreigon to classify data for you? Come and catch it!

classify data hydreigon indexer items management pokemon sortable structure typescript

Last synced: 07 May 2025

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/maximkrouk/storage

Lightweight framework for storing data (beta)

cache data keychain memmory storage swift swift5-1 userdefaults

Last synced: 02 Jul 2026

https://github.com/allanotieno254/spss-nutrition-research

This repository contains the results of statistical analyses performed in IBM SPSS Statistics on a child nutrition dataset.

data data-preprocessing dataanalysis spss

Last synced: 17 Feb 2026

https://github.com/bho0920/crime-data-analysis-eu

Crime Data Analysis for Self-Defense Tool Market Entry in the EU.

data data-analysis sql sqlite tableau

Last synced: 21 Jun 2025

https://github.com/istinnew/cook-me-up

[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.

bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning

Last synced: 16 May 2026

https://github.com/skygenesisenterprise/api-service

The Official Sky Genesis Enterprise API Service Ecosystem

api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket

Last synced: 31 Dec 2025

https://github.com/ioboi/obloc-data

Scrape guest counter of O'BLOC 🧗‍♀️

data scraping

Last synced: 04 Nov 2025

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 03 Mar 2025

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026