An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/dev88jerry/cs304

Bishop's University - CS304 Data Structures

bishops bu data data-structures python structure university

Last synced: 11 Jun 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/lamouchi-bayrem/data-matrix-scanner

A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision

data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp

Last synced: 30 Apr 2026

https://github.com/andygol/andygol.github.io

Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner

consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website

Last synced: 13 May 2026

https://github.com/priyam-hub/covid-19-data-analysis

Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook

analysis data data-visualization jupyter-notebook machine-learning python

Last synced: 08 Jun 2026

https://github.com/mmaithani/kaggle-projects

Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem

computer-vision data data-science image-processing machine-learning python

Last synced: 30 Apr 2026

https://github.com/chompfoods/sdk-jaxrs-cxf

JAXRS-CXF SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

apache-cxf api branded chomp cxf data database food grocery ingredients java jax-rs nutrition raw recipe-api recipes sdk

Last synced: 30 Apr 2026

https://github.com/dnut/json-match-finder

Python application used to match listings against openings via authenticated JSON API access.

data data-structures data-wrangling database json-api python-application python-modules

Last synced: 01 May 2026

https://github.com/alexyiann/finance

In this repository you can find scripts for pulling data and comparing them , but you can also find simple python scripts to automate trades on Crypto and back testing trading strategies on both crypto and stocks .

api bots data database finance option option-strategies strategy trading trading-algorithms

Last synced: 03 Jan 2026

https://github.com/benmizrahi/reactivejs

microservices event bus for async/sync communications

data microservices nodejs

Last synced: 01 May 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/nafisalawalidris/nafisalawalidris

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.

artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning

Last synced: 16 May 2026

https://github.com/m0nica/datalogues-refresh

:bar_chart: Programming blog focused on data with an emphasis on exploration in Python.

data jekyll python technical-writing

Last synced: 14 May 2026

https://github.com/kunalshelke90/kunalshelke90

💻 Machine Learning Enthusiast | Data Science Explorer | eager about solving problems with help of data.

data data-science dataanalysis database machine-learning mlops

Last synced: 06 Jul 2025

https://github.com/anisimov-anthony/data_forest

Implementation of various types of trees

algorithms-and-data-structures data lib rust tree

Last synced: 28 Apr 2025

https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey

IBM Data Analyst professionale certificate program final project.

cognos data data-analytics looker power-bi python sql statics

Last synced: 01 May 2026

https://github.com/ahmed-naserelden/astro-success-analytics

This project analyzes key factors influencing success in the Space Race using data science techniques. It includes data collection, machine learning modeling, and insightful visualizations to predict mission outcomes.

data dataanalysis python

Last synced: 01 May 2026

https://github.com/yeti-robotics/past-scouting-data

❄️ Scouting Data from Previous Events/Seasons ❄️

data first frc

Last synced: 06 Jan 2026

https://github.com/flexiui-labs/flexi-grid

Flexi Grid is an advanced, lightweight, and customizable Angular 19 data grid component

angular data filter grid search select sort table

Last synced: 14 May 2026

https://github.com/sadratehranian/data-collection-and-machine-learning

create a model using logistic regression to predict whether the fire alarm of a smoke detector should sound or not. Second, predicts whether an electric drive in a production plant may be faulty or not.

data data-analysis data-science datacollection logistic-regression machine-learning ml nn

Last synced: 05 Jan 2026

https://github.com/zazza123/hamana

A python library for seamless data extraction, storage, and SQL-based analysis using pandas and SQLite.

analysis data python

Last synced: 14 Jan 2026

https://github.com/programmer-rd-ai/competitive-programming-solutions

A collection of my solutions to various competitive programming problems from platforms like LeetCode. This repository serves as a personal archive of my problem-solving journey, covering a range of algorithms, data structures, and problem-solving techniques.

algorithm algorithms algorithms-and-data-structures data datastructures dsa javascript pandas python structures

Last synced: 01 Mar 2025

https://github.com/vidushibhadana/covid19-data-exploration-using-sql

Deployed diverse SQL techniques to analyze COVID-19 data for an improved understanding of pandemic's regression.

data database database-management sql

Last synced: 19 Aug 2025

https://github.com/stoyank7/football-prediction

This is my Semester 7 Project for my "AI for Society" minor at Fontys University of Applied Sciences.

ai betting data football machine-learning university-project

Last synced: 25 Mar 2025

https://github.com/dineshdhamodharan24/data-analysis

probability Analysis to customers and bascis analysis

analysis data powerbi probability python visualization

Last synced: 23 Jun 2026

https://github.com/renebentes/2806

Curso 2806 - Acesso à dados com C#, .NET 5, Dapper e SQL Server

csharp dapper data dotnet sqlserver

Last synced: 19 Apr 2026

https://github.com/soenneker/soenneker.constants.data

A set of commonly used constants related to various types of data

constants csharp data dotnet

Last synced: 12 Mar 2026

https://github.com/woctezuma/recent-sales-data

Data available to estimate sales of Steam games during release week.

data sales steam

Last synced: 05 Feb 2026

https://github.com/quonverbat/ordner

A simple, customizable and cross-platform data tracker.

data datatracker javafx management

Last synced: 07 Jul 2025

https://github.com/buffdelta/basketball_ref_webscraper

Python package to make webscraping from basketball-reference easy

basketball data python python-library webscraping

Last synced: 14 Jan 2026

https://github.com/roshaka/samplr

Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.

data data-analysis data-engineering decorators list python sampling

Last synced: 14 Jan 2026

https://github.com/fiedsch/data_util

misc. Utilities for data files like variable name lists

data helper management php

Last synced: 14 Jun 2025

https://github.com/bkestelman/dasy-ml

DaSy DataSynthesizer - Create synthetic data with desired statistical properties for machine learning research.

data data-science machine-learning

Last synced: 14 Jan 2026

https://github.com/atiqurcode/scrap-spec

Scrap data from the html to table html code / json

data html-table json-data scarp

Last synced: 05 Feb 2026

https://github.com/juangesino/research-project

Course files for Research Project @ University of Amsterdam

data data-science economics stata

Last synced: 02 Jan 2026

https://github.com/stupidcucumber/elephant-crawler

System for mining texts from websites.

data data-mining-python python

Last synced: 25 Apr 2026

https://github.com/shadeglare/genum

The ES Next tools to process data in a LINQ manner

data linq processing typescript

Last synced: 13 Apr 2026

https://github.com/afolabi022/getting-and-cleaning-data-course-project

Tidy Dataset Creation for Human Activity Recognition" This repository contains the code and files for cleaning and transforming the Human Activity Recognition Using Smartphones dataset into a tidy format. The project demonstrates data wrangling skills in R, including merging datasets

data data-science datacleaning r

Last synced: 25 Mar 2025

https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support

RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.

ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit

Last synced: 01 May 2026

https://github.com/bmcollier/contiguous

Provides COBOL-style contiguous data structures in Python

cobol contiguous data python

Last synced: 14 Jan 2026

https://github.com/krakozaure/pyzzy

Set of packages to simplify development in Python

configuration data formats json library logging logs python3 toml utils yaml

Last synced: 14 Jan 2026

https://github.com/jstafford5380/provausio.testing.generators

Generate fake data for testing and/or mocking

data fake-data generator testing

Last synced: 14 Jan 2026

https://github.com/idhruvs/angular4-smart-table-demo

Angular4 Smart Table Demo Project

angular4 data tables typescript

Last synced: 21 Apr 2026

https://github.com/albanecoiffe/jo2024_visualization

Tableau de bord avec Streamlit sur les JO de Paris 2024.

data streamlit visualization

Last synced: 30 Apr 2026

https://github.com/nagipragalathan/linkedin_backup_datas

This repository contains the backup data from my previous LinkedIn account. Unfortunately, my old LinkedIn account was compromised and subsequently blocked by LinkedIn. As a result, I created a new account, but that too got blocked for reasons unknown to me.

backup blocked data linkedin linkedin-account memory nagipragalathan recovery storage

Last synced: 18 Jan 2026

https://github.com/scx567888/scx-data

✨ SCX Data

data java scx

Last synced: 05 Apr 2025

https://github.com/otoneko1102/roulette-base

ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.

casino casino-games data json require roulette

Last synced: 16 Mar 2025

https://github.com/ournet/view-data

Ournet view-data nodejs module

data ournet view view-data

Last synced: 04 Apr 2025

https://github.com/desoga10/nety-form

In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.

contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify

Last synced: 03 Jan 2026

https://github.com/samhollings/nhs_data_cleansing

A repo of reusable functions for cleansing data

cleansing data data-cleaning data-cleansing preprocessing pyspark python python3

Last synced: 05 Oct 2025

https://github.com/nel-zi/climainsights

Developed an automated ETL pipeline using Apache Airflow and Python to collect, process, and store weather data from multiple cities via Weatherstack API. Implemented data cleaning, orchestration, and error handling to ensure accuracy and scalability.

airflow apache-spark data data-engineering engineering etl-pipeline

Last synced: 01 May 2026

https://github.com/luminati-io/Google-Maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 09 Apr 2025

https://github.com/fuadarradhi/gps_data_reset

Flutter plugin to reset and download gps data

cache data extra gps reset

Last synced: 23 Feb 2026

https://github.com/iankitnegi/statistically_speaking

Explore diverse projects showcasing statistical techniques with real-world data, comprehensive docs, and interactive visualizations.

data excel statistical-analysis statistics

Last synced: 09 Feb 2026

https://github.com/posixpascal/apple_appstore_search

📊 get public App Store data of your app in a ruby hash — that's it.

appstore data gem ios ruby

Last synced: 16 Mar 2025

https://github.com/stdlib-js/ndarray-vector-uint32

Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).

constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector

Last synced: 25 Apr 2026

https://github.com/pathilink/ebury_case

Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.

bigquery data modeling sql

Last synced: 05 Oct 2025

https://github.com/musamairshad/dsa-python

This repository contains all the material related to Data Structures and Algorithms implemented in Python.

algorithms data datastructures efficiency python searching-algorithms sorting-algorithms

Last synced: 25 Mar 2025

https://github.com/living-with-machines/zoonyper

Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks

crowdsourcing data data-processing data-science python zooniverse

Last synced: 04 Jul 2025

https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito

This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.

bigquery data data-analysis etl-pipeline tableau

Last synced: 21 Jan 2026

https://github.com/grace-mengke-hu/redditpushshiftapi

This package is for collecting Reddit dataset and organize the data in Mongo Database

collection data reddit

Last synced: 13 Jun 2025

https://github.com/davorg/towerbridge

When is Tower Bridge lifting?

data hacktoberfest london perl web-scraping

Last synced: 29 Jun 2026

https://github.com/kashyap-prabhat/sigma

A Scala library for probability and statistics formulas, including rules for probability calculations.

data formulas library mathematics probability scala statistics

Last synced: 06 Oct 2025

https://github.com/pdoup/enegry

Time-Series dataset combining multiple sources to explain the broader Greek energy market

data dataset day-ahead-auction energy-markets exploratory-data-analysis forecasting futures-market greek-energy-market renewable-energy time-series-data weather-data

Last synced: 07 May 2025

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/codegouvfr/codegouvfr-data

🧢 Data for code.gouv.fr

bluehats codegouvfr data

Last synced: 05 Mar 2026

https://github.com/ryanve/i11

CSS named colors list

colors css data dataset

Last synced: 07 Oct 2025

https://github.com/codegouvfr/codegouvfr-sources

🧢 Static web frontend for code.gouv.fr

bluehats codegouvfr data frontend

Last synced: 28 Feb 2025

https://github.com/white-gecko/lineage-dump

RDF dump of the device information from the lineage wiki

data dataset lineageos rdf

Last synced: 28 May 2026

https://github.com/ayush-raj8/godata

Write data to file. Standardizes the format for easy parsing and read by other programs.

data golang

Last synced: 18 Jan 2026

https://github.com/unknownsoup/budget_tracker

A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.

data data-science databases python sql

Last synced: 26 Jan 2026

https://github.com/so-cool/junction

My solution to the University of Bristol "Bristol Journey Time" Data Challenge https://So-Cool.github.io/junction

competition data modelling timeseries

Last synced: 02 Apr 2025

https://github.com/sysread/skewer

A priority queue for Go implemented using a skew heap

binary data go heap min minqueue priority queue skew structure

Last synced: 26 Aug 2025

https://github.com/lexiortiz/advanced-data-analytics

Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.

data data-analysis data-engineering google python-3 sql

Last synced: 29 May 2026

https://github.com/pythoncoderunicorn/startrek

a repo for Star Trek data from Technical Manuals

data klingon-language star-trek vulcan

Last synced: 07 Oct 2025

https://github.com/bcongdon/nid-data

National Inventory of Dams Data

data datasette government-data

Last synced: 21 Apr 2026

https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson

UK "Counties" Extracted from Office for National Statistics data

data geojson maps uk

Last synced: 29 Mar 2025

https://github.com/ohspc89/better_call_jin

A repository containing mentoring materials for a Ph.D. student in Neuroscience

data matlab spss-statistics visualization visualization-tools wrangling-data

Last synced: 08 Oct 2025