An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/afnanenayet/ds-a

Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3

algorithms data interview prep python structures

Last synced: 05 Apr 2025

https://github.com/rahul1582/bank-loan-classification

Classifying whether a person is taking personal loan or not using all the Classification Algorithms.

algorithm analysis classi data

Last synced: 08 Oct 2025

https://github.com/udofia2/crudwithdatabase

A simple Nodejs app that connect to a database.

crud data databse

Last synced: 08 Oct 2025

https://github.com/leevilaukka/alkometriikka

Tool to search Alko database and see some fun stats about different beverages

data gh-pages svelte typescript xlsx

Last synced: 18 May 2026

https://github.com/veivel/f1-sentiment-analysis

An entiment analysis project on tweets about Formula 1. To be reworked.

data f1 nlp-library nlp-machine-learning

Last synced: 04 Jul 2025

https://github.com/goto-eof/bitmaptize

Wraps data inside a .bmp and extracts data from .bmp.

bitmap bmp convert data wrap

Last synced: 18 Jan 2026

https://github.com/theopenwebjp/theopenweb-data-loader

Package for loading data to local project

data downloader import javascript typings

Last synced: 10 Oct 2025

https://github.com/bastianolea/minsal_suicidios

Casos de intento de suicidio y suicidio consumado en Chile

chile comunas data genero salud tiempo

Last synced: 19 Jan 2026

https://github.com/chowington/bg-counter-tools

A set of tools that can pull data from Biogents BG-Counter smart mosquito traps and convert them into a Darwin Core compliant format.

bg-counter biogents darwin-core data internet-of-things mosquito-prevalence population-dynamics

Last synced: 10 Oct 2025

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/aldro61/mmit-data

The data used in the Maximum Margin Interval Trees paper

data machine-learning machine-learning-algorithms reproducible-research

Last synced: 19 Feb 2026

https://github.com/ghomashudson/ao3_style_change

Style change detection dataset using AO3 fics

ao3 data dataset datasets fanfiction long-document style-change-detection

Last synced: 11 Oct 2025

https://github.com/dhruvil-26/tableau-projects

This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns

analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public

Last synced: 19 Jan 2026

https://github.com/equinor/sumo-wrapper-python

Thin python wrapper to interact with Sumo API

analytics data fmu python subsurface sumo

Last synced: 19 Jan 2026

https://github.com/ginga1402/data_visualization_on_honey_production_dataset

Data Visualization using Matplotlib & Seaborn Libraries

college-project data data-visualization

Last synced: 25 Aug 2025

https://github.com/thanhleviet/vietnam_antibiotics_bidding

This repo contains data of bidding for multiple drugs and antibiotics reported to Vietnam Ministry of Health in 2015, 2016, 2017.

antibiotics data vietnam

Last synced: 23 Feb 2026

https://github.com/jhpoelen/bees

Content-based iDigBio prototype

biodiversity data ecololgical informatics provenance

Last synced: 18 Mar 2026

https://github.com/axetroy/stone

build data stuck like a stone, Sturdy!

axetroy data stone stuck

Last synced: 04 Jul 2025

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 19 Apr 2026

https://github.com/mikeschinkel/go-testdata-defaulter

Simple package for Go to set table-driven test data defaults so that tables in tests only need include data that differs from defaults.

data defaults package testing tests

Last synced: 13 Oct 2025

https://github.com/petzi53/repairdata

Open Repair Alliance Datasets 2021

data open-data open-datasets r repair repair-cafe repairs

Last synced: 22 Jun 2026

https://github.com/tttardigrado/fq

Graffs for the MEDEA project

bokehplots data data-science dataanalysis pandas physics python3

Last synced: 12 Apr 2026

https://github.com/yash-chauhan-dev/sf_analytics

Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.

aiml analytics data dbt langchain llm python snowflake streamlit

Last synced: 07 May 2026

https://github.com/denisecase/620-mod6-web-scraping

Notes on how to get started scraping content from the web

beautifulsoup4 data mining python

Last synced: 11 Apr 2025

https://github.com/intersystems-ib/workshop-smart-data-fabric

Learn the main ideas involved in developing a Smart Data Fabric using InterSystems IRIS

analytics data datafabric interoperability smart

Last synced: 14 Apr 2026

https://github.com/yagoluiz/enem-analise-extracao

[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste

analysis data extraction python3 r

Last synced: 17 Apr 2026

https://github.com/jigyasag18/project-diwali-sales-analysis

This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.

analysis data datapr datapro eda jupyter-notebook python realtimedata

Last synced: 01 Jun 2026

https://github.com/poissonconsulting/klexdatr

An R package of data from the Kootenay Lake Exploitation Study

cran data fish kootenay-lake rstats

Last synced: 16 Oct 2025

https://github.com/fatihilhan42/nba-players-data-1950-to-2021

In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.

data data-analysis data-engineering data-science data-visualization

Last synced: 16 Oct 2025

https://github.com/hlan22/2025-03-18-data-validation

(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:

data validation

Last synced: 23 Jun 2026

https://github.com/enoch208/eventmaster

A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.

automation data key keylogging-python replay spy tools

Last synced: 01 Jun 2026

https://github.com/jneidel/animal-names

Dataset of 100 common animal names

animals data dataset json names opendata

Last synced: 25 Mar 2025

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/zoetrope69/website

:tada: my website

data javascript personal

Last synced: 12 Jun 2025

https://github.com/bkataru/spotigo

AI-powered local music intelligence platform with a task runner server core to retrieve and backup spotify account data to storage(s) at set periodic intervals

ai backup cron data go intelligence local-llm music ollama rag runner spotify task-runner tool-calling

Last synced: 16 Jan 2026

https://github.com/octoenergy/tentaclio-snowflake

A python project containing all the dependencies for snowflake tentaclio schema.

data

Last synced: 20 Oct 2025

https://github.com/zanysoft/virtualcolumn

Laravel virtual column

data laravel virtual-column

Last synced: 12 Apr 2026

https://github.com/team-hydrogen/nasa-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 25 Mar 2025

https://github.com/ournet/ournet.web.data

Ournet web data module

data ournet web

Last synced: 04 Apr 2025

https://github.com/andrewl/danelaw

Geopackage containing the boundary of the Danelaw

data geospatial medieval viking

Last synced: 23 Jan 2026

https://github.com/sankooc/validatez

object validation for node

data validate

Last synced: 13 May 2026

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 05 May 2026

https://github.com/ellisgl/geeklab-arraytranslation

Convert an array to another data format or convert a data format to an array.

array data format php php7-2 php72

Last synced: 25 Mar 2025

https://github.com/mikeasilva/api_data

API Data makes working with open data APIs easy.

api data python

Last synced: 23 Jan 2026

https://github.com/prajjwol09/power-bi-project

The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.

dashboard data interactive powerbi survey

Last synced: 15 Mar 2026

https://github.com/alextanhongpin/node-github-api

:page_with_curl: sample github api queries with nodejs for scraping purposes

data github-api nodejs

Last synced: 06 May 2026

https://github.com/tomquirk/sunshine-coast-council-rates-data

Rates data for the Sunshine Coast, Australia

australia data property rates real-estate

Last synced: 24 Feb 2026

https://github.com/encelo/wetpaper-data

Data files for the WetPaper project

data icons ncine

Last synced: 23 Jan 2026

https://github.com/fatihemres/pinch

File reader app with SwiftUI. Using data and models.

data models swift swiftui

Last synced: 17 May 2026

https://github.com/zainea-bogdan/data_engineer_project_wowcinema

WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting

analytics big-data data datawarehousing etl-pipeline postgres python sql

Last synced: 19 May 2026

https://github.com/raulmaulidhino-dev/ml_modelling_regression

There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.

data data-analysis-python data-science eda machine-learning scikit-learn

Last synced: 28 Jan 2026

https://github.com/mfurmanczyk/wh-sales

E-commerce analytics data warehouse ETL made with Apache Spark.

airflow data data-engineering data-warehouse kotlin python spark

Last synced: 24 Jan 2026

https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1

The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].

data datalad open-data

Last synced: 24 Jan 2026

https://github.com/semcod/code2llm

Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction

ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm

Last synced: 01 Jun 2026

https://github.com/eugenedakin/des-encryption-decryption

Encrypt and Decrypt text in Xojo using DES - Written in Native Xojo Language - Cross Platform

data data-encryption-standard decryption des encryption standard xojo

Last synced: 24 Feb 2026

https://github.com/bishtrishu/pizza_sales_analysis_dashboard_sql_bi

Welcome to the Pizza Sales Analysis Dashboard project! This repository contains a comprehensive guide to building an interactive and insightful dashboard for analyzing pizza sales data using SQL and Power BI.

data data-science dataanalyst datavisualization dax dax-query microsoft microsoft-azure microsoft-sql-server msexcel mysql powerbi powerquery project sql

Last synced: 16 Mar 2026

https://github.com/wolfchamane/data-sandbox

Sandbox tool for Front-end developments.

data database front-end nodejs npm rest sandbox tool

Last synced: 28 Oct 2025

https://github.com/cmdrvl/rvl

rvl reveals the smallest set of numeric changes that explain what actually changed between two datasets — or confidently tells you nothing changed.

cli csv data data-quality data-validation diff finance numerical-analysis open-source ops rust tooling

Last synced: 25 Feb 2026

https://github.com/spatialcurrent/go-flat

Recursively flatten a slice of slices.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/nasa-pds/nucleus

Nucleus is a software platform used to create workflows for the Planetary Data (PDS).

data ingestion pds planetary workflow

Last synced: 06 Feb 2026

https://github.com/bearaujus/bdatamatrix

Structured Tabular Data Management in Go

data go golang matrix

Last synced: 30 Jan 2026

https://github.com/wraith13/systematic-metasyntactic-variables

This is a list for that you can express the existence of different serieses when using metasyntax variables.

data

Last synced: 14 Jun 2025

https://github.com/ludwing-mj/manipulacion_ej

Ejercicio utilizado en la seccion numero ocho del manual para ejemplificar las herramientas proporcionadas por el tydyverse para la manipulacion de datos.

data manipulate-data package r

Last synced: 01 Apr 2025

https://github.com/bubblymaps/bubblymaps

The open source bubbler map. Mapping the world's water fountains. Open Code, Open Data.

bubbler bubbly-maps data fountain map open-source water

Last synced: 31 Jan 2026

https://github.com/fgazzelloni/20240930-dwpwr

Data Wrangling Practice with R - 30 September Tutorial for R-Ladies Rome

data data-science data-structures data-wrangling

Last synced: 28 Jun 2026

https://github.com/natanast/euroleaguebasketball

An R package providing data on Euroleague Basketball

data data-science package r

Last synced: 01 Apr 2025

https://github.com/giuleo129/dataanalysis

This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.

data data-analysis data-science statistical-learning

Last synced: 25 Jan 2026

https://github.com/matt-dray/draytasets

:1234::disguised_face: Miscellaneous datasets I've collected or prepared

card-games data phd pokemon

Last synced: 09 Feb 2026

https://github.com/dysnomia-studio/achieve-games-dump

Dump parts of achieve.games database to public including Steam Games List

data dump games steam steam-api steam-game steam-games

Last synced: 27 Feb 2026

https://github.com/beriberikix/senml-zephyr

A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr

codec data iot senml sensor zephyr-rtos

Last synced: 24 Mar 2025

https://github.com/infinitode/pyautoplot

PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.

analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python

Last synced: 16 Mar 2025

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/anandanraju/power_bi_dashboard_projects

The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.

amazon dashboard data data-visualization healthcare powerbi project

Last synced: 11 Feb 2026

https://github.com/jsanz/kart-test

Testing Kart repository

data geospatial kart

Last synced: 26 Jan 2026

https://github.com/pawamoy/keycut-data

Keyboard shortcuts data stored in YAML files

data keyboard-shortcuts

Last synced: 12 Feb 2026

https://github.com/bishtrishu/super_store_sales_dashboard

This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.

analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql

Last synced: 28 Feb 2026

https://github.com/petzi53/repair

R Datasets of the Open Repair Alliance (ORA).

data r repair repair-cafe

Last synced: 19 May 2026

https://github.com/e-kotov/albofr-data-archive

Tiger Mosquito Colonisation in France data

aedes-albopictus colonisation data france tiger-mosquito

Last synced: 23 May 2026