An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/softloud/spunk

Nutritional interventions for male infertility: a systematic review and meta-analysis

cochrane data evisynth living

Last synced: 18 Mar 2026

https://github.com/os-climate/data-requests

This repo is used to track issues related to new Data Requests

data data-engineering dataset

Last synced: 27 Feb 2026

https://github.com/paladini/aa-daily-reflections-database

Alcoholics Anonymous (AA) Daily Reflections in English, Spanish, French and Brazilian Portuguese

aa alcoholics-anonymous daily-reflections data database reflections

Last synced: 16 Apr 2026

https://github.com/abhinavrobinson/mc-community-world

Minecraft community world data.

data minecraft server world

Last synced: 27 Feb 2026

https://github.com/vatshayan/songs-datasets

Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view

1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs

Last synced: 18 Mar 2026

https://github.com/utrechtuniversity/momentum-dataflow

Repository for publishing website about data management practices of the Momentum project

data datageneration datamanagement

Last synced: 27 Feb 2026

https://github.com/bastianolea/sicvir_indicadores_rurales

Sistema de Indicadores de Calidad de Vida Rural (Sicvir)

chile comunas data estado rural social

Last synced: 27 Feb 2026

https://github.com/miozilla/snowden

snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT

data elt engineering snowflake sql vr-game

Last synced: 11 Feb 2026

https://github.com/sweta-kaundilya/power-bi-learning-projects

This repository contains completed exercises while learning Power BI

data datavisualization dax powerbi powerquery

Last synced: 27 Feb 2026

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/anandanraju/power_bi_dashboard_projects

The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.

amazon dashboard data data-visualization healthcare powerbi project

Last synced: 11 Feb 2026

https://github.com/praveendecode/retail-revenue-forecasting

Designed an end-to-end ML model pipeline, forecasting department-wide sales by accounting for holiday markdown effects, spanning data collection to inferencing.

azure collection data datapreprocessing docker exploratory-data-analysis feature-engineering featureimportance model modelbuilding modeldeployment modelselction python report tableau

Last synced: 16 Apr 2026

https://github.com/pbinkley/tweets-national-emergency-library

A twarc harvest of tweets related to Internet Archive's National Emergency Library (2020-03-23 to 2021-02-13)

data social

Last synced: 11 Feb 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/project-renard/test-data

Files for testing

data

Last synced: 27 Feb 2026

https://github.com/afeiship/next-object-operator

Object set/get/sets/gets and other operator.

data get gets next operator set sets store

Last synced: 27 Feb 2026

https://github.com/khalyomede/request

Function to validate request data for V.

data function request validate vlang

Last synced: 12 Feb 2026

https://github.com/beastbytes/postal-code-data-php

Implementation of PostalCodeDataInterface using PHP file storage

data php postal-code yii3

Last synced: 27 Feb 2026

https://github.com/vianneymi/amplifai

Amplifai is a package that allows you to transform your raw unstructured text into structured data in a few lines of codes.

data data-mining extraction langchain llm pydantic

Last synced: 27 Feb 2026

https://github.com/kirillsemyonkin/lsd

LSD (Less Syntax Data) configuration/data transfer format.

configuration data java parsing rust

Last synced: 27 Feb 2026

https://github.com/bzekeria/quran_dataset

The Holy Quran (Islam) Dataset

data islam quran religion

Last synced: 12 Feb 2026

https://github.com/pawamoy/keycut-data

Keyboard shortcuts data stored in YAML files

data keyboard-shortcuts

Last synced: 12 Feb 2026

https://github.com/soenneker/soenneker.dtos.requestdataoptions

A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.

controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions

Last synced: 12 Mar 2026

https://github.com/foundationallm/.github

A platform accelerating delivery of secure, trustworthy enterprise copilots.

agent ai data enterprise generative-ai large-language-model llm ml tool

Last synced: 12 Feb 2026

https://github.com/bishtrishu/super_store_sales_dashboard

This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.

analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql

Last synced: 28 Feb 2026

https://github.com/jeswr/blog

My personal blog

ai blog data semantics solid web

Last synced: 13 Feb 2026

https://github.com/namratha2301/sales-orders-analysis

Wanted to experiment with Looker. This dashboard visualizes sales trends across regions, customer segments, and product categories.

business-analytics dashboard data dataanalysis datavisualization excel looker looker-studio

Last synced: 13 Feb 2026

https://github.com/sumaiyyaf/british-airline-dashboard

This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.

analysis dashboard data tableau visualization

Last synced: 13 Feb 2026

https://github.com/j0a0m4/olympics

Final Project for Data Engineering Accelerated LATAM

data olympics spark

Last synced: 13 Feb 2026

https://github.com/krishkumar/scrobbles

all the music 🎸

data music scrobble

Last synced: 13 Feb 2026

https://github.com/infinitode/pywebscrapr

An open-source Python web scraping tool. Supports both image scraping and text scraping.

data data-collection data-science open-source pip scraping web-scraper

Last synced: 14 Feb 2026

https://github.com/sanand0/iss-location

Tracks the International Space Station position. A demo of how to use GitHub Actions to schedule commits weekly.

data

Last synced: 14 Feb 2026

https://github.com/imartinezl/madrid-challenge

Madrid Route Optimization Challenge 🚚♻️🚚

challenge city data optimization routing-algorithm traffic

Last synced: 28 Feb 2026

https://github.com/e-kotov/albofr-data-archive

Tiger Mosquito Colonisation in France data

aedes-albopictus colonisation data france tiger-mosquito

Last synced: 23 May 2026

https://github.com/molinsagustin/cinedata

# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.

agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver

Last synced: 28 Feb 2026

https://github.com/sunnahboy/checkfake_true_news

Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News

algorithms data level low programming structure

Last synced: 28 Feb 2026

https://github.com/gusenov/open-data-scripts

Scripts to explore public datasets. Скрипты для работы с открытыми данными.

charts data data-visualisation data-visualization datavisualization highcharts kazakhstan open-data opendata qazaqstan

Last synced: 28 Feb 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/florianreuth/pit

pit - the private information tracker

data java passwords security vault

Last synced: 28 Feb 2026

https://github.com/lijesh010/roadaccidentanalysisproject

This data analysis project was completed using MS Excel, and includes the creation of a dashboard.

data data-analytics data-exploration data-visualization msexcel

Last synced: 15 Feb 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/mochsyahrizal/jkfkjabar_studycase

First Data Analytics Study Case

data datanalytics studycase

Last synced: 15 Feb 2026

https://github.com/gourab337/karnataka-health-visualizer

Visualizer for Karnataka's district-wise healthcare info built using PHP

analytics data

Last synced: 19 Mar 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/arnocan/yapydata

The yapydata provides miscellaneous low-level Python data access APIs.

data datastructures ini json properties python python2 python3 xml yaml

Last synced: 16 Feb 2026

https://github.com/davidkhala/datasets

sample datasets

data

Last synced: 19 Mar 2026

https://github.com/soenneker/soenneker.attributes.mapto

A C# attribute for generic data mapping translation

attributes columns csharp data datatables dotnet mapping mapto maptoattribute object

Last synced: 02 Mar 2026

https://github.com/badranalyst/covid-deaths-dashboard-with-tableau

This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.

covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization

Last synced: 02 Mar 2026

https://github.com/j2kun/terrorism-usa-post-9-11

A copy of the terror data published by NewAmerica

data politics terrorism transparency

Last synced: 02 Mar 2026

https://github.com/soenneker/soenneker.data.zipcode

US ZIP code data from USPS, updated daily

code csharp data dotnet usps zip

Last synced: 02 Mar 2026

https://github.com/anuppm9917/data-processing-and-csv-to-json-using-python-project

This project guides you through processing data from CSV to JSON format using Python. You'll learn to cleanse, validate, and transform data with pandas, numpy, csv, and json libraries, ensuring it's ready for POS system integration. This will help improve data integrity and streamline integration.

csv-files data data-analysis data-cleaning data-collection data-transformation data-validation python3 transformation

Last synced: 16 Apr 2026

https://github.com/coderjolly/spotify-api-data-analysis

The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.

airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql

Last synced: 27 Mar 2026

https://github.com/nagar2nd/financial-analysis-power-bi

This project analyzes financial and credit card usage data using Power BI and DAX, focusing on customer behavior, credit risk, and financial performance. It includes insights on spending trends, delinquency rates, churn indicators, and satisfaction scores to drive better financial management and customer retention strategies.

analysis data dax dax-functions dax-query excel powerbi

Last synced: 03 Mar 2026

https://github.com/inzhenerka/scooters_data_generator

Generate data of scooter trips for analysis

data dbt generator

Last synced: 02 Jun 2026

https://github.com/shubhamsoni98/excel-practice

Excel-Practice-Questions

analysis data excel formula raw-data xlsx

Last synced: 03 Mar 2026

https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic

Database of transdiagnostic treatment of depression and anxiety

data

Last synced: 01 Apr 2026

https://github.com/jillmpla/kaggle_notebooks

Kaggle-based data analysis, data science, and data visualization.

data data-science data-visualization kaggle machine-learning

Last synced: 16 Apr 2026

https://github.com/anuraganalog/onyx-data

BI Visualizations to the problems in website. All the Visualization can be found at the below link

data onyx public tableau viz

Last synced: 02 Apr 2026

https://github.com/bonnevoyager/quick-storage

Simple key/value storage module with persistency.

browser data fs indexeddb javascript key-value nodejs persistence quick server storage

Last synced: 16 Apr 2026

https://github.com/erickpeirson/jhb-data

Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology

data geolocation history-of-biology named-entity-recognition topic-modeling

Last synced: 04 Mar 2026

https://github.com/ashakoen/bls-data-extract

This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.

data government sqlite us

Last synced: 01 Apr 2026

https://github.com/thomasjewson/cci-data-science-textbook

This is a short, interactive textbook aimed at introducing data science to non-IT university undergraduates. Funded by Erasmus+.

data data-science learning python textbook

Last synced: 16 Apr 2026

https://github.com/fastpix/android-data-bitmovin

FastPix Video Data SDK to monitor and analyze video playback metrics within Bitmovin for android

analytics android-sdk bitmovin data fastpix metrics player sdk video

Last synced: 16 Apr 2026

https://github.com/jigyasag18/power-bi-dashboard-project

The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.

dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization

Last synced: 04 Mar 2026

https://github.com/ksimicevic/discord-message-analyzer

Analyzing discord messages in Jupyter notebook

analysis data discord messages

Last synced: 16 Apr 2026

https://github.com/zelon88/motorized_bike_data

A repo to contain data in various formats related to motorized bicycle configurations.

bicycle bikes data data-set engine w

Last synced: 05 Mar 2026

https://github.com/arjunrao87/world-countries-graphql-api

GraphQL API for retrieving information about countries of the world

countries data database geographic-data geography graphql world

Last synced: 10 May 2026

https://github.com/jameshenderson12/data-lists

This respository contains lists of useful data that can be used in a variety of projects.

countries data list names scottish text

Last synced: 05 Mar 2026

https://github.com/sehgal-vishal/world-population-

World Population Sql Analysis

data dataanalysis population sql

Last synced: 05 Mar 2026

https://github.com/udhaya2823/microsoft---classifying-cybersecurity-incidents-with-machine_learning

🚨Microsoft: Classifying Cybersecurity Incidents with Machine Learning🔐 This project leverages the power of Machine Learning to classify cybersecurity incidents 🚨, improving the efficiency of Security Operation Centers (SOCs) at Microsoft. We train a model to predict incident grades, helping analysts prioritize threats with precision🎯.

classification data feature-engineering iqr-method machine-learning matplotlib model-evaluation modelselection predictive-modeling python sklearn

Last synced: 17 Apr 2026

https://github.com/derhuerst/uic-codes

UIC country codes.

data dataviz i18n transit

Last synced: 05 Mar 2026

https://github.com/amethyst-php/collection

Simple as the name, this package allow you to create collection of other models.

amethyst amethyst-package api collection data laravel

Last synced: 17 Apr 2026

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/kaungkhantkyaw1997/mock-schema-generator

A tool for generating mock data and implementations based on schema definitions. Ideal for testing and development.

data generator mock schema testing

Last synced: 05 Mar 2026

https://github.com/jwszolek/accelerated-data-generator

Ultra-fast random data generator. It gives you an ability to generate almost 1M of rows in around second.

bash csv data data-generator generator shell

Last synced: 02 Apr 2026

https://github.com/michael-ljn/cirp-lce-2025

Prospective Global Warming Potential of Australian Low-Emission Hydrogen in a Net-Zero Emission Context

data publication

Last synced: 06 Mar 2026

https://github.com/ashfaqalizardariofficial/databasehelper

A C# database helper library to connect with the database server and perform actions insert, update, delete, select data and select multiple data from the database.

ashfaq-ali-zardari ashfaq-ali-zardari-official data database delete helper insert ms-sql-server multiple select-data server sql-server update

Last synced: 02 Apr 2026

https://github.com/doruirimescu/stateful-data-processor

Resumable, checkpointed item processing with graceful interrupts — subclass and go.

data edl processor python python3 stateful

Last synced: 02 Apr 2026

https://github.com/evyatarmeged/mdg

Data mocking web application built with Python & Flask

csv data flask generate json mocking python sql xml

Last synced: 17 Apr 2026

https://github.com/foreteternelle/pokemonstudiodataapi

The GitHub repository of the Pokémon Studio Data Api

api data fangame

Last synced: 02 Apr 2026

https://github.com/joshuagilgallon/cam-data

Large collection of data about digital cameras

camera data

Last synced: 17 Apr 2026

https://github.com/ffatahillah7/eda-dsf-dibimbing-titanic-accident

Data Science Fair 3.0 Dibimbing Portofolio - Analyctics and Learning from titanic dataset

data numpy pandas python science seaborn

Last synced: 17 Apr 2026

https://github.com/snacks02/wobbling-statistics

Audio equipment statistics using Squiglink data

audio data data-visualization headphones iems speakers squiglink statistics

Last synced: 17 Apr 2026