An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/charon25/weatherdata

17 000 weather measurements collected by a weather station created for a college project.

csv data dataset datasets json measurements strasbourg weather weather-data

Last synced: 16 Jan 2026

https://github.com/gabrielcsapo/bluse

⚗️ blend and fuse data with ease

data normalize utility

Last synced: 15 Mar 2025

https://github.com/simonbolivarpy/vault-decode-py

Simple Tools for decode crypto data, from extensions wallet, Metamask, Ronin, TrustWallet, TronLink(old), etc.

data decode decrypt metamask passwords python ronin salt tronlink trustwallet vault

Last synced: 15 Mar 2025

https://github.com/csoren66/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 03 Mar 2025

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/beastbytes/postal-code-data-php

Implementation of PostalCodeDataInterface using PHP file storage

data php postal-code yii3

Last synced: 27 Feb 2026

https://github.com/vlamug/ratibor

Ratibor is a service for making metrics from data

data metrics prometheus

Last synced: 10 Mar 2026

https://github.com/raphaellaude/usaschooldata

Cleaned and accessible school enrollment data for US schools

data duckdb duckdb-wasm education object-storage oss wasm

Last synced: 12 May 2026

https://github.com/barbosa89/vue-table

A classical data table component in VueJS and Bootstrap 4, optimized for Laravel applications.

bootstrap4 data datatable javascript laravel php table vuejs

Last synced: 11 Apr 2026

https://github.com/officialxviid/gloogia

👓 Make your big ideas come true by building real projects using real data 🌎

api build data gloogia projects xviid

Last synced: 05 Jan 2026

https://github.com/bzekeria/quran_dataset

The Holy Quran (Islam) Dataset

data islam quran religion

Last synced: 12 Feb 2026

https://github.com/soenneker/soenneker.dtos.requestdataoptions

A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.

controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions

Last synced: 12 Mar 2026

https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data

This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.

big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/ersinkoc/minote

Minimal Notation for LLMs

data llm notation token

Last synced: 21 Feb 2026

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 30 Jun 2025

https://github.com/namratha2301/sales-orders-analysis

Wanted to experiment with Looker. This dashboard visualizes sales trends across regions, customer segments, and product categories.

business-analytics dashboard data dataanalysis datavisualization excel looker looker-studio

Last synced: 13 Feb 2026

https://github.com/sumaiyyaf/british-airline-dashboard

This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.

analysis dashboard data tableau visualization

Last synced: 13 Feb 2026

https://github.com/filiprokita/tobase64

This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.

base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64

Last synced: 30 Jun 2025

https://github.com/j0a0m4/olympics

Final Project for Data Engineering Accelerated LATAM

data olympics spark

Last synced: 13 Feb 2026

https://github.com/dug22/jjournal

A Jupyter like notebook software for Java

data data-analysis data-science java jshell jshell-repl notebook swing swing-application

Last synced: 11 Apr 2026

https://github.com/gdcmarinho/vaultchat

VaultChat is a end-to-end encryption chat service

chat data e2ee encrypted messaging privacy

Last synced: 23 Mar 2025

https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia

Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.

async broker communication data java kafka message real-time real-time-analytics springboot wikimedia

Last synced: 14 May 2026

https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends

This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.

analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization

Last synced: 16 Feb 2026

https://github.com/infinitode/pywebscrapr

An open-source Python web scraping tool. Supports both image scraping and text scraping.

data data-collection data-science open-source pip scraping web-scraper

Last synced: 14 Feb 2026

https://github.com/fuzzt/location-analyzer

The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.

analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack

Last synced: 11 Apr 2026

https://github.com/sushmashreeps/python

This repository showcases a comprehensive Python project, demonstrating expertise in backend development, data analysis, and machine learning. Built with Python 3.x, the project utilizes popular libraries like Django, Flask, NumPy, pandas, and scikit-learn. The project features efficient data processing, robust API integration, and scalable archite

api data data-science dataanalysis datavisualization game gamedeveloment python

Last synced: 12 May 2026

https://github.com/svetlanam/etl-transformation

ETL data cleaning and transformation for specific use case in own Keboola project

cleaning data etl keboola python rest-api transformation

Last synced: 20 Jun 2026

https://github.com/oliver021/helppad-net

Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities

assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks

Last synced: 15 Jun 2026

https://github.com/guilyx/airplane-booking

Simple airline ticket reservation program.

algorithms data linked-list

Last synced: 25 Jun 2025

https://github.com/writetome51/page-load-access

A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.

batch class data javascript load loader typescript

Last synced: 16 May 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/writetome51/public-data-container-interface

Just a TypeScript interface with 1 property: 'data'

container data interface typescript

Last synced: 15 May 2026

https://github.com/sakshamarora07/whatsapp-chat-analyser

This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.

chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp

Last synced: 04 Jan 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/sourceduty/cults_3d

🔢 Software concept for additional statistics from Python for Cults design data .csv files.

3d 3d-model 3d-model-software 3d-modelling account account-management concept cults cults-3d data idea sourceduty

Last synced: 08 Aug 2025

https://github.com/florianreuth/pit

pit - the private information tracker

data java passwords security vault

Last synced: 28 Feb 2026

https://github.com/kayahr/datastream

Data stream classes for writing and reading all kinds of data types, even single bits

data datastream input output stream typescript

Last synced: 01 Aug 2025

https://github.com/mierune/tinybufr

[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.

bufr data meteorology rust weather wmo

Last synced: 15 May 2025

https://github.com/lijesh010/roadaccidentanalysisproject

This data analysis project was completed using MS Excel, and includes the creation of a dashboard.

data data-analytics data-exploration data-visualization msexcel

Last synced: 15 Feb 2026

https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project

This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.

data machine-learning-algorithms pandas pymysql python sql

Last synced: 11 Apr 2026

https://github.com/itrauco/data-dirtying-tool

a simple command line tool to generate dirty data and do common data things in google cloud

data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning

Last synced: 24 Feb 2025

https://github.com/mochsyahrizal/jkfkjabar_studycase

First Data Analytics Study Case

data datanalytics studycase

Last synced: 15 Feb 2026

https://github.com/gourab337/karnataka-health-visualizer

Visualizer for Karnataka's district-wise healthcare info built using PHP

analytics data

Last synced: 19 Mar 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/petzi53/repairdata

Open Repair Alliance Datasets 2021

data open-data open-datasets r repair repair-cafe repairs

Last synced: 22 Jun 2026

https://github.com/davidkhala/datasets

sample datasets

data

Last synced: 19 Mar 2026

https://github.com/basemax/okala-product-ids

A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.

crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/caiorss/julia-box-docker

Docker that provides a development environment for Julia language, Octave, Python, R (Rlang) with a Jupyter Notebook; Jupyter QtConsole and so on.

data datascience deveops docker julia jupyter octave python rlang scientific

Last synced: 09 May 2026

https://github.com/oniani/miniframe

Minimal data frames with relational algebra

data dataframe-library haskell haskell-library library

Last synced: 04 Mar 2025

https://github.com/mnazlukhanyan/da-projects

Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт

data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization

Last synced: 11 Apr 2026

https://github.com/agustinmusanti/sqlchallenge-7

Resolución de un extenso desafío de SQL propuesto por el profesor Diego Moisset De Espanes, quien comparte ejercicios para aprender y practicar SQL Server a través de su canal de YouTube.

challenge data learning sqlserver

Last synced: 15 Apr 2025

https://github.com/wlgs/got-dialogues-data-stats

Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.

data game-of-thrones got r statistics stats

Last synced: 22 May 2026

https://github.com/pythoncoderunicorn/tool-discography

Music Band TOOL albums and songs dataset

data data-science metal-music music r songs

Last synced: 26 May 2026

https://github.com/pawlo77/nos_snowflake

Network Operating Systems course for DS studies in Winter 2024/25

azure data data-science snowflake snowpark streamlit

Last synced: 09 May 2026

https://github.com/badranalyst/covid-deaths-dashboard-with-tableau

This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.

covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization

Last synced: 02 Mar 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/mohamedbilal1800/olympic_history_data_analysis

This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.

analysis data eda matplotlib-pyplot pandas python seaborn visulaization

Last synced: 09 May 2026

https://github.com/open-geodata/sp_bh_pcj-2020-2035

Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035

data python

Last synced: 16 Jan 2026

https://github.com/soenneker/soenneker.data.zipcode

US ZIP code data from USPS, updated daily

code csharp data dotnet usps zip

Last synced: 02 Mar 2026

https://github.com/nicholas-owen/rdm-calendar

A small utility to manage conference and event information

calendar conference data event research

Last synced: 26 May 2026

https://github.com/karashiiro/lodestone-character-data-scraper

Lodestone character data scraper.

data ffxiv ffxiv-character lodestone

Last synced: 23 Apr 2026

https://github.com/ahmadjamil888/ink-flow-share

A medium clone with all basic features such as blog generation , auth and history and user data

articles blogs cs data flow herald ink ink-flow-share journalism medium post react shad shadcn share users vite

Last synced: 09 Apr 2026

https://github.com/coderjolly/spotify-api-data-analysis

The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.

airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql

Last synced: 27 Mar 2026

https://github.com/inzhenerka/scooters_data_generator

Generate data of scooter trips for analysis

data dbt generator

Last synced: 02 Jun 2026

https://github.com/andrii04/ga4-gcs-to-bigquery-etl

Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.

automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql

Last synced: 18 May 2026

https://github.com/skysign/dat

데이터분석을 함께 공부하는 스터디입니다.

data data-analysis data-science

Last synced: 02 Jan 2026

https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic

Database of transdiagnostic treatment of depression and anxiety

data

Last synced: 01 Apr 2026

https://github.com/0xbitx/dedsec_pastebin-cli

allows you to manage your pastes directly from the terminal

code data paste pastebin payload

Last synced: 25 Jan 2026

https://github.com/jillmpla/kaggle_notebooks

Kaggle-based data analysis, data science, and data visualization.

data data-science data-visualization kaggle machine-learning

Last synced: 16 Apr 2026

https://github.com/sharoonjoseph321/insurance_fraud_detection

Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.

analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics

Last synced: 23 Mar 2025

https://github.com/bonnevoyager/quick-storage

Simple key/value storage module with persistency.

browser data fs indexeddb javascript key-value nodejs persistence quick server storage

Last synced: 16 Apr 2026

https://github.com/cityofnewyork/nyco-wp-open-data-transients

Interface for saving Open Data endpoints as WordPress Transients. Maintained by @NYCOpportunity

civic-tech composer data nycopportunity open-data plugin transients wordpress

Last synced: 10 Apr 2026

https://github.com/ashakoen/bls-data-extract

This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.

data government sqlite us

Last synced: 01 Apr 2026

https://github.com/hit07/fitgpt-hacksc

AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon

data elasticsearch gpt-4o-mini llm pipeline

Last synced: 28 Feb 2025

https://github.com/jigyasag18/power-bi-dashboard-project

The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.

dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization

Last synced: 04 Mar 2026

https://github.com/arjunrao87/world-countries-graphql-api

GraphQL API for retrieving information about countries of the world

countries data database geographic-data geography graphql world

Last synced: 10 May 2026