An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://gitlab.com/pommalabs/htmlark

HtmlArk packs a webpage into a single HTML file: https://htmlark-docs.pommalabs.xyz/

audios css data embed fonts html images javascript uri videos

Last synced: 03 Sep 2025

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/kaungkhantkyaw1997/mock-schema-generator

A tool for generating mock data and implementations based on schema definitions. Ideal for testing and development.

data generator mock schema testing

Last synced: 05 Mar 2026

https://github.com/jwszolek/accelerated-data-generator

Ultra-fast random data generator. It gives you an ability to generate almost 1M of rows in around second.

bash csv data data-generator generator shell

Last synced: 02 Apr 2026

https://github.com/smaug6739/data-bit

This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.

bits data nodejs

Last synced: 14 May 2026

https://github.com/michael-ljn/cirp-lce-2025

Prospective Global Warming Potential of Australian Low-Emission Hydrogen in a Net-Zero Emission Context

data publication

Last synced: 06 Mar 2026

https://github.com/ashfaqalizardariofficial/databasehelper

A C# database helper library to connect with the database server and perform actions insert, update, delete, select data and select multiple data from the database.

ashfaq-ali-zardari ashfaq-ali-zardari-official data database delete helper insert ms-sql-server multiple select-data server sql-server update

Last synced: 02 Apr 2026

https://github.com/rahult18/atmo-flow

AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes

airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark

Last synced: 23 Jun 2026

https://github.com/doruirimescu/stateful-data-processor

Resumable, checkpointed item processing with graceful interrupts — subclass and go.

data edl processor python python3 stateful

Last synced: 02 Apr 2026

https://github.com/turner-kendall/turner-kendall

Turner Kendall - dev, opps, sec.

config data github-config go rust security

Last synced: 31 Oct 2025

https://github.com/pbinkley/mfmcollections

Project to distill data about published collections of microfilms from library lists

data research retro

Last synced: 28 May 2026

https://github.com/joshuagilgallon/cam-data

Large collection of data about digital cameras

camera data

Last synced: 17 Apr 2026

https://github.com/mohamedbilal1800/olympic_history_data_analysis

This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.

analysis data eda matplotlib-pyplot pandas python seaborn visulaization

Last synced: 09 May 2026

https://github.com/snacks02/wobbling-statistics

Audio equipment statistics using Squiglink data

audio data data-visualization headphones iems speakers squiglink statistics

Last synced: 17 Apr 2026

https://github.com/amethyst-php/attendance

Indicate the attendance/absence of an employee in a defined office with a range of dates

amethyst amethyst-package api attendance data laravel

Last synced: 17 Apr 2026

https://github.com/etmendz/mendz.data.sqlserver

Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to SQL Server databases.

ado-net context data database datasettings mendz sql-server

Last synced: 10 May 2026

https://github.com/umrlastig/global-local

The Global-Local loop: bridging the gap between geospatial communities

challenges communities data fusion gaps geospatial perspectives

Last synced: 03 Apr 2026

https://github.com/csoren66/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 03 Mar 2025

https://github.com/madhuresh2011/50-days-sql-challenge

Start a 50days-sql-challenge journey to SQL mastery and transform how we interact with data!

consistency data data-analytics database problem-solving query question-answering real-world-data sql

Last synced: 03 Jun 2026

https://github.com/cloud-shuttle/drover-sqlforge

The Data Automation Engine. A blazing-fast, pure Go alternative to dbt for data transformations.

ast data drover sql transformation

Last synced: 03 Jun 2026

https://github.com/artcc/coredatademo

Demo for CoreDataGenericModule implementation

core coredata coredata-model data encrypted encrypted-data encryption persist

Last synced: 19 Jun 2026

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/vlamug/ratibor

Ratibor is a service for making metrics from data

data metrics prometheus

Last synced: 10 Mar 2026

https://github.com/bhavanachitragar/layoff_analysis

This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.

data dataanalysis streamlit streamlit-webapp

Last synced: 18 Apr 2026

https://github.com/zurd46/zurdsynthdatagen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

data data-structures dataset electron json jsonl nodejs openai synthetic

Last synced: 04 Apr 2026

https://github.com/murshidazher/client-side-data-storage

🚌 A workspace containing client-side data storage implementations

cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql

Last synced: 02 Sep 2025

https://github.com/neelamraikwar9/bookdata

This is my 1st assignment git repository. I have worked with Book Data and by using Express Js created routes and API's for Post, Update, Delete, and Get.

api books data database deployment expressjs node nodejs postman postman-api

Last synced: 05 Apr 2026

https://github.com/docuvesta/shiseido_skincare_usa_fr_infographics

Découvrir les indicateurs de performance liés aux avis d'un sérum très réputé de la marque de beauté luxe japonaise Shiseido. Cette comparaison concerne les sites web USA et FR 💯

analysis automatisation data datanalysis graphique infographie pandas plotly python skincare soins

Last synced: 11 Apr 2026

https://github.com/raphaellaude/usaschooldata

Cleaned and accessible school enrollment data for US schools

data duckdb duckdb-wasm education object-storage oss wasm

Last synced: 12 May 2026

https://github.com/jigyasag18/iit-guhawati-final-capstone-project

Smart Dynamic Parking Price Optimization System that adjusts parking fees in real-time based on demand, traffic, and competition. It employs adaptive pricing models and rerouting logic to enhance parking utilization and reduce congestion. The system is visualized via an interactive Streamlit dashboard, enabling users to simulate dynamic pricing.

bokeh bokeh-server bokehplots capstone-project data dataset deployment machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot mlproject normalisation numpy pandas pathway python streamlit

Last synced: 05 Apr 2026

https://github.com/codbex/codbex-hestia-data-sample

Sample data for codbex-hestia

data module sample

Last synced: 05 Apr 2026

https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly

Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool

blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly

Last synced: 18 Apr 2026

https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas

SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.

data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase

Last synced: 18 May 2026

https://github.com/cao7113/datalab

data lab and tools

data tool

Last synced: 18 Apr 2026

https://github.com/mksingh431/free-data-science-courses

Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.

course data data-analysis data-science data-visualization free freecou python

Last synced: 19 Apr 2026

https://github.com/phelipe-sempreboni/certificates

Tutorial intended for information about my licenses and certificates acquired over time.

certificate certificates certification course data database datascience licences license-management marketing marketing-analytics python sql

Last synced: 16 May 2026

https://github.com/barbosa89/vue-table

A classical data table component in VueJS and Bootstrap 4, optimized for Laravel applications.

bootstrap4 data datatable javascript laravel php table vuejs

Last synced: 11 Apr 2026

https://github.com/ahmad-ali-rafique/decision-tree-classifier-modeling

👏Comprehensive exploration of decision tree classifiers, including data cleaning, model building🏩, and performance evaluation on various datasets.

analytics classification classification-models data data-science dataanalytics datacleaning dataset decision-tree-classifier models

Last synced: 20 Apr 2026

https://github.com/henryssondaniel/teacup-java-report-mysql

Report Teacup data to a MySQL database

data logs mysql reports teacup

Last synced: 20 Apr 2026

https://github.com/istinnew/etl-pipeline-ganz-project

End-to-end ETL pipeline project for collecting, transforming, and loading data into a cloud-based database using Python, MySQL, and Google Cloud Analytics

cloud cloud-engineering cloud-services data data-science dataanalytics database database-schema googlecloud mysql mysql-database python python-lambda

Last synced: 20 Apr 2026

https://github.com/stdlib-js/wasm-base-dtype2wasm

Return the WebAssembly data type associated with a provided array data type value.

array base data dtype javascript node node-js nodejs stdlib type types util utilities utility utils wasm webassembly

Last synced: 09 May 2026

https://github.com/anjaliwork20/moodify

Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning

artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs

Last synced: 20 Apr 2026

https://github.com/arda-guler/binmotion

Convert ANY data to a video file. Sister project of binGallery.

data data-visualization proof-of-concept video

Last synced: 04 Jun 2026

https://github.com/dms-codes/scrape-tokoalvabet-com

Toko Alvabet Data Scraping and Price Comparator This Python script is designed to scrape data from Toko Alvabet's website and perform price comparison for the obtained products. It includes features for viewing and analyzing product data, as well as comparing prices with other sellers.

data price python scraping

Last synced: 29 Jul 2025

https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice

Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.

container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle

Last synced: 10 May 2026

https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

colab-notebook data datavisualization matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/g-schumacher44/analyst_resource_hub

A collection of guidebooks, quickref, and resources for data analysis

analytics bigquery data lookerstudio machine-learning model python sql yaml-configuration

Last synced: 20 Jun 2026

https://github.com/nikoheikkila/maps

A TypeScript collection of specialized map implementations

data javascript maps typescript

Last synced: 20 Apr 2026

https://github.com/juanandres-montero/dataanalysis

Dedicado al análisis de datos.

costa-rica data

Last synced: 10 Aug 2025

https://github.com/stdlib-js/ndarray-vector-int8

Create a signed 8-bit integer vector (i.e., a one-dimensional ndarray).

constructor ctor data int8 javascript ndarray node node-js nodejs stdlib structure types vec vector

Last synced: 24 Apr 2026

https://github.com/mozzo1000/web-analytics

Website analysis tools and data

analysis analytics data website

Last synced: 21 Apr 2026

https://github.com/pranjaldhamane/social-media-sentiment-analysis

This project aims to analyze sentiment in Twitter data to understand attitudes towards specific topics or entities. It seeks to uncover positive and negative sentiment patterns, detect potential cyberbullying or hate speech, and provide insights into Twitter's overall sentiment landscape.

data dataanalysis logistic-regression nlp-machine-learning python sentiment-analysis twitter

Last synced: 18 Apr 2026

https://github.com/meizuflux/cion

Python minimal data validation library

data minimal python validation

Last synced: 28 May 2026

https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio

This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time

data data-analysis data-visualization pandas seaborn

Last synced: 10 May 2026

https://github.com/stefen-taime/llm-rag-mtl-public-hospital

Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal

data google-reviews hopital hospital hub ia llm montreal open-source quebec rag

Last synced: 21 Apr 2026

https://github.com/jdenn0514/surveycore

Core Survey Analysis Infrastructure

data r resear survey-analysis

Last synced: 21 Apr 2026

https://github.com/adrianoleitedasilva/adrianoleitedasilva

Me chamo Adriano, tenho 35 anos de idade, sendo 18 anos dedicados as áreas de Tecnologia da Informação e Educação.

adrianoleitedasilva automation ceo cio cto data data-science dev diretor github mobile professor python readme techlead web

Last synced: 10 May 2026

https://github.com/hemangsharma/assignment-2---classification-models

Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications

data datascience-machinelearning machine-learning ml

Last synced: 10 Jun 2026

https://github.com/mevlutcelik/turkey-cities-data

📍 Türkiye şehirlerine ait şehir verisi paketi: Plaka, koordinat (lat/lon), nüfus (2024 ADNKS) ve coğrafi bölge bilgilerini içerir.

cities coordinates data json nufus plaka turkey turkiye typescript

Last synced: 10 Mar 2026

https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis

A data science notebook project focused on analyzing car features and building a model for car price prediction.

data data-analysis data-visualization jupyter-notebook python

Last synced: 23 Apr 2026

https://github.com/mohammad-malik/covid-visualizations-d3

This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).

covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization

Last synced: 28 May 2026

https://github.com/nivasharmaa/genetrack

A Java program for analyzing DNA sequences and identifying individuals based on Short Tandem Repeats (STRs). Features profile database creation, STR analysis, individual identification, and relationship detection.

data data-processing dna-analysis file-io-in-java genetic-analysis java-oop

Last synced: 25 Aug 2025

https://github.com/howwohmm/fetchgram

era-adjusted Instagram content intelligence — scrape any public profile, OCR every image, measure what actually works. free, local, no API keys.

analytics cli content-strategy data instagram ocr python scraper

Last synced: 06 Jun 2026

https://github.com/mwelwankuta/image-match

a multi-threaded tool for batch renaming images of their appearance and match in a datasource

data openai typescript worker-threads

Last synced: 09 Mar 2025

https://github.com/cyberoctane29/python-for-data-analysis

A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.

data data-analysis data-analytics data-science python

Last synced: 24 Apr 2026

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/marielachirinosr/cyclistic-data-analytics-project

This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.

data data-visualization pandas powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/brightway-lca/bw_io

IO tools for Brightway LCA framework

bw3 data life-cycle-assessment python

Last synced: 10 Jun 2026

https://github.com/sakan811/show-leaving-soon-tracker-website

This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.

data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website

Last synced: 18 Mar 2025

https://github.com/rubix982/product-quality-classification

This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.

data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp

Last synced: 25 Apr 2026

https://github.com/mlkav/tri-hita-karana

Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.

bali data data-science g20 science

Last synced: 06 Jun 2026

https://github.com/afeiship/data-arary

Data array with some new methods.

array data data-structure js list

Last synced: 11 May 2026

https://github.com/vishwas-chakilam/hr-dashboard

This project involves creating an interactive HR Dashboard using Power BI for visualization and MySQL for data cleaning and analysis. It provides insights into employee performance, attrition, salary distribution, and hiring trends.

dashboard data datac datacleaning datavisualization mysql powerbi

Last synced: 23 Mar 2025

https://github.com/marielachirinosr/hotel-data-analysis

Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries

data data-analysis matplotlib pandas python

Last synced: 25 Apr 2026

https://github.com/f-ssemwanga/pandas-numpy-repo

This repo has extensive work I have done on Pandas and NumPy Modules during the advanced programming Module

cleaning-data-in-python data numpy-arrays pandas visualization

Last synced: 27 Apr 2026

https://github.com/miniql/miniql-inline

A MiniQL query resolver for inline data.

data query query-language

Last synced: 27 May 2026

https://github.com/tsbarr/citi-bikes-challenge

Citibikes NYC Data Analysis: Uncover insights from over a decade of ride data. Jupyter notebook for data aggregation/cleaning & Tableau dashboards for interactive visualization.

data data-visualization pandas-python python tableau

Last synced: 27 Apr 2026

https://github.com/rezapace/newbash

This project involves managing various application shortcuts and configurations primarily for a Linux environment. It includes scripts for creating .desktop entries for applications, managing system configurations, and handling application processes.

automation backup bash data dekstop linux newbash ohmyzsh script testing zsh

Last synced: 11 Apr 2026

https://github.com/schenkd/tweetminer

Data Miner for Twitter Streaming API

data dataminer datamining java twitter twitter-api twitter4j

Last synced: 07 Jun 2026

https://github.com/bhumitbedse/machine-learning-projects

AI Machine learning Deep learning Computer vision NLP Projects with code

computer-vision data data-science deep-learning machine-learning natural-language-processing python

Last synced: 27 Apr 2026