An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/wiseql/wiseql

The wise data browser — run SQL recipes as small, observable, debuggable steps

data debugging duckdb oracle quality sql tui

Last synced: 13 Jun 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/dineshram0212/youtube-analysis

This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.

data data-visualization pandas python webscraping youtube-api-v3

Last synced: 19 Jun 2026

https://github.com/rylan12/apscores

A quick way to visualize how the AP score distributions have changed from year to year.

advanced-placement analysis ap-exam data scores

Last synced: 19 Jun 2026

https://github.com/artcc/coredatademo

Demo for CoreDataGenericModule implementation

core coredata coredata-model data encrypted encrypted-data encryption persist

Last synced: 19 Jun 2026

https://github.com/svetlanam/kbl-to-csv-s3

Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket

data data-cleaning data-transformation etl keboola s3-bucket

Last synced: 20 Jun 2026

https://github.com/hlan22/2025-03-18-data-validation

(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:

data validation

Last synced: 23 Jun 2026

https://github.com/dineshdhamodharan24/data-analysis

probability Analysis to customers and bascis analysis

analysis data powerbi probability python visualization

Last synced: 23 Jun 2026

https://github.com/agbianchessi/js-struct

C-like Structs for JavaScript.

binary c data struct

Last synced: 23 Jun 2026

https://github.com/anburocky3/cbse-schools-data

Fetch CBSE Schools in seconds and use it for your data projects

cbse data data-analysis data-science grabber nextjs

Last synced: 24 Jun 2026

https://github.com/brayflex/spy-sector-rotation-google-sheet

Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.

data etf google-sheets index price rotation script sector spreadsheet spy stock-market

Last synced: 29 Jun 2026

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/olekscode/datageneration

Exploring the methods of data generation for different Machine Learning algorithms

data javascript machine-learning

Last synced: 05 Apr 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/mikpom/genomvar

Sequence variant analysis in Python

data genomics

Last synced: 10 Apr 2026

https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

cleaning-data data database dataset mysql mysql-database sql

Last synced: 07 Apr 2025

https://github.com/aminnairi/node-decode

Check that your data meet your expectations

check data decode expectations schema

Last synced: 22 Apr 2026

https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails

Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.

cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql

Last synced: 24 Jul 2025

https://github.com/team-hydrogen/2025-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 11 Apr 2025

https://github.com/goutamhegde002/dsa-roadmap-for-beginners

The "DSA Roadmap for Beginners" repository is a comprehensive guide designed to help beginners learn Data Structures and Algorithms (DSA) efficiently. It provides structured content covering fundamental and advanced topics in DSA, with practical examples, exercises, and coding problems.

beginner beginner-friendly beginner-guide coding-practice data data-science data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice dsa-roadmap interview-preparation interview-resources programming-fundamentals

Last synced: 28 Feb 2026

https://github.com/justinyahin/wpdf

Create, filter, sort and display users data on your WordPress site.

data filtering wordpress

Last synced: 18 Apr 2026

https://github.com/fatihemres/Fruits

Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.

animations data models onboarding swift swiftui

Last synced: 31 Aug 2025

https://github.com/soenneker/soenneker.dtos.idpartitionpair

A minimal Record type with an Id (string), PartitionKey (string), and maximum JSON compatibility

csharp data dotnet dto id key partition

Last synced: 09 Mar 2026

https://github.com/cmda-tt/course-25-26

🎓 tech track · 2025-2026 · curriculum and syllabus 📊

d3 data datavis functional javascript programming research svelte visualization

Last synced: 20 Jan 2026

https://github.com/equinor/fmu-sumo-uploader

Upload to Sumo in the FMU context

data fmu python subsurface sumo

Last synced: 06 May 2026

https://github.com/themost-framework/mysql

Most Web Framework MySQL Adapter

data database mariadb mysql orm query sql

Last synced: 07 Mar 2026

https://github.com/karashiiro/lodestone-character-data-scraper

Lodestone character data scraper.

data ffxiv ffxiv-character lodestone

Last synced: 23 Apr 2026

https://github.com/iamfrerot/userverse

creating api for data analysis

data data-analytics spring-boot users

Last synced: 23 Mar 2025

https://github.com/open-geodata/sp_bh_pcj-2020-2035

Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035

data python

Last synced: 16 Jan 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/mierune/tinybufr

[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.

bufr data meteorology rust weather wmo

Last synced: 15 May 2025

https://github.com/sakshamarora07/whatsapp-chat-analyser

This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.

chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp

Last synced: 04 Jan 2026

https://github.com/quantumudit/test-store-data-analysis

This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.

data data-visualization dataanalytics python python-webscraping webscraper webscraping-data

Last synced: 11 Apr 2026

https://github.com/dug22/jjournal

A Jupyter like notebook software for Java

data data-analysis data-science java jshell jshell-repl notebook swing swing-application

Last synced: 11 Apr 2026

https://github.com/diegoperea20/datos-secuenciales-con-ia

Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes

ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao

Last synced: 06 Feb 2026

https://github.com/apostolissiampanis/weather-app-api

WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.

api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux

Last synced: 05 May 2026

https://github.com/ersinkoc/minote

Minimal Notation for LLMs

data llm notation token

Last synced: 21 Feb 2026

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/officialxviid/gloogia

👓 Make your big ideas come true by building real projects using real data 🌎

api build data gloogia projects xviid

Last synced: 05 Jan 2026

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/csoren66/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 03 Mar 2025

https://github.com/simonbolivarpy/vault-decode-py

Simple Tools for decode crypto data, from extensions wallet, Metamask, Ronin, TrustWallet, TronLink(old), etc.

data decode decrypt metamask passwords python ronin salt tronlink trustwallet vault

Last synced: 15 Mar 2025

https://github.com/fiddlydigital/anonimizer

A lib to replace and rehydrate sensitive data in text

anonimize anonymize data data-security prompt sanitize string string-manipulation text

Last synced: 15 Mar 2025

https://github.com/gabrielcsapo/bluse

⚗️ blend and fuse data with ease

data normalize utility

Last synced: 15 Mar 2025

https://github.com/e-panourgia/big-data

Big Data Management Systems course assignments

analytics azure bigdata data hadoop json latex mrjob neo4j python redis stream

Last synced: 11 Apr 2026

https://github.com/abirsaha111/ipl-2022-analysis

The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.

data dataana dataanalytics datavi matplotlib python

Last synced: 07 Jun 2026

https://github.com/ybelenko/openapi-data-mocker-interfaces

Package with OpenApiDataMocker interfaces.

data fake faker interface mock mocker oas oas3 openapi swagger

Last synced: 05 Jan 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/bbfh-dev/protox

Go library for (de-)serializing custom protocols

binary data format go library parsing protocol reader writer

Last synced: 01 Jul 2025

https://github.com/mecha-cms/x.route

Custom route files.

custom data extension file folder path route url

Last synced: 23 Mar 2025

https://github.com/gkannan-codes/habitableexos

With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.

data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization

Last synced: 11 Apr 2026

https://github.com/halyusa16/mysql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 20 Jan 2026

https://github.com/praxtube/dogg

CLI tool to log data manually

data data-logger log logger

Last synced: 10 Jun 2026

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025

https://github.com/psyteachr/psyteachrdata

Datasets for psyTeachR Books

data

Last synced: 23 Mar 2025

https://github.com/roovedot/unet-cnn-for-road-segmentation

(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation

cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch

Last synced: 01 Jul 2025

https://github.com/lablnet/alibaba_scraper

This is a robust web scraper that extracts data from the Alibaba website. It's multi-threaded and utilizes Playwright to efficiently scrape data from the website. This script is capable of scraping the entire Alibaba site, which would take approximately 4-6 months to complete.

alibaba data ecom mit-license open-source products scraper

Last synced: 15 Mar 2025

https://github.com/d4niee/exifpy

An simple console tool to view Image meta datas

data exif image meta python

Last synced: 23 Mar 2025

https://github.com/wittyicon29/kritika-iit-b-2023

Seletcion task for the summer projects of Kritika IIT-B

data data-analysis data-science

Last synced: 15 Mar 2025

https://gitlab.com/hailstorm75/Common

A collection of extension libraries for various use-cases

common core cpp csharp data extensions libraries library math matrix

Last synced: 07 May 2025

https://github.com/mevlutcelik/turkey-cities-data

📍 Türkiye şehirlerine ait şehir verisi paketi: Plaka, koordinat (lat/lon), nüfus (2024 ADNKS) ve coğrafi bölge bilgilerini içerir.

cities coordinates data json nufus plaka turkey turkiye typescript

Last synced: 10 Mar 2026

https://github.com/mubashirsidiki/certifications_work

his repository contains my work, projects, and solutions from various professional certification programs.

analysis coursera data data-science google ibm john-hopkins machine-learning michigan udemy

Last synced: 01 Jul 2025

https://github.com/sasanthns/sql_data_warehouse_project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver

Last synced: 24 Mar 2025

https://github.com/bertrand31/one-billion-rows-challenge

🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds

competitive-programming competitive-programming-contests data data-engineering data-processing performance scala

Last synced: 05 Sep 2025

https://github.com/plnech/never2late

Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'

dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper

Last synced: 10 Jun 2025

https://github.com/docuvesta/shiseido_skincare_usa_fr_infographics

Découvrir les indicateurs de performance liés aux avis d'un sérum très réputé de la marque de beauté luxe japonaise Shiseido. Cette comparaison concerne les sites web USA et FR 💯

analysis automatisation data datanalysis graphique infographie pandas plotly python skincare soins

Last synced: 11 Apr 2026

https://github.com/murshidazher/client-side-data-storage

🚌 A workspace containing client-side data storage implementations

cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql

Last synced: 02 Sep 2025

https://github.com/bdr-pro/graphyml

A powerful, interactive Streamlit application to explore, edit, visualize, and query a graph-based database of YAML nodes — ideal for movie metadata, research articles, or structured knowledge graphs.

data database yaml yml

Last synced: 23 Jul 2025

https://github.com/awpala/udemy-my-courses-data-parser

Download Udemy lists and courses metadata for authenticated student user

data scripts udemy

Last synced: 07 May 2026

https://github.com/rahult18/atmo-flow

AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes

airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark

Last synced: 23 Jun 2026

https://github.com/gustavonav/daily-youtube-extraction

Projeto que completa a criação de um ambiente para extração, armazenamento e processamento de dados do Youtube

airflow data minio python3 spark

Last synced: 21 Feb 2026

https://github.com/abhijeetdasbakshi/ecommerce-insights

A Dockerized end-to-end project that combines unsupervised machine learning for customer segmentation with scalable data pipelines. It uses MongoDB for data ingestion, Scikit-learn for clustering, Airflow for orchestration, and Streamlit for interactive visualization — enabling actionable insights into e-commerce

airflow airflow-dags ci-cd-pipeline clustering dags data data-pipelines docker docker-compose docker-container dockerfile git great-expectations kafka mongodb pca-analysis postgresql pyspark t-sne umap-learn

Last synced: 04 Apr 2026

https://github.com/0xHericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 24 Mar 2025

https://github.com/richardlitt/bird-watching

My birdwatching list and repo

birding data ebird

Last synced: 26 Jan 2026

https://github.com/sauravsrivastav/githubreposearcher

GitHub Repo Searcher 🔍 is a Streamlit web application designed to help you search for GitHub repositories based on a query and view the results in a tabular format. You can also download the results in CSV or Excel format for further analysis. 📊📈

data data-export excel github-api python repository-searcher streamlit webapp

Last synced: 20 Jan 2026

https://github.com/shahsuvarli/election-voters-data-analysis-pandas

Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.

cleaning data grouping matplotlib numpy pandas python visualization

Last synced: 12 Apr 2026

https://github.com/merekat/flight-delay-prediction

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

aviation data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling

Last synced: 08 Apr 2025

https://github.com/beriberikix/senml-zephyr

A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr

codec data iot senml sensor zephyr-rtos

Last synced: 24 Mar 2025

https://github.com/keanteng/kaggledata

📊Data Source For Program Testing

data dataset excel

Last synced: 24 Mar 2025

https://github.com/darkogamerz/dhis2heat

A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.

analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization

Last synced: 01 Apr 2025

https://github.com/armand-sauzay/datasets

Datasets for machine learning

ai data datasets machine-learning ml

Last synced: 18 Jan 2026