An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/krakozaure/pyzzy

Set of packages to simplify development in Python

configuration data formats json library logging logs python3 toml utils yaml

Last synced: 14 Jan 2026

https://github.com/mierune/tinybufr

[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.

bufr data meteorology rust weather wmo

Last synced: 15 May 2025

https://github.com/aaisha-nexus/sql_company_insights

A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.

business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms

Last synced: 12 Aug 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/pchaparro/search-engine

Full stack search-engine created from youtube videos obtained using "web-scraping"

data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website

Last synced: 17 Apr 2026

https://github.com/samhollings/nhs_data_cleansing

A repo of reusable functions for cleansing data

cleansing data data-cleaning data-cleansing preprocessing pyspark python python3

Last synced: 05 Oct 2025

https://github.com/mnazlukhanyan/da-projects

Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт

data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization

Last synced: 11 Apr 2026

https://github.com/fuadarradhi/gps_data_reset

Flutter plugin to reset and download gps data

cache data extra gps reset

Last synced: 23 Feb 2026

https://github.com/pathilink/ebury_case

Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.

bigquery data modeling sql

Last synced: 05 Oct 2025

https://github.com/shreeparab1890/indian-elections-2019-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.

data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization

Last synced: 28 Apr 2026

https://github.com/h-sutiwas/r2de-2025

This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.

data data-engineering data-visualization docker gcp pipeline spark

Last synced: 30 Apr 2026

https://github.com/andrii04/andreamonforte-bi-assignment

Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.

automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql

Last synced: 09 Nov 2025

https://github.com/armand-sauzay/datasets

Datasets for machine learning

ai data datasets machine-learning ml

Last synced: 18 Jan 2026

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026

https://github.com/vim89/flowforge

Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently

archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming

Last synced: 14 Apr 2026

https://github.com/ashita-ai/ashita-ai.github.io

Ashita AI - The island of misfit data tools

ai data

Last synced: 19 Feb 2026

https://github.com/ramonmeza/mysteamstats

Visualize your stats from your favorite games on Steam!

data statistics steam steam-api videogame visualization

Last synced: 17 Mar 2025

https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql

Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.

analytics data dataanalytics mssql powerbi sql

Last synced: 26 Jun 2025

https://github.com/apigear-io/template-cpp14

C++14 technology template

conan cpp cpp14 data library

Last synced: 18 Feb 2026

https://github.com/alexis-gss/games-data

Games Data is a library of informations about all games, realised under NuxtJs

css3 data games nuxtjs tailwindcss typescript vuejs

Last synced: 13 Mar 2025

https://github.com/sambhav/fb-insights

A tool to analyze your Facebook data dumps and generate insights

analytics data facebook graphs insights language learning machine natural personal processing

Last synced: 17 Mar 2025

https://github.com/preranarao03/madhav_e-commerce_dashboard

This repository features the Madhav_E-Commerce_Dashboard built with Power BI. It provides interactive visualizations for analyzing e-commerce sales performance, product categories, customer segments, and geographic data, aiding in data-driven business decisions.

analysis data powerbi

Last synced: 30 Jan 2026

https://github.com/jszafran/personal-aws-data-lake

Personal, cloud based (AWS), data lake for experimenting with cloud services.

aws cloud data data-engineering dataengineering datalake etl terraform

Last synced: 20 May 2026

https://github.com/redinfinitypro/scientificsharp

Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.

componentmodel cryptography data drawing forms generic linq system tasks text

Last synced: 06 Apr 2025

https://github.com/antononcube/raku-data-typesystem

Data type system for different data structures.

data data-structures rakulang type-system

Last synced: 09 Jul 2025

https://github.com/mightymetrika/mmirestriktor

Informative Hypothesis Testing Web Applications

data hypothesis infomative power r restriktor statistics testing

Last synced: 17 Mar 2025

https://github.com/alexdonh/adonis-cache

Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!

adonis-framework adonisjs cache data dependency redis storing

Last synced: 15 May 2026

https://github.com/anzerr/storage.ts

Util to store data used in a service

data nodejs storage typescript util

Last synced: 20 May 2026

https://github.com/samharrison7/datamapper

Making mapping between datasets as simple as possible.

data data-mapper data-mapping data-science data-structures

Last synced: 17 Mar 2025

https://github.com/kylepw/multistack

Example of multiple stacks in one array.

algorithms array data data-structures python stack

Last synced: 17 Mar 2025

https://github.com/harrisonwelch/pythondatascience

Repo of code from the linked-in lesson "Python: Data Analysis"

data data-science matplotlib notes numpy python tutorial

Last synced: 12 Apr 2026

https://github.com/farovictor/mongodbloader

This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.

data go mongodb pipeline

Last synced: 15 May 2026

https://github.com/josemartinezrdev/logisticadb

Logistica Database

data ddl diagrama dml mysql sql

Last synced: 09 Jul 2025

https://github.com/stdlib-js/array-base-index-of-same-value

Return the index of the first element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 15 May 2026

https://github.com/dina-hosny/sequence-trigger-pair-for-all-schema-tables-plsql

A PLSQL script that creates Sequence Trigger Pair for all Schema's Tables

data oracle plsql sequence sequencetrigger sql toad trigger

Last synced: 06 Mar 2026

https://github.com/arthurcfranklin/acervo-musical

Este projeto consiste na criação de um banco de dados relacional para auxiliar um DJ na organização e catalogação do seu acervo musical. O objetivo é fornecer um sistema eficiente para armazenar e gerenciar informações sobre cantores, bandas, músicas e suas versões remixadas.

data database mysql mysql-database sql

Last synced: 22 Mar 2025

https://github.com/kinshukjainn/dclue-v1

Dsainone is a highly optimized Data Structures and Algorithms (DSA) library designed to provide efficient implementations of graph algorithms, trees, hashing, and linked lists while maintaining exceptional memory efficiency. The library is designed to be as fast and optimized as possible

data dsa-algorithm python

Last synced: 20 May 2026

https://github.com/erkylima/algorithms

Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! 🚀 #Python #Algorithms #DataStructures

algorithms algorithms-and-data-structures data data-structures

Last synced: 19 Jan 2026

https://github.com/piyushkumar2025/analytical-sql-project-exploring-trends-segmentation-kpis

A complete SQL analytics project using a simulated data warehouse. It analyzes sales, customer, and product data with CTEs, joins, window functions, subqueries, and views to deliver insights on trends, segmentation, and KPIs, showing how SQL enables data-driven decisions without BI tools.

advanced-sql analytics business-intelligence data data-science-projects datascience joins kpi mysql query sql window-functions-in-sql

Last synced: 02 Jul 2025

https://github.com/lord3008/instances-of-data-analysis

This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.

data data-analysis

Last synced: 03 Mar 2025

https://github.com/ournet/places-data

Ournet places data module

data ournet places storage

Last synced: 04 Apr 2025

https://github.com/xmen3em/kaggle-competitions

This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.

data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit

Last synced: 09 Apr 2026

https://github.com/francois-lenne/portofolio_flenne_streamlit

portofolio francois lenne using streamlit

data portofolio python slack-api streamlit

Last synced: 15 May 2026

https://github.com/seesharprun/sample-data-yaml

Example repository illustrating the automatic creation of sample data files from YAML data

csv data dotnet json sample xml yaml

Last synced: 08 Apr 2026

https://github.com/mwoss/poketruth

Application checking facts about Pokemons.

data json pokemon python truth

Last synced: 20 May 2026

https://github.com/madihanazir/ds-using-c

Basic insights into Data Structures (inspired by Abdul Bari course but in C language)

data self-learning structures-in-c

Last synced: 17 Mar 2025

https://github.com/dan149/uselesscontentcreator

Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.

content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3

Last synced: 03 Apr 2025

https://github.com/brunosalerno/osm_data

Ruby objects for dealing with OSM data, and generating XML files

data openstreetmap ruby xml

Last synced: 21 Apr 2026

https://github.com/garcane/layoffs-exploratory-data-analysis

This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.

data dataanalysis eda mysql sql

Last synced: 29 Oct 2025

https://github.com/clagiordano/weblibs-data-export

Library for generic data export to various formats

clagiordano data export weblibs xlsx

Last synced: 22 Mar 2025

https://github.com/webdevcave/collections-php

A PHP library for managing collections of data with support for nested keys.

array collection data helper library nested-keys package php utility utility-classes

Last synced: 23 Feb 2025

https://github.com/jigyasag18/employee-salary-prediction-jigyasa

PayNexus is a machine learning-powered web app that predicts employee salaries based on role, education, and experience. Built using Python, Streamlit, and scikit-learn, it supports both single and batch predictions. The app includes advanced features like resume parsing via NLP and interactive visual analytics. Ideal for job seekers, HR profession

data dataset decision-tree-regressor gradient-boosting-classifier knearest-neighbor-classifier labelencoder lasso-regression linear-regression machine-learning machine-learning-algorithms machinelearning onehot-encoder pipeline random-forest random-forest-classifier ridge-regression standardscaler svr-regression-prediction xgboost xgboost-classifier

Last synced: 15 May 2026

https://github.com/tearth/test-data-generator

The generator of test data for the school project.

data generator test

Last synced: 05 Jul 2025

https://github.com/errea/vet_clinic_database

For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.

data data-analysis data-structures data-visualization database

Last synced: 21 May 2026

https://github.com/danpoynor/data-pagination-and-filtering-project

Data pagination exercise using 'vanilla' JavaScript. This script consumes a JSON array containing any number of objects and adds buttons to a page that users can click to navigate to different pages of data.

data javascript json navigation pagination vanilla-javascript

Last synced: 20 Apr 2026

https://github.com/pbinkley/tweets-online-classes-covid19

A twarc harvest of tweets related to online classes during the COVID-19 outbreak, starting 2020-03-02

data social

Last synced: 06 Mar 2026

https://github.com/kashirin-alex/thither.direct-onamove

an android skeleton-example application for using data from Thither.Direct platform on mobile applications

android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management

Last synced: 27 Apr 2026

https://github.com/fuwn/records-data

🗃 Records Data

data records rust

Last synced: 30 Mar 2025

https://github.com/luminovrym/crawler-tools-js

Crawler Tools Js adalah sebuah aplikasi yang digunakan untuk scrapping data pada sebuah web

crawler crawler-js data js web-scraping

Last synced: 08 Sep 2025

https://github.com/i-rzr-i/domaincommonextensions

The purpose of this repository/library is to provide the most relevant and used extension methods in the life cycle of application development that allow us to improve our code, and writing speed, and use more efficiently dev team time during this period for more complex functionality.

api class data datatype extension helper object parser type util

Last synced: 20 Sep 2025

https://github.com/soenkekluth/micromitter

minimal and performant event emitter / dispatcher

data dispatch dispatcher emit emitter event eventdriven handler on send trigger

Last synced: 02 Nov 2025

https://github.com/yanaksalvo/all-panel-database-sql

Türkiye Cumhuriyeti Devleti'nin verilerini çalarak insanlara satarak para kazanan veya bu paraları kara para aklama şeklinde aklayarak gelir elde eden kişilerin database verileri ve bu sitelere giren kişilerin IP Adres bilgileri

api data database devlet ihbar panel panel-data paneldata panels sorgu sorgulama sorgupanel sql usom usomgovtr

Last synced: 06 Apr 2025

https://github.com/viglino/forets-de-cassini

couche SIG l’ensemble des contours des forêts représentées sur la carte de Cassini (hal-01267936)

cassini data forest

Last synced: 18 Feb 2026

https://github.com/siongui/xemaauj9k5qn34x88m4h

No source code. Only serve JSON files of Pāli words

data go json pali

Last synced: 15 May 2026

https://github.com/devprnvk/pycryptochain

A implementation of a blockchain-based cryptocurrency in Python. This project aims to provide a fundamental understanding of blockchain technology and cryptocurrency by building a basic version from scratch. Features include blockchain creation, transaction handling, mining rewards, simulation.

blockchain crypto data decryption encryption hashing processing py python salting storage

Last synced: 09 Mar 2026

https://github.com/amethyst-php/product

An item that is made to be sold or bought

amethyst amethyst-package api data laravel product

Last synced: 21 May 2026

https://github.com/gui-sitton/bank-loans

In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 21 May 2026

https://github.com/nyxblabs/mimikra

🔄 Sleek data morphing tool from one file to another

data file filesystem morphing node nodejs sleek tool

Last synced: 21 May 2026

https://github.com/rellyson/data-engineering-tools

This repository holds examples and documentation about the most used tools in the data engineering ecosystem.

apache-airflow apache-spark data data-engineering jupyter-notebook python tools

Last synced: 17 Jan 2026

https://github.com/bastianolea/servel_elecciones

Resultados electorales desde Servel (2024)

chile comunas data elecciones genero

Last synced: 08 Jul 2025

https://github.com/bastianolea/mineduc_matriculas_superior

Bases de datos de estudiantes matriculados en Educación Superior

chile comunas data educacion social

Last synced: 16 Jun 2026

https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet

Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski

data data-preparation-and-analysis data-visualization rstudio statistics

Last synced: 14 Apr 2025

https://github.com/the-universal-linux-society/sysreport

Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.

analysis bash bash-script bash-scripting data report reporting system

Last synced: 15 May 2026

https://github.com/jun-labs/algorithm

📝 자료구조, 알고리즘 학습 저장소.

algorithm data data-structures leetcode problem-solving programmers ps structure

Last synced: 14 Mar 2025

https://github.com/fastpix/flutter-core-data-sdk

A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.

analyt dart data flutter

Last synced: 15 May 2026

https://github.com/dscamilo/gestion-clientes-springboot

Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.

data interface java lombok-maven restcontroller spring-boot

Last synced: 15 May 2026

https://github.com/shrutakeerti/eye-gaze-detection

This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15

ai aiml data eda eeg eeg-signals eye jodhpur mlflow

Last synced: 17 Mar 2025

https://github.com/mksingh431/sql-complete-notes

SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.

data database sql sql-server

Last synced: 21 Apr 2026