An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/genert/metis

Asynchronous data sender library

analytics asynchronous data dependency-free typescript

Last synced: 27 Jan 2026

https://github.com/anobaka/insidecollector

这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。

collection data excel-like list list-manager table

Last synced: 19 Jan 2026

https://github.com/R-Mahesh45/HR---Resume-Text-Classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 13 Oct 2025

https://github.com/iamgmujtaba/github-python-daily-trending

This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file

data data-scraping github-actions tranding tranding-bot

Last synced: 13 Oct 2025

https://github.com/twistezo/ts-dto-mapper

DTO (Data Transfer Object) to Object Model transformer

data dto map mapper model object transfer transform transformer typescript

Last synced: 05 Feb 2026

https://github.com/stdlib-js/strided-base-dtype-resolve-str

Return the data type string associated with a supported strided array data type value.

array data dtype dtypes enum javascript node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 13 Oct 2025

https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis

This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle

data database mysql sql walmart

Last synced: 24 Feb 2026

https://github.com/nnavales/desafios-data-engineer

En este proyecto abordaremos desafíos comunes en el rol de un Data Engineer con tecnologías modernas.

data data-engineering database dataengineering docker minio scrapping spark

Last synced: 01 Jun 2026

https://github.com/akv3sic/cryptocurrency-charts

Cryptocurrency API data visualizations 📈 with Matplolib.

cryptocurrency data data-visualization matplotlib python

Last synced: 16 Oct 2025

https://github.com/potreic/etl-fashion-trend-analysis

✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊

airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends

Last synced: 27 Jan 2026

https://github.com/data-forge-notebook/javascript-cheat-sheet

Cheat sheet that accompanies my book Data Wrangling with JavaScript

cheatsheet data data-wrangling javascript nodejs

Last synced: 15 Apr 2026

https://github.com/nicolasbizzozzero/datagenerator

Randomly generate various commonly used data

data data-generation data-generator data-science

Last synced: 18 Oct 2025

https://github.com/gematik/poc-isik-patient-merge

The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.

data fhir isik poc

Last synced: 19 Oct 2025

https://github.com/jaldekoa/fiscaldataapi

A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 27 Jan 2026

https://github.com/lemniscate-world/stratai

This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.

ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading

Last synced: 23 Oct 2025

https://github.com/rodekruis/510-data-catalog

The Project is CKAN based Data Catalog Portal for 510

catalog ckan data opendata

Last synced: 23 Jan 2026

https://github.com/purarue/git_doc_history

copy/track file history in git, with python bindings to traverse and extract history/files/lines at some date

data git

Last synced: 17 May 2026

https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london

Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

data data-analysis-python data-analytics data-visualization ecommerce

Last synced: 24 Oct 2025

https://github.com/garcane/Income-Prediction-ML

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 24 Oct 2025

https://github.com/farzai/geonames-php

This package provides a simple way to download Geonames data and format it for friendly use.

countries country-codes data geography geonames

Last synced: 24 Oct 2025

https://github.com/cmda-tt/course-24-25

🎓 tech track · 2024-2025 · curriculum and syllabus 📊

d3 data datavis datavisualization es6 functional javascript programming svelte

Last synced: 28 Jan 2026

https://github.com/imahdimir/githubdata

A very simple Python package to easily download from and manage a GitHub "Data Repository"

data data-repository python-package

Last synced: 23 Jan 2026

https://github.com/rnabla/cuda-des

Bruteforcing DES using CUDA

bruteforce cuda data des encryption gpu parallel standard

Last synced: 27 Oct 2025

https://github.com/aleenprd/docbt

Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.

ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit

Last synced: 11 Nov 2025

https://github.com/patrikmasiar/algorythm-of-the-night

Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻‍💻

algorithms data interview-questions logic logic-programming math mathematics science

Last synced: 27 Oct 2025

https://github.com/maccccd/wsoa3029a_2444372

This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.

d3js data hacking visualization

Last synced: 24 Jan 2026

https://gitlab.com/Native-Coder/d3-react-component

This is a dead-simple React component that makes D3 implementation a breeze.

chart component d3 data react vis visualization viz

Last synced: 24 Jan 2026

https://github.com/zoekelepiri/ota_observatory

A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.

backend data database spring-boot

Last synced: 06 Feb 2026

https://github.com/CheeseWithSauce/HadithsJSONFormat

Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.

api arabic data dev free hadith islam islamic muslim open-source quran sunnah

Last synced: 24 Feb 2026

https://github.com/ariqf1/learn_data

Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.

data data-engineering mysql

Last synced: 15 Apr 2026

https://github.com/desktopcleaner/naturemagazinescraper

Scrapes open-access Nature magazine articles and store as txt files.

data nature-magazine python scrapper word-frequency

Last synced: 06 Feb 2026

https://github.com/fairspec/fairspec-typescript

Fairspec TypeScript is a fast data management framework built on top of the Fairspec standard and Polars DataFrames

ckan csv data dataframe dataset excel fair json ods polars quality schema sqlite table typescript validation zenodo

Last synced: 09 Feb 2026

https://github.com/alejo1630/titanic_kaggle

This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.

data data-science jupyter-notebook notebook python titanic-survival-prediction

Last synced: 03 May 2026

https://github.com/itu-helper/data-updater

Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.

data istanbul-technical-university scraper selenium-python

Last synced: 29 Jan 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/sandk21/etude_eau_potable_monde

Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau

analysis data tableau tableau-public visualization

Last synced: 19 Mar 2026

https://github.com/simranjeet97/quotes-analysis

Kaggle Dataset on Quotes Analysis and Visualization With Python, Pandas and MatplotLib Using Jupyter Notebook.

data data-science datavisualization jupyter-notebook kaggle kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python quotes quotes-application

Last synced: 15 Apr 2026

https://github.com/tee8z/noaa-oracle

NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style

data duckdb-wasm noaa-weather parquet-files sql weather

Last synced: 17 Feb 2026

https://github.com/openearth/rws-viewer

This viewer is created by Deltares in cooperation with Voorhoede under OpenEarth GPL License. The viewer can be used via several RWS websites, please visit https://www.informatiehuismarien.nl/, https://waterinfo-extra.rws.nl/ and https://basismonitoringwadden.waddenzee.nl/.

data mapbox-gl-js ogc-services viewer

Last synced: 01 Feb 2026

https://github.com/aniketkkajania/wassupanalyzer

WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.

data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat

Last synced: 25 Feb 2026

https://github.com/jub0t/eso

An application to manage all your Encryption & Decryption keys and other related tools.

data encryption encryption-decryption hacking hacking-tool keys pgp privacy private

Last synced: 07 Feb 2026

https://github.com/noahweasley/node-user-settings

A universal but simple node library to implement user settings, built to work with Electron.js with little or no configurations

app data electronjs json nodejs persist settings storage sync user

Last synced: 08 Feb 2026

https://github.com/raymondcm/strawberrydata

Tool suite for fast multi-camera strawberry data collection project. The standards document houses cross compatibility/purpose implementation details.

camera cpp data intel multi-camera

Last synced: 08 Feb 2026

https://github.com/jeanmanguy/milk-sci-fi

Census of every mention of milk in sci-fi works.

data milk sci-fi

Last synced: 26 Feb 2026

https://github.com/ajityadav2621/datadoom

Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.

ai data

Last synced: 09 Feb 2026

https://github.com/pharo-ai/data-preprocessing

Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.

data pharo pharo-smalltalk preprocessing smalltalk

Last synced: 09 Feb 2026

https://github.com/codenoid/alodokter.com-database

a Alodokter.com Database, collected by Hofesh Bot (Scrapper)

alodokter data extraction hofesh

Last synced: 18 Mar 2026

https://github.com/jhpoelen/rats

self-replicating data publication related to rat (Rattus sp.) specimen.

biodiversity data natural-history-collections provenance

Last synced: 18 Mar 2026

https://github.com/scottleechua/data

Public datasets under CC-BY-4.0 license.

data public-data

Last synced: 18 Mar 2026

https://github.com/mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema

Last synced: 11 Feb 2026

https://github.com/lmuffato/project-mongodb-dataflights-trybe

Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB

back-end crud data database filter mongo mongodb query trybe-projects

Last synced: 16 Apr 2026

https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis

Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard

analysis data data-visualization datacleaning excel excel-vba

Last synced: 19 Mar 2026

https://github.com/seabbs/estzoonotictb

Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden

bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb

Last synced: 28 Feb 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/m0nica/datalogues-outdated

Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll

data pelican pelican-blog pelican-theme

Last synced: 28 Feb 2026

https://github.com/ismail-mouyahada/lodscroljs-library

LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks

data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll

Last synced: 13 Feb 2026

https://github.com/stdlib-js/array-base-every-by-right

Test whether all elements in an array pass a test implemented by a predicate function, iterating from right to left.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 13 Feb 2026

https://github.com/obsidianplusplus/5e_play_cs-go

Python工具,分析你在5EPlay的CS:GO比赛数据。抓取、分析、筛选并导出。 | Python tool to analyze your 5EPlay CS:GO match data. Fetches, analyzes, filters, and exports.

5eplay analysis api automation csgo data esports excel json match pandas performance player python reporting scraping stats team

Last synced: 13 Feb 2026

https://github.com/frictionlessdata/extensiondp

Extension DP (Data Package Extension Template) is a Git repository template for rapid Data Package extension development

data datapackage exchange extension format

Last synced: 13 Feb 2026

https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-

Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.

analysis analytics data excel sales

Last synced: 14 Feb 2026

https://github.com/stdlib-js/array-base-assert-is-complex-floating-point-data-type

Test if an input value is a supported array complex-valued floating-point data type.

array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate

Last synced: 14 Feb 2026

https://github.com/jopanel/factual-scraper

Data scraper for Factual v2 API

data

Last synced: 15 Feb 2026

https://github.com/luminati-io/twitter-x-dataset-samples

A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.

api data dataset twitter twitter-api twitter-scraper web-scraping x

Last synced: 19 Mar 2026

https://github.com/ghonimo/diode-pn-junction-characterization-psu-ece515

A detailed analysis of the I-V characteristics of a PN junction diode (1N4148) under different temperatures, utilizing Excel for graphical analysis and parameter extraction. This study was conducted as part of the ECE 515: Fundamentals of Semiconductor Devices course at Portland State University.

analysis characterization data device diode diodes excel mosfet-transistor pn-junction

Last synced: 28 Feb 2026

https://github.com/linx-software/file-import-to-rest-api

Import a CSV file and make the data available via a REST API.

csv data linx low-code

Last synced: 19 Mar 2026

https://github.com/skywardai/paper_gallery

Papers gallery for using LLMs ability over dataset

ai data data-science llm medicine neural-network research security

Last synced: 19 Mar 2026

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 27 Mar 2026

https://github.com/droduit/grand-comics-database

EPFL course project to manage a huge database containing hundreds of millions data, and optimize the queries to create a smooth experience on user interface.

big-data data database epfl sql

Last synced: 16 Apr 2026

https://github.com/stdlib-js/array-base-every-by

Test whether all elements in an array pass a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 03 Mar 2026

https://github.com/meineglock20/listtotabledisplay

The List to Table Formatter for .NET is a versatile library designed to convert lists of objects into well-formatted table displays . Ideal for web applications and console applications - including log files and word documents.

asp-net asp-net-core console csharp data display dotnet formatter html list logging netstandard20 object-list presentation razor-pages table table-formatter text-table text-to-table utility

Last synced: 04 Mar 2026