An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/neurazum-ai-department/tumor-stages-dataset---v1

Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.

brain data dataset datasets image mri neuroscience tumor tumor-segmentation

Last synced: 18 Mar 2026

https://github.com/ludreinsalvador/global-covid-19-data-analysis

Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.

analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization

Last synced: 26 Feb 2026

https://github.com/drkane/area-profiles

Produce UK area profiles based on various data sources

dash-plotly data flask statistics uk

Last synced: 27 Apr 2026

https://github.com/nivasharmaa/genetrack

A Java program for analyzing DNA sequences and identifying individuals based on Short Tandem Repeats (STRs). Features profile database creation, STR analysis, individual identification, and relationship detection.

data data-processing dna-analysis file-io-in-java genetic-analysis java-oop

Last synced: 25 Aug 2025

https://github.com/mwelwankuta/image-match

a multi-threaded tool for batch renaming images of their appearance and match in a datasource

data openai typescript worker-threads

Last synced: 09 Mar 2025

https://github.com/kena0ki/dddl

generates test Data from DDL.

data database db ddl generator sql table test

Last synced: 30 Apr 2026

https://github.com/anburocky3/cbse-schools-data

Fetch CBSE Schools in seconds and use it for your data projects

cbse data data-analysis data-science grabber nextjs

Last synced: 24 Jun 2026

https://github.com/os-climate/data-requests

This repo is used to track issues related to new Data Requests

data data-engineering dataset

Last synced: 27 Feb 2026

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/d4niee/exifpy

An simple console tool to view Image meta datas

data exif image meta python

Last synced: 23 Mar 2025

https://github.com/matthewgferrari/covid-contextualizer

A Coronavirus Contextualizer for the USA

data react visualization

Last synced: 26 Jun 2026

https://github.com/bastianolea/sicvir_indicadores_rurales

Sistema de Indicadores de Calidad de Vida Rural (Sicvir)

chile comunas data estado rural social

Last synced: 27 Feb 2026

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/anandanraju/power_bi_dashboard_projects

The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.

amazon dashboard data data-visualization healthcare powerbi project

Last synced: 11 Feb 2026

https://github.com/lablnet/alibaba_scraper

This is a robust web scraper that extracts data from the Alibaba website. It's multi-threaded and utilizes Playwright to efficiently scrape data from the website. This script is capable of scraping the entire Alibaba site, which would take approximately 4-6 months to complete.

alibaba data ecom mit-license open-source products scraper

Last synced: 15 Mar 2025

https://github.com/sakan811/show-leaving-soon-tracker-website

This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.

data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website

Last synced: 18 Mar 2025

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/vishwas-chakilam/hr-dashboard

This project involves creating an interactive HR Dashboard using Power BI for visualization and MySQL for data cleaning and analysis. It provides insights into employee performance, attrition, salary distribution, and hiring trends.

dashboard data datac datacleaning datavisualization mysql powerbi

Last synced: 23 Mar 2025

https://github.com/afeiship/next-object-operator

Object set/get/sets/gets and other operator.

data get gets next operator set sets store

Last synced: 27 Feb 2026

https://github.com/tomcardoso/journalism-data-intersection

A talk on working at the intersection of journalism and data science

data data-journalism journalism

Last synced: 15 May 2025

https://github.com/kirillsemyonkin/lsd

LSD (Less Syntax Data) configuration/data transfer format.

configuration data java parsing rust

Last synced: 27 Feb 2026

https://github.com/ashishsingh789/titanic_dataset_eda_and_visualization

This repository contains an exploratory data analysis (EDA) of the Titanic dataset. Key analyses include survival rates by gender, passenger class, age distribution, family size, and correlation heatmaps.

data data-science dataanalysis matplotlib numpy pandas pandas-dataframe python seborn visualisation

Last synced: 11 Apr 2026

https://github.com/r-mahesh45/india-news-headlines-analysis

Excited to share my latest project: India News Headlines Analysis (2001–2023). This Power BI report dives deep into 21 years of Indian headlines, uncovering: Trends that defined the nation, Key themes that shaped public discourse, Insights into the evolution of media coverage.

data data-science powerbi visualization

Last synced: 05 Jan 2026

https://github.com/dhimmel/adeptus

ADEPTUS -- differential gene expression signatures of disease

adeptus data differential-expression disease gene-expression genes rephetio

Last synced: 05 Jan 2026

https://github.com/bishtrishu/super_store_sales_dashboard

This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.

analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql

Last synced: 28 Feb 2026

https://github.com/avestura/shell-dads

❓ Show a random tip from NIST DADS (https://xlinux.nist.gov/dads) every time you open your terminal

algorithms dads data data-structures ds nist

Last synced: 23 Oct 2025

https://github.com/suryadev99/stream_processing_website_click_data

Stream Processing of website click data using Kafka and monitored and visualised using Prometheus and Grafana

clickdata data dataengineering docker flink-kafka flink-metrics flink-stream-processing git grafana kafka kafka-streams kafka-topic prometheus psql python

Last synced: 10 Mar 2026

https://github.com/sumaiyyaf/british-airline-dashboard

This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.

analysis dashboard data tableau visualization

Last synced: 13 Feb 2026

https://github.com/j0a0m4/olympics

Final Project for Data Engineering Accelerated LATAM

data olympics spark

Last synced: 13 Feb 2026

https://github.com/smeltier/data-structures-c

This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.

algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c

Last synced: 16 May 2025

https://github.com/mecha-cms/x.kick

URL redirection files.

data extension files link redirect tool tsv url

Last synced: 23 Mar 2025

https://github.com/infinitode/pywebscrapr

An open-source Python web scraping tool. Supports both image scraping and text scraping.

data data-collection data-science open-source pip scraping web-scraper

Last synced: 14 Feb 2026

https://github.com/kalaspuff/ready

🎟 [not yet built] Take control of the event loop with simplified task management, queueing and data loading.

asyncio data dataloading event futures python python3 resolver tasks

Last synced: 10 May 2026

https://github.com/ehvenga/data.driven.modeling

Repository to practice data driven modelling

data data-modeling

Last synced: 23 Mar 2025

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/gianlucatruda/titanic

An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.

data database flask info matplotlib python science scrape server titanic visualisation web

Last synced: 10 Apr 2026

https://github.com/ngofilho/scripts-db

Repository containing several dbs scripts samples.

cache data database db mariadb mongodb mysql oracle redis sql-server

Last synced: 11 Apr 2026

https://github.com/s-babaeizadeh/next-mini-app

nextjs mini application

css data nextjs reactjs

Last synced: 11 Apr 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/soenneker/soenneker.dtos.idnamepair

A minimal Record type with an Id (string), Name (string), and maximum JSON compatibility

csharp data dotnet dto id name

Last synced: 12 Mar 2026

https://github.com/gabrielcsapo/bluse

⚗️ blend and fuse data with ease

data normalize utility

Last synced: 15 Mar 2025

https://github.com/fiddlydigital/anonimizer

A lib to replace and rehydrate sensitive data in text

anonimize anonymize data data-security prompt sanitize string string-manipulation text

Last synced: 15 Mar 2025

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/naveenk-ds/redbus_web_screaping.app.py

🚌 Red Bus Project Overview The Red Bus Project is a web scraping and visualization tool built with Selenium to extract bus information from the RedBus website. It stores the data in a MySQL database and provides an interactive visualization interface using Streamlit. The goal is to deliver insights into bus schedules, prices, ratings, etc...

data data-science database-management pandas pyhton selenium-webdriver sql

Last synced: 11 Apr 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/arnocan/yapydata

The yapydata provides miscellaneous low-level Python data access APIs.

data datastructures ini json properties python python2 python3 xml yaml

Last synced: 16 Feb 2026

https://github.com/n-ce/localstorage-data-interchange-manager

Implementation of local storage data interchange using map data structure.

data export import javascript js-maps json localstorage

Last synced: 28 Apr 2026

https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm

Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.

algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model

Last synced: 01 Sep 2025

https://github.com/peterhellberg/bugsnag-data

Dump Bugsnag data using the Data access API

bugsnag data go

Last synced: 22 Jun 2026

https://github.com/gdcmarinho/vaultchat

VaultChat is a end-to-end encryption chat service

chat data e2ee encrypted messaging privacy

Last synced: 23 Mar 2025

https://github.com/badranalyst/covid-deaths-dashboard-with-tableau

This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.

covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization

Last synced: 02 Mar 2026

https://github.com/howz1t/ptypes

This package provides useful data types for use in PHP.

badges composer computer-science data data-structures data-types packagist php types

Last synced: 29 Apr 2026

https://github.com/martinius96/meteostanica-odosielacie-scripty

Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.

arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka

Last synced: 11 Apr 2026

https://github.com/mtalhaofc/nutrition_system

A simple AI-powered web app built using Streamlit that provides personalized weekly meal plans and nutrition recommendations based on user demographics, health goals, and nutritional preferences.

cosine-similarity data data-science food machine-learning model nutrition pandas python streamlit

Last synced: 29 Apr 2026

https://github.com/stdlib-js/array-struct-factory

Return a constructor for creating arrays having a fixed-width composite data type.

array composite data factory javascript node node-js nodejs stdlib struct structure typed typed-array types

Last synced: 29 Apr 2026

https://github.com/etmendz/mendz.data.sqlserver

Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to SQL Server databases.

ado-net context data database datasettings mendz sql-server

Last synced: 10 May 2026

https://github.com/sanchittechnogeek/overscripted-analysis

Geolocation and user language extraction analysis from Mozilla Overscripted dataset

analysis data data-analysis mozilla

Last synced: 23 Mar 2025

https://github.com/martgro/datagrabber

Tool for extracting data points from plots

data extract image plots python3

Last synced: 29 Apr 2026

https://github.com/chandansoren/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 29 Apr 2026

https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling

Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models

Last synced: 17 Apr 2026

https://github.com/iamfrerot/userverse

creating api for data analysis

data data-analytics spring-boot users

Last synced: 23 Mar 2025

https://github.com/zevio/acl

ACL Anthology corpus sample

data dataset scholarly-articles

Last synced: 01 Mar 2026

https://github.com/zurd46/zurdsynthdatagen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

data data-structures dataset electron json jsonl nodejs openai synthetic

Last synced: 04 Apr 2026

https://github.com/oliver021/helppad-net

Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities

assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks

Last synced: 15 Jun 2026

https://github.com/krakozaure/pyzzy

Set of packages to simplify development in Python

configuration data formats json library logging logs python3 toml utils yaml

Last synced: 14 Jan 2026

https://github.com/codbex/codbex-number-generator-data

Number Generator for Documents Module - Data

data module

Last synced: 05 Apr 2026

https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors

A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.

cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest

Last synced: 10 Apr 2025

https://github.com/stupidcucumber/elephant-crawler

System for mining texts from websites.

data data-mining-python python

Last synced: 25 Apr 2026

https://github.com/mbrsagor/mysql

MySql database command line

data mysql mysql-database sql

Last synced: 14 Jun 2025

https://github.com/mi7773/advanced_sql_data_analytics_project

A hands-on SQL project simulating data analysis using fact and dimension tables, covering trends over time, cumulative metrics, performance breakdowns, segmentation, and reporting via SQL.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics database query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 18 Apr 2026

https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly

Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool

blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly

Last synced: 18 Apr 2026

https://github.com/roshaka/samplr

Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.

data data-analysis data-engineering decorators list python sampling

Last synced: 14 Jan 2026

https://github.com/cao7113/datalab

data lab and tools

data tool

Last synced: 18 Apr 2026

https://github.com/abdullahashfaqvirk/earth-engine-data-scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 10 May 2026

https://github.com/andykee/aurora

A lightweight tool for indexing, cataloging, and browsing data.

catalog data data-catalog data-discovery indexing metadata metadata-extraction search-and-discovery

Last synced: 17 Jan 2026

https://github.com/robthree/cfnreader

Provides a simple way to read FNIRSI's CFN files (*.cfn) produced by the FNIRSI UsbMeter tool

cfn csv data fnirsi usb usb-tester

Last synced: 01 Mar 2025

https://github.com/danielrosehill/global-value-factors-explorer-dataset

Derivative database of IFVI Global Value Factors for data analysis and visualization use cases.

data environmental-data sustainability-data

Last synced: 23 Feb 2026

https://github.com/nafisalawalidris/nafisalawalidris

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.

artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning

Last synced: 16 May 2026

https://github.com/yeti-robotics/past-scouting-data

❄️ Scouting Data from Previous Events/Seasons ❄️

data first frc

Last synced: 06 Jan 2026

https://github.com/laguer/jupyt-nb

Mathematical and Physical Constants ratios in Cosmology and micro physics

analysis constants cosmology data dimensional julia mathematical micro notebook physical physics python ratios science

Last synced: 13 Apr 2026

https://github.com/openwashdata/ugabore

Borehole repair data from central Uganda associated with a project report completed by Joseph Lwere for the “data science for openwashdata” course

analysis borehole data open-data r uganda wash water

Last synced: 17 Jan 2026

https://github.com/kashifkhan7/cleaning-analysis_cli

Analyze sales data easily with our CLI app. Gain insights on revenue trends and visualize results using Python, Pandas, and Matplotlib. 🚀📊

conditional-statements css data datacleaning exception-handling exiftool html json matplotlib-pyplot metadata metadata-extraction pandas-python python sales-analysis seaborn-python speech-to-text transcription youtube

Last synced: 13 Apr 2026

https://github.com/ohspc89/better_call_jin

A repository containing mentoring materials for a Ph.D. student in Neuroscience

data matlab spss-statistics visualization visualization-tools wrangling-data

Last synced: 08 Oct 2025

https://github.com/fcoagz/rate-reader-epv

pyDolarVenezuela API utilities, image processing (EnParaleloVzla) to extract currency exchange rates from specific platforms, validating content against expected patterns

data finance json processing-images pydolarvenezuela

Last synced: 14 Jun 2025

https://github.com/nukopian/shell-flatten

Flatten a series into a single record

automation data shell

Last synced: 18 Jun 2025