An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-set

A curated list of projects in awesome lists tagged with data-set .

https://github.com/gridaco/ui-dataset

A pre labelled dataset for ui element / layout detection

data-set labelling layout-detection reflect ui ui-dataset

Last synced: 01 Mar 2025

https://github.com/mcosovic/matgrid

MATGRID is an easy-to-use power system simulation tool for researchers and educators provided as a MATLAB package.

bad-data data-set gauss-newton-method least-absolute-value measurements newton-raphson observability outlier-detection phasor-measurement-unit power-flow power-systems state-estimation weighted-least-squares

Last synced: 01 Aug 2025

https://github.com/das-group/rba-dataset

Login feature data of more than 33M login attempts and 3M users (IP, UA, RTT)

authentication data-set ip-address login-data risk-based-authentication round-trip-time security-testing user-agent

Last synced: 02 Mar 2026

https://github.com/michidk/myo-dataset

This repository contains sEMG Data of 13 subjects recorded with the Myo Armband.

data-set dataset gesture-detection gesture-recognition myo myo-armband rock-paper-scissors semg

Last synced: 20 Jan 2026

https://github.com/andreped/adverse-events

IEEE BIBM 2021: Bayesian optimization-guided topic modeling for automatic detection of sepsis-related events from free text

adverse-events bayesian-optimization classification data-set detection ieee-bibm latent-dirichlet-allocation lda machine-learning natural-language-processing sepsis

Last synced: 13 Apr 2025

https://github.com/timpulver/netlabel-list

A list of active and inactive netlabels in JSON-format

data data-set json label list music netaudio netlabel

Last synced: 12 May 2025

https://github.com/smappnyu/twitter_elections_public_interest

Dataset of public interest interventions on Twitter for politicians and candidates during the 2020 US General Elections

2020-election content-moderation data-set dataset datasets elections open-source political-science politicians politics social-media twitter

Last synced: 17 Feb 2026

https://github.com/architsingh15/perceptron-algorithm-from-scratch-sonar-dataset

Analysis of the Sonar Dataset using algorithm designed from scratch

analysis data-set neural-network perceptron python

Last synced: 11 Mar 2026

https://github.com/mjethani/scaling-palm-tree

A data set for the Cliqz ad blocker benchmarks

ad-blocking data-set

Last synced: 27 Feb 2026

https://github.com/bazilsuhail/resume-dataset

Resume-Dataset is a Python-based project to generate PDF CVs from a LaTeX template and a CSV dataset, designed for creating a structured dataset for training LayoutLM, a model for document understanding.

data-set latex-document latex-resume-template resume-builder resume-dataset resume-template

Last synced: 30 Aug 2025

https://github.com/sap/task-execution-data-set

The data sets contained in this repository comprise workload metadata of a general task execution platform. The metadata consists of statically defined properties and runtime data.

data-set workload-data

Last synced: 16 Aug 2025

https://github.com/olivr/sql-utility-data

General utility data in relational SQL format for countries, languages, currencies (ISO 3166-1, UN M49, ISO 639-1, ISO 4217)

countries currencies data-set iso-3166-1 iso-4217 iso-639-1 languages postgresql sql un-m49

Last synced: 05 Apr 2025

https://github.com/devinschumacher/serp.media

Movies & TV Show Data / Content for SERP Media

data-set dataset movie-data-analysis movies

Last synced: 14 Oct 2025

https://github.com/kadocolak/turkey-football-super-league-json-data-set

Json data set including general share of Turkey Super and 1st League football teams

data-set json web web-service

Last synced: 09 Apr 2025

https://github.com/michaelfromyeg/data

Data set dump.

data data-set

Last synced: 16 Jan 2026

https://github.com/zelon88/motorized_bike_data

A repo to contain data in various formats related to motorized bicycle configurations.

bicycle bikes data data-set engine w

Last synced: 05 Mar 2026

https://github.com/hrushikesh1058/sub-terahertz-meta-stickers-for-non-invasive-food-sensing-using-machine-learning

Major motive of this project is to find whether the selected fruit ( Apple) is Ripen or medium Ripen or Rotten Using Machine Learning

ansys-hfss data-set jupyter-notebook machine-learning

Last synced: 21 Aug 2025

https://github.com/andresmpa/todo-app

This is the classic TODO app crafted for browser, with a couple of extra features like, localstorage, drag&drop and themes

data-set drag-and-drop html5 javascript localstorage stylus-css stylus-lang

Last synced: 17 Oct 2025

https://github.com/sh4n1d/power-bi

A repository showcasing various data visualizations and dashboards created using Microsoft Power BI. The repository includes interactive reports and analyses, demonstrating my proficiency in transforming data into insightful visualizations, enabling data-driven decision-making.

data-set power-bi power-bi-dashboard report-design

Last synced: 02 Feb 2026

https://github.com/sagargaud01/ai-driven-media-investment-plan-

AI-Driven Media Investment Plan Across Channels for E-commerce

ai business-intelligence data-set juypter python seaborn training training-data

Last synced: 26 Feb 2025

https://github.com/mahmoudwal27/e-commerce-data-analysis

A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.

analytics data-analysis data-analysis-python data-set google-cloud python

Last synced: 08 Mar 2025

https://github.com/vitonsky/iso-639_langs

Repository contains files with ISO-639 language codes in JSON format

data-science data-set iso-639 iso-639-1 iso-639-2 iso-639-3 parsed-data

Last synced: 05 Nov 2025