Projects in Awesome Lists tagged with data-labeling
A curated list of projects in awesome lists tagged with data-labeling .
https://github.com/heartexlabs/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
annotation annotation-tool annotations boundingbox computer-vision data-labeling dataset datasets deep-learning image-annotation image-classification image-labeling image-labelling-tool label-studio labeling labeling-tool mlops semantic-segmentation text-annotation yolo
Last synced: 17 Aug 2025
https://github.com/humansignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
annotation annotation-tool annotations boundingbox computer-vision data-labeling dataset datasets deep-learning image-annotation image-classification image-labeling image-labelling-tool label-studio labeling labeling-tool mlops semantic-segmentation text-annotation yolo
Last synced: 01 Jun 2026
https://github.com/HumanSignal/label-studio?fbclid=IwAR30j2OmVMcB-TenAczkNwwUsObi8JAOpTNxGFzrmMrJ2pd4-gg_S0D3S78
Label Studio is a multi-type data labeling and annotation tool with standardized output format
annotation annotation-tool annotations boundingbox computer-vision data-labeling dataset datasets deep-learning image-annotation image-classification image-labeling image-labelling-tool label-studio labeling labeling-tool mlops semantic-segmentation text-annotation yolo
Last synced: 28 Apr 2025
https://github.com/HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
annotation annotation-tool annotations boundingbox computer-vision data-labeling dataset datasets deep-learning image-annotation image-classification image-labeling image-labelling-tool label-studio labeling labeling-tool mlops semantic-segmentation text-annotation yolo
Last synced: 26 Mar 2025
https://github.com/cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
active-learning annotation data-centric-ai data-cleaning data-curation data-labeling data-profiling data-quality data-science data-validation dataops dataquality datasets exploratory-data-analysis labeling llms noisy-labels out-of-distribution-detection outlier-detection weak-supervision
Last synced: 08 Jan 2026
https://github.com/doccano/doccano
Open source annotation tool for machine learning practitioners.
annotation-tool data-labeling dataset datasets machine-learning natural-language-processing nuxt nuxtjs python text-annotation vue vuejs
Last synced: 12 Jan 2026
https://github.com/heartexlabs/awesome-data-labeling
A curated list of awesome data labeling tools
3d-annotation annotation annotation-tool audio-annotation audio-annotation-tool awesome awesome-list bounding-box data-labeling deep-learning image-annotation image-labeling image-labeling-tool label-images label-videos labeling labeling-tool lidar semantic-segmentation video-annotation
Last synced: 23 Apr 2025
https://github.com/code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
active-learning annotations artificial-intelligence data-centric-ai data-labeling data-science deep-learning human-in-the-loop labeling labeling-tool machine-learning natural-language-processing neural-search nlp python spacy supervised-learning text-annotation text-classification transformers
Last synced: 14 May 2025
https://github.com/alteryx/compose
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
ai automl data-labeling data-science labeling labeling-tool machine-learning prediction-engineering prediction-problem training-data
Last synced: 14 May 2025
https://github.com/shoumikchow/bbox-visualizer
Make drawing and labeling bounding boxes easy as cake
annotation annotations bboxes bounding-box bounding-boxes boundingbox computer-vision computer-vision-tools cv data-labeling deep-learning image-annotation image-labeling image-labeling-tool labeling object-detection object-recognition python3
Last synced: 07 Apr 2025
https://github.com/davidjurgens/potato
potato: the portable annotation tool
agentic-ai agentic-workflow agents annotation annotation-tool audio data-labeling image labeling-tool nlp speech vision
Last synced: 02 Apr 2026
https://github.com/Slava/label-tool
Web application for image labeling and segmentation
boundingbox computer-vision computer-vision-tools data-labeling image-annotation image-label image-labeling image-labeling-tool labelme machine-learning segmentation sematic-segmentation training-data
Last synced: 06 Apr 2025
https://github.com/phurwicz/hover
:speedboat: Label data at scale. Fun and precision included.
annotation-tool audio-classification audio-labeling bokeh bulk-labeling data-labeling image-classification image-labeling labeling labeling-tool machine-learning supervised-learning text-classification text-labeling visualization
Last synced: 21 Feb 2026
https://github.com/dataqa/nlp-labelling
Labelling platform for text using weak supervision.
annotation-tool data-labeling data-science learning-with-limited-labeled-data learning-with-noisy-labels natural-language-processing ner nlp nlp-machine-learning pseudo-labeling search-engine text-annotation-tool text-classification text-mining weak-supervision
Last synced: 18 Feb 2026
https://github.com/samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
active-learning annotation-tool data-labeling data-science gpt-4 machine-learning named-entity-recognition natural-language-processing ner nlp sequence-to-sequence text-annotation text-annotation-tool
Last synced: 15 Mar 2025
https://github.com/expectedparrot/edsl
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
anthropic data-labeling deepinfra domain-specific-language experiments llama2 llm llm-agent llm-framework llm-inference market-research mixtral open-source openai python social-science surveys synthetic-data
Last synced: 15 May 2025
https://github.com/HumanSignal/label-studio-transformers
Label data using HuggingFace's transformers and automatically get a prediction service
bert data-labeling label-studio natural-language-processing natural-language-understanding nlp pytorch-transformers text-labeling transformers
Last synced: 19 Jul 2025
https://github.com/humansignal/label-studio-transformers
Label data using HuggingFace's transformers and automatically get a prediction service
bert data-labeling label-studio natural-language-processing natural-language-understanding nlp pytorch-transformers text-labeling transformers
Last synced: 20 Aug 2025
https://github.com/gereleth/jupyter-bbox-widget
A Jupyter widget for annotating images with bounding boxes
annotations bbox bounding-boxes data-labeling image-annotation jupyter jupyter-widget labeling-tool notebook python
Last synced: 13 Apr 2025
https://github.com/villagecomputing/superpipe
Superpipe - optimized LLM pipelines for structured data
classification data-extraction data-labeling llm llm-evaluation llm-optimization structured-data
Last synced: 04 Apr 2026
https://github.com/doccano/doccano-client
A simple client for doccano API.
active-learning annotation api-client api-wrapper data-labeling dataset doccano machine-learning natural-language-processing python text-annotation upload-file
Last synced: 05 Apr 2025
https://github.com/cyberagent/fast-annotation-tool
FAST is an annotation tool that focuses on mobile devices. https://aclanthology.org/2021.emnlp-demo.41/
annotation-tool data-labeling dataset firebase google-cloud machine-learning natural-language-processing react text-annotation
Last synced: 15 Jun 2025
https://github.com/zhenye-na/crnn-pytorch
βοΈ Convolutional Recurrent Neural Network in Pytorch | Text Recognition
computer-vision crnn-ocr data-augmentations data-labeling deep-learning emnist-dataset full-stack iam-lines-dataset natural-language-processing serving-predictions text-recognition
Last synced: 02 May 2025
https://github.com/datagym-ai/datagym-core
Open source annotation and labeling tool for image and video assets
annotation annotations bounding-box computer-vision data-labeling dataset image-annotation image-labeling image-labeling-tool label-images label-videos labeling labeling-tool semantic-segmentation video-annotation video-labeling
Last synced: 17 Jan 2026
https://github.com/doccano/auto-labeling-pipeline
doccano auto labeling pipeline helps doccano to annotate a document automatically.
annotation-tool data-labeling doccano machine-learning natural-language-processing python text-annotation
Last synced: 20 Sep 2025
https://github.com/megagonlabs/tagruler
Data programming by demonstration for information extraction and span annotation
data-labeling data-programming data-programming-by-demonstration machine-learning weak-supervision
Last synced: 23 Apr 2025
https://github.com/megagonlabs/ruler
Data Programming by Demonstration (DPBD) for Document Classification
data-labeling data-programming data-science machine-learning training-data weak-supervision
Last synced: 07 Jul 2025
https://github.com/microsoft/OneLabeler
A system for building labeling tools
annotation data-labeling interactive-machine-learning visual-programming
Last synced: 05 Apr 2025
https://github.com/explosion/vscode-prodigy
𧬠A VS Code extension for annotating data with Prodigy
annotation-tool data-annotation data-labeling data-labeling-tools data-science labeling-tool nlp prodigy spacy vscode vscode-extension
Last synced: 19 Oct 2025
https://github.com/cleanlab/cleanlab-studio
Client interface to Cleanlab Studio and the Trustworthy Language Model
annotations automl computer-vision data-centric-ai data-cleaning data-curation data-labeling data-profiling data-quality data-science data-validation image-classification llm machine-learning model-deployment natural-language-processing noisy-labels outlier-detection structured-data text-classification
Last synced: 13 Apr 2025
https://github.com/segments-ai/segments-ai
Segments.ai Python SDK
annotation computer-vision data-labeling dataset deep-learning image-annotation labeling-tool panoptic-segmentation pointcloud pointcloud-detection pointcloud-segmentation robotics semantic-segmentation
Last synced: 08 Apr 2026
https://github.com/smrfeld/dash-annotate-cv
Dash components for computer vision annotation tasks
annotations computer-vision dash data-labeling machine-learning plotly plotly-dash
Last synced: 07 Apr 2025
https://github.com/kenken64/examtopics-data-labeler
π ExamTopics Data Labeler Project Summary π― Project Overview The ExamTopics Data Labeler is a comprehensive web application designed for IT certification preparation and management. It combines PDF processing, AI-powered question generation, secure authentication, and sophisticated data management into a unified platform.
data-labeling nextjs nodejs ocr openai pytho react
Last synced: 11 Jun 2026
https://github.com/nisheethjaiswal/Data-Annotator-for-SpaCy
πSpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy π₯.
data-annotation data-annotation-tools data-labeling data-preparation named-entity-recognition nlp spacy-nlp text-labeling
Last synced: 06 Aug 2025
https://github.com/nikitaignatov/csvninja
Tool for annotation and labeling of the time series sensor data for the purpose of machine learning.
data-annotation data-labeling labeling labeling-tool sensor-data time-series
Last synced: 09 Oct 2025
https://github.com/yasho191/SwiftAnnotate
Auto labelling tool for Text, Image, Video
automation computer-vision data-labeling llms nlp vlms
Last synced: 05 Mar 2026
https://github.com/jaxony/action-annotator
Bug reports and feature requests for macOS 11+ video action classification app Action Annotator.
action-classification action-classifier ai apple create-ml data-labeling machine-learning macos ml swift video video-processing
Last synced: 13 Feb 2026
https://github.com/ismailuddin/iris
π₯ Web platform for easy labelling and management of π image data labelling
data-labeling data-management machine-learning python
Last synced: 01 Apr 2026
https://github.com/kennethwussmann/caption.now
Quickly and efficiently caption your image dataset for AI training
ai annotation annotations captioning captioning-images data-labeling dataset dataset-generation datasets image-classification image-labeling image-labelling-tool labeling labeling-tool offline-first progressive-web-app pwa
Last synced: 14 Jun 2025
https://github.com/strickvl/panlabel
Universal annotation converter
annotation annotation-conversion converter data-annotation data-labeling
Last synced: 15 Mar 2026
https://github.com/pv-bhat/meta-labeler
An intuitive conversation labeling tool for extracting insights from sales and customer interaction data, complete with customizable metrics and segmentation control.
conversation-analysis conversational-ai data-labeling meta-tags python-gui-tkinter
Last synced: 03 Aug 2025
https://github.com/twitech/graph-modelling-and-community-detection
This research seeks to explore the discussions surrounding JAK inhibitors on Reddit by utilizing graph modeling and community detection techniques through the application of NetworkX and the Louvain algorithm.
data-labeling graph-model llms louvain-algorithm networkx openapi
Last synced: 27 Apr 2026
https://github.com/cleanlab/tlm
Score the trustworthiness of outputs from any LLM in real-time
ai-agents ai-safety confidence-estimation data-extraction data-labeling error-detection evals evaluation guardrails hallucination hallucination-detection human-in-the-loop-ai llm llm-as-a-judge llm-evaluation rag structured-outputs trustworthy-ai uncertainty-quantification verifiers
Last synced: 23 Feb 2026
https://github.com/monatis/label-snd
Easily label sound datasets!
data-labeling machine-learning sound
Last synced: 10 Oct 2025
https://github.com/multitagging/multitagging
A vulnerable Ethereum smart contract labeling framework
analysis-tools blockchain data-labeling ethereum smart-contracts vulnerabilities
Last synced: 26 Oct 2025
https://github.com/michaelholm6/yoloez
An all-in-one GUI tool for labeling, training, and running YOLO11 models. Built with Python and Ultralytics, this tool simplifies dataset creation, model training, and inference.
computer-vision data-labeling deep-learning-tool gui-application image-annotation inference model-training object-detection python ultralytics yolo
Last synced: 06 Feb 2026
https://github.com/1sarthakbhardwaj/labellerr-mcp-server
MCP Server for Labellerr SDK - Manage annotation projects, monitor progress, and query data via natural language with any MCP Client
annotation api claude-desktop data-labeling labellerr machine-learning mcp sdk
Last synced: 16 May 2026
https://github.com/humansignal/label-studio-plugins
Plugins to extend Label Studio with custom workflows, integrations, and UI components.
annotation-tool boundingbox computer-vision data-labeling dataset datasets deep-learning image-annotation image-classification image-labeling image-labelling-tool label-studio labeling labeling-tool mlops semantic-segmentation text-annotation yolo
Last synced: 29 Oct 2025
https://github.com/shprintsin/defactor
R package for dynamically managing, annotating, and retrieving variable labels in data frames. It provides functions to efficiently set and extract labels based on patterns, column selections, or custom rules, enhancing data cleaning and analysis workflows.
data-annotation data-cleaning data-labeling data-manipulation r r-package
Last synced: 02 Apr 2025
https://github.com/hk669/yolov5
Object Detection Using YOLO with Self-Trained Data
data-labeling labelimg object-detection training yolov5
Last synced: 03 Jan 2026
https://github.com/rasitayaz/data-labeling-system
Gives informative labels to data using various algorithms
data-labeling machine-learning
Last synced: 11 Apr 2025
https://github.com/nv78/anote-ai
Reference Page for Anote
agents ai data-labeling deep-learning evaluation machine-learning nlp reinforcement-learning
Last synced: 12 Jun 2026
https://github.com/ayushsubedi/labelagree
(WIP) Sahamati is a consensus-based data labeling tool. Looking for contributors.
consensus data-labeling django
Last synced: 13 Jun 2026