An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with digitization

A curated list of projects in awesome lists tagged with digitization .

https://github.com/dilawar/plotdigitizer

A Python utility to digitize plots.

data-extraction digitization image-processing python3

Last synced: 06 Apr 2025

https://github.com/ds4v/nomnaocr

Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nomnaocr

digitization history nom ocr optical-character-recognition text-detection text-recognition vietnamese

Last synced: 11 Nov 2025

https://github.com/kitodo/kitodo-production

Kitodo.Production is a workflow management tool for mass digitization and is part of the Kitodo Digital Library Suite.

code4lib digital-library digitisation digitization java kitodo kitodo-production mass-digitization metadata-editor mets mets-xml mods mods-xml workflow-management-system xslt

Last synced: 05 Apr 2025

https://github.com/open-data-rescue/climate-data-rescue

Climate Data Rescue is an archival data rescue platform using Ruby on Rails.

archives citizen-science climate community data-rescue digitization hacktoberfest science transcription

Last synced: 20 Jul 2025

https://github.com/todrobbins/audio-digitization-toolkit

A list of resources for setting up an audio digitization workflow

analog audio digitization hardware workflow

Last synced: 04 Feb 2026

https://github.com/hadro/directory-pipeline

A pipeline for turning digital collections into structured data -- an LLM assisted, IIIF-native tool to jump into working with sources like digitized print directories.

annotations city-directories cultural-heritage digitization directories iiif layout-analysis ner ocr surya-ocr

Last synced: 20 Jun 2026

https://github.com/smithsonian/osprey

Dashboard that displays the file validation results in mass digitization projects. Digitization Program Office, OCIO, Smithsonian.

digitization digitization-workflows mass-digitization museum-collections python3

Last synced: 13 Apr 2025

https://github.com/olimic1000/vinylflow

Digitize vinyl 10x faster. Open source tool for splitting, tagging, and organizing vinyl recordings

aiff audio digitization discogs dj docker flac mp3 music open-source vinyl

Last synced: 02 May 2026

https://github.com/exploring-solver/catalogcraft

This is the repo for prototype of the Build for Bharat hackathon, by team Progmatic , MAIT

buildforbharat catalog digitization django kotlin mongodb nodejs ondc postgresql reactjs

Last synced: 26 Jun 2026

https://github.com/smithsonian/dpomd_dashboard

Dashboard to display general and detailed statistics of digitization projects by DPO Mass Digitization, OCIO, Smithsonian

dashboard digitization mass-digitization museums shiny shiny-apps

Last synced: 03 Sep 2025

https://github.com/efenstakes/lifeboat

Foster care management system server (fostering the fosters for the fostered ones)

digitization foster-care goverment government-project management ngo server

Last synced: 24 Jul 2025

https://github.com/ds4v/nomnasite

Deployment of NomNaOCR: https://youtu.be/o5xpfwalEWw

digitization history vietnamese

Last synced: 19 Oct 2025

https://github.com/smithsonian/massdigi-tools

Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO

data data-validation digital-files digitization digitization-workflows museums

Last synced: 13 Apr 2025

https://github.com/ohidurbappy/tabular-data-digitalizer

Project to convert handwritten tabular data to excel table

digitization firebase-mlkit ocr text-classification textrecognition

Last synced: 08 Oct 2025

https://github.com/mikidi/batch-dia-scan

Utility for batch scanning diapositives on a flatbed scanner in Linux

batch-scanner diapos diapositivas digitization epson linux sane scanning slides

Last synced: 29 Apr 2026

https://github.com/oschulz/struckvmedevices.jl

[WIP] Julia Struck VME Digitizers Package

adc devices digitization instruments julia

Last synced: 11 Jul 2025

https://github.com/smithsonian/virtual-barcodes

R/Shiny app to display barcodes for filenames from a database. Digitization Program Office, OCIO

digitization r-shiny

Last synced: 02 Aug 2025

https://github.com/berat02xz/3d-reconstruction-oko-bridge

3D photogrammetric reconstruction of the statue of "OKO Skopje," creating a virtual model showcasing the statue's appearance in a three-dimensional format, presented on a webpage

3d-reconstruction digitization photogrammetry project reality-capture

Last synced: 07 Feb 2026

https://github.com/kutukvpavel/plotdigitizer

A small helper for digitizing old scientific plots.

charts digitization digitizer image-processing plots

Last synced: 01 Aug 2025

https://github.com/rise-unibas/bildersammlung-buddhismus-public

Dataset of the University of Basel's research seminar "Indexing and Digital Processing of a Historical Image Collection on the Appropriation of Buddhism in the West"

digitization documentation humanities indexing python social-sciences transkribus tropy

Last synced: 17 Jan 2026

https://github.com/hensing/scan-to-paperless

A secure SMB bridge for hardware scanners to automatically upload documents to Paperless-ngx. Features multi-arch support (x86/ARM), non-root execution, and auto-archiving.

alpine-linux automation digitization dms docker inotify paperless-ngx raspberry-pi samba scan-to-paperless scanner scansnap self-hosted smb-shares

Last synced: 17 May 2026

https://github.com/dbetchkal/public_dataglean_i

A Processing based tool to glean useful numeric results that are 'locked up' in a digital image

coordinate-systems digitization processing3 python reverse-engineering scientific-papers scientific-visualization

Last synced: 19 May 2026

https://github.com/smithsonian/osprey_worker

Script to check images from digitization projects and store in the Osprey database. Digitization Program Office, OCIO, Smithsonian.

digitization digitization-workflows exiftool imagemagick jhove mass-digitization museum-collections python3

Last synced: 16 Sep 2025

https://github.com/smithsonian/osprey_misc

Database, scripts, and other files related to the Osprey system. Digitization Program Office, OCIO, Smithsonian.

digitization mass-digitization museum-collections

Last synced: 06 Jul 2025

https://github.com/shashanksingh57/daily-planner-automation-suite

Complete automation system for digitizing handwritten daily planner pages with health analytics and Notion integration. Consolidates OCR, AI parsing, dashboard monitoring, and comprehensive health data analysis.

ai apple-vision automation daily-planner digitization health-analytics notion ocr openai python

Last synced: 15 Jul 2025

https://github.com/ljramones/martial_arts_ocr

Advanced OCR system for digitizing Donn Draeger's martial arts research materials. Features dual-engine OCR (Tesseract+EasyOCR), comprehensive Japanese text processing with romanization/translation, intelligent layout analysis, and academic-quality HTML reconstruction. Specialized for mixed English/Japanese documents with embedded diagrams.

computer-vision digitization easyocr historical-documents image-processsing japanese martial-arts mecab nlp ocr-recognition opencv tesseract

Last synced: 06 Mar 2026