Projects in Awesome Lists tagged with digitization
A curated list of projects in awesome lists tagged with digitization .
https://github.com/zenodotus280/apl-md
architecture digitization markdown obsidian
Last synced: 13 Apr 2026
https://github.com/dilawar/plotdigitizer
A Python utility to digitize plots.
data-extraction digitization image-processing python3
Last synced: 06 Apr 2025
https://github.com/ds4v/nomnaocr
Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nomnaocr
digitization history nom ocr optical-character-recognition text-detection text-recognition vietnamese
Last synced: 11 Nov 2025
https://github.com/kitodo/kitodo-production
Kitodo.Production is a workflow management tool for mass digitization and is part of the Kitodo Digital Library Suite.
code4lib digital-library digitisation digitization java kitodo kitodo-production mass-digitization metadata-editor mets mets-xml mods mods-xml workflow-management-system xslt
Last synced: 05 Apr 2025
https://github.com/open-data-rescue/climate-data-rescue
Climate Data Rescue is an archival data rescue platform using Ruby on Rails.
archives citizen-science climate community data-rescue digitization hacktoberfest science transcription
Last synced: 20 Jul 2025
https://github.com/todrobbins/audio-digitization-toolkit
A list of resources for setting up an audio digitization workflow
analog audio digitization hardware workflow
Last synced: 04 Feb 2026
https://github.com/hadro/directory-pipeline
A pipeline for turning digital collections into structured data -- an LLM assisted, IIIF-native tool to jump into working with sources like digitized print directories.
annotations city-directories cultural-heritage digitization directories iiif layout-analysis ner ocr surya-ocr
Last synced: 20 Jun 2026
https://github.com/smithsonian/osprey
Dashboard that displays the file validation results in mass digitization projects. Digitization Program Office, OCIO, Smithsonian.
digitization digitization-workflows mass-digitization museum-collections python3
Last synced: 13 Apr 2025
https://github.com/olimic1000/vinylflow
Digitize vinyl 10x faster. Open source tool for splitting, tagging, and organizing vinyl recordings
aiff audio digitization discogs dj docker flac mp3 music open-source vinyl
Last synced: 02 May 2026
https://github.com/exploring-solver/catalogcraft
This is the repo for prototype of the Build for Bharat hackathon, by team Progmatic , MAIT
buildforbharat catalog digitization django kotlin mongodb nodejs ondc postgresql reactjs
Last synced: 26 Jun 2026
https://github.com/smithsonian/dpomd_dashboard
Dashboard to display general and detailed statistics of digitization projects by DPO Mass Digitization, OCIO, Smithsonian
dashboard digitization mass-digitization museums shiny shiny-apps
Last synced: 03 Sep 2025
https://github.com/efenstakes/lifeboat
Foster care management system server (fostering the fosters for the fostered ones)
digitization foster-care goverment government-project management ngo server
Last synced: 24 Jul 2025
https://github.com/ds4v/nomnasite
Deployment of NomNaOCR: https://youtu.be/o5xpfwalEWw
digitization history vietnamese
Last synced: 19 Oct 2025
https://github.com/smithsonian/massdigi-tools
Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO
data data-validation digital-files digitization digitization-workflows museums
Last synced: 13 Apr 2025
https://github.com/ohidurbappy/tabular-data-digitalizer
Project to convert handwritten tabular data to excel table
digitization firebase-mlkit ocr text-classification textrecognition
Last synced: 08 Oct 2025
https://github.com/mikidi/batch-dia-scan
Utility for batch scanning diapositives on a flatbed scanner in Linux
batch-scanner diapos diapositivas digitization epson linux sane scanning slides
Last synced: 29 Apr 2026
https://github.com/caltechlibrary/popstar
Phone-Oriented Processing SofTware for ARchives
archiving digitization document-processing iphone libraries scanning shortcuts-app workflow-automation
Last synced: 20 Mar 2026
https://github.com/oschulz/struckvmedevices.jl
[WIP] Julia Struck VME Digitizers Package
adc devices digitization instruments julia
Last synced: 11 Jul 2025
https://github.com/smithsonian/virtual-barcodes
R/Shiny app to display barcodes for filenames from a database. Digitization Program Office, OCIO
Last synced: 02 Aug 2025
https://github.com/berat02xz/3d-reconstruction-oko-bridge
3D photogrammetric reconstruction of the statue of "OKO Skopje," creating a virtual model showcasing the statue's appearance in a three-dimensional format, presented on a webpage
3d-reconstruction digitization photogrammetry project reality-capture
Last synced: 07 Feb 2026
https://github.com/kutukvpavel/plotdigitizer
A small helper for digitizing old scientific plots.
charts digitization digitizer image-processing plots
Last synced: 01 Aug 2025
https://github.com/rise-unibas/bildersammlung-buddhismus-public
Dataset of the University of Basel's research seminar "Indexing and Digital Processing of a Historical Image Collection on the Appropriation of Buddhism in the West"
digitization documentation humanities indexing python social-sciences transkribus tropy
Last synced: 17 Jan 2026
https://github.com/hensing/scan-to-paperless
A secure SMB bridge for hardware scanners to automatically upload documents to Paperless-ngx. Features multi-arch support (x86/ARM), non-root execution, and auto-archiving.
alpine-linux automation digitization dms docker inotify paperless-ngx raspberry-pi samba scan-to-paperless scanner scansnap self-hosted smb-shares
Last synced: 17 May 2026
https://github.com/speciesfilegroup/taxonworks_homepage
Homepage for TaxonWorks
biodiversity collections digitization species taxonomy taxonworks
Last synced: 11 Apr 2025
https://github.com/abikesa/p
Digitization at TVETs
agency-theory conflict contract digitization process programmability shirking uganda
Last synced: 25 Mar 2025
https://github.com/dbetchkal/public_dataglean_i
A Processing based tool to glean useful numeric results that are 'locked up' in a digital image
coordinate-systems digitization processing3 python reverse-engineering scientific-papers scientific-visualization
Last synced: 19 May 2026
https://github.com/smithsonian/osprey_worker
Script to check images from digitization projects and store in the Osprey database. Digitization Program Office, OCIO, Smithsonian.
digitization digitization-workflows exiftool imagemagick jhove mass-digitization museum-collections python3
Last synced: 16 Sep 2025
https://github.com/plantfce/text-to-pid
Text to P&ID: Going from process description to P&ID.
dexpi digital-transformation digitisation digitization digitization-workflows neo4j neo4j-graph pid plantfce python python3
Last synced: 01 May 2026
https://github.com/ajitagupta/user-login-page
A simple user login interface in angularjs
angularjs consulting digitization frontend-programming javascript-frameworks single-page-applications user-management-systems
Last synced: 22 Mar 2025
https://github.com/smithsonian/osprey_misc
Database, scripts, and other files related to the Osprey system. Digitization Program Office, OCIO, Smithsonian.
digitization mass-digitization museum-collections
Last synced: 06 Jul 2025
https://github.com/shashanksingh57/daily-planner-automation-suite
Complete automation system for digitizing handwritten daily planner pages with health analytics and Notion integration. Consolidates OCR, AI parsing, dashboard monitoring, and comprehensive health data analysis.
ai apple-vision automation daily-planner digitization health-analytics notion ocr openai python
Last synced: 15 Jul 2025
https://github.com/zyn10/remote-sensing-assessment
Remote_Sensing_Assessment_1
arc-map digitization geospatial gis-data remote-sensing remotesensing vector-data
Last synced: 27 Mar 2026
https://github.com/ljramones/martial_arts_ocr
Advanced OCR system for digitizing Donn Draeger's martial arts research materials. Features dual-engine OCR (Tesseract+EasyOCR), comprehensive Japanese text processing with romanization/translation, intelligent layout analysis, and academic-quality HTML reconstruction. Specialized for mixed English/Japanese documents with embedded diagrams.
computer-vision digitization easyocr historical-documents image-processsing japanese martial-arts mecab nlp ocr-recognition opencv tesseract
Last synced: 06 Mar 2026