Projects in Awesome Lists tagged with code4lib
A curated list of projects in awesome lists tagged with code4lib .
https://github.com/cdk/cdk
The Chemistry Development Kit
bioinformatics blueobelisk cheminformatics chemistry code4lib java
Last synced: 21 Oct 2025
https://github.com/miku/esbulk
Bulk indexing command line tool for elasticsearch.
code4lib elasticsearch hacktoberfest indexing
Last synced: 16 May 2025
https://github.com/natlibfi/skosmos
Thesaurus and controlled vocabulary browser using SKOS and SPARQL
code4lib php skos sparql vocabularies
Last synced: 06 Feb 2026
https://github.com/WDscholia/scholia
Wikidata-based scholarly profiles
bibliography bibliometrics bibtex citations code4lib datacuration dataviz fairdata hacktoberfest latex linked-open-data literature scientometrics sparql wikicite wikidata
Last synced: 27 Mar 2025
https://github.com/natlibfi/annif
Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.
annif annotation-tool classification code4lib connexion flask-application glam machine-learning multilabel-classification python rest-api subject-indexing text-classification
Last synced: 04 Apr 2025
https://gwu-libraries.github.io/sfm-ui/
Social Feed Manager user interface application.
code4lib social-feed-manager social-media web-archiving
Last synced: 22 Apr 2025
https://github.com/gwu-libraries/sfm-ui
Social Feed Manager user interface application.
code4lib social-feed-manager social-media web-archiving
Last synced: 08 Apr 2025
https://github.com/hectorcorrea/marcli
MARC command line parser
code4lib marc marc-files marc-xml marc21
Last synced: 05 Apr 2025
https://github.com/miku/metha
Command line OAI-PMH harvester and client with built-in cache.
code4lib hacktoberfest harvest oai
Last synced: 04 Jan 2026
https://github.com/tw4l/brunnhilde
Siegfried-based characterization tool for directories and disk images
archives code4lib digital-preservation disk-image format-identification pronom
Last synced: 07 May 2025
https://github.com/opencultureconsulting/openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
bash-script batch-processing code4lib docker etl openrefine
Last synced: 08 Apr 2025
https://github.com/opencultureconsulting/openrefine-client
The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the command line interface (CLI) and is distributed as a convenient one-file-executable (Windows, Linux, Mac). It is also available via Docker Hub, PyPI and Binder.
binder code4lib docker etl openrefine pypi python
Last synced: 07 Apr 2025
https://github.com/hectorcorrea/solr-for-newbies
Materials for the Solr for newbies workshop
code4lib solr tutorial workshop
Last synced: 22 Feb 2025
https://github.com/justinlittman/fbarc
A commandline tool and Python library for archiving data from Facebook using the Graph API.
Last synced: 22 Jul 2025
https://github.com/metafacture/metafacture-core
Core package of the Metafacture tool suite for metadata processing.
code4lib etl fix java-library metadata-processing metafacture metafix metamorph
Last synced: 11 Jan 2026
https://github.com/ropensci/roadoi
Use Unpaywall with R
altmetrics code4lib oadoi open-access peer-reviewed r r-package rstats unpaywall webclient
Last synced: 22 Oct 2025
https://github.com/kitodo/kitodo-production
Kitodo.Production is a workflow management tool for mass digitization and is part of the Kitodo Digital Library Suite.
code4lib digital-library digitisation digitization java kitodo kitodo-production mass-digitization metadata-editor mets mets-xml mods mods-xml workflow-management-system xslt
Last synced: 05 Apr 2025
https://github.com/ub-mannheim/zotkat
Erweiterung von Zotero für die Katalogisierung
cataloguing code4lib libraries wikicite zotero
Last synced: 13 Apr 2025
https://github.com/miku/solrbulk
SOLR bulk indexing utility for the command line.
Last synced: 24 Aug 2025
https://github.com/LibreCat/LibreCat
A publication management system
catmandu code4lib dancer elasticsearch perl
Last synced: 02 Sep 2025
https://github.com/gbv/cocoda
A web-based tool for creating mappings between knowledge organization systems.
code4lib coli-conc service ts4nfdi web-application
Last synced: 24 Feb 2025
https://github.com/natlibfi/annif-tutorial
Instructions, exercises and example data sets for Annif hands-on tutorial
annif code4lib glam machine-learning multilabel-classification open-educational-resources subject-indexing text-classification tutorial workshop
Last synced: 31 Jul 2025
https://github.com/kitodo/kitodo-presentation
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
alto alto-xml apache-solr code4lib digital-library iiif kitodo kitodo-presentation mets mets-xml mods mods-xml openlayers php solarium solr tei tei-xml typo3
Last synced: 04 Apr 2025
https://github.com/ubleipzig/solrdump
Export SOLR documents efficiently with cursors.
Last synced: 18 Jun 2025
https://github.com/atomotic/archiviiify
Download digitized books from Internet Archive and view with IIIF, locally and offline.
code4lib digital-libraries iiif internetarchive
Last synced: 21 Apr 2025
https://github.com/benibela/videlibri
:books: Cross-platform library client to automate any OPAC and library catalog from your local device, e.g. for renewing of borrowed books or searching for books available in the library in automated scripts.
bibliography bibliothekskatalog bibtex book-information catalog catalogue cdvost code4lib codingdavinci koha librarian libraries library library-catalogue opac open-library public-libraries public-library voebb vufind
Last synced: 05 Mar 2025
https://github.com/slub/urnlib
Java library for representing, parsing and encoding URNs as in RFC2141 and RFC8141 (Maintained by @claussni)
Last synced: 11 Apr 2025
https://librarycarpentry.github.io/Top-10-FAIR/
Top 10 FAIR Data & Software Things
code4lib fair-data fair-principles fair-software library-carpentry research-software-engineering
Last synced: 25 Sep 2025
https://github.com/natlibfi/bib-rdf-pipeline
Scripts and configuration for converting MARC bibliographic records into RDF
Last synced: 10 Jul 2025
https://github.com/ub-mannheim/malibu
Mannheim library utilities
bnbweekly book-information code4lib isbn mashup
Last synced: 08 Jul 2025
https://github.com/tw4l/metsflask
A web application for human-friendly exploration of Archivematica METS files
archivematica code4lib mets-xml
Last synced: 21 Apr 2025
https://github.com/natlibfi/fingreylit
Data set of Finnish grey literature, containing curated Dublin Core style metadata and links to original PDF publications
code4lib dataset glam grey-literature metadata pdf
Last synced: 10 Jul 2025
https://github.com/code4lib/code4lib.github.io
Jekyll static site for Code4Lib.org
Last synced: 28 Oct 2025
https://github.com/opencultureconsulting/openrefine-docker
OpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.
code4lib docker etl openrefine server
Last synced: 02 Feb 2026
https://github.com/miku/siskin
Tasks around metadata.
code4lib library luigi luigi-pipeline metadata
Last synced: 11 Apr 2025
https://github.com/httpreserve/httpreserve
Digital Preservation of HTTP in documentary heritage.
archives code4lib digipres digital-repositories digitalpreservation documentary-heritage internetarchive wayback waybackmachine webarchiving
Last synced: 17 Jan 2026
https://github.com/jorol/processing-marc
Processing MARC with open source tools
Last synced: 11 Oct 2025
https://github.com/opencultureconsulting/orcli
OpenRefine command-line interface written in Bash (💎+🤖). Supports batch processing (import, transform, export).
bash bashly batch cli code4lib etl openrefine openrefine-api
Last synced: 02 Feb 2026
https://github.com/gbv/daia
Specification of Document Availability Information (DAIA)
api availability code4lib specification
Last synced: 03 Oct 2025
https://github.com/scriptotek/php-sru-client
PHP client for Search/Retrieve via URL (SRU)
Last synced: 11 Jan 2026
https://github.com/miku/span
Span formats.
bibliographic code4lib json metadata xml
Last synced: 11 Apr 2025
https://github.com/corylown/edtf-humanize
This gem adds a humanize method to EDTF dates.
Last synced: 09 Feb 2026
https://github.com/gbv/paia
Specification of Patrons Account Information API (PAIA)
Last synced: 24 Feb 2025
https://github.com/kitodo/kitodo-publication
Kitodo.Publication
code4lib fedora-repository kitodo kitodo-publication mets mets-xml mods mods-xml repository repository-management typo3
Last synced: 20 Aug 2025
https://github.com/wragge/trovebuildabot
Build your own Trove collection Twitter bot.
code4lib troveaustralia twitter-bot
Last synced: 28 Oct 2025
https://github.com/miku/ntto
Small n-triples to line delimited JSON converter and prefix cutter.
Last synced: 12 Jul 2025
https://github.com/hbz/lobid-resources
Transformation, web frontend, and API for the hbz catalog as LOD
api code4lib etl gruppe-offene-infrastruktur lobid lod
Last synced: 24 Jan 2026
https://github.com/ksclarke/jiiify-presentation
A Java IIIF Presentation library
code4lib iiif iiif-presentation java
Last synced: 27 Jul 2025
https://github.com/httpreserve/linkstat
CLI implementation of httpreserve that can test links and retrieve internet archive replacements
archives cli code4lib digipres digital-preservation glam internet-archive link-checker wayback-machine web-archiving
Last synced: 17 Jan 2026
https://github.com/httpreserve/tikalinkextract
Tika based link (URL) extractor for httpreserve
archives code4lib digitalpreservation httpreserve iipc tika tika-wrapper url-extractor webarchiving
Last synced: 17 Jan 2026
https://github.com/miku/issnlister
List of valid, registered ISSN
code4lib harvest issn metadata
Last synced: 11 Apr 2025
https://github.com/miku/esdump
Stream documents from elasticsearch with scroll (and HTTP GET only)
code4lib command-line-tool elasticsearch
Last synced: 11 Apr 2025
https://github.com/tw4l/brunnhilde-gui
Graphical user interface for Brunnhilde
archives brunnhilde code4lib digital-preservation
Last synced: 21 Apr 2025
https://github.com/digipres/policies
Digital preservation policies and strategies
archives code4lib digipres digital-preservation policy strategy
Last synced: 17 Jan 2026
https://github.com/uhlibraries-digital/carpenters
Digital preservation workflow utility
code4lib electron metadata preservation
Last synced: 03 Apr 2025
https://github.com/ffdev-info/jsonid
Identification of JSON (JSONL, YAML, and TOML) objects: JSONID
archives code4lib digipres digital-preservation file-formats format-identification glam json jsonl toml yaml
Last synced: 04 Jan 2026
https://github.com/librecat/catmandu-marc
Catmandu modules for working with MARC data
Last synced: 04 May 2025
https://github.com/opencultureconsulting/oai-pmh2
This is a stand-alone OAI-PMH 2.0 data provider. It serves records in any XML metadata format from a SQL database, supports deleted records, resumption tokens and sets.
code4lib data-provider oai oai-pmh php repository
Last synced: 22 Jan 2026
https://github.com/ub-mannheim/vmabookshelf
Create a virtual Book Shelf, Proxy script for connect to other webpages (ebooks) and a Firefox Add-on vMaBookShelfHelper
Last synced: 05 Jul 2025
https://github.com/ross-spencer/sumfolder1
What is the checksum of a directory?
authenticity checksum code4lib digipres digital-preservation merkle-tree pronom
Last synced: 24 Dec 2025
https://github.com/atomotic/isbn-authors
get author identifier (VIAF and Wikidata Qid) from ISBN
Last synced: 21 Apr 2025
https://github.com/natlibfi/fintoai
Finto AI suggests subjects for a given text. It's based on Annif, a tool for automated subject indexing.
annif classification code4lib glam multilabel-classification rest-api subject-indexing text-classification
Last synced: 26 Oct 2025
https://github.com/artefactual-labs/pygfried
Siegfried as a Python extension
code4lib digital-preservation format-identification pronom python siegfried
Last synced: 07 May 2025
https://github.com/code4lib/2020.code4lib.org
website for Code4Lib 2020 in Pittsburgh, PA
Last synced: 04 Aug 2025
https://github.com/peterk/pimmer
Exploratory code for PDF image mining
code4lib datamining humanities image-analysis image-mining opencv
Last synced: 12 Apr 2025
https://github.com/europeana/annotation
Source code for the Europeana Annotations API.
annotation api code4lib cultural-heritage europeana
Last synced: 20 Nov 2025
https://github.com/librecat/docker-catmandu
Docker image for the Catmandu data toolkit
catmandu code4lib docker docker-compose
Last synced: 04 May 2025
https://github.com/miku/wikitools
Few tools for working with wikipedia XML dumps.
Last synced: 11 Apr 2025
https://github.com/atomotic/epub-linkchecker
extract and check links from an epub file
Last synced: 21 Apr 2025
https://github.com/tvirolai/clj-isbn
An ISBN utility library for Clojure
bibliography books clojure code4lib conversion identifiers isbn
Last synced: 22 Oct 2025
https://github.com/gbv/jskos-cli
Command Line Applications to process JSKOS data format
Last synced: 04 Jul 2025
https://github.com/gbv/catmandu-pica
Catmandu modules for working with PICA+ data
Last synced: 03 Feb 2026
https://github.com/kaitlinnewson/c4l18-workshop-staticweb
Pre-conference workshop for Code4Lib 2018 on static site generators
code4lib reveal-js static-site-generation
Last synced: 12 Apr 2025
https://github.com/gbv/k10plusbot
Interlink Wikidata and K10Plus library union catalog
Last synced: 23 Apr 2025
https://github.com/wragge/recordsearch-functions
Exploring functions in the National Archives of Australia's RecordSearch database
archives code4lib recordsearch
Last synced: 04 Aug 2025
https://github.com/kitodo/kitodo-contrib
Kitodo Community Contributions
code4lib community contribution kitodo
Last synced: 05 May 2025
https://github.com/code4lib/2024.code4lib.org
website for the 2024 Code4Lib conference
Last synced: 09 Jul 2025
https://github.com/tw4l/addext
Python script to add file extensions based on PRONOM ID (PUID)
code4lib file-extensions pronom
Last synced: 21 Apr 2025
https://github.com/edualb/godmitri
This project provides utilities for chemistry. The name of the project was inspired by periodic table creator Dmitri Ivanovic Mendeleev name.
bioinformatics cheminformatics chemistry code4lib golang library scientific
Last synced: 24 Jan 2026
https://github.com/httpreserve/linkscanner
A helper package to tokenize textual content and retrieve hyperlinks
archives code4lib digitalpreservation documentary-heritage httpreserve webarchiving
Last synced: 14 Jan 2026
https://github.com/wragge/troveharvester
Tool for harvesting large collections of digitised newspaper articles from Trove.
code4lib newspapers troveaustralia
Last synced: 28 Oct 2025
https://github.com/opencultureconsulting/ba-sachsen-pica
Transformation von Bibliotheca und Alephino nach PICA+ für die Berufsakademie Sachsen
bash-script code4lib etl go-task openrefine
Last synced: 02 Feb 2026
https://github.com/wragge/closed_access
Harvesting and analysing items with the access status of 'Closed' from the National Archives of Australia.
archives code4lib recordsearch
Last synced: 26 Nov 2025
https://github.com/dulibrarytech/digitaldu-frontend
Digital Collections DU front end
code4lib digital-repository nodejs
Last synced: 17 Jan 2026