Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with tei
A curated list of projects in awesome lists tagged with tei .
https://github.com/adbar/trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
article-extractor corpus corpus-builder corpus-tools crawler html-to-markdown html2text news news-aggregator news-crawler nlp readability rss-feed scraping tei text-cleaning text-extraction text-mining text-preprocessing web-scraping
Last synced: 30 Jul 2024
https://github.com/BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
alto alto-xml csharp docstrum document-layout-analysis hocr hocr-documents layout-analysis page-segmentation page-xml pdf pdfpig recursive-xy-cut table-extraction tei xy-cut xycut
Last synced: 03 Aug 2024
https://github.com/freedict/fd-dictionaries
hand-written dictionaries from the FreeDict project
dictionaries dictionary tei tei-xml
Last synced: 02 Aug 2024
https://github.com/eeditiones/tei-publisher-app
The main TEI Publisher app
digital-edition digital-editions digital-humanities exist-db tei tei-publisher tei-xml
Last synced: 08 Aug 2024
https://github.com/open-editions/corpus-joyce-portrait-TEI
The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man
Last synced: 29 Jul 2024
https://github.com/lueck/standoff-mode
a major mode for GNU Emacs for annotations in a stand-off manner
annotation dh digital-humanities emacs markup rdf tagger tagging tei
Last synced: 05 Aug 2024
https://github.com/lucaterre/l-terriel_memoiredestage_m2tnah_enc
Mémoire de stage et annexes pour le Master 2 Technologies numériques appliquées à l'histoire (TNAH) de l'École nationale des chartes.
automatic-transcription flask history htr kraken nlp-machine-learning ocr-recognition python3 tei xml
Last synced: 02 Oct 2024