Projects in Awesome Lists tagged with article-extraction
A curated list of projects in awesome lists tagged with article-extraction .
https://github.com/utrechtuniversity/dataquest
A configurable pipeline for extracting and filtering articles from large corpora, tailored for the Delpher Kranten corpus, with support for features like keyword filtering and tf-idf-based relevance scoring.
article-extraction corpus-processing delpher-kranten information-retrieval keyword-filtering
Last synced: 05 Feb 2026