An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with article-extraction

A curated list of projects in awesome lists tagged with article-extraction .

https://github.com/utrechtuniversity/dataquest

A configurable pipeline for extracting and filtering articles from large corpora, tailored for the Delpher Kranten corpus, with support for features like keyword filtering and tf-idf-based relevance scoring.

article-extraction corpus-processing delpher-kranten information-retrieval keyword-filtering

Last synced: 05 Feb 2026