Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by opendatalab
A curated list of projects in awesome lists by opendatalab .
https://github.com/opendatalab/mineru
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python
Last synced: 25 Sep 2024
https://github.com/opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Last synced: 02 Aug 2024
https://github.com/opendatalab/MinerU
MinerU is a one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Last synced: 31 Jul 2024
https://github.com/opendatalab/labelU
Data annotation toolbox supports image, audio and video data.
Last synced: 02 Aug 2024