Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by opendatalab

A curated list of projects in awesome lists by opendatalab .

https://github.com/opendatalab/mineru

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python

Last synced: 25 Sep 2024

https://github.com/opendatalab/PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Last synced: 02 Aug 2024

https://github.com/opendatalab/WanJuan1.0

万卷1.0多模态语料

Last synced: 02 Aug 2024

https://github.com/opendatalab/MinerU

MinerU is a one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Last synced: 31 Jul 2024

https://github.com/opendatalab/labelU

Data annotation toolbox supports image, audio and video data.

Last synced: 02 Aug 2024