https://github.com/layout-parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
https://github.com/layout-parser/layout-parser
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 9 months ago
JSON representation
A Unified Toolkit for Deep Learning Based Document Image Analysis
- Host: GitHub
- URL: https://github.com/layout-parser/layout-parser
- Owner: Layout-Parser
- License: apache-2.0
- Created: 2020-06-10T20:22:54.000Z (over 5 years ago)
- Default Branch: main
- Last Pushed: 2024-08-15T06:26:34.000Z (over 1 year ago)
- Last Synced: 2025-04-24T02:00:05.825Z (9 months ago)
- Topics: computer-vision, deep-learning, detectron2, document-image-processing, document-layout-analysis, layout-analysis, layout-detection, layout-parser, object-detection, ocr
- Language: Python
- Homepage: https://layout-parser.github.io/
- Size: 58.3 MB
- Stars: 5,199
- Watchers: 74
- Forks: 496
- Open Issues: 114
-
Metadata Files:
- Readme: README.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: .github/CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
A unified toolkit for Deep Learning Based Document Image Analysis
---
## What is LayoutParser

LayoutParser aims to provide a wide range of tools that aims to streamline Document Image Analysis (DIA) tasks. Please check the LayoutParser [demo video](https://youtu.be/8yA5xB4Dg8c) (1 min) or [full talk](https://www.youtube.com/watch?v=YG0qepPgyGY) (15 min) for details. And here are some key features:
- LayoutParser provides a rich repository of deep learning models for layout detection as well as a set of unified APIs for using them. For example,
Perform DL layout detection in 4 lines of code
```python
import layoutparser as lp
model = lp.AutoLayoutModel('lp://EfficientDete/PubLayNet')
# image = Image.open("path/to/image")
layout = model.detect(image)
```
- LayoutParser comes with a set of layout data structures with carefully designed APIs that are optimized for document image analysis tasks. For example,
Selecting layout/textual elements in the left column of a page
```python
image_width = image.size[0]
left_column = lp.Interval(0, image_width/2, axis='x')
layout.filter_by(left_column, center=True) # select objects in the left column
```
Performing OCR for each detected Layout Region
```python
ocr_agent = lp.TesseractAgent()
for layout_region in layout:
image_segment = layout_region.crop(image)
text = ocr_agent.detect(image_segment)
```
Flexible APIs for visualizing the detected layouts
```python
lp.draw_box(image, layout, box_width=1, show_element_id=True, box_alpha=0.25)
```
Loading layout data stored in json, csv, and even PDFs
```python
layout = lp.load_json("path/to/json")
layout = lp.load_csv("path/to/csv")
pdf_layout = lp.load_pdf("path/to/pdf")
```
- LayoutParser is also a open platform that enables the sharing of layout detection models and DIA pipelines among the community.
Check the LayoutParser open platform
Submit your models/pipelines to LayoutParser
## Installation
After several major updates, layoutparser provides various functionalities and deep learning models from different backends. But it still easy to install layoutparser, and we designed the installation method in a way such that you can choose to install only the needed dependencies for your project:
```bash
pip install layoutparser # Install the base layoutparser library with
pip install "layoutparser[layoutmodels]" # Install DL layout model toolkit
pip install "layoutparser[ocr]" # Install OCR toolkit
```
Extra steps are needed if you want to use Detectron2-based models. Please check [installation.md](installation.md) for additional details on layoutparser installation.
## Examples
We provide a series of examples for to help you start using the layout parser library:
1. [Table OCR and Results Parsing](https://github.com/Layout-Parser/layout-parser/blob/main/examples/OCR%20Tables%20and%20Parse%20the%20Output.ipynb): `layoutparser` can be used for conveniently OCR documents and convert the output in to structured data.
2. [Deep Layout Parsing Example](https://github.com/Layout-Parser/layout-parser/blob/main/examples/Deep%20Layout%20Parsing.ipynb): With the help of Deep Learning, `layoutparser` supports the analysis very complex documents and processing of the hierarchical structure in the layouts.
## Contributing
We encourage you to contribute to Layout Parser! Please check out the [Contributing guidelines](.github/CONTRIBUTING.md) for guidelines about how to proceed. Join us!
## Citing `layoutparser`
If you find `layoutparser` helpful to your work, please consider citing our tool and [paper](https://arxiv.org/pdf/2103.15348.pdf) using the following BibTeX entry.
```
@article{shen2021layoutparser,
title={LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis},
author={Shen, Zejiang and Zhang, Ruochen and Dell, Melissa and Lee, Benjamin Charles Germain and Carlson, Jacob and Li, Weining},
journal={arXiv preprint arXiv:2103.15348},
year={2021}
}
```