https://github.com/fcakyon/craft-text-detector

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector
https://github.com/fcakyon/craft-text-detector

actions anaconda computer-vision craft deep-learning document hacktoberfest linux macos neural-network ocr pypi python pytorch text text-detection vision windows workflow

Last synced: 3 months ago
JSON representation

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Host: GitHub
URL: https://github.com/fcakyon/craft-text-detector
Owner: fcakyon
License: mit
Archived: true
Created: 2020-04-14T10:52:26.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2022-05-09T10:55:28.000Z (about 3 years ago)
Last Synced: 2024-11-03T10:32:05.528Z (8 months ago)
Topics: actions, anaconda, computer-vision, craft, deep-learning, document, hacktoberfest, linux, macos, neural-network, ocr, pypi, python, pytorch, text, text-detection, vision, windows, workflow
Language: Python
Homepage:
Size: 1.48 MB
Stars: 252
Watchers: 7
Forks: 91
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # CRAFT: Character-Region Awareness For Text detection




















Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector | [Paper](https://arxiv.org/abs/1904.01941) |

## Overview

PyTorch implementation for CRAFT text detector that effectively detect text area by exploring each character region and affinity between characters. The bounding box of texts are obtained by simply finding minimum bounding rectangles on binary map after thresholding character region and affinity scores.



## Getting started

### Installation

- Install using pip:

```console

pip install craft-text-detector

```

### Basic Usage

```python

# import Craft class

from craft_text_detector import Craft

# set image path and export folder directory

image = 'figures/idcard.png' # can be filepath, PIL image or numpy array

output_dir = 'outputs/'

# create a craft instance

craft = Craft(output_dir=output_dir, crop_type="poly", cuda=False)

# apply craft text detection and export detected regions to output directory

prediction_result = craft.detect_text(image)

# unload models from ram/gpu

craft.unload_craftnet_model()

craft.unload_refinenet_model()

```

### Advanced Usage

```python

# import craft functions

from craft_text_detector import (

    read_image,

    load_craftnet_model,

    load_refinenet_model,

    get_prediction,

    export_detected_regions,

    export_extra_results,

    empty_cuda_cache

)

# set image path and export folder directory

image = 'figures/idcard.png' # can be filepath, PIL image or numpy array

output_dir = 'outputs/'

# read image

image = read_image(image)

# load models

refine_net = load_refinenet_model(cuda=True)

craft_net = load_craftnet_model(cuda=True)

# perform prediction

prediction_result = get_prediction(

    image=image,

    craft_net=craft_net,

    refine_net=refine_net,

    text_threshold=0.7,

    link_threshold=0.4,

    low_text=0.4,

    cuda=True,

    long_size=1280

)

# export detected text regions

exported_file_paths = export_detected_regions(

    image=image,

    regions=prediction_result["boxes"],

    output_dir=output_dir,

    rectify=True

)

# export heatmap, detection points, box visualization

export_extra_results(

    image=image,

    regions=prediction_result["boxes"],

    heatmaps=prediction_result["heatmaps"],

    output_dir=output_dir

)

# unload models from gpu

empty_cuda_cache()

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/fcakyon/craft-text-detector

Awesome Lists containing this project

README