https://github.com/mindee/doctr-tfjs-demo

Javascript demo of docTR, powered by TensorFlowJS
https://github.com/mindee/doctr-tfjs-demo

computer-vision deep-learning node-js ocr optical-character-recognition reactjs tensorflowjs yarn

Last synced: 7 months ago
JSON representation

Javascript demo of docTR, powered by TensorFlowJS

Host: GitHub
URL: https://github.com/mindee/doctr-tfjs-demo
Owner: mindee
Created: 2021-09-28T12:34:55.000Z (about 4 years ago)
Default Branch: master
Last Pushed: 2024-04-05T14:20:26.000Z (over 1 year ago)
Last Synced: 2024-04-30T17:58:20.101Z (over 1 year ago)
Topics: computer-vision, deep-learning, node-js, ocr, optical-character-recognition, reactjs, tensorflowjs, yarn
Language: TypeScript
Homepage:
Size: 163 MB
Stars: 78
Watchers: 4
Forks: 14
Open Issues: 23
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# docTR Tensorflow.js demo

[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](LICENSE) ![Build Status](https://github.com/mindee/doctr-tfjs-demo/workflows/builds/badge.svg)

This project is based on [docTR](https://github.com/mindee/doctr) and leverages [TensorFlow.js](https://www.tensorflow.org/js) to serve you an end-to-end OCR running directly in your favorite web browser.

![demo](https://github.com/teamMindee/tensorflow-js-demo/releases/download/v0.1-models/demo_illustration.png)

For this project, models were trained with docTR using its TensorFlow back-end, then converted to the TJFS SavedModel format thanks to the [`tensorflowjs_converter`](https://www.tensorflow.org/js/tutorials/conversion/import_saved_model). Just like docTR, under the hood, there are two types of modules:
- **Text detection**: `db_mobilenet_v2` (low resolution) & [`db_resnet50`](https://mindee.github.io/doctr/latest/models.html#doctr.models.detection.db_resnet50) (high resolution) as available architectures, post-processing performed with [OpenCV.js](https://docs.opencv.org/3.4/d5/d10/tutorial_js_root.html).
- **Text recognition**: [`crnn_vgg16_bn`](https://mindee.github.io/doctr/latest/models.html#doctr.models.recognition.crnn_vgg16_bn) as available architecture

Documentation about all the models can be found over [here](https://mindee.github.io/doctr/models.html).

## Using the interface

The interface is divided into five sections:
- **Model settings** (side pannel): select the architectures to use for text detection and for text recognition.
- **Input Image** (top-left pannel): upload your image there by clicking in the area & selecting your file. Uploading a file will automatically run the OCR on it.
- **Text localization** (top-right pannel): the output of the text localization module.
- **Detected word boxes** (bottom-left pannel): visualization of the final predictions of the OCR.
- **Words** (bottom-right pannel): the list of all the detected words. If you hover a prediction on the bottom-left pannel, it will highlight the corresponding text prediction in this section.

## Getting started

### Prerequisites

In order to install this project, you will need [Yarn](https://classic.yarnpkg.com/lang/en/docs/install) and [NPM](https://docs.npmjs.com/downloading-and-installing-node-js-and-npm), which are package managers for [Node.js](https://nodejs.org/en/).

```shell
npm install -g serve
```

### Installation

This demo was built using [React](https://reactjs.org/), a framework for JavaScript development. This demo requires you to install the project from the source code, which will require you to install [Git](https://git-scm.com/book/en/v2/Getting-Started-Installing-Git). First clone the project repository:
```shell
git clone https://github.com/mindee/doctr-tfjs-demo.git
```

Then install the project's dependencies using the following command:

```shell
cd doctr-tfjs-demo
yarn install
```

### Running the app

#### Production mode

Alternatively, if you are looking at a production situation, first build the bundle and serve it:
```shell
yarn build
serve --no-clipboard -s build
```
then navigate to the URL with your favorite web browser

#### Development mode

Once all dependencies have been installed, launch the app using:
```shell
yarn start
```
and navigate with your web browser to the URL in the console.

### Using Docker container

Lucky for you, if you prefer working with containers, we provide a minimal Docker image. You can build it as follows (it might take a few minutes depending on your setup):
```shell
DOCKER_BUILDKIT=1 docker build . -t doctr-tfjs:node12-alpine
```
and then run your image with:
```shell
docker run -p 8001:3000 doctr-tfjs:node12-alpine
```
Feel free to change the port, but by default, you should be able to access the demo at `http://localhost:8001/`. *The `-p 8001:3000` lets Docker know that we want to map the internal port of the container (3000) to port 8001 on the outside.*

## License

Distributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mindee/doctr-tfjs-demo

Awesome Lists containing this project

README