An open API service indexing awesome lists of open source software.

https://github.com/duaibeom/chemOCR

DB-based Optical Chemical Structure Recognition
https://github.com/duaibeom/chemOCR

Last synced: about 24 hours ago
JSON representation

DB-based Optical Chemical Structure Recognition

Awesome Lists containing this project

README

        


ChemOCR(OCSR)


DB(Differentiable Binarization)-based Optical Chemical Structure Recognition

---

```sh
streamlit run demo.py
```

### Backbone

- Swin
- ResNet
- MobileNetV3

## Parser
**Rule-based method**

## DATA

- ChEMBL: https://www.ebi.ac.uk/chembl/

### Training data sample

### Limitation
- Non-charged atom
- (Inner bridged) Complex ring is not possible.
- Specific resolution of image
- Small text atoms (like Iodin) are not recognized.

### TODO
- [x] Character recognition (pretrained model from MMOCR RCNN)
- [x] Web front-end (streamlit)
- [ ] Bond direction analysis

## References

1. https://github.com/MhLiao/DB
2. https://github.com/open-mmlab/mmocr
3. https://github.com/rdkit/rdkit