https://github.com/luopeixiang/im2latex

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex
https://github.com/luopeixiang/im2latex

encoder-decoder-model im2latex imagecaptioning pytorch seq2seq show-and-tell

Last synced: 23 days ago
JSON representation

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex

Host: GitHub
URL: https://github.com/luopeixiang/im2latex
Owner: luopeixiang
License: mit
Created: 2019-03-26T11:51:02.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2023-10-03T21:29:40.000Z (over 1 year ago)
Last Synced: 2025-05-01T09:53:54.769Z (23 days ago)
Topics: encoder-decoder-model, im2latex, imagecaptioning, pytorch, seq2seq, show-and-tell
Language: Python
Homepage:
Size: 7.21 MB
Stars: 193
Watchers: 6
Forks: 53
Open Issues: 24
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        

# Im2Latex

![License](https://img.shields.io/apm/l/vim-mode.svg)

Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex, the pytorch implemention of the model architecture used by the [Seq2Seq for LaTeX generation](https://guillaumegenthial.github.io/image-to-latex.html)

## Sample results from this implemention

![sample_result](imgs/sample_result.png)

## Experimental results on the IM2LATEX-100K  test dataset

| BLUE-4 | Edit Distance | Exact Match |

| ------ | ------------- | ----------- |

| 40.80  | 44.23         | 0.27        |

## Getting Started

**Install dependency:**

```bash

pip install -r requirement.txt

```

**Download the dataset for training:**

```bash

cd data

wget http://lstm.seas.harvard.edu/latex/data/im2latex_validate_filter.lst

wget http://lstm.seas.harvard.edu/latex/data/im2latex_train_filter.lst

wget http://lstm.seas.harvard.edu/latex/data/im2latex_test_filter.lst

wget http://lstm.seas.harvard.edu/latex/data/formula_images_processed.tar.gz

wget http://lstm.seas.harvard.edu/latex/data/im2latex_formulas.norm.lst

tar -zxvf formula_images_processed.tar.gz

```

**Preprocess:**

```bash

python preprocess.py

```

**Build vocab**

```bash

python build_vocab.py

```

**Train:**

     python train.py \

          --data_path=[data dir] \

          --save_dir=[the dir for saving ckpts] \

          --dropout=0.2 --add_position_features \

          --epoches=25 --max_len=150

**Evaluate:**

```bash

python evaluate.py --split=test \

     --model_path=[the path to model] \

     --data_path=[data dir] \

     --batch_size=32 \

     --ref_path=[the file to store reference] \

     --result_path=[the file to store decoding result]

```

## Features

- [x] Schedule Sampling from [Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks](https://arxiv.org/pdf/1506.03099.pdf)

- [x] Positional Embedding from [Attention Is All You Need](https://arxiv.org/abs/1706.03762)

- [x] Batch beam search

- [x] Training from checkpoint 

- [ ] Improve the code of data loading for cpu/cuda memery efficiency 

- [ ] **Finetune hyper parameters for better performance**

- [ ] A HTML Page allowing upload picture to decode

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/luopeixiang/im2latex

Awesome Lists containing this project

README