An open API service indexing awesome lists of open source software.

https://github.com/elliotxx/paper_autotranslation

An automatic translation tool for paper ( PDF => TXT, English => Chinese )
https://github.com/elliotxx/paper_autotranslation

paper-translate pdfminer python requests youdao-fanyi-api

Last synced: 2 months ago
JSON representation

An automatic translation tool for paper ( PDF => TXT, English => Chinese )

Awesome Lists containing this project

README

        

## 论文自动翻译
自动翻译论文(pdf),生成带翻译段落的 txt 文档作参考,以方便阅读英文文献。

## 介绍
PDF 解析成文本采用 pdfminer 库,主要参考:[解决pdfminer ImportError: cannot import name process_pdf](https://blog.csdn.net/MrLevo520/article/details/52136414) 和 [(7)PDFMiner提取PDF文本](https://blog.csdn.net/fighting_no1/article/details/51038942)

翻译部分调用了网友封装的有道翻译的免费 API,参考 github:[Chinese-boy/Many-Translaters](https://github.com/Chinese-boy/Many-Translaters)

ps:有道翻译API现在收费了,一篇4页的论文翻译一次就花了1元。

**日志输出等级**

```
0: 输出最简单,无段落信息,无错误信息
1: 输出段落信息,无错误信息
2: 输出段落信息,输出错误信息
```

## 依赖
* pdfminer
* requests
* execjs

## 用法
1. 安装依赖
```
pip install pdfminer requests
```

2. 运行测试用例
```
paper_translation.py "2017_ICSA_Bidirectional Mapping between Architecture Model and Code for Synchronization.pdf"
```

3. 本地会生成 pdf 同名的 txt 文件

注意:程序基于 python2.7,**python3.x 请使用文件 "paper_translation_3x.py"(感谢 [@Zephyrusvvvv](https://github.com/Zephyrusvvvv) 和我一起修改 3.x 的版本)**

## 效果图

![效果图](./screenshot/1.png)

## 参考资料
* PDFMiner 官方文档
https://euske.github.io/pdfminer/

* PDFMiner Github
https://github.com/euske/pdfminer

* pdfminer API介绍:pdf网页爬虫
https://www.cnblogs.com/rongyux/p/5445723.html

* ~~有道智云(有道翻译官方API,收费)~~
http://ai.youdao.com/gw.s

* ~~Python学习笔记(28)-Python读取word文本~~
https://blog.csdn.net/woshisangsang/article/details/75221723

* ~~python 操作 office~~
https://www.cnblogs.com/Jacklovely/p/5743868.html