Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jiyfeng/rl4nlp

Reinforcement learning for natural language processing reading group
https://github.com/jiyfeng/rl4nlp

Last synced: 3 months ago
JSON representation

Reinforcement learning for natural language processing reading group

Awesome Lists containing this project

README

        

# RL4NLP Reading Group (Spring 2017)

- Location: CSE 203

## Schedule

### 1. RL Basic and MDP

- Yangfeng
- Time: April 17, Monday, 4:30 - 5:30 PM
- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 01 and 03
- Notes: [Chap 01](notes/01-rl-basic.md) and [Chap 03](notes/02-mdp.md)

### 2. Dynamic Programming and Monte Carlo Methods

- Chenhao
- Time: April 24, Monday, 4:30 - 5:30 PM
- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 04 and 05

### 3. Policy Gradient Methods

- Ji
- Time: May 1, Monday, 4:30 - 5:30 PM
- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 13

### 4. POS tagging and Syntactic Parsing

- Yijia
- Time: May 8, Monday, 4:30 - 5:30 PM
- Suggested reading:
* [EACL imitation learning tutorial](https://sheffieldnlp.github.io/ImitationLearningTutorialEACL2017/)
* [DAGGER](https://www.cs.cmu.edu/~sross1/publications/Ross-AIStats11-NoRegret.pdf)
* [LOLS](https://arxiv.org/pdf/1502.02206.pdf)
* [A Dynamic Oracle for Arc-Eager Dependency Parsing](http://www.aclweb.org/anthology/C12-1059)
* [Noise Reduction and Targeted Exploration in Imitation Learning for Abstract Meaning Representation Parsing](http://aclweb.org/anthology/P16-1001)

### 5. Information Extraction

- Colin
- Time: May 15, Monday, 4:30 - 5:30 PM
- Suggested reading: some papers from Regina's group
- [Learning to Win by Reading Manuals in a Monte-Carlo Framework](http://people.csail.mit.edu/regina/my_papers/civ11.pdf)
- [Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning](http://people.csail.mit.edu/karthikn/assets/pdf/rlie16.pdf)

### 6. Machine Translation and Language Modeling

- Max
- Time: May 22, Monday, 4:30 - 5:30 PM
- Suggested reading:
- [Don’t Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation](https://www.umiacs.umd.edu/~jbg/docs/2014_emnlp_simtrans.pdf)
- [Dual Learning for Machine Translation](https://papers.nips.cc/paper/6469-dual-learning-for-machine-translation.pdf)

### 7. Summarization and Question Answering

- Mandar
- Time: Jun 5, Monday, 4:30 - 5:30 PM
- Suggested reading:
- [A Deep Reinforced Model for Abstractive Summarization](https://arxiv.org/pdf/1705.04304.pdf)
- [Coarse-to-Fine Question Answering for Long Documents](http://homes.cs.washington.edu/~eunsol/papers/acl17eunsol.pdf)