https://github.com/jiyfeng/rl4nlp

Reinforcement learning for natural language processing reading group
https://github.com/jiyfeng/rl4nlp

Last synced: 15 days ago
JSON representation

Reinforcement learning for natural language processing reading group

Host: GitHub
URL: https://github.com/jiyfeng/rl4nlp
Owner: jiyfeng
Created: 2017-04-16T17:26:46.000Z (about 8 years ago)
Default Branch: master
Last Pushed: 2018-06-13T17:45:26.000Z (almost 7 years ago)
Last Synced: 2024-11-19T14:42:34.409Z (6 months ago)
Size: 551 KB
Stars: 310
Watchers: 29
Forks: 53
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # RL4NLP Reading Group (Spring 2017)

- Location: CSE 203

## Schedule

### 1. RL Basic and MDP

- Yangfeng

- Time: April 17, Monday, 4:30 - 5:30 PM

- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 01 and 03

- Notes: [Chap 01](notes/01-rl-basic.md) and [Chap 03](notes/02-mdp.md)

### 2. Dynamic Programming and Monte Carlo Methods

- Chenhao

- Time: April 24, Monday, 4:30 - 5:30 PM

- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 04 and 05

### 3. Policy Gradient Methods

- Ji

- Time: May 1, Monday, 4:30 - 5:30 PM

- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 13

### 4. POS tagging and Syntactic Parsing

- Yijia

- Time: May 8, Monday, 4:30 - 5:30 PM

- Suggested reading: 

    * [EACL imitation learning tutorial](https://sheffieldnlp.github.io/ImitationLearningTutorialEACL2017/)

    * [DAGGER](https://www.cs.cmu.edu/~sross1/publications/Ross-AIStats11-NoRegret.pdf)

    * [LOLS](https://arxiv.org/pdf/1502.02206.pdf)

    * [A Dynamic Oracle for Arc-Eager Dependency Parsing](http://www.aclweb.org/anthology/C12-1059)

    * [Noise Reduction and Targeted Exploration in Imitation Learning for Abstract Meaning Representation Parsing](http://aclweb.org/anthology/P16-1001)

### 5. Information Extraction

- Colin

- Time: May 15, Monday, 4:30 - 5:30 PM

- Suggested reading: some papers from Regina's group

	- [Learning to Win by Reading Manuals in a Monte-Carlo Framework](http://people.csail.mit.edu/regina/my_papers/civ11.pdf)

	- [Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning](http://people.csail.mit.edu/karthikn/assets/pdf/rlie16.pdf)

### 6. Machine Translation and Language Modeling

- Max

- Time: May 22, Monday, 4:30 - 5:30 PM

- Suggested reading:

    - [Don’t Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation](https://www.umiacs.umd.edu/~jbg/docs/2014_emnlp_simtrans.pdf)

    - [Dual Learning for Machine Translation](https://papers.nips.cc/paper/6469-dual-learning-for-machine-translation.pdf)

### 7. Summarization and Question Answering

- Mandar

- Time: Jun 5, Monday, 4:30 - 5:30 PM

- Suggested reading:

    - [A Deep Reinforced Model for Abstractive Summarization](https://arxiv.org/pdf/1705.04304.pdf)

    - [Coarse-to-Fine Question Answering for Long Documents](http://homes.cs.washington.edu/~eunsol/papers/acl17eunsol.pdf)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jiyfeng/rl4nlp

Awesome Lists containing this project

README