Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jiyfeng/rl4nlp
Reinforcement learning for natural language processing reading group
https://github.com/jiyfeng/rl4nlp
Last synced: 3 months ago
JSON representation
Reinforcement learning for natural language processing reading group
- Host: GitHub
- URL: https://github.com/jiyfeng/rl4nlp
- Owner: jiyfeng
- Created: 2017-04-16T17:26:46.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-06-13T17:45:26.000Z (over 6 years ago)
- Last Synced: 2024-04-04T12:33:57.048Z (7 months ago)
- Size: 551 KB
- Stars: 310
- Watchers: 30
- Forks: 54
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# RL4NLP Reading Group (Spring 2017)
- Location: CSE 203
## Schedule
### 1. RL Basic and MDP
- Yangfeng
- Time: April 17, Monday, 4:30 - 5:30 PM
- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 01 and 03
- Notes: [Chap 01](notes/01-rl-basic.md) and [Chap 03](notes/02-mdp.md)### 2. Dynamic Programming and Monte Carlo Methods
- Chenhao
- Time: April 24, Monday, 4:30 - 5:30 PM
- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 04 and 05### 3. Policy Gradient Methods
- Ji
- Time: May 1, Monday, 4:30 - 5:30 PM
- Reading: [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book-2nd.html) Chap 13### 4. POS tagging and Syntactic Parsing
- Yijia
- Time: May 8, Monday, 4:30 - 5:30 PM
- Suggested reading:
* [EACL imitation learning tutorial](https://sheffieldnlp.github.io/ImitationLearningTutorialEACL2017/)
* [DAGGER](https://www.cs.cmu.edu/~sross1/publications/Ross-AIStats11-NoRegret.pdf)
* [LOLS](https://arxiv.org/pdf/1502.02206.pdf)
* [A Dynamic Oracle for Arc-Eager Dependency Parsing](http://www.aclweb.org/anthology/C12-1059)
* [Noise Reduction and Targeted Exploration in Imitation Learning for Abstract Meaning Representation Parsing](http://aclweb.org/anthology/P16-1001)### 5. Information Extraction
- Colin
- Time: May 15, Monday, 4:30 - 5:30 PM
- Suggested reading: some papers from Regina's group
- [Learning to Win by Reading Manuals in a Monte-Carlo Framework](http://people.csail.mit.edu/regina/my_papers/civ11.pdf)
- [Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning](http://people.csail.mit.edu/karthikn/assets/pdf/rlie16.pdf)### 6. Machine Translation and Language Modeling
- Max
- Time: May 22, Monday, 4:30 - 5:30 PM
- Suggested reading:
- [Don’t Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation](https://www.umiacs.umd.edu/~jbg/docs/2014_emnlp_simtrans.pdf)
- [Dual Learning for Machine Translation](https://papers.nips.cc/paper/6469-dual-learning-for-machine-translation.pdf)### 7. Summarization and Question Answering
- Mandar
- Time: Jun 5, Monday, 4:30 - 5:30 PM
- Suggested reading:
- [A Deep Reinforced Model for Abstractive Summarization](https://arxiv.org/pdf/1705.04304.pdf)
- [Coarse-to-Fine Question Answering for Long Documents](http://homes.cs.washington.edu/~eunsol/papers/acl17eunsol.pdf)