Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/adityathakker/awesome-rl-nlp
Curated Reinforcement Learning Resources for Natural Language Processing
https://github.com/adityathakker/awesome-rl-nlp
List: awesome-rl-nlp
Last synced: 3 months ago
JSON representation
Curated Reinforcement Learning Resources for Natural Language Processing
- Host: GitHub
- URL: https://github.com/adityathakker/awesome-rl-nlp
- Owner: adityathakker
- License: gpl-3.0
- Created: 2017-09-09T12:15:09.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-04-12T04:44:23.000Z (almost 7 years ago)
- Last Synced: 2024-05-20T09:02:48.173Z (8 months ago)
- Size: 18.6 KB
- Stars: 390
- Watchers: 23
- Forks: 50
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-interesting-topics-in-nlp - NLP
- awesome-deep-rl - awesome-rl-nlp
- awesome-machine-learning-resources - **[List - rl-nlp?style=social) (Table of Contents)
README
# Awesome Reinforcement Learning(RL) for Natural Language Processing(NLP)
Curated List of Reinforcement Learning Resources for Natural Language Processing. I am updating this list as I read and learn about this repositories/papers/tutorials. This repository might not get updated regularly because less research work is being published in the intersection of NLP and RL.## Table of Contents
- [Repositories](#repositories)
- [Tutorials](#tutorials)
- [Interesting Research Papers](#interesting-research-papers)
## Repositories
- [Generative_NLP_RL_GAN](https://github.com/OctThe16th/Generative_NLP_RL_GAN)
- [RL4NLP Reading Group (Spring 2017)](https://github.com/jiyfeng/rl4nlp)
- [Recent Deep Learning papers in NLU and RL](https://github.com/madrugado/deep-learning-nlp-rl-papers)## Tutorials
- [CMU Neural Nets for NLP 2017 (16): Reinforcement Learning](https://www.youtube.com/watch?v=F1hZfoh-wX4)
- [Reinforcement Learning for NLP](http://www.umiacs.umd.edu/~jbg/teaching/CSCI_7000/11a.pdf)## Interesting Research Papers
- [Learning to Organize Knowledge with N-Gram Machines](https://arxiv.org/abs/1711.06744v1)
- [Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision](https://arxiv.org/abs/1611.00020v4)
- [Deep Reinforcement Learning for Mention-Ranking Coreference Models](https://arxiv.org/abs/1609.08667v3)
- [Coarse-to-Fine Question Answering for Long Documents](https://homes.cs.washington.edu/~eunsol/papers/acl17eunsol.pdf)
- [A Deep Reinforced Model for Abstractive Summarization](https://arxiv.org/pdf/1705.04304.pdf)
- [Reinforcement Learning for Simultaneous Machine Translation](https://www.umiacs.umd.edu/~jbg/docs/2014_emnlp_simtrans.pdf)
- [Dual Learning for Machine Translation](https://papers.nips.cc/paper/6469-dual-learning-for-machine-translation.pdf)
- [Learning to Win by Reading Manuals in a Monte-Carlo Framework](http://people.csail.mit.edu/regina/my_papers/civ11.pdf)
- [Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning](http://people.csail.mit.edu/karthikn/assets/pdf/rlie16.pdf)
- [Deep Reinforcement Learning with a Natural Language Action Space](http://www.aclweb.org/anthology/P16-1153)
- [Deep Reinforcement Learning for Dialogue Generation](https://arxiv.org/pdf/1606.01541.pdf)
- [Reinforcement Learning for Mapping Instructions to Actions](http://people.csail.mit.edu/branavan/papers/acl2009.pdf)
- [Language Understanding for Text-based Games using Deep Reinforcement Learning](https://arxiv.org/pdf/1506.08941.pdf)
- [End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning](https://arxiv.org/pdf/1606.01269v1.pdf)
- [End-to-End Reinforcement Learning of Dialogue Agents for Information Access](https://arxiv.org/pdf/1609.00777v1.pdf)
- [Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning](https://arxiv.org/pdf/1702.03274.pdf)
- [Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis](https://arxiv.org/abs/1804.02504)