An open API service indexing awesome lists of open source software.

https://github.com/neulab/Text-Summarization-Papers

An Exhaustive Paper List for Text Summarization
https://github.com/neulab/Text-Summarization-Papers

Last synced: about 2 months ago
JSON representation

An Exhaustive Paper List for Text Summarization

Awesome Lists containing this project

README

        

# Text Summarization Papers
by [Pengfei Liu](http://pfliu.com/), Yiran Chen, [Jinlan Fu](https://scholar.google.com/citations?hl=en&user=D4vtw8QAAAAJ), [Hiroaki Hayashi](https://hiroakih.me/), Danqing Wang and other contributors.

An exhaustive paper list for **Text Summarization**,
covering papers from eight top conferences (*ACL / EMNLP / NAACL / ICML / ICLR / AAAI / IJCAI / NeurIPS*) in the last eight years (*2013-2020*).

## What can I get here?

### 1. [Paper Retrieval System **[Click Me !!!]** 🔽](http://pfliu.com/pl-summarization/summ_paper.html)

* Find the **top-cited** summarization papers! [The latest update on: 02.25/2020]
* Track the **latest** summarization papers!
* Find the **milestone** summarization papers for beginners.
* Search papers by [**research concepts**](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_concept.md) or your interested **keywords**.

### 2. What are the recent Research Concepts and which are HOT?

We first define the typology of essential [concepts](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_concept.md) for the summarization task. We then plot the number of papers for each concept below.

before 2019 denotes the number of papers **before 2019**.

from 2019 denotes the number of papers **since 2019**.

Summary

#### `Trends in 2019`:
HOT Concepts in red suggest HOT topics, and we can observe:

* HOT**Task**: Scientific paper-based summarization has gain growing interests.
* HOT**Data**: More new datasets are constructed.
* HOT**Architecture**: Pretrained models and graph neural networks prevail.
* HOT**Evaluation**: Evaluation of the generated summary's factuality attracts recent attention.

```
Hot topic: when the proportion of papers on a concept since 2019 is greater than a certain threshold (0.4), we define this concept as a hot topic.
```

### 3. Recommended Papers

#### Papers with Hot Topics
* [`pre-X`](http://pfliu.com/pl-summarization/summ_paper_preX.html): summarizer with unsupervised pretrained models.
* [`task-sci`](http://pfliu.com/pl-summarization/summ_paper_task-sci.html): scientifc paper-based summarization.
* [`eval-factuality`](http://pfliu.com/pl-summarization/summ_paper_factuality.html): factuality evaluation on generated summaries.
* [`arch-gnn`](http://pfliu.com/pl-summarization/summ_paper_arch-gnn.html): graph neural network-based summarizers.
* [`data-new`](http://pfliu.com/pl-summarization/summ_paper_data-new.html): more new datasets are constructed.

#### Milestone Papers
* [10 must-read papers for neural **extractive** summarization](http://pfliu.com/pl-summarization/summ_paper_gen-ext.html)
* [10 must-read papers for neural **abstractive** summarization](http://pfliu.com/pl-summarization/summ_paper_gen-abs.html)
* [Top 10 **most-cited** summarization papers since 2014](http://pfliu.com/pl-summarization/summ_paper_topcited.html)

### 4. Mainstream [Dataset List 🔽](http://pfliu.com/pl-summarization/summ_data.html)

## What can I do here?
* ### If you have a new "research concept" -- Tell us
* **Update** the file [`summ_concept.md`](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_concept.md) and send us a `Pull request`.
* Or you could open an `Issue`.

* ### If you have a new "paper" or want to modify our inaccurate annotations of concepts:
* **Update** your paper into the file [`summ_paper.crowdsource`](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_paper.crowdsource) and send us a `Pull request`.
* Or you could open an `Issue`.

* ### If you have a new "dataset" or want to modify our inaccurate annotations:
* **Add** your dataset (If possible, with a brief description) into [`summ_data.crowdsource`](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_data.crowdsource) and send us a `Pull request`.
* Or you could open an `Issue`.

## Related Work

* [Concepts in Neural Networks for NLP](https://github.com/neulab/nn4nlp-concepts)
* [Named Entity Recognition Paper List](https://github.com/pfliu-nlp/Named-Entity-Recognition-NER-Papers)
* [Historiography of Text Summarization](http://pfliu.com/Historiography/summarization/summ-eng.html)

## Future Work
Hopefully, you will see our version-2.0 covering papers from 1980 to 2020.

## Acknowledgments
* Thanks Prof. [Graham Neubig](http://www.phontron.com/)'s idea on the "concept" and other comments.
* Thanks Prof. [Jackie C. K. Cheung](https://www.cs.mcgill.ca/~jcheung/)'s useful idea about the "old" papers.
* Thanks Prof. [Fei Liu](http://www.cs.ucf.edu/~feiliu/) for providing us with a bunch of interesting work and description, which enriches our concept file.
* Thanks [Peter J. Liu](https://research.google/people/105182/) a lot for the crowdsourcing idea of the paper and dataset annotations. Feel free to correct our wrong annotations by updating [`summ_paper.crowdsource`](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_paper.crowdsource) and [`summ_data.crowdsource`](https://github.com/neulab/Text-Summarization-Papers/blob/master/summ_data.crowdsource).
* Thanks Prof. [Mohit Bansal](http://www.cs.unc.edu/~mbansal/)'s feedback about this summary.
* Thanks for [Richard Socher](https://www.socher.org/index.php/Main/HomePage)'s invitation to giving a talk in salesforce and talking more about this project.