https://github.com/huggingface/awesome-papers

Papers & presentation materials from Hugging Face's internal science day
https://github.com/huggingface/awesome-papers

Last synced: 19 days ago
JSON representation

Papers & presentation materials from Hugging Face's internal science day

Host: GitHub
URL: https://github.com/huggingface/awesome-papers
Owner: huggingface
Created: 2020-03-11T15:42:41.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2020-10-31T14:19:22.000Z (over 5 years ago)
Last Synced: 2026-02-09T17:57:46.686Z (29 days ago)
Homepage:
Size: 7.05 MB
Stars: 2,053
Watchers: 340
Forks: 118
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-artificial-intelligence-research - Awesome NLP Paper Discussions (by The Hugging Face team)
ultimate-awesome - awesome-papers - Papers & presentation materials from Hugging Face's internal science day. (Other Lists / TeX Lists)
jimsghstars - huggingface/awesome-papers - Papers & presentation materials from Hugging Face's internal science day (Others)

README

          # Awesome NLP Paper Discussions

The Hugging Face team believes that we can reach our goals in NLP by building powerful open source tools and by conducting impactful research. Our team has begun holding regular internal discussions about awesome papers and research areas in NLP. In the spirit of open science, we've decided to share these discussion materials with the community.

_Note: These science day discussions are held offline with no physical presentation or discussion to provide. However, some presentation materials do include limited comments from our team or summaries of internal discussions._

See [planned future discussions](#planned-discussions) below.

#### August 12, 2020

- **Paper**: [Pre-training via Paraphrasing](https://arxiv.org/abs/2006.15020)

- **Authors**: [Mike Lewis](https://twitter.com/ml_perception), [Marjan Ghazvininejad](https://twitter.com/gh_marjan), [Gargi Ghosh](https://twitter.com/gargighosh), Armen Aghajanyan, [Sida Wang](https://twitter.com/sidawxyz), [Luke Zettlemoyer](https://twitter.com/lukezettlemoyer)

- **Presenter**: [Sam Shleifer](https://twitter.com/sam_shleifer)

- **Presentation**: [Forum Summary](https://discuss.huggingface.co/t/science-tuesday-marge/685)

- **[Community Discussion](https://discuss.huggingface.co/t/science-tuesday-marge/685)**



#### June 23, 2020

- **Paper**: [Weight Poisoning Attacks on Pre-trained Models](https://arxiv.org/abs/2004.06660)

- **Authors**: Keita Kurita, [Paul Michel](https://twitter.com/pmichelX), [Graham Neubig](https://twitter.com/gneubig)

- **Presenter**: [Joe Davison](https://twitter.com/joeddav)

- **Presentation**: [Colab notebook/post](https://colab.research.google.com/drive/1BzdevUCFUSs_8z_rIP47VyKAlvfK1cCB?usp=sharing)

- **[Community Discussion](https://github.com/huggingface/awesome-papers/discussions/8)**



#### June 18, 2020

- **Paper**: [Linformer: Self-Attention with Linear Complexity](https://arxiv.org/abs/2006.04768)

- **Authors**: [Sinong Wang](https://twitter.com/sinongwang), [Belinda Li](https://twitter.com/belindazli), Madian Khabsa, [Han Fang](https://twitter.com/Han_Fang_), Hao Ma 

- **Presenter**: [Teven Le Scao](https://twitter.com/Fluke_Ellington)

- **Presentation**: [Tutorial Blog Post](https://tevenlescao.github.io/blog/fastpages/jupyter/2020/06/18/JL-Lemma-+-Linformer.html)

- **[Community Discussion](https://github.com/huggingface/awesome-papers/discussions/7)**



#### June 9, 2020

- **Paper**: [Evaluating NLP Models via Contrast Sets](https://arxiv.org/abs/2004.02709)

- **Authors**: [Matt Gardner](https://twitter.com/nlpmattg), [Yoav Artzi](https://twitter.com/yoavartzi), Victoria Basmova, [Jonathan Berant](https://twitter.com/JonathanBerant), [Ben Bogin](https://twitter.com/ben_bogin), [Sihao Chen](https://twitter.com/soshsihao), [Pradeep Dasigi](https://twitter.com/pdasigi), [Dheeru Dua](https://twitter.com/ddua17), [Yanai Elazar](https://twitter.com/yanaiela), Ananth Gottumukkala, [Nitish Gupta](https://twitter.com/yanaiela), [Hanna Hajishirzi](https://twitter.com/HannaHajishirzi), [Gabriel Ilharco](https://twitter.com/gabriel_ilharco), [Daniel Khashabi](https://twitter.com/DanielKhashabi), [Kevin Lin](https://twitter.com/nlpkevinl), Jiangming Liu, [Nelson F. Liu](https://twitter.com/nelsonfliu), Phoebe Mulcaire, [Qiang Ning](https://twitter.com/qiangning), [Sameer Singh](https://twitter.com/sameer_), [Noah A. Smith](https://twitter.com/nlpnoah), [Sanjay Subramanian](https://twitter.com/sanjayssub), [Reut Tsarfaty](https://twitter.com/rtsarfaty), [Eric Wallace](https://twitter.com/Eric_Wallace_), Ally Zhang, [Ben Zhou](https://twitter.com/BenZhou96)

- **Presenter**: [Victor Sanh](https://twitter.com/SanhEstPasMoi)

- **Presentation**: [Slides](https://docs.google.com/presentation/d/1DfA2xi0JBSbqQ0hJrhI0jzANwjSaxV7odOA73lPfHjo/edit?usp=sharing)



#### May 18, 2020

- **Paper**: [Movement Pruning: Adaptive Sparsity by Fine-Tuning](https://arxiv.org/abs/2005.07683)

- **Authors**: [Victor Sanh](https://twitter.com/SanhEstPasMoi), [Thomas Wolf](https://twitter.com/Thom_Wolf), [Alexander M. Rush](https://twitter.com/srush_nlp)

- **Presenter**: [Victor Sanh](https://twitter.com/SanhEstPasMoi)

- **Presentation**: [Slideshare](https://www.slideshare.net/VictorSanh/movement-pruning-explain-like-im-five-234205241)



#### May 5, 2020

- **Paper**: [Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs](https://arxiv.org/abs/1812.04616) 

- **Authors**: [Sachin Kumar](https://twitter.com/shocheen), Yulia Tsvetkov

- **Presenter**: [Victor Sanh](https://twitter.com/SanhEstPasMoi)

- **Presentation**: [Colab notebook](https://colab.research.google.com/drive/1040xlv5WkLo_Xli0FpA2_bxyfsMouZ-w)



#### April 22, 2020

- **Topic**: Transfer Learning in Natural Language Processing (NLP): Open questions, current trends, limits, and future directions

- **Presenter**: [Thomas Wolf](https://twitter.com/Thom_Wolf)

- **Presentation**: [Video](https://www.youtube.com/watch?v=G5lmya6eKtc)



#### April 7, 2020

- **Topic**: Overview of recent work on: Indexing and Retrieval for Open Domain Question Answering

- **Presenter**: [Yacine Jernite](https://twitter.com/YJernite)

- **Presentation**: [Slides](https://docs.google.com/presentation/d/1A5wJEzFYGdNem7egJ-BTm6EMI3jGNe1lalyChYL54gw)



#### March 24, 2020

- **Paper**: [Scaling Laws for Neural Language Models](https://arxiv.org/abs/2001.08361)

- **Authors**: Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, [Scott Gray](https://twitter.com/scottgray76), [Alec Radford](https://twitter.com/AlecRad), Jeffrey Wu, Dario Amodei

- **Presenter**: [Teven Le Scao](https://twitter.com/Fluke_Ellington)

- **Presentation**: [Google doc paper tutorial](https://docs.google.com/document/d/1Rye61octaEF6FPHN3E7Bn2s-W3AWgMi1hukxrbkBmgY/edit#heading=h.s0a83j1o76km)



#### March 17, 2020

- **Paper**: [Representation Learning with Contrastive Predictive Coding](https://arxiv.org/abs/1807.03748) 

- **Authors**: [Aaron van den Oord](https://twitter.com/avdnoord), Yazhe Li, Oriol Vinyals

- **Presenter** [Patrick von Platen](https://twitter.com/PatrickPlaten)

- **Presentation**: [Slides](https://docs.google.com/presentation/d/1qxt7otjFI8iQSCpwzwTNei4_n4e4CIczC6nwy3jdiJY/edit?usp=sharing)



#### March 10, 2020

- **Paper**: [Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

](https://arxiv.org/abs/1902.01007)

- **Authors**: [R. Thomas McCoy](https://twitter.com/RTomMcCoy), Ellie Pavlick, [Tal Linzen](https://twitter.com/tallinzen)

- **Presenter**: [Victor Sanh](https://twitter.com/SanhEstPasMoi)

- **Presentation**: [Slides](https://docs.google.com/presentation/d/15waw0-rr4RmPx0dhEzhNhkSiFnNqhvjm66IufWbRLyw/edit?usp=sharing)



#### March 3, 2020

- **Paper**: [REALM: Retrieval-Augmented Language Model Pre-Training](https://arxiv.org/abs/2002.08909)

- **Authors**: [Kelvin Guu](https://twitter.com/kelvin_guu), [Kenton Lee](https://twitter.com/kentonctlee), Zora Tung, [Panupong Pasupat](https://twitter.com/IcePasupat), [Ming-Wei Chang](https://twitter.com/mchang21)

- **Presenter**: [Joe Davison](https://twitter.com/joeddav)

- **Presentation**: [Write-up](https://joeddav.github.io/blog/2020/03/03/REALM.html)



#### February 25, 2020

- **Paper**: [Adaptively Sparse Transformers](https://arxiv.org/abs/1909.00015) 

- **Authors**: Gonçalo M. Correia, [Vlad Niculae](https://twitter.com/vnfrombucharest), André F.T. Martins

- **Presenter**: [Sasha Rush](https://twitter.com/srush_nlp)

- **Presentation**: [Colab notebook](https://colab.research.google.com/drive/1EB7MI_3gzAR1gFwPPO27YU9uYzE_odSu)



### Planned Discussions

No planned discussions for the moment, check back soon.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/huggingface/awesome-papers

Awesome Lists containing this project

README