Awesome-Long-Context-Language-Modeling

Papers of Long Context Language Model
https://github.com/davendw49/Awesome-Long-Context-Language-Modeling

Last synced: 6 days ago
JSON representation

Introduction (Draft by ChatGPT😄)
- ChatGPT - 3.5 has a maximum context window of **2,048** tokens. This limitation poses challenges when dealing with longer pieces of text, as it may cut off relevant information beyond the context window.
- Figure taken from Longformer
PaperList
Contact Me
- CV-Inspired
  - Cheng Deng

Categories

PaperList 47 Introduction (Draft by ChatGPT😄) 2 Contact Me 1

Sub Categories

Memory/Cache-Augmented Models 17 Transformer Variants (Totally change the KV or position embedding of the transformers) 8 Analysis 6 Window-Based/On-the-fly Methods 6 CV-Inspired 5 Benchmark 4 Reinforcement Learning 2