Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-llm-papers
😎 Awesome lists about all kinds of LLM related papers
https://github.com/InfiniteAICreations/awesome-llm-papers
Last synced: about 2 hours ago
JSON representation
-
LLMs
- Code Llama: Open Foundation Models for Code
- An Introduction to Vision-Language Modeling
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- Advancing Multimodal Medical Capabilities of Gemini
- CodeGemma: Open Code Models Based on Gemma
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- Code Llama: Open Foundation Models for Code
-
Agents
-
Evaluation
-
Vision
-
Text / Information / Knowledge
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks - purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation.
- Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering
-
Image
-
Architecture
- Attention is All You Need - head attention mechanism.
- Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention - based Large Language Models (LLMs) to infinitely long inputs with bounded memory and computation.
- A Primer on the Inner Workings of Transformer-based Language Models - based language models.
- KAN: Kolmogorov-Arnold Networks - Arnold representation theorem, the paper propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs)
-
Survey
- A Survey of Large Language Models
- A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
- A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT
- Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
- A Comprehensive Overview of Large Language Models
- RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
-
Operating System
-
Text To SQL
- Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
- Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
- DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction
- KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers
- Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
- Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
-
LLM fine-tuning
-
Security
-
Audio