Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code and related datasets.
https://github.com/codefuse-ai/Awesome-Code-LLM

Last synced: 5 days ago
JSON representation

5. Methods/Models for Downstream Tasks
- Code RAG
  - 2024-11
  - 2024-05
  - 2024-05
  - 2025-03
  - 2024-09
  - 2024-11
  - 2024-10
  - 2024-10
  - 2024-06
  - 2025-02
  - 2024-12
  - 2025-01
  - 2025-03
  - 2025-03
  - 2025-02
- Code Commenting and Summarization
  - 2024-10
  - 2024-10
  - 2020-05
  - 2020-12
  - 2021-04
  - 2022-03
  - 2023-03
  - 2023-05
  - 2023-08
  - 2023-08
  - 2024-04
  - 2024-10
  - 2025-01
  - 2024-10
  - 2025-02
  - 2025-02
  - 2024-06
  - 2024-06
  - 2024-09
  - 2024-07
  - 2024-08
  - 2024-04
  - 2024-10
  - 2024-08
  - 2024-10
  - 2022-05
  - 2024-10
  - 2025-01
  - 2024-09
  - 2024-08
  - 2024-10
  - 2024-04
  - 2024-10
  - 2024-12
  - 2024-05 - Mint/DocuMint)]
  - 2024-05
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-05
  - 2024-10
  - 2024-10
  - 2024-06
  - 2024-07
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2025-01
  - 2025-02
  - 2024-10
  - 2024-10
  - 2024-10
  - 2025-02
- Code Similarity and Embedding (Clone Detection, Code Search)
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-07
  - 2024-05
  - 2024-04
  - 2024-01
  - 2024-08
  - 2024-08
  - 2024-08
  - 2024-09
  - 2024-10
  - 2024-08
  - 2024-08
  - 2025-03
  - 2023-05
  - 2020-09
  - 2024-12
  - 2024-10
  - 2024-04
  - 2024-05
  - 2024-10
  - 2024-06
  - 2024-10
  - 2024-06
  - 2024-07
  - 2024-11
- Text-To-SQL
  - 2024-02
  - 2024-07
  - 2024-05
  - 2024-05
  - 2024-05
  - 2024-11
  - 2024-11
  - 2024-02
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-02
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-08
  - 2024-08
  - 2024-08
  - 2024-07
  - 2024-07
  - 2024-04
  - 2024-04
  - 2024-04
  - 2024-05
  - 2024-08
  - 2024-08
  - 2024-02
  - 2024-08
  - 2024-08
  - 2025-03
  - 2025-03
  - 2025-03
  - 2024-07
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-08
  - 2024-02
  - 2024-02
  - 2024-02
  - 2024-08
  - 2024-08
  - 2025-03
  - 2024-08
  - 2024-11
  - 2024-11
  - 2024-04
  - 2021-09
  - 2022-04
  - 2022-09
  - 2022-10
  - 2022-10
  - 2023-03
  - 2023-04
  - 2023-05
  - 2023-05
  - 2023-05
  - 2023-07
  - 2023-08
  - 2024-03
  - 2024-05
  - 2024-10
  - 2024-09
  - 2024-09
  - 2024-12
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-02
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-05
  - 2024-05
  - 2024-10
  - 2024-11
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-10
  - 2024-12
  - 2024-12
  - 2023-12
  - 2024-10
  - 2024-10
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-11
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-02
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-03
  - 2025-03
  - 2025-03
  - 2025-02
  - 2025-03
  - 2025-03
  - 2025-03
- Vulnerability Detection
  - 2024-05
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-12
  - 2025-02
  - 2024-06
  - 2024-09
  - 2024-07
  - 2024-07
  - 2024-07
  - 2021-05
  - 2021-06
  - 2021-10
  - 2022-01
  - 222-04
  - 2022-05
  - 2022-05
  - 2022-09
  - 2022-12
  - 2023-05
  - 2023-06
  - 2023-08
  - 2023-08
  - 2023-10
  - 2023-11
  - 2023-12
  - 2024-01
  - 2024-01
  - 2024-02
  - 2024-03
  - 2024-04
  - 2024-05
  - 2024-05
  - 2024-08
  - 2024-08
  - 2025-03
  - 2024-08
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-02
  - 2025-01
  - 2025-01
  - 2024-08
  - 2024-08
  - 2024-11
  - 2024-11
  - 2024-09
  - 2024-04
  - 2024-04
  - 2024-04
  - 2024-03
  - 2018-04
  - 2020-01
  - [paper
  - 2024-05
  - 2024-10
  - 2019-10
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-12
  - 2024-12
  - 2023-01
  - 2024-10
  - 2024-10
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-05
  - 2024-11
  - 2024-06
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-06
  - 2024-07
  - 2024-11
  - 2024-11
  - 2025-02
  - 2025-02
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-03
  - 2025-03
  - 2025-02
  - 2025-02
- Program Repair
  - 2024-05
  - 2024-05
  - 2024-05
  - 2024-04
  - 2021-05
  - 2021-06
  - 2022-05
  - 2022-07
  - 2022-08
  - 2022-10
  - 2023-01
  - 2023-02
  - 2023-03
  - 2023-04
  - 2023-04
  - 2023-06
  - 2025-01
  - 2025-01
  - 2024-04
  - 2024-04
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-08
  - 2024-09
  - 2024-05
  - 2024-04
  - 2024-04
  - 2024-05
  - 2025-03
  - 2024-07
  - 2025-01
  - 2024-09
  - 2024-08
  - 2024-08
  - 2024-08
  - 2025-03
  - 2024-08
  - 2022-11
  - 2023-12
  - 2024-04
  - 2024-04
  - 2024-10
  - 2024-09
  - 2024-09
  - 2024-09
  - 2025-02
  - 2025-02
  - 2024-11
  - 2024-06
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2021-02
  - 2025-03
  - 2025-02
- Code Review
  - 2024-05
  - 2024-02
  - 2024-11
  - 2025-01
  - 2025-01
  - 2025-02
  - 2024-07
  - 2022-01
  - 2022-08
  - 2023-02
  - 2023-08
  - 2024-04
  - 2024-08
  - 2025-01
  - 2024-11
  - 2024-09
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-11
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-11
  - 2024-12
  - 2024-06
  - 2024-07
  - 2024-07
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-02
  - 2025-03
- Code Translation
  - 2024-05
  - 2025-01
  - 2025-01
  - 2024-08
  - 2024-08
  - 2024-06
  - 2024-04
  - 2024-04
  - 2024-07
  - 2023-10
  - 2024-03
  - 2018-02
  - 2018-07
  - 2021-10
  - 2022-06
  - 2022-07
  - 2023-02
  - 2023-06
  - 2023-08
  - 2023-11
  - 2024-05
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-11
  - 2024-07
  - 2024-11
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-03
  - 2025-03
- Repository-Level Coding
  - 2023-05
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-07
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-03
  - 2024-04
  - 2024-05
  - 2025-03
  - 2025-04
  - 2024-09
  - 2024-03
  - 2022-06
  - 2022-12
  - 2024-03
  - 2023-12
  - 2024-08 - philia/CoEdPilot)]
  - 2024-10
  - 2024-10
  - 2024-06
  - 2024-06
  - 2024-05
  - 2024-11
  - 2024-01
  - 2024-12
  - 2024-12
  - 2024-07
  - 2024-12
  - 2025-03
  - 2025-02
- Compiler Optimization
  - 2023-06
  - 2024-02
  - 2024-11
  - 2024-08
  - 2025-03
  - 2024-08
  - 2024-08
  - 2023-09
  - 2023-10
  - 2024-12
  - 2024-06
  - 2024-06
  - 2024-10
  - 2024-06
  - 2024-12
  - 2024-12
  - 2025-01
- Oracle Generation
  - 2024-05
  - 2024-11
  - 2024-07
  - 2024-05
  - 2024-07
  - 2025-01
  - 2024-11
  - 2020-09
  - 2021-09
  - 2025-02
  - 2024-10
  - 2024-11
  - 2025-01
  - 2025-02
  - 2025-02
- Frontend Development
  - 2024-11
  - 2024-04
  - 2025-02
  - 2024-07
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-06
  - 2024-10
  - 2024-03
  - 2024-09
  - 2024-09
  - 2024-11
  - 2024-11
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2024-03
  - [paper
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-05
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-11
  - 2024-12
  - 2025-03
- Program Proof
  - 2024-11
  - 2025-01
  - 2025-02
  - 2025-02
  - 2023-03
  - 2023-10
  - 2024-02
  - 2024-09
  - 2024-05
  - 2024-05
  - 2024-09
  - 2023-11
  - 2024-10
  - 2024-10
  - 2024-12
  - 2025-02
  - 2024-12
  - 2024-12
  - 2025-02
  - 2025-03
  - 2025-03
  - 2025-02
- Code Generation
  - 2024-11
  - 2024-11
  - 2024-06
  - 2024-12
  - 2024-04
  - 2024-03
  - 2025-01
  - 2024-06
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-08
  - 2023-11
  - 2024-04
  - 2023-09
  - 2024-01
  - 2024-08
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-08
  - 2024-08
  - 2024-03
  - 2024-08
  - 2024-08
  - 2024-08
  - 2024-09
  - 2024-09
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-04
  - 2024-04
  - 2024-11
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2025-02
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-05
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-11
  - 2024-06
  - 2024-12
  - 2024-06
  - 2024-11
  - 2024-10
  - 2025-02
  - 2025-02
  - 2024-12
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-03
  - 2025-03
  - 2025-03
  - 2025-03
  - 2025-02
  - 2025-03
  - 2025-03
- Fuzz Testing
  - 2024-11
  - 2022-12
  - 2023-08
  - 2023-10
  - 2024-06
  - 2024-09
  - 2024-09
  - 2025-01
  - 2024-11
  - 2024-12
  - 2024-10
  - 2024-11
- Malicious Code Detection
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2024-08
  - 2024-04
  - [paper
  - [paper
  - 2024-09
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2023-08
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2023-03
  - 2023-05
  - 2023-08
  - 2023-12
  - 2023-12
  - 2024-03
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2024-07
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2025-03
  - [paper
- Requirement Engineering
  - 2024-11
  - 2024-05
  - 2023-09
  - 2023-10
  - 2023-11
  - 2024-04
  - 2024-04
  - 2024-04
  - 2024-08
  - 2024-09
  - 2024-09
  - 2024-09
  - 2025-01
  - 2024-08
  - 2024-08
  - 2024-09
  - 2024-09
  - 2024-12
  - 2024-12
  - 2024-05
  - 2024-10
  - 2024-10
  - 2024-09
  - 2024-10
  - 2024-11
  - 2024-11
  - 2024-12
  - 2025-01
- Type Prediction
  - 2021-08
  - 2024-04
  - 2023-07
  - 2024-10
- Test Generation
  - 2024-04
  - 2025-01
  - 2023-05
  - 2025-02
  - 2024-06
  - 2024-06
  - 2024-09
  - 2024-04
  - 2024-08
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-08
  - 2024-11
  - 2024-09
  - 2024-04
  - 2023-02
  - 2023-02
  - 2023-04
  - 2023-10
  - 2024-04
  - 2023-08
  - 2023-08
  - 2020-09
  - 2023-02
  - 2023-05
  - 2023-05
  - 2023-07
  - 2023-07
  - 2023-10
  - 2024-03
  - 2024-12
  - 2024-05
  - 2024-04
  - 2024-04
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-12
  - 2025-02
  - 2025-01
  - 2024-11
  - 2024-11
  - 2024-06
  - 2024-06
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-11
  - 2024-07
  - 2024-07
  - 2024-12
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-02
  - 2023-10
  - 2025-03
  - 2025-03
  - 2025-03
  - 2025-03
- Code Refactoring and Migration
  - 2025-01
  - 2024-11
  - 2025-03
  - 2024-11
  - 2024-11
  - 2024-12
  - 2025-02
  - 2024-11
  - 2024-11
  - 2025-02
  - 2025-01
  - 2025-01
  - 2025-03
  - 2025-03
- Mutation Testing
  - 2025-01
  - 2024-06
  - 2022-03
  - 2023-01
  - 2024-04
  - 2024-10
  - 2025-01
- Binary Analysis and Decompilation
  - 2025-01
  - 2025-02
  - 2025-02
  - 2025-03
  - 2019-05
  - 2019-06
  - 2020-09
  - 2021-03
  - 2022-12
  - 2023-01
  - 2023-05
  - 2023-11
  - 2024-03
  - 2018-03
  - 2018
  - 2024-10
  - 2024-06
  - 2024-11
  - 2024-11
  - 2025-02
  - 2024-12
  - 2025-03
- Automated Machine Learning
  - 2024-10
  - 2024-05
  - 2025-03
  - 2024-10
  - 2024-11
  - 2024-11
  - 2025-02
  - 2024-10
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-03
  - 2025-04
- Software Configuration
  - 2025-01
  - 2025-03
  - 2024-02
  - 2023-10
  - 2023-11
  - 2023-12
  - 2024-11
  - 2025-02
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-03
  - 2025-03
- Log Analysis
  - 2024-06
  - 2024-06
  - 2024-09
  - 2024-09
  - 2024-08
  - 2022-08
  - 2023-02
  - 2023-06
  - 2023-08
  - 2023-09
  - 2023-09
  - 2023-10
  - 2024-04
  - 2024-05
  - 2024-12
  - 2024-10
  - 2024-06
  - 2024-06
  - 2024-11
  - 2024-10
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
- Code Ranking
  - 2022-06
  - 2024-08
  - 2024-08
  - 2024-08
  - 2023-10
  - 2024-09
  - 2022-11
  - 2023-02
  - 2024-12
  - 2024-10
  - 2025-02
  - 2025-02
- Commit Message Generation
  - 2024-04
  - 2024-10
  - 2025-01
  - 2025-03
  - 2025-02
- Software Modeling
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-10
  - 2024-10
  - 2022-12
  - 2024-04
  - 2024-04
  - 2024-06
- Code QA & Reasoning
  - 2024-12
  - 2025-02
  - 2025-03
3. When Coding Meets Reasoning
- 3.1 Coding for Reasoning
  - 2024-02
  - 2024-02
  - 2024-11
  - 2024-01
  - 2024-01
  - 2023-08
  - 2023-10
  - 2024-05
  - 2024-05
  - 2024-12
  - 2024-08
  - 2024-08
  - 2024-07
  - 2024-01
  - 2024-02
  - 2024-07
  - 2024-03
  - 2024-02
  - 2024-03
  - 2024-07
  - 2024-07
  - 2022-11 - machines/pal)]
  - 2022-11 - of-Thoughts)]
  - 2023-12
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-01
  - 2023-05
  - 2024-11
  - 2024-11
  - 2025-02
  - 2025-02
  - 2025-02
  - 2024-04
  - 2024-09
  - 2025-02
  - 2025-02
  - 2024-11
  - 2024-10
  - 2024-09
  - 2024-10
  - 2024-05
  - 2024-05
  - 2024-10
  - 2024-06
  - 2024-10
  - 2024-12
  - 2024-07
  - 2024-11
  - 2025-02
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-03
- 3.3 Code Agents
  - 2024-11
  - 2023-10
  - 2024-04
  - 2024-04
  - 2025-01
  - 2025-02
  - 2025-02
  - 2024-06
  - 2024-03
  - 2024-05
  - 2024-09
  - 2024-08
  - 2024-06
  - 2024-03
  - 2024-01
  - 2024-07
  - 2023-04
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-09 - websoft/PairCoder)]
  - 2024-11
  - 2024-11
  - 2025-02
  - 2023-07
  - 2023-08
  - 2024-03
  - 2024-03
  - 2024-05
  - 2024-10
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-05
  - 2024-05
  - 2024-10
  - 2024-10
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-10
  - 2024-11
  - 2024-12
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-02
  - 2025-03
  - 2025-03
  - 2025-03
  - 2025-03
- 3.5 Frontend Navigation
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-11
  - 2025-01
  - 2024-04
  - 2021-10
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-04
  - 2025-03
  - 2024-04
  - 2023-07
  - 2021-10
  - 2021-12
  - 2022-01
  - 2022-01
  - 2022-02
  - 2022-02
  - 2022-07
  - 2022-10
  - 2022-10
  - 2023-01
  - 2023-06
  - 2023-07
  - 2023-12
  - 2024-01
  - 2024-01
  - 2024-02
  - 2024-02
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-09
  - 2024-12
  - 2024-11
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-06
  - 2024-12
  - 2024-12
  - 2024-10
  - 2024-11
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-01
- 3.4 Interactive Coding
  - 2024-11
  - 2024-11
  - 2024-11
  - 2024-05
  - 2023-06
  - 2020-06
  - 2022-08
  - 2023-03
  - 2023-03
  - 2023-04
  - 2023-05
  - 2024-03
  - 2025-02
  - 2024-06
  - 2024-08
  - 2024-07
  - 2023-11
  - 2017-03
  - 2023-06
  - 2024-02
  - 2024-11
  - 2024-04
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-03
  - 2023-05
  - 2024-03
  - 2024-09
  - 2024-12
  - 2025-02
  - 2025-02
  - 2025-02
  - 2024-10
  - 2024-05
  - 2024-05
  - 2024-05
  - 2024-10
  - 2024-11
  - 2025-02
  - 2024-12
  - 2024-12
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-02
  - 2025-02
  - 2025-03
  - 2025-03
- 3.2 Code Simulation
  - 2025-02
  - 2024-07
  - 2024-04
  - 2025-02
  - 2024-01
  - 2024-02
  - 2024-02
  - 2024-03
  - 2024-03
  - 2025-02
  - 2024-10
  - 2024-10
  - 2025-01
  - 2025-03
4. Datasets
- 4.2 Benchmarks
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - dot-jar/bugs-dot-jar)] |
  - [paper
  - [paper
  - [paper - tan/CoCoNut-Artifact)] |
  - [paper
  - [paper - USZ/FixJS)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - code-search)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
8. Datasets
- 8.2 Benchmarks
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - benchmarks/tree/main/MBUPP)] |
  - [paper - Eval)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - li/CleanVul)] |
  - [paper - swe-bench/multi-swe-bench.github.io)] |
  - [paper - code)] |
  - [paper
  - [paper
  - [paper - data/)] |
  - [paper - lily.github.io/spider)] |
  - [paper
  - [paper - codechanges)] |
  - [paper
  - [paper - group/mineSStuBs)] |
  - [paper
  - [paper - KTH/megadiff)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - docstring-corpus)] |
  - [paper
  - [paper
  - [paper
  - [paper - group/diversevul)] |
  - [paper
  - [paper - Targaryen/MC-Evaluation)] |
  - [paper - 810A)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Coder/tree/main/qwencoder-eval/instruct/CodeArena)] |
  - [paper - Lab/BookSQL)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2025-02
  - [paper - dougherty/fvapps)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - NLP/novicode)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - sri/TFix)] |
  - [paper
  - 2024-02
  - [paper
  - [paper - bench/SciCode)] |
  - [paper
  - [paper
  - [paper - team/coir)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2023-11
  - 2024-08
  - [paper
  - [paper
  - [paper - ai/WebApp1K-React)] |
  - [paper
  - [paper
  - [paper - research/google-research/tree/master/mbpp)] [[MathQA-Python](https://github.com/google/trax/blob/master/trax/examples/MathQA_Python_generation_notebook.ipynb)] |
  - [paper
  - [paper
  - [paper
  - [paper - fixes)] |
  - [paper - bugs/bears-benchmark)] |
  - [paper
  - [paper - hu/TL-CodeSum)] |
  - [paper
  - [paper - kb/tree/main/MSR2019)] |
  - [paper
  - [paper
  - [paper - lab.org/projects/TypeWriter/data.tar.gz)] |
  - [paper
  - [paper
  - [paper - types-4-py-dataset)] |
  - [paper - group/CoDiSum)] |
  - [paper
  - [paper - autosuggestions)] |
  - [paper
  - [paper - Research/commit_message_generation)] |
  - [paper
  - [paper
  - [paper
  - [paper - jie-Huang/CoCoNote)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - V/HumanEval-V-Benchmark)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - us/download/103554)] |
  - [paper - EA6F/)] |
  - [paper - bench.github.io/)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - group/TypeT5)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - eval)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Research/CodeJudge-Eval)] |
  - [paper
  - [paper
  - [paper - dot-jar/bugs-dot-jar)] |
  - [paper
  - [paper - USZ/FixJS)] |
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - X/cruxeval-x)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2023-10 - ai/codefuse-evaluation)]
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper - AES-AI4Code/CodeQuestionAnswering)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2024-04
  - [paper
  - [paper
  - [paper - nlp/USACO)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper - lily.github.io/sparc)] |
  - [paper - lily.github.io/cosql)] |
  - [paper
  - [paper
  - [paper
  - [paper - lab-code-research/XLCoST)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Question-Code-Dataset)] |
  - [paper - corpus.github.io/)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - conala)] |
  - [paper - plugin/nl2code-dataset)] |
  - [paper - E)] |
  - [paper - science/mxeval)] |
  - [paper - jie-Huang/ExeDS)] |
  - [paper - ai/DS-1000)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - DK)] |
  - [paper - SpiderCG)] |
  - [paper - bench.github.io/)] |
  - [paper
  - [paper
  - [paper - Code-Search-Evaluation-Dataset)] |
  - [paper - LAB-SJTU/CosBench/wiki)] |
  - [paper
  - [paper
  - [paper - Code/NL-code-search-WebQuery)] |
  - [paper
  - [paper
  - [paper
  - [paper - easel/StudentEval)] |
  - [paper
  - [paper - bench.github.io/)] |
  - [paper
  - [paper - 0/commit0)] |
  - 2024-11
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - gmu/mHumanEval)] |
  - [paper
  - [paper
  - [paper - nl2sql)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2025-02
  - [paper - eval)] |
  - [paper - deepmind/code_contests)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - xl)] |
  - [paper
  - [paper - ai/Spider2)] |
  - [paper - level-Vulnerability-Detection)] |
  - 2025-02 - deepmind/bbeh)]
  - 2025-02
  - 2025-02
  - 2025-02
  - [paper - Bench-D65E/README.md)] |
  - [paper - XL)] |
  - [paper - ai/geospatial-code-llms-dataset)] |
  - [paper - AI4Code/CodeMMLU)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Bench)] |
  - [paper
  - [paper
  - 2024-10
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - Eval-Team/M2RC-Eval)] |
  - [paper
  - 2024-06 - rag-bench/code-rag-bench)]
  - [paper - bench/JavaBench)] |
  - [paper
  - 2024-06 - Research/lca-baselines)]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2024-12 - eval)]
  - [paper
  - [paper - codes/VulDeePecker)] |
  - [paper
  - [paper - Research/plot_bench)] |
  - [paper - Coder/tree/main/qwencoder-eval/instruct/CodeArena)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2024-06
  - [paper - project/bigcodebench)] |
  - [paper
  - [paper - AI/RES-Q)] |
  - [paper
  - [paper
  - [paper
  - [paper - eval)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - liuzy/CodeUpdateArena)] |
  - [paper
  - [paper
  - [paper
  - [paper - tan/CoCoNut-Artifact)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - 2025-02
  - [paper - Benchmark)] |
  - 2020-09
  - 2023-02
  - [paper - sudo/DependEval)] |
  - [paper - lab-code-research/MuST-CoST)] |
  - [paper - TransEval)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - 2025-02
  - 2025-02
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Pro/CodeEval-Pro/tree/main)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Bench)] |
  - 2025-01
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - code-search)] |
  - 2025-01
  - [paper
  - [paper
  - [paper
  - [paper - hu/DeepCom)] |
  - [paper - level-Vulnerability-Detection)] |
  - [paper - benchmark)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - Benchmark/Tests-C250)] |
  - [paper - 7B74/README.md)] |
  - [paper - 9/probench)] |
  - [paper
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
- 8.1 Pretraining
  - 2025-01
  - [data
  - 2022-11 - stack)]
  - 2023-03 - data/roots)]
  - 2024-02 - stack-v2-dedup)]
2. Models
- 2.5 Reinforcement Learning on Code
  - 2024-11
  - 2023-10
  - 2022-03
  - 2022-07
  - 2023-01 - lab-code-research/PPOCoder)]
  - 2023-07 - scut/RLTF)]
  - 2024-04
  - 2024-02
  - 2024-09
  - 2025-02
  - 2024-11
  - 2024-10
  - 2024-10
  - 2024-01
  - 2025-02
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-06
  - 2024-06
  - 2025-02
  - 2025-02
- 2.2 Existing LLM Adapted to Code
  - 2023-10
  - 2022-06
  - 2023-05
  - 2024-03
  - 2024-03
  - 2024-03
  - 2023-08
  - 2024-04
  - 2024-09
  - 2024-09
  - 2024-06
  - 2024-11
  - 2025-03
- 2.1 Base LLMs and Pretraining Strategies
  - 2024-05 - 7B)]
  - 2025-01
  - 2024-06
  - 2024-04 - ai/JetMoE)]
  - 2022-04 - neox)]
  - 2023-03
  - 2023-09 - 1_5)]
  - 2023-09 - inc/Baichuan2)]
  - 2023-09
  - 2024-01 - of-experts/)]
  - 2024-01
  - 2024-06
  - 2024-11
  - 2025-01
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-07
  - 2024-07
  - 2024-09
  - 2024-07
  - 2024-05 - ai/DeepSeek-V2)]
  - 2024-04
  - 2024-04
  - 2024-04 - FLM)]
  - 2024-07
  - 2024-04 - llama/llama3)] [[paper](https://arxiv.org/abs/2407.21783)]
  - 2023-10 - src)]
  - 2023-12
  - 2023-12
  - 2023-12 - research/YAYI2)]
  - 2024-01 - ai/DeepSeek-LLM)]
  - 2024-07
  - 2024-02
  - 2024-01 - ai/DeepSeek-MoE)]
  - 2022-01
  - 2023-07
  - 2024-02 - open-models/)]
  - 2024-04
  - 2024-06
  - 2024-09
  - 2024-08
  - 2024-09
  - 2024-11
  - 2024-04 - 34B)]
  - 2024-12
  - 2024-03 - ai/Yi)]
  - 2024-03 - 3-family)]
  - 2025-02
  - 2025-03
  - 2024-12
  - 2024-12
  - 2025-02
  - 2024-11
  - 2024-10
  - 2024-05 - art-projection/MAP-NEO)]
  - 2024-10
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-12
  - 2024-06
  - 2024-11
  - 2025-03
  - 2024-12
  - 2024-12
  - 2024-12
  - 2025-02
  - 2025-04
  - 2025-03
- 2.4 (Instruction) Fine-Tuning on Code
  - 2023-09
  - 2023-11
  - 2024-05
  - 2024-05
  - 2023-06
  - 2023-07
  - 2023-11 - ai/MFTCoder)]
  - 2024-04
  - 2024-02
  - 2024-06
  - 2024-06
  - [paper
  - 2024-07
  - 2023-12
  - 2024-02
  - 2024-04
  - 2024-04 - uiuc/xft)]
  - 2024-07
  - 2024-08
  - ACL 2024 Findings
  - 2024-09
  - 2024-09
  - 2024-09
  - 2024-11
  - 2025-03
  - 2023-12
  - 2024-01
  - 2024-03
  - 2025-03
  - 2024-12
  - 2025-02
  - 2025-02
  - 2025-02
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-05
  - 2024-05
  - 2024-10
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-06
  - 2024-10
  - 2024-07
  - 2024-07
  - 2024-11
  - 2025-01
  - 2024-06
  - 2025-03
  - 2025-04
  - 2025-04
- 2.3 General Pretraining on Code
  - 2019-12 - research/google-research/tree/master/cubert)]
  - 2020-02
  - 2020-09
  - 2021-08
  - 2021-10
  - 2022-05
  - 2020-05
  - 2021-12
  - 2022-02 - LMs)]
  - 2022-03
  - 2022-04
  - 2022-06
  - 2022-07
  - 2023-01
  - 2020-10
  - 2021-02 - mastropaolo/TransferLearning4Code)]
  - 2024-02
  - 2024-09
  - 2021-02
  - 2021-03
  - 2021-09
  - 2022-01
  - 2022-06
  - 2023-05
  - 2020-12
  - 2022-03
  - 2024-05 - 07] [[paper](https://arxiv.org/abs/2407.13739)]
  - 2024-01 - ai/DeepSeek-Coder)]
  - 2024-02
  - 2024-01
  - 2024-10
  - 2024-11
  - 2023-05
  - 2023-06 - 1)]
  - 2024-04
  - 2024-03
  - 2022-12 - code)]
  - 2024-07
  - 2025-01
  - 2025-03
7. Human-LLM Interaction
- Others
  - 2024-05
  - 2024-05
  - 2024-06
  - 2024-06
  - 2024-04
  - 2024-04
  - 2025-01
  - 2024-04
  - 2024-05
  - 2024-04
  - 2024-07
  - 2024-08
  - 2024-07
  - 2024-09
  - 2024-09
  - 2024-04
  - 2024-09
  - 2024-10
  - 2024-07
  - 2025-01
  - 2025-01
  - 2024-09
  - 2022-06
  - 2022-10
  - 2023-02
  - 2023-02
  - 2023-04
  - 2023-08
  - 2023-09
  - 2023-09
  - 2023-10
  - 2024-04
  - 2024-11
  - 2024-05
  - 2024-10
  - 2024-10
  - 2022-04
  - 2024-09
  - 2024-11
  - 2025-02
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-10
  - 2024-05
  - 2024-06
  - 2024-05
  - 2024-05
  - 2024-05
  - 2024-06
  - 2024-10
  - 2024-10
  - 2024-12
  - 2024-12
  - 2024-06
  - 2024-07
  - 2024-11
  - 2025-01
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-01
  - 2025-02
  - 2025-02
  - 2025-03
6. Analysis of AI-Generated Code
- AI-Generated Code Detection
  - 2024-05
  - 2024-09
  - 2023-10
  - 2024-04
  - 2023-05
  - 2025-01
  - 2024-11
  - 2024-11
  - 2024-12
  - 2024-12
  - 2025-02
  - 2025-02
  - 2025-02
  - 2024-05
  - 2024-12
  - 2025-02
  - 2025-03
- Robustness
  - 2023-10
  - 2024-02
  - 2025-03
  - 2024-04
  - 2024-11
  - 2024-11
  - 2024-12
  - 2024-06
  - 2024-07
- Others
  - 2024-05
  - 2023-12
  - 2024-11
  - 2025-02
  - 2024-07
  - 2024-08
  - 2024-08
  - 2024-08
  - 2024-07
  - 2024-04
  - 2024-09
  - 2025-04
  - 2025-01
  - 2025-01
  - 2024-11
  - 2024-04
  - 2024-12
  - 2024-12
  - 2024-11
  - 2024-11
  - 2025-02
  - 2025-03
  - 2025-03
  - 2024-09
  - 2024-09
  - 2024-06
  - 2024-06
  - 2024-12
  - 2025-01
  - 2025-01
  - 2025-03
  - 2025-03
- Correctness
  - 2024-11
  - 2024-06
  - 2025-02
  - 2024-06
  - 2024-07
  - 2024-08
  - 2024-08
  - 2024-08
  - 2024-09
  - 2024-09
  - 2024-02
  - 2024-09
  - 2024-09
  - 2024-03
  - 2022-05
  - 2023-04
  - 2023-08
  - 2024-03
  - 2023-03
  - 2024-11
  - 2024-10
  - 2024-09
  - 2024-12
  - 2024-11
  - 2024-06
  - 2024-11
  - 2024-07
  - 2024-11
  - 2025-02
  - 2025-02
  - 2025-03
  - 2025-03
- Security and Vulnerabilities
  - 2024-11
  - 2024-10
  - 2024-06
  - 2025-01
  - 2025-02
  - 2024-07
  - 2024-07
  - 2024-07
  - 2024-05
  - 2023-02
  - 2023-12
  - 2024-04
  - 2024-04
  - 2024-04
  - 2024-08
  - 2024-08
  - 2024-09
  - 2024-10
  - 2024-08
  - 2024-03
  - 2024-08
  - 2024-03
  - 2024-04
  - 2021-08
  - 2022-04
  - 2022-08
  - 2022-1
  - 2024-05
  - 2024-10
  - 2024-09
  - 2024-09
  - 2024-10
  - 2024-10
  - 2024-11
  - 2024-12
  - 2024-07
  - 2024-11
  - 2025-02
  - 2025-02
  - 2025-02
  - 2025-03
  - 2025-03
  - 2025-03
- Efficiency
  - 2024-06
  - 2024-06
  - 2024-04
  - 2024-07
  - 2024-05
  - 2024-08
  - 2024-07
  - 2024-02
  - 2024-02
  - 2025-03
  - 2024-10
  - 2024-12
  - 2024-11
  - 2024-10
  - 2024-11
  - 2025-02
- Bias
  - 2025-01
  - 2024-04
  - 2025-01
  - 2025-01
  - 2024-11
  - 2024-10
  - 2025-03
- Interpretability
  - 2025-02
  - 2024-07
  - 2024-07
  - 2025-02
  - 2024-06
  - 2024-12
  - 2024-07
  - 2025-01
  - 2025-03
- Privacy
  - 2025-02
  - 2025-01
  - 2024-12
  - 2024-04
  - 2024-10
  - 2024-10
- API Usage
  - 2024-06
  - 2024-08
  - 2024-09
  - 2024-09
  - 2025-03
  - 2024-12
  - 2025-02
  - 2025-03
- Hallucination
  - 2024-07
  - 2024-04
  - 2024-08
  - 2024-04
  - 2024-10
  - 2024-09
  - 2024-06
  - 2024-10
  - 2024-07
4. Code LLM for Low-Resource, Low-Level, and Domain-Specific Languages
- 3.5 Frontend Navigation
5. Datasets
- 5.2 Benchmarks
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper
  - [paper - level-Vulnerability-Detection)] |
1. Surveys
- 2022-12
- 2022-12
- 2023-08
- 2023-08
- 2023-10
- 2024-05
- 2024-10
- 2023-02
- 2023-12
- 2024-04
- 2023-12
- 2024-03
- 2024-10
- 2025-03
- 2025-03
9. Recommended Readings
- 8.2 Benchmarks
  - Mixed Precision Training
  - Neural Machine Translation by Jointly Learning to Align and Translate - decoder RNN |
  - Neural Machine Translation of Rare Words with Subword Units - pair encoding: split rare words into subword units |
  - Attention Is All You Need - attention for long-range dependency and parallel training |
  - Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models - knowledge and complex reasoning benchmark |
  - Emergent Abilities of Large Language Models
  - Scaling Instruction-Finetuned Language Models
  - Self-Instruct: Aligning Language Models with Self-Generated Instructions - generated data |
  - The Pile: An 800GB Dataset of Diverse Text for Language Modeling
  - GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
  - Improving Language Understanding by Generative Pre-Training - finetuning paradigm applied to Transformer decoder |
  - BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
  - Language Models are Unsupervised Multitask Learners
  - SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
  - RoBERTa: A Robustly Optimized BERT Pretraining Approach
  - Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
  - ZeRO: Memory Optimizations Toward Training Trillion Parameter Models - efficient distributed optimization |
  - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer - decoder pretrained with an MLM-like denoising objective |
  - Language Models are Few-Shot Learners - 2 (175B), they discovered a new learning paradigm: In-Context Learning (ICL) |
  - Measuring Massive Multitask Language Understanding - knowledge and complex reasoning benchmark |
  - LoRA: Low-Rank Adaptation of Large Language Models - efficient finetuning |
  - Finetuned Language Models Are Zero-Shot Learners - finetuning |
  - Multitask Prompted Training Enables Zero-Shot Task Generalization
  - Scaling Language Models: Methods, Analysis & Insights from Training Gopher
  - Chain-of-Thought Prompting Elicits Reasoning in Large Language Models - of-Though reasoning |
  - Training language models to follow instructions with human feedback - 3 instruction finetuned with RLHF (reinforcement learning from human feedback) |
  - Training Compute-Optimal Large Language Models
  - Large Language Models are Zero-Shot Reasoners
  - RoFormer: Enhanced Transformer with Rotary Position Embedding
  - PaLM: Scaling Language Modeling with Pathways
  - BLOOM: A 176B-Parameter Open-Access Multilingual Language Model - source dense LLM, trained on 46 languages, with detailed discussion about training and evaluation |
  - LLaMA - 4](https://arxiv.org/abs/2303.08774) or [PaLM 2](https://arxiv.org/abs/2305.10403). For comprehensive reviews on these more general topics, we refer to other sources such as [Awesome-LLM](https://github.com/Hannibal046/Awesome-LLM), [Awesome AIGC Tutorials](https://github.com/luban-agi/Awesome-AIGC-Tutorials), or for LLM applications in other specific domains: [Awesome Domain LLM](https://github.com/luban-agi/Awesome-Domain-LLM), [Awesome Tool Learning](https://github.com/luban-agi/Awesome-Tool-Learning#awesome-tool-learning), [Awesome-LLM-MT](https://github.com/hsing-wang/Awesome-LLM-MT), [Awesome Education LLM](https://github.com/Geralt-Targaryen/Awesome-Education-LLM).
News
7. User-LLM Interaction
- Others
  - 2024-05
6. Datasets
- 6.2 Benchmarks
  - [paper
  - [paper - level-Vulnerability-Detection)] |
  - [paper - level-Vulnerability-Detection)] |
  - [paper
Star History
- 5.2 Benchmarks
  - ![Star History Chart - history.com/#codefuse-ai/Awesome-Code-LLM&Date)
- 8.2 Benchmarks
  - ![Star History Chart - history.com/#codefuse-ai/Awesome-Code-LLM&Date)

Programming Languages

Python 2

Awesome-Code-LLM

5. Methods/Models for Downstream Tasks

Code RAG

Code Commenting and Summarization

Code Similarity and Embedding (Clone Detection, Code Search)

Text-To-SQL

Vulnerability Detection

Program Repair

Code Review

Code Translation

Repository-Level Coding

Compiler Optimization

Oracle Generation

Frontend Development

Program Proof

Code Generation

Fuzz Testing

Malicious Code Detection

Requirement Engineering

Type Prediction

Test Generation

Code Refactoring and Migration

Mutation Testing

Binary Analysis and Decompilation

Automated Machine Learning

Software Configuration

Log Analysis

Code Ranking

Commit Message Generation

Software Modeling

Code QA & Reasoning

3. When Coding Meets Reasoning

3.1 Coding for Reasoning

3.3 Code Agents

3.5 Frontend Navigation

3.4 Interactive Coding

3.2 Code Simulation

4. Datasets

4.2 Benchmarks

8. Datasets

8.2 Benchmarks

8.1 Pretraining

2. Models

2.5 Reinforcement Learning on Code

2.2 Existing LLM Adapted to Code

2.1 Base LLMs and Pretraining Strategies

2.4 (Instruction) Fine-Tuning on Code

2.3 General Pretraining on Code

7. Human-LLM Interaction

Others

6. Analysis of AI-Generated Code

AI-Generated Code Detection

Robustness

Others

Correctness

Security and Vulnerabilities

Efficiency

Bias

Interpretability

Privacy

API Usage

Hallucination

4. Code LLM for Low-Resource, Low-Level, and Domain-Specific Languages

3.5 Frontend Navigation

5. Datasets

5.2 Benchmarks

1. Surveys

9. Recommended Readings

8.2 Benchmarks

News

7. User-LLM Interaction

Others

6. Datasets

6.2 Benchmarks

Star History

5.2 Benchmarks

8.2 Benchmarks