Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome_codegeneration
A list of papers and resources dedicated to code generation
https://github.com/LIANGQINGYUAN/awesome_codegeneration
Last synced: 1 day ago
JSON representation
-
Datasets
-
Dataset for imperative programming language generation
- Latent Predictor Networks for Code Generation
- Learning to generate pseudo-code from source code using statistical machine translation
- Latent Predictor Networks for Code Generation
- Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow
- Mapping language to code in programmatic context
- Measuring coding challenge competence with apps
- Competition-Level Code Generation with AlphaCode
- Lyra: A Benchmark for Turducken-Style Code Generation
-
Dataset for Text-to-SQL generation
- Learning a neural semantic parser from user feedback
- Evaluation of spoken language systems: The ATIS domain
- Learning to parse database queries using inductive logic programming
- Automated construction of database interfaces: Intergrating statistical and relational learning for semantic parsing
- Learning a neural semantic parser from user feedback
- Improving Text-to-SQL Evaluation Methodology
- Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning
- Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task
- A Pilot Study for Chinese SQL Semantic Parsing
- On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries
- Tableqa: a large-scale chinese text-to-sql dataset for table-aware sql generation
- DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset
- KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers
- Sparc: Cross-domain semantic parsing in context
- CoSQL: A conversational text-to-SQL challenge towards cross-domain natural language interfaces to databases
- Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL
- Constructing an interactive natural language interface for relational databases
- SQLizer: query synthesis from natural language
-
Pretrained models
- Convolutional Neural Networks over Tree Structures for Programming Language Processing
- Towards a Big Data Curated Benchmark of Inter-project Code Clones
- Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree
- Convolutional Neural Networks over Tree Structures for Programming Language Processing
-
-
Techniques
-
Code representation
-
Generation architectures
-
Pretrained models
- CodeBERT: A Pre-Trained Model for Programming and Natural Languages
- GraphCodeBERT: Pre-training Code Representations with Data Flow
- CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
- CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
- Unified Pre-training for Program Understanding and Generation
- PANGU-CODER: Program Synthesis with Function-Level Language Modeling
- CoditT5: Pretraining for Source Code and Natural Language Editing
- Evaluating large language models trained on code
-
Variable representation
-
Code and comment
-
Understanding with GNNs
-
Understanding with pre-trained models
-
-
Tools
-
Pretrained models
- Captain Stack
- Copilot - time, right from your editor.
-
Programming Languages
Categories
Sub Categories