Abstract

Pre-trained models for programming languages have recently demonstrated great success on code intelligence. To support both code-related understanding and generation tasks, recent works attempt to pre-train unified encoder-decoder models. However, such encoder-decoder framework is sub-optimal for auto-regressive tasks, especially code completion that requires a decoder-only manner for efficient inference. In this paper, we present UniXcoder, a unified cross-modal pre-trained model for programming language. The model utilizes mask attention matrices with prefix adapters to control the behavior of the model and leverages cross-modal contents like AST and code comment to enhance code representation. To encode AST…

Citation impact

538
total citations
FWCI
73.10
Percentile
100%
References
28
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Code generation
  • Code (set theory)
  • Representation (politics)
  • Encoder
  • ENCODE
  • Programming language
  • Encoding (memory)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding