articleJan 1, 2016GOLD OA

Summarizing Source Code using a Neural Attention Model

University of Washington

Indexed incrossref

Abstract

High quality source code is often paired with high level summaries of the computation it performs, for example in code documentation or in descriptions posted in online forums. Such summaries are extremely useful for applications such as code search but are expensive to manually author, hence only done for a small fraction of all code that is produced. In this paper, we present the first completely datadriven approach for generating high level summaries of source code. Our model, CODE-NN , uses Long Short Term Memory (LSTM) networks with attention to produce sentences that describe C# code snippets and SQL queries. CODE-NN is trained on a new corpus that is automatically collected from StackOverflow, which we…

Citation impact

722
total citations
FWCI
51.24
Percentile
100%
References
49
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Automatic summarization
  • Code (set theory)
  • Source code
  • Benchmark (surveying)
  • Margin (machine learning)
  • Information retrieval
  • Artificial intelligence
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding