Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

Google (United States)

Indexed incrossref

Abstract

We provide the first exploration of sentence embeddings from text-to-text transformers (T5) including the effects of scaling up sentence encoders to 11B parameters. Sentence embeddings are broadly useful for language processing tasks. While T5 achieves impressive performance on language tasks, it is unclear how to produce sentence embeddings from encoder-decoder models. We investigate three methods to construct Sentence-T5 (ST5) models: two utilize only the T5 encoder and one using the full T5 encoderdecoder. We establish a new sentence representation transfer benchmark, SentGLUE, which extends the SentEval toolkit to nine tasks from the GLUE benchmark Our encoder-only models outperform the previous best…

Citation impact

259
total citations
FWCI
24.30
Percentile
100%
References
42
Citations per year

Authors

7

Topics & keywords

Keywords
  • Computer science
  • Sentence
  • Encoder
  • Natural language processing
  • Artificial intelligence
  • Benchmark (surveying)
  • Transformer
  • Scalability
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.