Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Indexed incrossref
Abstract
We provide the first exploration of sentence embeddings from text-to-text transformers (T5) including the effects of scaling up sentence encoders to 11B parameters. Sentence embeddings are broadly useful for language processing tasks. While T5 achieves impressive performance on language tasks, it is unclear how to produce sentence embeddings from encoder-decoder models. We investigate three methods to construct Sentence-T5 (ST5) models: two utilize only the T5 encoder and one using the full T5 encoderdecoder. We establish a new sentence representation transfer benchmark, SentGLUE, which extends the SentEval toolkit to nine tasks from the GLUE benchmark Our encoder-only models outperform the previous best…
Citation impact
259
total citations
- FWCI
- 24.30
- Percentile
- 100%
- References
- 42
Citations per year
Authors
7Topics & keywords
Topics
Keywords
- Computer science
- Sentence
- Encoder
- Natural language processing
- Artificial intelligence
- Benchmark (surveying)
- Transformer
- Scalability
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.