LaMDA: Language Models for Dialog Applications

Thoppilan, Romal; Freitas, Daniel De; Hall, Jamie; Shazeer, Noam; Kulshreshtha, Apoorv; Cheng, Heng-Tze; Jin, Alicia; Bos, Taylor; Baker, Leslie L.; Du, Yu; Li, YaGuang; Lee, Hongrae; Zheng, Huaixiu; Ghafouri, Amin; Menegali, Marcelo; Huang, Yanping; Krikun, Maxim; Lepikhin, Dmitry; Qin, James; Chen, Dehao; Xu, Yuanzhong; Chen, Zhifeng; Roberts, Adam; Bosma, Maarten; Zhao, Vincent; Zhou, Yanqi; Chang, Chung-Ching; Krivokon, Igor; Rusch, Will; Pickett, Marc; Srinivasan, Pranesh; Man, Laichee; Meier-Hellstern, K.S.; Morris, Meredith Ringel; Doshi, Tulsee; Santos, Renelito Delos; Duke, Toju; Søraker, Johnny Hartz; Zevenbergen, Ben; Prabhakaran, Vinodkumar; Díaz, Mark; Hutchinson, Ben; Olson, Kristen; Molina, Alejandra; Hoffman-John, Erin; Lee, Josh; Aroyo, Lora; Rajakumar, Ravi; Butryna, Alena; Lamm, Matthew S.; Kuzmina, Viktoriya; Fenton, Joe; Cohen, Aaron J.; Bernstein, Rachel; Kurzweil, Ray; Aguera-Arcas, Blaise; Cui, Claire; Croak, Marian; H., Ed; Le, Quoc V.

doi:10.48550/arxiv.2201.08239

preprintarXiv (Cornell University)Jan 20, 2022GREEN OA

LaMDA: Language Models for Dialog Applications

RTRomal Thoppilan DDDaniel De Freitas JHJamie Hall NSNoam Shazeer AKApoorv Kulshreshtha

Indexed inarxivdatacite

Abstract

We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text. While model scaling alone can improve quality, it shows less improvements on safety and factual grounding. We demonstrate that fine-tuning with annotated data and enabling the model to consult external knowledge sources can lead to significant improvements towards the two key challenges of safety and factual grounding. The first challenge, safety, involves ensuring that the model's responses are consistent with a set of human values, such as preventing harmful…

Citation impact

705

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

60

Topics & keywords

Topics

Keywords

Computer science
Dialog box
Language model
Artificial intelligence
Classifier (UML)
Natural language processing
Set (abstract data type)
Consistency (knowledge bases)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.