Design Guidelines for Prompt Engineering Text-to-Image Generative Models

Liu, Vivian; Chilton, Lydia B.

doi:10.1145/3491102.3501825

articleCHI Conference on Human Factors in Computing SystemsApr 28, 2022Closed access

Design Guidelines for Prompt Engineering Text-to-Image Generative Models

VLVivian Liu LBLydia B. Chilton

Columbia University · University of Missouri

Indexed incrossref

Abstract

Text-to-image generative models are a new and powerful way to generate visual artwork. However, the open-ended nature of text as interaction is double-edged; while users can input anything and have access to an infinite range of generations, they also must engage in brute-force trial and error with the text prompt when the result quality is poor. We conduct a study exploring what prompt keywords and model hyperparameters can help produce coherent outputs. In particular, we study prompts structured to include subject and style keywords and investigate success and failure modes of these prompts. Our evaluation of 5493 generations over the course of five experiments spans 51 abstract and concrete subjects as well…

Citation impact

541

total citations

FWCI: 29.44
Percentile: 100%
References: 25

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Generative grammar
Computer science
Hyperparameter
Image (mathematics)
Quality (philosophy)
Style (visual arts)
Range (aeronautics)
Generative model

UN Sustainable Development Goals

No poverty

No related works found for this paper.

Funding

NS
National Science Foundation
Award: DGE - 1644869