Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Shankar, Shreya; Zamfirescu-Pereira, J.D.; Hartmann, Bjoern; Parameswaran, Aditya; Arawjo, Ian

doi:10.1145/3654777.3676450

articleOct 11, 2024GOLD OA

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

SSShreya Shankar JZJ.D. Zamfirescu-Pereira BHBjoern Hartmann APAditya Parameswaran IAIan Arawjo

Epic Sciences (United States) · University of California, Berkeley · +3 more institutions

Indexed incrossref

Abstract

Due to the cumbersome nature of human evaluation and limitations of code-based evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in evaluating LLM outputs. Yet LLM-generated evaluators simply inherit all the problems of the LLMs they evaluate, requiring further human validation. We present a mixed-initiative approach to “validate the validators”—aligning LLM-generated evaluation functions (be it prompts or code) with human requirements. Our interface, EvalGen, provides automated assistance to users in generating evaluation criteria and implementing assertions. While generating candidate implementations (Python functions, LLM grader prompts), EvalGen asks humans to grade a…

Citation impact

126

total citations

FWCI: 39.65
Percentile: 100%
References: 29

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science

No related works found for this paper.