The Promises and Pitfalls of Large Language Models as Feedback Providers: A Study of Prompt Engineering and the Quality of AI-Driven Feedback

Jacobsen, Lucas Jasper; Weber, Kira Elena

doi:10.3390/ai6020035

articleAIFeb 12, 2025GOLD OA

The Promises and Pitfalls of Large Language Models as Feedback Providers: A Study of Prompt Engineering and the Quality of AI-Driven Feedback

LJLucas Jasper Jacobsen KEKira Elena Weber

Universität Hamburg

Indexed incrossrefdoaj

Abstract

Methods

To address these questions, we developed a theory-driven manual to evaluate prompt quality and designed three prompts of varying quality. Feedback generated by ChatGPT-4 was assessed alongside feedback from novices and experts, who were provided with the highest-quality prompt.

Results

Our findings reveal that only the best prompt consistently produced high-quality feedback. Additionally, LLM feedback outperformed novice feedback and, in the categories explanation, questions, and specificity, even surpassed expert feedback in quality while being generated more quickly.

Citation impact

43

total citations

FWCI: 81.95
Percentile: 100%
References: 47

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Quality (philosophy)
Computer science
Data science
Epistemology
Philosophy

No related works found for this paper.

Funding

DF
Deutsche Forschungsgemeinschaft