The Promises and Pitfalls of Large Language Models as Feedback Providers: A Study of Prompt Engineering and the Quality of AI-Driven Feedback
Indexed incrossrefdoaj
Abstract
Methods
To address these questions, we developed a theory-driven manual to evaluate prompt quality and designed three prompts of varying quality. Feedback generated by ChatGPT-4 was assessed alongside feedback from novices and experts, who were provided with the highest-quality prompt.
Results
Our findings reveal that only the best prompt consistently produced high-quality feedback. Additionally, LLM feedback outperformed novice feedback and, in the categories explanation, questions, and specificity, even surpassed expert feedback in quality while being generated more quickly.
Citation impact
43
total citations
- FWCI
- 81.95
- Percentile
- 100%
- References
- 47
Citations per year
Authors
2Topics & keywords
Keywords
- Quality (philosophy)
- Computer science
- Data science
- Epistemology
- Philosophy
No related works found for this paper.