Abstract

In this paper, we experimentally evaluate the zero-shot performance of GPT-4 against prior generations of GPT on the entire uniform bar examination (UBE), including not only the multiple-choice multistate bar examination (MBE), but also the open-ended multistate essay exam (MEE) and multistate performance test (MPT) components. On the MBE, GPT-4 significantly outperforms both human test-takers and prior models, demonstrating a 26% increase over ChatGPT and beating humans in five of seven subject areas. On the MEE and MPT, which have not previously been evaluated by scholars, GPT-4 scores an average of 4.2/6.0 when compared with much lower scores for ChatGPT. Graded across the UBE components, in the manner in…

Citation impact

184
total citations
FWCI
353.64
Percentile
100%
References
67
Citations per year

Authors

4

Topics & keywords

Keywords
  • Bar (unit)
  • Test (biology)
  • Corporate governance
  • Subject (documents)
  • Theme (computing)
  • Computer science
  • Psychology
  • Physics
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.