Law Professors Prefer AI Over Peer Answers

Abstract

Large language models (LLMs) are increasingly promoted as educational tutors,
yet most evaluations focus on domains with a single ground truth. Many disciplines,
however, hinge on judgment: reasoning, weighing ambiguity, and reaching defensi-
ble conclusions. Law provides a sharp test. We conducted a blinded evaluation of
short-answer tutoring in contracts courses with sixteen U.S. law professors. Partici-
pants created 40 representative questions, wrote answers, and judged 2,918 anonymized
comparisons between human and LLM responses. Professors rated LLMs far higher
than their peers (average win rate = 75.33%), with models performing similarly to the
best instructor. LLM responses were also rarely flagged as harmful (3.53%, vs 12.06%
for professors). Preferences for LLM answers were consistent across evaluators and
reflected shared professional standards. Our evaluation can be reliably extended to ad-
ditional models by employing a separate LLM as a judge, rendering expert agreements
an effective, scalable method to evaluate AI tutors in judgment-rich domains.

Details

Author(s):
  • Julian Nyarko
  • Alejandro Salinas
  • Carly Frieders
  • Neel Guha
  • Sibo Ma
  • Ralph Anzivino
  • Ian Ayres
  • Oren Bar-Gill
  • Omri Ben-Shahar
  • Stephen Friedman
  • George Geis
  • Sue Guan
  • Christoph Henkel
  • Stephanie Hoffer
  • Gregory Klass
  • Larasz Moody-Villarose
  • Sarath Sanga
  • Keith Sharfman
  • Justin Simard
  • Rebecca Stone
  • David Wishnick
Publish Date:
May 27, 2026
Publication Title:
Social Science Research Network
Format:
Journal Article Page(s) 61
Citation(s):
  • Julian Nyarko, Alejandro Salinas, Carly Frieders, Neel Guha, Sibo Ma, Ralph Anzivino, Ian Ayres, Oren Bar-Gill, Omri Ben-Shahar, Stephen Friedman, George Geis, Sue Guan, Christoph Henkel, Stephanie Hoffer, Gregory Klass, Larasz Moody-Villarose, Sarath Sanga, Keith Sharfman, Justin Simard, Rebecca Stone & David Wishnick, Law Professors Prefer AI Over Peer Answers, Social Science Research Network 61 (2026).
Related Organization(s):

Other Publications By