//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
LEXRUBRIC is a new benchmark for evaluating open-ended legal tasks in Chinese, featuring over 12,000 expert criteria. It caters to the demand for reliable legal AI, demonstrating language models' varying capacities and limitations in resolving complex legal queries. https://arxiv.org/abs/2606.09389
ArXiv link for LexRubric: A Rubric-Guided Diagnostic Benchmark for Open-Ended Legal Tasks
arxiv.org
LexRubric: A Rubric-Guided Diagnostic Benchmark for Open-Ended Legal Tasks
3h
AI Firehose