LEXRUBRIC is a new benchmark for evaluating open-ended legal tasks in Chinese, featuring over 12,000 expert criteria. It caters to the demand for reliable legal AI, demonstrating language models' varying capacities and limitations in resolving complex legal queries. https://arxiv.org/abs/2606.09389
ArXiv link for LexRubric: A Rubric-Guided Diagnostic Benchmark for Open-Ended Legal Tasks