Inlay

Profile

Professor @ucsd & Scholar @CodeMetal. Helps people write code they can trust

Loris D'Antoni

Why is it called a Large Language Model and not a Word Wide Web?

Constrained Adaptive Rejection Sampling (CARS) exploits this information to incrementally build a representation of the constrained distribution. As more samples are processed, CARS learns to avoid large classes of failures, while still producing samples from the exact target distribution. 4/

In our ICML26 paper, we show that autoregressive constraint checkers reveal rich information that can be used to drastically reduce the rejection rate any autoregressive sampling algorithm. 2/

On some of our benchmarks, acceptance rates improve from roughly 1 valid sample out of every 100 attempts to nearly 1 out of every 3, while preserving exact sampling guarantees. 5/

Getting LLMs to generate samples that satisfy constraints is a fundamental problem in machine learning. The standard approach is rejection sampling: generate a sample, check whether it satisfies the constraint, and if not, try again. 1/

Every interaction with the checker, whether a sample is accepted or rejected, provides information about the structure of the constrained space and where future failures are likely to occur. 3/

This work shows that the structure of constraint checkers can be exploited to dramatically accelerate sampling without sacrificing correctness; reminiscent of how conflict-driven clause learning transformed SAT solving by extracting and reusing information from failed search attempts. 6/

Paper: Constrained Adaptive Rejection Sampling arxiv.org/abs/2510.01902 Paweł Parys, Sairam Vaidya, Taylor Berg-Kirkpatrick, and Loris D'Antoni

12d

Language Models (LMs) are increasingly used in applications where generated outputs must satisfy strict semantic or syntactic constraints. Existing approaches to constrained generation fall along a sp...

arxiv.org

Constrained Adaptive Rejection Sampling

Loris D'Antoni