//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
In evaluation, we find that people agree with labels from off-the-shelf LLMs less than a random other person! But fine-tuning and then applying our personalization method yields a 66% relative improvement in agreement compared to human-human agreement rates, leading to SOTA performance.
2mo
Ziv Epstein