//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
These days RAG systems have gotten popular for boosting LLMsโ€”but they're brittle๐Ÿ’”. Minor shifts in phrasing (โœ๏ธ style, politeness, typos) can wreck the pipeline. Even advanced components donโ€™t fix the issue. Check out this extensive eval by @neelbhandari.bsky.social and @tianyucao.bsky.social!
Apr 18, 2025
Akhila Yerukola
1/๐Ÿšจ ๐—ก๐—ฒ๐˜„ ๐—ฝ๐—ฎ๐—ฝ๐—ฒ๐—ฟ ๐—ฎ๐—น๐—ฒ๐—ฟ๐˜ ๐Ÿšจ RAG systems excel on academic benchmarks - but are they robust to variations in linguistic style? We find RAG systems are brittle. Small shifts in phrasing trigger cascading errors, driven by the complexity of the RAG pipeline ๐Ÿงต
Apr 17, 2025
Neel Bhandari