Preprint Alert!
We present new strategies to accelerate large-scale document comparison using MinHash-like sketches.
A thread:
Antoine Limasset
Compressed inverted indexes for scalable sequence similarity https://www.biorxiv.org/content/10.1101/2025.11.21.689685v1