//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Fresh from bioRxiv our latest work introducing The Embedded Alphabet (TEA), a powerful new representation for protein sequences obtained by discretising ESM2 embeddings into 20 characters. Pre-print: www.biorxiv.org/content/10.1... ๐Ÿงต๐Ÿ‘‡(1/n)
6mo
www.biorxiv.org
Detecting remote homology with speed and sensitivity is crucial for tasks like function annotation and structure prediction. We introduce a novel approach using contrastive learning to convert protein...
Rewriting protein alphabets with language models
Lorenzo Pantolini