Cody also put in a ton of extra work to make the code organized and usable in the GitHub repo: github.com/Anantharaman...
It links to a Colab notebook for model inference, training data, and pretrained models.
github.com
Protein Set Transformer (PST) framework for training protein-language-model-based genome language models. Inference is possible for viral genomes using our pretrained viral foundation model. - Anan...