New family of generative DNA language models trained on 1T 6-mer DNA tokens in long genomic contexts. Open weights, open source, open training data...
h/t @mmitchell.bsky.social
huggingface.co/spaces/Huggi...
Enter a DNA string and the Carbon model can continue the sequence, score genetic variants, or predict the protein’s 3‑D structure. You can also explore gene‑embedding visualisations and see the ful...