Improving life science with programmable experiments. 🧬 🤖 Home of the Protein Engineering Tournament & Open Datasets Initiative.
The Align Foundation
Loading...
This release was made possible by a strong cross-team effort, with key contributions from @erika-alden.bsky.social, Anjali Chadha, Dave Ross’s team at National Institute of Standards and Technology (NIST) and the DAMP Lab at @bostonu.bsky.social.
🔗 Access the dataset: data.alignbio.org
The Align Foundation
📢 Data Release Tuesday: Align TEV Protease Dataset
📊 We’re expanding the The Align Foundation data ecosystem again with ~30,000 high-quality GROQ-seq data points capturing TEV protease sequence–function relationships at scale.
📢 Public Data Release: Align T7 RNA Polymerase Dataset.
📊The data keeps coming at Align! We’re excited to release our T7 RNA polymerase dataset, adding ~35,000 unique GROQ-seq data points to the growing Align data ecosystem, capturing sequence–function relationships across variants at scale.
To our knowledge, this is the largest mutational dataset on TEV protease to date. Notably, no comprehensive deep mutational scanning study across the full protein has been reported in over three decades, leaving key aspects of its functional landscape unexplored.
TEV protease is a cornerstone tool in biotechnology, known for its high substrate specificity, and this dataset provides a rich resource for enzyme engineering and ML-driven protein design.
To our knowledge, this is the largest mutational dataset on T7 RNA polymerase to date!
🔗 Access the dataset on the Align Data Portal: hubs.la/Q04802KK0
#OpenScience #SyntheticBiology #ProteinEngineering #BioAI #MachineLearning #AlignData #GROQSEQ #RNApolymerase