A Rosetta-based protein design protocol converging to natural sequences

Bruno Emanuel Ferreira De Sousa Correia, Stéphane Rosset, Zander Harteveld, Giulia Sormani
2021
Journal paper

Abstract

Computational protein design has emerged as a powerful tool capable of identifying sequences compatible with pre-defined protein structures. The sequence design protocols, implemented in the Rosetta suite, have become widely used in the protein engineering community. To understand the strengths and limitations of the Rosetta design framework, we tested several design protocols on two distinct folds (SH3-1 and Ubiquitin). The sequence optimization, when started from native structures and natural sequences or polyvaline sequences, converges to sequences that are not recognized as belonging to the fold family of the target protein by standard bioinformatic tools, such as BLAST and Hmmer. The sequences generated from both starting conditions (native and polyvaline) are instead very similar to each other and recognized by Hmmer as belonging to the same "family." This demonstrates the capability of Rosetta to converge to similar sequences, even when sampling from distinct starting conditions, but, on the other hand, shows intrinsic inaccuracy of the scoring function that drifts toward sequences that lack identifiable natural sequence signatures. To address this problem, we developed a protocol embedding Rosetta Design simulations in a genetic algorithm, in which the sequence search is biased to converge to sequences that exist in nature. This protocol allows us to obtain sequences that have recognizable natural sequence signatures and, experimentally, the designed proteins are biochemically well behaved and thermodynamically stable.

Official source

https://infoscience.epfl.ch/record/284322?ln=en

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

A Rosetta-based protein design protocol converging to natural sequences

Graph Chatbot

Chat with Graph Search

Opportunities and challenges in design and optimization of protein function

From Sequence to Dynamics to Function: Computational Design of Allostery and Ligand Selectivity in G-Protein Coupled Receptors

Engineering novel protein interactions with therapeutic potential using deep learning-guided surface design

From Sequence to Dynamics to Function: Computational Design of Allostery and Ligand Selectivity in G-Protein Coupled Receptors

Engineering novel protein interactions with therapeutic potential using deep learning-guided surface design

Opportunities and challenges in design and optimization of protein function