The intricate relationship between metal ions andprotein scaffolds plays a major role in most processes that are fundamental to Life. Metal cofactorsare often essential for the proper folding of polypeptide chains or function of folded proteins.The birth and exponential growth of the PDB over the past 50 years has provided the scientificcommunity with an invaluable resource for the study of such relationships, allowing for exampleto unveil the coordination number and binding propensity of coordinating residues. It has also beenexploited by Machine Learning studies aimed at ab initio identifying and differentiatingstructural and functional metal sites. However, the full potential of protein ligands to bind anygiven metal cofactors and tune their chemistry has not been exhaustively explored by evolution. Tothis aim, the de novo design of metalloproteins by leveraging the wealth of knowledgecontained in the PDB represents a promising strategy for the discovery of new catalysts.Ithas been shown that the protein structural space is highly degenerate and can be recapitulatedwith a finite set of structural units, such as elements of tertiary structure calledTERMs (TERtiaryMotifs).1 Here, we build on this concept and report the developmentof a dataset of highly specialized metal-binding elements: MetalTERMs. Over 100’000 MetalTERMswere identified from sites in which the metal was bound only by protein residuesand water molecules. Subsequently, MetalTERMs were clustered according to their root-mean-square-deviation, the total number of residues and the number of non-contiguous segments.We find that the number of clusters rapidly drops with the increase in complexity of thetertiary arrangement, and that MetalTERMs composed of at most three segments can recapitulateabout 90% of the whole dataset. This would indicate that medium to long-range mutationshave most likely only a marginal effect on the metal coordination sphere, and would thereforecorroborate the well-established adoption of the miniaturization approach for designing metalloproteinsfrom scratch. Our analyses have also enabled us to identify new-to-nature combinationsof most recurring MetalTERMs, which could finally lead to the design of unprecedented catalysts. 1Mackenzie et al, PNAS, 2016, 113 (47), E7438-E7447
Developing a dataset of tertiary structural motifs for metalloprotein design / Pirro, Fabio; Lemmin, Thomas; Chino, Marco. - (2021), p. 127. (Intervento presentato al convegno Virtual symposium celebrating the 50th anniversary of the Protein Data Bank).
Developing a dataset of tertiary structural motifs for metalloprotein design
Fabio Pirro
Primo
;Marco ChinoCo-ultimo
2021
Abstract
The intricate relationship between metal ions andprotein scaffolds plays a major role in most processes that are fundamental to Life. Metal cofactorsare often essential for the proper folding of polypeptide chains or function of folded proteins.The birth and exponential growth of the PDB over the past 50 years has provided the scientificcommunity with an invaluable resource for the study of such relationships, allowing for exampleto unveil the coordination number and binding propensity of coordinating residues. It has also beenexploited by Machine Learning studies aimed at ab initio identifying and differentiatingstructural and functional metal sites. However, the full potential of protein ligands to bind anygiven metal cofactors and tune their chemistry has not been exhaustively explored by evolution. Tothis aim, the de novo design of metalloproteins by leveraging the wealth of knowledgecontained in the PDB represents a promising strategy for the discovery of new catalysts.Ithas been shown that the protein structural space is highly degenerate and can be recapitulatedwith a finite set of structural units, such as elements of tertiary structure calledTERMs (TERtiaryMotifs).1 Here, we build on this concept and report the developmentof a dataset of highly specialized metal-binding elements: MetalTERMs. Over 100’000 MetalTERMswere identified from sites in which the metal was bound only by protein residuesand water molecules. Subsequently, MetalTERMs were clustered according to their root-mean-square-deviation, the total number of residues and the number of non-contiguous segments.We find that the number of clusters rapidly drops with the increase in complexity of thetertiary arrangement, and that MetalTERMs composed of at most three segments can recapitulateabout 90% of the whole dataset. This would indicate that medium to long-range mutationshave most likely only a marginal effect on the metal coordination sphere, and would thereforecorroborate the well-established adoption of the miniaturization approach for designing metalloproteinsfrom scratch. Our analyses have also enabled us to identify new-to-nature combinationsof most recurring MetalTERMs, which could finally lead to the design of unprecedented catalysts. 1Mackenzie et al, PNAS, 2016, 113 (47), E7438-E7447File | Dimensione | Formato | |
---|---|---|---|
PDB50_day1-abstracts.pdf
solo utenti autorizzati
Descrizione: Book of Abstracts
Tipologia:
Versione Editoriale (PDF)
Licenza:
Copyright dell'editore
Dimensione
2.23 MB
Formato
Adobe PDF
|
2.23 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.