Abstract
The native conformation of structured proteins is stabilized by a complex network of interactions. We analyzed the elementary patterns that constitute such network and ranked them according to their importance in sha** protein sequence design. To achieve this goal, we employed a cluster expansion of the partition function in the space of sequences and evaluated numerically the statistical importance of each cluster. An important feature of this procedure is that it is applied to a dense finite system. We found that patterns that contribute most to the partition function are cycles with even numbers of nodes, while cliques are typically detrimental. Each cluster also gives a contribute to the sequence entropy, which is a measure of the evolutionary designability of a fold. We compared the entropies associated with different interaction patterns to their abundances in the native structures of real proteins.
Graphical abstract
![](http://media.springernature.com/lw685/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Figa_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1140%2Fepje%2Fs10189-022-00250-x/MediaObjects/10189_2022_250_Fig8_HTML.png)
Similar content being viewed by others
Data Availability Statement
All data and codes used in this work can be downloaded from https://github.com/guidotiana/cluster-expansion.
References
L. Di Paola, M. De Ruvo, P. Paci, D. Santoni, A. Giuliani, Protein contact networks: an emerging paradigm in chemistry. Chem. Rev. 113, 1598–1613 (2013)
M. Vendruscolo, N.V. Dokholyan, E. Paci, M. Karplus, Small-world view of the amino acids that play a key role in protein folding. Phys. Rev. E 65, 061910 (2002)
N. Kannan, S. Vishveshwara, Identification of side-chain clusters in protein structures by a graph spectral method. J. Mol. Biol. 292, 441–464 (1999)
M. Vendruscolo, E. Paci, C.M. Dobson, M. Karplus, Three key residues form a critical contact network in a protein folding transition state. Nature 409, 641–645 (2001)
V.I. Abkevich, A.M. Gutin, E.I. Shakhnovich, Specific nucleus as the transition state for protein folding: evidence from the lattice model. Biochemistry 33, 10026–10036 (1994)
G. Tiana, F. Simona, G.M.S. De Mori, R.A. Broglia, G. Colombo, Understanding the determinants of stability and folding of small globular proteins from their energetics. Protein Sci. 13, 113–124 (2004). ISSN 0961-8368
E.I. Shakhnovich, A.M. Gutin, Engineering of stable and fast-folding sequences of model proteins. Proc. Natl. Acad. Sci. USA 90, 7195–7199 (1993)
J.D. Bryngelson, P.G. Wolynes, Spin glasses and the statistical mechanics of protein folding. Proc. Natl. Acad. Sci. USA 84, 7524–7528 (1987)
E.I. Shakhnovich, A.M. Gutin, A new approach to the design of stable proteins. Protein Eng. 6, 793–800 (1993)
B. Rost, Protein structures sustain evolutionary drift. Fold. Des. 2, S19–S24 (1997)
H. Li, R. Helling, C. Tang, N. Wingreen, Emergence of preferred structures in a simple model of protein folding. Science 273, 666–669 (1996). ISSN 00368075
S. Govindarajan, R.A. Goldstein, Why are some proteins structures so common? Proc. Natl. Acad. Sci. 93, 3341–3345 (1996)
P.G. Wolynes, Symmetry and the energy landscapes of biomolecules. Proc. Natl. Acad. Sci. USA 93, 14249–14255 (1996)
A. Maritan, C. Micheletti, J.R. Banavar, Role of secondary motifs in fast folding polymers: a dynamical variational principle. Phys. Rev. Lett. 84, 3009–3012 (2000). ISSN 00319007
K.W. Plaxco, K.T. Simons, D. Baker, Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 277, 985–994 (1998)
J.L. England, E.I. Shakhnovich, Structural determinant of protein designability. Phys. Rev. Lett. 90, 218101 (2003)
I.N. Berezovsky, A.Y. Grosberg, E.N. Trifonov, Closed loops of nearly standard size: common basic element of protein structure. FEBS Lett. 466, 283–286 (2000)
M. Negri, G. Tiana, R. Zecchina, Native state of natural proteins optimizes local entropy. Phys. Rev. E 104, 064117 (2021). ISSN 2470-0045
K. Huang, Statistical Mechanics. Wiley, London (1987)
S. Miyazawa, R. Jernigan, Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules 18, 534–552 (1985)
P.D. Thomas, K.A. Dill, Statistical potentials extracted from protein structures: how accurate are they? J. Mol. Biol. 257, 457–469 (1996)
G. Tiana, L. Sutto, Equilibrium properties of realistic random heteropolymers and their relevance for globular and naturally unfolded proteins. Phys. Rev. E 84, 61910 (2011)
S. Ramanathan, E.I. Shakhnovich, Statistical mechanics of proteins with evolutionary selected sequences. Phys. Rev. E 50, 1303–1312 (1994)
G. Franco, M. Cagiada, G. Bussi, G. Tiana, Statistical mechanical properties of sequence space determine the efficiency of the various algorithms to predict interaction energies and native contacts from protein coevolution. Phys. Biol. 16, 046007 (2019). ISSN 1478-3975
T.E. Creighton. Proteins: Structures and Molecular Properties. W. H. Freeman and Co. (1992)
Author information
Authors and Affiliations
Contributions
MT performed the numerical calculations and helped to develop the theoretical calculations; AT and GT developed the theoretical calculations; GT wrote the manuscript with the help of MT and AT.
Corresponding author
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tajana, M., Trovato, A. & Tiana, G. Key interaction patterns in proteins revealed by cluster expansion of the partition function. Eur. Phys. J. E 45, 95 (2022). https://doi.org/10.1140/epje/s10189-022-00250-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epje/s10189-022-00250-x