close
Where is the PDF?
The PDF is loading from the publisher - it may take some time. If you see a login, please first login into your institution. If there is a problem, please click problem report and we will look into this.
arXiv:1008.4938 [q-bio.QM] 29 Aug 2010
Towards Solving the Inverse Protein Folding Problem
Yoojin Hong, Kyung Dae Ko, Gaurav Bhardwaj, Zhenhai Zhang, Damian B van Rossum and Randen L Patterson
Abstract
Accurately assigning folds for divergent protein sequences is a major
obstacle to structural studies and underlies the inverse protein folding
problem. Herein, we outline our theories for fold-recognition in the
"twilight-zone" of sequence similarity (<25% identity). Our analyses
demonstrate that structural sequence profiles built using Position-Specific
Scoring Matrices (PSSMs) significantly outperform multiple popular
homology-modeling algorithms for relating and predicting structures given only
their amino acid sequences. Importantly, structural sequence profiles
reconstitute SCOP fold classifications in control and test datasets. Results
from our experiments suggest that structural sequence profiles can be used to
rapidly annotate protein folds at proteomic scales. We propose that encoding
the entire Protein DataBank (~1070 folds) into structural sequence profiles
would extract interoperable information capable of improving most if not all
methods of structural modeling.