journal article Jun 23, 2006

Statistical potential‐based amino acid similarity matrices for aligning distantly related protein sequences

View at Publisher Save 10.1002/prot.21020
Abstract
AbstractAligning distantly related protein sequences is a long‐standing problem in bioinformatics, and a key for successful protein structure prediction. Its importance is increasing recently in the context of structural genomics projects because more and more experimentally solved structures are available as templates for protein structure modeling. Toward this end, recent structure prediction methods employ profile–profile alignments, and various ways of aligning two profiles have been developed. More fundamentally, a better amino acid similarity matrix can improve a profile itself; thereby resulting in more accurate profile–profile alignments. Here we have developed novel amino acid similarity matrices from knowledge‐based amino acid contact potentials. Contact potentials are used because the contact propensity to the other amino acids would be one of the most conserved features of each position of a protein structure. The derived amino acid similarity matrices are tested on benchmark alignments at three different levels, namely, the family, the superfamily, and the fold level. Compared to BLOSUM45 and the other existing matrices, the contact potential‐based matrices perform comparably in the family level alignments, but clearly outperform in the fold level alignments. The contact potential‐based matrices perform even better when suboptimal alignments are considered. Comparing the matrices themselves with each other revealed that the contact potential‐based matrices are very different from BLOSUM45 and the other matrices, indicating that they are located in a different basin in the amino acid similarity matrix space. Proteins 2006. © 2006 Wiley‐Liss, Inc.
Topics

No keywords indexed for this article. Browse by subject →

References
71
[1]
[2]
Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: Assessment in two blind tests

Stanisław Ołdziej, C. Czaplewski, A. Liwo et al.

Proceedings of the National Academy of Sciences 10.1073/pnas.0502655102
[3]
New Monte Carlo algorithms for protein folding

Ulrich H.E. Hansmann, Yuko Okamoto

Current Opinion in Structural Biology 10.1016/s0959-440x(99)80025-6
[5]
The relation between the divergence of sequence and structure in proteins.

C. Chothia, A.M. Lesk

The EMBO Journal 10.1002/j.1460-2075.1986.tb04288.x
[7]
Local sequence-structure correlations in proteins

Christopher Bystroff, Kim T Simons, Karen F Han et al.

Current Opinion in Biotechnology 10.1016/s0958-1669(96)80117-0
[8]
HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins 1 1Edited by J. Thornton

Christopher Bystroff, Vesteinn Thorsson, David Baker

Journal of Molecular Biology 10.1006/jmbi.2000.3837
[9]
Predicting novel protein folds by using FRAGFOLD

David T. Jones

Proteins: Structure, Function, and Bioinformatics 10.1002/prot.1171
[10]
TOUCHSTONE: An ab initio protein structure prediction method that uses threading-based tertiary restraints

Daisuke Kihara, Hui Lu, Andrzej Kolinski et al.

Proceedings of the National Academy of Sciences 10.1073/pnas.181328398
[11]
Automated structure prediction of weakly homologous proteins on a genomic scale

Yang Zhang, Jeffrey Skolnick

Proceedings of the National Academy of Sciences 10.1073/pnas.0305695101
[12]
De Novo Prediction of Three-dimensional Structures for Major Protein Families

Richard Bonneau, Charlie E.M Strauss, Carol A Rohl et al.

Journal of Molecular Biology 10.1016/s0022-2836(02)00698-8
[14]
Protein Structure Prediction and Structural Genomics

David Baker, Andrej Šali

Science 10.1126/science.1065659
[15]
Comparative Protein Structure Modeling of Genes and Genomes

Marc A. Marti-Renom, Ashley C. Stuart, András Fiser et al.

Annual Review of Biophysics and Biomolecular Struc... 10.1146/annurev.biophys.29.1.291
[16]
Fold Recognition Methods

Adam Godzik

Methods of Biochemical Analysis 10.1002/0471721204.ch26
[18]
Defrosting the frozen approximation: PROSPECTOR? A new approach to threading

Jeffrey Skolnick, Daisuke Kihara

Proteins: Structure, Function, and Bioinformatics 10.1002/1097-0134(20010215)42:3<319::aid-prot30>3.0.co;2-a
[19]
Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm

Jeffrey Skolnick, Daisuke Kihara, Yang Zhang

Proteins: Structure, Function, and Bioinformatics 10.1002/prot.20106
[21]
CASP5 assessment of fold recognition target predictions

Lisa N. Kinch, James O. Wrabl, S. Sri Krishna et al.

Proteins: Structure, Function, and Bioinformatics 2003 10.1002/prot.10557
[22]
A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction

John Moult

Current Opinion in Structural Biology 10.1016/j.sbi.2005.05.011
[23]
Ab initio protein structure prediction

Corey Hardin, Taras V Pogorelov, Zaida Luthey-Schulten

Current Opinion in Structural Biology 10.1016/s0959-440x(02)00306-8
[26]
Multiple sequence alignments

Iain M Wallace, Gordon Blackshields, Desmond G. Higgins

Current Opinion in Structural Biology 10.1016/j.sbi.2005.04.002
[28]
Alignment of protein sequences by their profiles

Marc A. Marti‐Renom, M.S. Madhusudhan, Andrej Šali

Protein Science 10.1110/ps.03379804
[29]
Scoring profile‐to‐profile sequence alignments

Guoli Wang, Roland L. Dunbrack

Protein Science 10.1110/ps.03601504
[31]
FORTE: a profile–profile comparison tool for protein fold recognition

Kentaro Tomii, Yutaka Akiyama

Bioinformatics 10.1093/bioinformatics/btg474
[32]
Comparison of sequence profiles. Strategies for structural predictions using sequence information

Leszek Rychlewski, Weizhong Li, Lukasz Jaroszewski et al.

Protein Science 10.1110/ps.9.2.232
[35]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

S. Altschul

Nucleic Acids Research 10.1093/nar/25.17.3389
[36]
Amino acid substitution matrices from protein blocks.

S Henikoff, Jorja G. Henikoff

Proceedings of the National Academy of Sciences 10.1073/pnas.89.22.10915
[42]
Calculation of conformational ensembles from potentials of mena force

Manfred J. Sippl

Journal of Molecular Biology 10.1016/s0022-2836(05)80269-4
[46]
Mattice WL (1994)
[48]
Press WH (1988)
[49]
Improved tools for biological sequence comparison.

W R Pearson, D J Lipman

Proceedings of the National Academy of Sciences 10.1073/pnas.85.8.2444
[50]
Basic local alignment search tool

Stephen F. Altschul, Warren Gish, Webb Miller et al.

Journal of Molecular Biology 10.1016/s0022-2836(05)80360-2

Showing 50 of 71 references

Cited By
36
Proteins: Structure, Function, and...
Proteins: Structure, Function, and...
International Journal of Molecular...
Bioinformatics
Proteins: Structure, Function, and...
Proteins: Structure, Function, and...
Proteins: Structure, Function, and...
Metrics
36
Citations
71
References
Details
Published
Jun 23, 2006
Vol/Issue
64(3)
Pages
587-600
License
View
Cite This Article
Yen Hock Tan, He Huang, Daisuke Kihara (2006). Statistical potential‐based amino acid similarity matrices for aligning distantly related protein sequences. Proteins: Structure, Function, and Bioinformatics, 64(3), 587-600. https://doi.org/10.1002/prot.21020
Related

You May Also Like

Improved side‐chain torsion potentials for the Amber ff99SB protein force field

Kresten Lindorff‐Larsen, Stefano Piana · 2010

5,733 citations

Structure validation by Cα geometry: ϕ,ψ and Cβ deviation

Simon C. Lovell, Ian W. Davis · 2003

4,087 citations

Essential dynamics of proteins

Andrea Amadei, Antonius B. M. Linssen · 1993

3,134 citations