journal article Sep 21, 2001

Linear programming optimization and a double statistical filter for protein threading protocols

View at Publisher Save 10.1002/prot.1145
Abstract
AbstractThe design of scoring functions (or potentials) for threading, differentiating native‐like from non‐native structures with a limited computational cost, is an active field of research. We revisit two widely used families of threading potentials: the pairwise and profile models. To design optimal scoring functions we use linear programming (LP). The LP protocol makes it possible to measure the difficulty of a particular training set in conjunction with a specific form of the scoring function. Gapless threading demonstrates that pair potentials have larger prediction capacity compared with profile energies. However, alignments with gaps are easier to compute with profile potentials. We therefore search and propose a new profile model with comparable prediction capacity to contact potentials. A protocol to determine optimal energy parameters for gaps, using LP, is also presented. A statistical test, based on a combination of local and globalZ‐scores, is employed to filter out false‐positives. Extensive tests of the new protocol are presented. The new model provides an efficient alternative for threading with pair energies, maintaining comparable accuracy. The code, databases, and a prediction server are available athttp://www.tc.cornell.edu/CBIO/loopp. Proteins 2001;45:241–261. © 2001 Wiley‐Liss, Inc.
Topics

No keywords indexed for this article. Browse by subject →

References
52
[1]
A Method to Identify Protein Sequences That Fold into a Known Three-Dimensional Structure

James U. Bowie, Roland Lüthy, David Eisenberg

Science 10.1126/science.1853201
[2]
A new approach to protein fold recognition

David T. Jones, W. R. Taylort, J. M. Thornton

Nature 10.1038/358086a0
[3]
Detection of native‐like models for amino acid sequences of unknown three‐dimensional structure in a data base of known protein conformations

Manfred J. Sippl, Sabine Weitckus

Proteins: Structure, Function, and Bioinformatics 10.1002/prot.340130308
[4]
Topology fingerprint approach to the inverse protein folding problem

Adam Godzik, Andrzej Kolinski, Jeffrey Skolnick

Journal of Molecular Biology 10.1016/0022-2836(92)90693-e
[5]
Prediction of Protein Structure by Evaluation of Sequence-structure Fitness

Christos Ouzounis, Chris Sander, Michael Scharf et al.

Journal of Molecular Biology 10.1006/jmbi.1993.1433
[6]
An empirical energy function for threading protein sequence through the folding motif

Stephen H. Bryant, Charles E. Lawrence

Proteins: Structure, Function, and Bioinformatics 10.1002/prot.340160110
[8]
Protein structure prediction by threading. why it works and why it does not 1 1Edited by F. Cohen

Leonid A. Mirny, Eugene I. Shakhnovich

Journal of Molecular Biology 10.1006/jmbi.1998.2092
[9]
GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences

David T. Jones

Journal of Molecular Biology 10.1006/jmbi.1999.2583
[10]
Combination of threading potentials and sequence profiles improves fold recognition 1 1Edited by B. Honig

Anna R. Panchenko, Aron Marchler-Bauer, Stephen H. Bryant

Journal of Molecular Biology 10.1006/jmbi.2000.3541
[11]
Progress in protein structure prediction: Assessment of CASP3

Michael JE Sternberg, Paul A Bates, Lawrence A Kelley et al.

Current Opinion in Structural Biology 10.1016/s0959-440x(99)80050-5
[12]
A united-residue force field for off-lattice protein-structure simulations. I. Functional forms and parameters of long-range side-chain interaction potentials from protein crystal data

A. Liwo, S. O?dziej, M. R. Pincus et al.

Journal of Computational Chemistry 10.1002/(sici)1096-987x(199705)18:7<849::aid-jcc1>3.0.co;2-r
[13]
Ab initio construction of protein tertiary structures using a hierarchical approach

Yu Xia, Enoch S. Huang, Michael Levitt et al.

Journal of Molecular Biology 10.1006/jmbi.2000.3835
[15]
Babajide A "Exploring protein sequence space using knowledge based potentials" J Compar Biol (1999)
[16]
A study of combined structure/sequence profiles

Arne Elofsson, Daniel Fischer, Danny W. Rice et al.

Folding and Design 10.1016/s1359-0278(96)00061-2
[17]
A general method applicable to the search for similarities in the amino acid sequence of two proteins

Saul B. Needleman, Christian D. Wunsch

Journal of Molecular Biology 10.1016/0022-2836(70)90057-4
[18]
Identification of common molecular subsequences

Temple F Smith, M.S. Waterman

Journal of Molecular Biology 10.1016/0022-2836(81)90087-5
[19]
Alignment and Searching for Common Protein Folds Using a Data Bank of Structural Templates

Mark S. Johnson, John P. Overington, Tom L. Blundell

Journal of Molecular Biology 10.1006/jmbi.1993.1323
[20]
Croman HT (1985)
[23]
Goldstein RA (1996)
[24]
Contact potential that recognizes the correct folding of globular proteins

Vladimir N. Maiorov, Gordon M. Grippen

Journal of Molecular Biology 10.1016/0022-2836(92)90228-c
[25]
On the design and analysis of protein folding potentials

Dror Tobi, Gil Shafran, Nathan Linial et al.

Proteins: Structure, Function, and Bioinformatics 10.1002/(sici)1097-0134(20000701)40:1<71::aid-prot90>3.0.co;2-3
[26]
Pairwise contact potentials are unsuitable for protein folding

Michele Vendruscolo, Eytan Domany

The Journal of Chemical Physics 10.1063/1.477748
[27]
Meller J "Maximum feasibility guideline in the design and analysis of protein folding potentials" J Comp Chem (2001)
[32]
Statistics of sequence-structure threading

Stephen H. Bryant, Stephen F Altschul

Current Opinion in Structural Biology 10.1016/0959-440x(95)80082-4
[34]
Altschul SF "Significance of nucleotide sequence alignments: a method for random sequence permutation that preserves dinucleotide and codon usage" Mol Biol Evol (1985)
[35]
FischerD ElofssonA RiceD EisenbergD.Assessing the performance of fold recognition methods by means of a comprehensive benchmark. In: Pacific Symposium on Biocomputing Hawaii 1996; p300–318.
[36]
CASP3 "Third community wide experiment on the critical assessment of techniques for protein structure prediction" Proteins (1999)
[37]
Holm L "The FSSP database of structurally aligned protein fold families" Nucleic Acids Res (1994)
[38]
MellerJ ElberR. Learning Observing and Outputting Protein Patterns (LOOPP)—a program for protein recognition and design of folding potentials;http://www.tc.cornell.edu/CBIO/loopp.
[39]
Amino acid substitution matrices from protein blocks.

S Henikoff, Jorja G. Henikoff

Proceedings of the National Academy of Sciences 10.1073/pnas.89.22.10915
[40]
Gambel EJ (1958) 10.7312/gumb92958
[41]
Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

S Karlin, S F Altschul

Proceedings of the National Academy of Sciences 10.1073/pnas.87.6.2264
[42]
Improved tools for biological sequence comparison.

W R Pearson, D J Lipman

Proceedings of the National Academy of Sciences 10.1073/pnas.85.8.2444
[43]
Empirical statistical estimates for sequence similarity searches

William R Pearson

Journal of Molecular Biology 10.1006/jmbi.1997.1525
[44]
Are proteins ideal mixtures of amino acids? Analysis of energy parameter sets

Adam Godzik, Jeffrey Skolnick, Andrzej Kolinski

Protein Science 10.1002/pro.5560041016
[47]
A lattice model for protein structure prediction at low resolution.

D A Hinds, M Levitt

Proceedings of the National Academy of Sciences 10.1073/pnas.89.7.2536
[48]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

S. Altschul

Nucleic Acids Research 10.1093/nar/25.17.3389

Showing 50 of 52 references

Cited By
103
Archives of Biological Sciences
Proteins: Structure, Function, and...
CAFASP3 in the spotlight of EVA

Volker A. Eyrich, Dariusz Przybylski · 2003

Proteins: Structure, Function, and...
Metrics
103
Citations
52
References
Details
Published
Sep 21, 2001
Vol/Issue
45(3)
Pages
241-261
License
View
Cite This Article
Jaroslaw Meller, Ron Elber (2001). Linear programming optimization and a double statistical filter for protein threading protocols. Proteins: Structure, Function, and Bioinformatics, 45(3), 241-261. https://doi.org/10.1002/prot.1145
Related

You May Also Like

Improved side‐chain torsion potentials for the Amber ff99SB protein force field

Kresten Lindorff‐Larsen, Stefano Piana · 2010

5,733 citations

Structure validation by Cα geometry: ϕ,ψ and Cβ deviation

Simon C. Lovell, Ian W. Davis · 2003

4,087 citations

Essential dynamics of proteins

Andrea Amadei, Antonius B. M. Linssen · 1993

3,134 citations