journal article Open Access Mar 01, 1995

An automatic method involving cluster analysis of secondary structures for the identification of domains in proteins

Protein Science Vol. 4 No. 3 pp. 506-520 · Wiley
View at Publisher Save 10.1002/pro.5560040317
Abstract
AbstractWith a growing number of structures available in the Brookhaven Protein Data Bank, automatic methods for domain identification are required for the construction of databases. Domains are considered to be clusters of secondary structure elements. Thus, helices and strands are first clustered using intersecondary structural distances between Cα positions, and dendrograms based on this distance measure are used to identify domains. Individual domains are recognized by a disjoint factor, which enables the automatic identification and classification into disjoint, interacting, and conjoint domains. Application to a database of 83 protein families and 18 unique structures shows that the approach provides an effective delineation of boundaries and identifies those proteins that can be considered as a single domain. A quantitative estimate of the interaction between domains has been proposed. The database of protein domains is a useful tool for understanding protein folding, for recognizing protein folds, and for understanding structure‐activity relationships.
Topics

No keywords indexed for this article. Browse by subject →

References
38
[3]
The protein data bank: A computer-based archival file for macromolecular structures

Frances C. Bernstein, Thomas F. Koetzle, Graheme J.B. Williams et al.

Journal of Molecular Biology 10.1016/s0022-2836(77)80200-3
[5]
X-ray analyses of aspartic proteinases

T.L. Blundell, J.A. Jenkins, B.T. Sewell et al.

Journal of Molecular Biology 10.1016/0022-2836(90)90084-y
[6]
The tree structural organization of proteins

Gordon M. Crippen

Journal of Molecular Biology 10.1016/0022-2836(78)90043-8
[8]
SETOR: Hardware-lighted three-dimensional solid model representations of macromolecules

Stephen V. Evans

Journal of Molecular Graphics 10.1016/0263-7855(93)87009-t
[10]
Construction of Phylogenetic Trees

Walter M. Fitch, Emanuel Margoliash

Science 10.1126/science.155.3760.279
[13]
Protein Architecture and the Origin of Introns

M. Go, M. Nosaka

Cold Spring Harbor Symposia on Quantitative Biolog... 10.1101/sqb.1987.052.01.100
[14]
Parser for protein folding units

Liisa Holm, Chris Sander

Proteins: Structure, Function, and Bioinformatics 10.1002/prot.340190309
[16]
IslamSA LudJ SternbergJE.1995.Identification and analysis of domains in proteins.Protein Eng. Forthcoming. 10.1093/protein/8.6.513
[17]
Domains in proteins: Definitions, location, and structural principles

Joël Janin, Cyrus Chothia

Methods in Enzymology 10.1016/0076-6879(85)15030-5
[18]
Kabsch W "Dictionary of protein secondary structure —Pattern‐recognition of hydrogen‐bonded and geometrical features" Bio‐polymers (1983)
[19]
Structure of papain refined at 1.65 Å resolution

I.G. Kamphuis, K.H. Kalk, M.B.A. Swarte et al.

Journal of Molecular Biology 10.1016/0022-2836(84)90467-4
[23]
Structural patterns in globular proteins

Michael Levitt, Cyrus Chothia

Nature 10.1038/261552a0
[24]
X-Ray Studies of Protein Interactions

A Liljas, M G Rossmann

Annual Review of Biochemistry 10.1146/annurev.bi.43.070174.002355
[27]
PhillipsD.1966.The three‐dimensional structure of an enzyme molecule.Sci Am Nov:78–90. 10.1038/scientificamerican1166-78
[29]
Hierarchic organization of domains in globular proteins

George D. Rose

Journal of Molecular Biology 10.1016/0022-2836(79)90363-2
[35]
Nucleation, Rapid Folding, and Globular Intrachain Regions in Proteins

Donald B. Wetlaufer

Proceedings of the National Academy of Sciences 10.1073/pnas.70.3.697
[36]
Location of structural domains in proteins

Shoshana J. Wodak, Joël Janin

Biochemistry 10.1021/bi00526a005
[38]
Compact units in proteins

Michael H. Zehfus, George D. Rose

Biochemistry 10.1021/bi00367a062
Cited By
67
Proteins: Structure, Function, and...
Proteins: Structure, Function, and...
Proteins: Structure, Function, and...
Proteins: Structure, Function, and...
Metrics
67
Citations
38
References
Details
Published
Mar 01, 1995
Vol/Issue
4(3)
Pages
506-520
License
View
Cite This Article
R. Sowdhamini, Tom L. Blundell (1995). An automatic method involving cluster analysis of secondary structures for the identification of domains in proteins. Protein Science, 4(3), 506-520. https://doi.org/10.1002/pro.5560040317
Related

You May Also Like

UCSF ChimeraX: Structure visualization for researchers, educators, and developers

Eric F. Pettersen, Thomas D. Goddard · 2020

9,270 citations

UCSF ChimeraX: Meeting modern challenges in visualization and analysis

Thomas D. Goddard, Conrad C. Huang · 2017

5,387 citations

MolProbity: More and better reference data for improved all‐atom structure validation

Christopher J. Williams, Jeffrey J. Headd · 2017

4,536 citations

Verification of protein structures: Patterns of nonbonded atomic interactions

Chris Colovos, Todd O. Yeates · 1993

3,760 citations