scop, bilogical data bases, bioinformatics data base

Upload: rajesh-guru

Post on 07-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/4/2019 SCOP, bilogical data bases, bioinformatics data base

    1/2

    SCOP

    Structural Classification of Proteins (SCOP) [http://scop.mrc-lmb.cam.ac.uk/scop/]

    database is a largely manual classification of protein structural domains based on similarities

    of their amino acid sequences and three-dimensional structures. Originally published in

    1995it is usually updated at least once yearly by Alexei G. Murzin and his colleagues, upon

    whose expertise the classification rests.

    The SCOP database aims to provide a detailed and comprehensive description of the

    structural and evolutionary relationships between all proteins whose structure is known,

    including all entries in the Protein Data Bank(PDB). It is available as a set of tightly linked

    hypertext documents which make the large database comprehensible and accessible. In

    addition, the hypertext pages offer a panoply of representations of proteins, including links to

    PDB entries, sequences, references, images and interactive display systems. World Wide

    Web URL http://scop.mrc-lmb.cam.ac.uk/scop/ is the entry point to the database (MRC site).

    Existing automatic sequence and structure comparison tools cannot identify all structural and

    evolutionary relationships between proteins. The SCOP classification of proteins has been

    constructed manually by visual inspection and comparison of structures, but with the

    assistance of tools to make the task manageable and help provide generality. The job is made

    more challenging--and theoretically daunting--by the fact that the entities being organized are

    not homogeneous: sometimes it makes more sense to organize by individual domains, and

    other times by whole multi-domain proteins.

    Classification

    Proteins are classified to reflect both structural and evolutionary relatedness. Many levels

    exist in the hierarchy, but the principal levels are family, superfamily and fold, described

    below. The exact position of boundaries between these levels are to some degree subjective.

    The evolutionary classification is generally conservative: where any doubt about relatedness

    exists, we made new divisions at the family and superfamily levels. Thus, some researchers

    may prefer to focus on the higher levels of the classification tree, where proteins with

    structural similarity are clustered.

    The different major levels in the hierarchy are:

    1. Family: Clear evolutionarily relationshipProteins clustered together into families are clearly evolutionarily related. Generally,

    this means that pairwise residue identities between the proteins are 30% and greater.

    However, in some cases similar functions and structures provide definitive evidence

    of common descent in the absense of high sequence identity; for example, many

    globins form a family though some members have sequence identities of only 15%.

    2. Superfamily: Probable common evolutionary originProteins that have low sequence identities, but whose structural and functional

    features suggest that a common evolutionary origin is probable are placed together in

    superfamilies. For example, actin, the ATPase domain of the heat shock protein, and

    hexakinase together form a superfamily.

    http://www.rcsb.org/pdb/http://www.rcsb.org/pdb/http://www.rcsb.org/pdb/http://www.rcsb.org/pdb/
  • 8/4/2019 SCOP, bilogical data bases, bioinformatics data base

    2/2

    3. Fold:Major structural similarityProteins are defined as having a common fold if they have the same major secondary

    structures in the same arrangement and with the same topological connections.

    Different proteins with the same fold often have peripheral elements of secondary

    structure and turn regions that differ in size and conformation. In some cases, these

    differing peripheral regions may comprise half the structure. Proteins placed togetherin the same fold category may not have a common evolutionary origin: the structural

    similarities could arise just from the physics and chemistry of proteins favoring

    certain packing arrangements and chain topologies.

    Structural classes

    SCOP includes the following structural classes:

    1. -helical domains2. -sheet domains3. / domains, which consist of "beta-alpha-beta" structural units or "motifs" that form

    mainly parallel -sheets

    4. + domainsformed by independent -helices and mainly antiparallel -sheets5. multi-domain proteins6. membrane and cell surface proteins and peptides (not including those involved in the

    immune system)

    7. "small" proteins8. coiled-coil proteins9. low-resolution protein structures10.peptides and fragments11.designed proteins of non-natural sequence