pdb, bilogical data bases, bioinformatics data base

Upload: rajesh-guru

Post on 07-Apr-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/4/2019 PDB, bilogical data bases, bioinformatics data base

    1/4

    PDB (Protein Data Bank)

    The PDB archive contains information about experimentally-determined structures of proteins,

    nucleic acids, and complex assemblies. As of Aug 16, 2011 PDB there are 75246 Structures.

    The Protein Data Bank (PDB) (http://www.pdb.org/) )archive is the single worldwiderepository of information about the 3D structures of large biological molecules, includingproteins and nucleic acids. These are the molecules of life that are found in all organismsincluding bacteria, yeast, plants, flies, other animals, and humans. Understanding the shape ofa molecule helps to understand how it works. This knowledge can be used to help deduce astructure's role in human health and disease, and in drug development. The structures in thearchive range from tiny proteins and bits of DNA to complex molecular machines like theribosome.

    The PDB was established in 1971 at Brookhaven National Laboratory and originallycontained 7 structures. In 1998, the Research Collaboratory for Structural Bioinformatics

    (RCSB) became responsible for the management of the PDB. In 2003, the wwPDB wasformed to maintain a single PDB archive of macromolecular structural data that is freely andpublicly available to the global community. It consists of organizations that act as deposition,data processing and distribution centers for PDB data. In addition, the RCSB PDB supports awebsite where visitors can perform simple and complex queries on the data, analyze, andvisualize the results. The RCSB PDB is a member of the wwPDB, a collaborative effort withPDBe (UK), PDBj (Japan), and BMRB (USA) to ensure the PDB archive is global anduniform. As the wwPDB archive keeper, the RCSB PDB updates the PDB archive atftp://ftp.wwpdb.org weekly. The structures included in each release are highlighted on theRCSB PDB home page and clearly defined on the FTP site.

    PDB Entry , Data Retrieval, Data deposition & Visualization

    The files from the databse can be retrieved from database either by the Name or bythe PDB ID (Composed of Alphabate & Numeric ex. 2BIW) by putting the searchquery in the search box.

    The structure download page helps to download the structural fie in three format--.PDB Format, mmCIF Format, PDBML/XML format

    The protein Sequences can be downloaded or visualized by FASTA format. (>,symbol followed by sequence in 1 letter code).

    The 3D structure of the protein can be visualized by Jmol viewer.or Kiosk viewer,including the Xray diffraction data, NMR data, Molecular description, Ligandcomponents, Deposition summery & the publication details & other usefull data relateto it.

    The data deposition tool available in the RCSB server includes Auto Dep Input Tool(ADIT), pdb_extract, Ligand Depot & RCSB Validation server.

    ADIT : It helps to deposit the structure to PDB & also provides checking of format ofcoordinate, structure factor files, & performs various validation before submitting.

    Pdb_extract : It is an online tool which assemble the statistical information from the outputfile generated during protein structure determination using X-Ray crystallography & NMR

    Method. It generates two mmCIF (Macromolecular Crystallographic File) where allinformations are merged.

  • 8/4/2019 PDB, bilogical data bases, bioinformatics data base

    2/4

    Ligand Depot : This server integrates informations releted to small molecules bound tomacromolecules including a graphical interface to search chemical structure.

    Validation Server : By the RCSB validation server is helpful for checking the coordinates &structural factor files ( .PDB, mm CIF submitted). Validation done by two ways (a) Precheck(Required changes in data files) (b) Validate ( need the coordinate files to be in PDB or mmCIF format). The output can be provided by PROCHECk, NUCheck & SFCheck.

    Including all these tools some other tools like CIFtr, MAXIT, PDBML2CIF, Biopython,Open Rasmol, Web Mol, Simple Viewer, cif2pdb,pdb2cif etc are also available.

    Summery Information of PDB Search

    3ASW

    Structural and biochemical characterization of ClfB:ligand interactions

    Authors: Ganesh, V.K.ReleaseDate:

    2011-05-04 Classification: Cell Adhesion

    Experiment: X-RAY DIFFRACTION with resolution of 2.60

    Compound:2 Polymers [Display Full Polymer Details|Display for AllResults]

    Citation: Not Available.

    Sample PDB Format

    HEADER CELL ADHESION 22-DEC-10 3ASWTITLE STRUCTURAL AND BIOCHEMICAL CHARACTERIZATION OF CLFB:LIGANDTITLE 2 INTERACTIONS

    COMPND MOL_ID: 1;COMPND 2 MOLECULE: CLUMPING FACTOR B;

    COMPND 3 CHAIN: A; ----------------------------PROTEIN B;

    COMPND 6 ENGINEERED: YES;

    COMPND 7 MUTATION: YES;SOURCE MOL_ID: 1;------------------------------------SOURCE 2 ORGANISM_SCIENTIFIC: STAPHYLOCOCCUS AUREUS;

    SOURCE 13 OTHER_DETAILS: THE PEPTIDE WAS CHEMICALLY SYNTHESIZED.

    KEYWDS IGG LIKE, ADHESIN, CYTOKERATIN, CELL ADHESIONEXPDTA X-RAY DIFFRACTION

    AUTHOR V.K.GANESHREVDAT 1 04-MAY-11 3ASW 0

    JRNL AUTH

    V.K.GANESH,E.M.BARBU,C.C.S.DEIVANAYAGAM,B.LE,A.ANDERSON,JRNL AUTH 2 Y.MATSUKA,S.L.LIN,T.J.FOSTER,SV.L.NARAYANA,M.HOOK

    JRNL TITL STRUCTURAL AND BIOCHEMICAL CHARACTERIZATION OF

    CLFB:LIGAND

    http://www.pdb.org/pdb/explore/explore.do?structureId=3ASWhttp://www.pdb.org/pdb/explore/explore.do?structureId=3ASWhttp://www.pdb.org/pdb/explore/explore.do?structureId=3ASWhttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/explore/explore.do?structureId=3ASWhttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/results/results.do?outformat=&qrid=AE0623D3&tabtoshow=Currenthttp://www.pdb.org/pdb/explore/explore.do?structureId=3ASWhttp://www.pdb.org/pdb/explore/jmol.do?structureId=3ASWhttp://www.pdb.org/pdb/download/downloadFile.do?fileFormat=pdb&compression=NO&structureId=3ASWhttp://www.pdb.org/pdb/download/downloadFile.do?fileFormat=pdb&compression=NO&structureId=3ASWhttp://www.pdb.org/pdb/explore/explore.do?structureId=3ASW
  • 8/4/2019 PDB, bilogical data bases, bioinformatics data base

    3/4

    JRNL TITL 2 INTERACTIONSREMARK 2REMARK 2 RESOLUTION. 2.60 ANGSTROMS.REMARK 3

    REMARK 3 REFINEMENT.REMARK 3 PROGRAM : REFMAC 5.2.0019

    REMARK 3 ESU BASED ON FREE R VALUE (A):0.223REMARK 3 ESU BASED ON MAXIMUM LIKELIHOOD (A):

    0.148REMARK 3 ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD (A**2):

    7.087 ----------------------------------------------------------

    SEQRES 1 A 328 HIS HIS HIS HIS HIS HIS GLY SER GLY THR ASN VALASNSEQRES 2 A 328 ASP LYS VAL THR ALA SER ASN PHE LYS LEU GLU LYS

    THR ------------------------------------------------------------ATOM 1 N GLY A 212 41.059 -8.999 -6.884 1.00 51.65

    NATOM 2 CA GLY A 212 42.248 -8.275 -6.336 1.00 53.21C

    ATOM 3 C GLY A 212 42.046 -7.660 -4.949 1.00 53.22C

    1.00 29.95 OO -----------------------------------------------------------------

    HETATM 2654 O HOH B 114 15.539 25.677 -18.531 1.00 36.53OHETATM 2655 O HOH B 135 18.524 26.554 -19.527 1.00 42.35

    OMASTER 341 0 0 7 37 0 0 6 2653 2 0

    28END

    HEADER Types of proteinCOMPND Chemical compoundSOURCE Organism NameEXPDATA No of structure considered for NMR or other

    experimentsAUTHOR Author of the workREVDAT Revised date

    JRNL Journal InformationREMARK Information about paper publishedSEQRES Protein sequence using 3 letter amino acid codeFTNOTE Foot noteFORMUL Formula for compoundATOM Coordinate of sequence & name of atomCONECT Connectivity between the AtomsMODEL NMR data with model conformationsENDMDL End of model informationEND End of PDB File

    (--------------------------)** Indication of Continuing.**

  • 8/4/2019 PDB, bilogical data bases, bioinformatics data base

    4/4