xml in bio medical field

34
Presented by: Eng. Juman Ghazi Director: Dr. Eng. Rasha Masood

Upload: juman-ghazi

Post on 12-Jul-2015

139 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Xml in bio medical field

Presented by: Eng. Juman Ghazi

Director: Dr. Eng. Rasha Masood

Page 2: Xml in bio medical field

What is XML?

XML stands for EXtensible Markup Language.

XML is a markup language much like HTML.

XML was designed to describe data and focus on what

data is.

2

Page 3: Xml in bio medical field

eXtensible Markup Language

Helps information systems share structured data.

A meta language that gives meaning to data that other

application can use.

Application and platform independent.

Allows various types of data.

Extensible to accommodate new tags and processing methods.

Allows user-defined tags.

4

Page 4: Xml in bio medical field

Advantages of using XML

Simpler version of Standard Generalized Markup

Language (SGML).

Easy to understand and read.

Supported by large number of platforms.

Used across open standards.

5

Page 5: Xml in bio medical field

Components of an XML Document

1. Elements: <hello>

2. Attributes: <item id=“33905”>

3. Entities: &lt; (<)

4. Advanced Components

1. CData Sections

2. Processing Instructions

6

Page 6: Xml in bio medical field

Example in HTML

7

<html>

<head>

<title>Menu</>

</head>

<body>

<h1>Soup</h1>

<h4>4.99</h4>

</body>

</html>

Page 7: Xml in bio medical field

HTML in web browser

8

Page 8: Xml in bio medical field

Example in HTML

9

<?xml version=“1.0” ?>

<menu>

<item>

<itemname>soup</itemname>

<cost>4.99</cost>

</item>

</menu>

Page 9: Xml in bio medical field

XML in web browser

10

Page 10: Xml in bio medical field

Declaration:

First line in document.

Provides information to the parser.

Recommended but optional.

Contains three name-value pairs:

Version (common).

Encoding (defaults to UTF-8).

Standalone (rare).

11

Page 11: Xml in bio medical field

Tags:

Text in between < and > Have start tag and end tag.

Tags and data stored together.

Data is self-descriptive and easy to under stand.

12

Page 12: Xml in bio medical field

13

Root

(text) (text) (text)

element element

Page 13: Xml in bio medical field

Elements:

Basic building blocks of XML file.

Text between a start tag and end

tag is considered the value of the

element

Documents contain one root

element.

Can contain Nested elements.

14

Page 14: Xml in bio medical field

Attributes:

Provide additional information about

the elements.

Name-value pairs:

- Single or double quotes to encode

values.

- Attribute names are unique within

the same element.

16

Page 15: Xml in bio medical field

Comments:

Appear anywhere in document

- Start tag <!--

- End tag --!> contents inside comment are not parsed.

17

Page 16: Xml in bio medical field

More in XML:

1. Schemas

2. Parsers

3. Editors

4. Standards

18

Page 17: Xml in bio medical field

1. Schemas: Describe the structure and content of an XML

document.

Define a shared vocabulary for application.

Can be expressed using XML schema languages

such as:

-Document Type Definition (DTD).

-XML Schema (W3C).

19

Page 18: Xml in bio medical field

Industry standards and data exchange:

20

Page 19: Xml in bio medical field

2. Parsers:

Read and process the content of an XML

document.

Include push and pull parsers

-Pull parsers: events generated by the application

-Push parsers: events controlled by the parser

Free XML parsers available, including tools from

IBM.

21

Page 20: Xml in bio medical field

3. Editors:

Text and graphical editors facilitate the editing

of XML code.

Benefits of using editors:

coding effort.

-Provide to perform tasks.

22

Page 21: Xml in bio medical field

23

Page 22: Xml in bio medical field

4. Standards: Various types of standards:

-Core standards from the basis of what is expressed

in an XML document.

- Processing standards relate to XML processing by

developers.

-Key vocabularies (applications).

XML standards influencers include the W3C, ISO and

OASIS.

24

Page 23: Xml in bio medical field

XML RuLes:

1. Must Have a Closing Tag.

In HTML, some elements do not have to have a closing tag:

<p>This is a paragraph<p>This is another paragraph

In XML, it is illegal to omit the closing tag.

<p>This is a paragraph</p><p>This is another paragraph</p>

2. XML Tags are Case Sensitive.

XML tags are case sensitive. The tag <Letter> is different from the tag<letter>.

<Message>This is incorrect</message><message>This is correct</message>

"Opening and closing tags" are often referred to as "Start and end tags". Use whatever you prefer. It is exactly the same thing.

25

Page 24: Xml in bio medical field

XML RuLes:3. Elements Must be Properly Nested:

In HTML, you might see improperly nested elements:

<b><i>This text is bold and italic</b></i>

In XML, all elements must be properly nested within each other:

<b><i>This text is bold and italic</i></b>

4. XML Documents Must Have a Root

Element:

XML documents must contain one

element that is the parent of all

other elements. This element is called

the root element.

<root>

<child>

<subchild>.....</subchild>

</child>

< /root>

26

Page 25: Xml in bio medical field

XML RuLes:XML Attribute Values Must be

Quoted:

XML elements can have attributes

in name/value pairs

< note date=12/11/2007>

< to>Tove</to>

< from>Jani</from>

< /note>

< note date="12/11/2007">

< to>Tove</to>

< from>Jani</from>

< /note> 27

Wrong

Right

Page 26: Xml in bio medical field

XML RuLes:

28

5. Entity References

Some characters have a special meaning in XML.

-character like "<" inside an XML element, will

generate an error because the parser interprets it as

the start of a new element.

<message>if salary < 1000 then</message>

<message>if salary &lt; 1000 then</message>

Page 27: Xml in bio medical field

Characters have a special meaning in XML

Characters meanings in XMLLess than<&lt;

Greater than>&qt;

ampersand&&amp;

apostrophe‘&apos;

Quotation mark&quot;

29

Page 28: Xml in bio medical field

tensibleXeXML Elements are XML elements can be extended to carry more information.<note>

<to>Tove</to><from>Jani</from><body>Don't forget me this weekend!</body></note>

Added some extra information to it:<note>

<date>2008-01-10</date><to>Tove</to><from>Jani</from><heading>Reminder</heading><body>Don't forget me this weekend!</body></note>

Should the application break or crash?

No. One of the beauties of XML, is that it can be extended without breaking applications. 31

Page 29: Xml in bio medical field

Examples: 1- book store<bookstore>

<book category="CHILDREN"><title>Harry Potter</title><author>J K. Rowling</author><year>2005</year><price>29.99</price>

</book><book category="WEB"><title>Learning XML</title><author>Erik T. Ray</author><year>2003</year><price>39.95</price>

</book></bookstore>

32

Page 30: Xml in bio medical field

Why XML For -informatics?

Biology is a complex discipline.

Wide variety of data resources and repositories.

Biological data represented in multiple formats. (FASTA

, agp ,gff..)

No standard protocol:

1-to interrogate biological data stores.

2-for Genomic, Proteonomic, Chemi-informatics.

3-to exchange biological data.

Difficulties in using and exchanging data.

34

Page 31: Xml in bio medical field

XML in -informatics

1- (Visual Genomics).

2- (ProteoMetrics).

3- (Chemical info. “atomic, crystallographic

info., structures….”).

4- ene ntology onsortium.

35

Page 32: Xml in bio medical field

The Bioinformatics Sequences Markup Language

(BSML)

-The DTD is aimed at representing DNA, RNA, Protein

sequences and their graphic properties.

-Found the structure of the information to be similar to

the one used in the databases.

(http://www.ebi.ac.uk/embl.html)

(http://www.visualgenomics.com/products/index.html)

(http://www.ncbi.nlm.nih.gov; http://www.ddbj.nig.ac.jp) 36

Page 33: Xml in bio medical field

Gene Ontology Consortium

Controlled description for:

1- Molecular function.

2- Biological processes.

3- Cellular locations of gene products.

37

Page 34: Xml in bio medical field

The BIOpolymer Markup Language (BioML)

- is different to BSML approach.

- BioML Goal (Fenyo, 1999) is “

BioML was designed to mimic thehierarchical structure of aliving organism.”

- Data integration e.g nucleotide and protein sequences

38