xml databases

20
XML Databases and Effective Search Engine

Upload: srinivasan-r

Post on 09-Jan-2017

248 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Xml databases

XML Databases and Effective Search Engine

Page 2: Xml databases

XML

● Extensible Markup Language is a meta-language that describes the contents of the document

(Self-describing data)

● JAVA=Portable CodeXML=Portable Data

Page 3: Xml databases

Applications of XML

● Configuration Filesused extensively in J2EE architectures

● Media for data interchangeA better alternative to proprietary data formats

● B2B Transactions on the WebElectronic Business orders (ebXML)Financial Exchange Message Exchange (SOAP)

Page 4: Xml databases

Four tasks of data management

The basic tasks of data management do not change simply because the data are stored in

XML ● Store ● Search ● Retrieve ● Display

Page 5: Xml databases

XML a Database ?

● Only in the strictest sense of the term● It is self-describing● It is portable● It can describe data in tree or graph structures

Page 6: Xml databases

XML a “ Sort of ” Database ?● Storage -- XML documents

● Schemas -- DTDs, XML Schemas, and so on

● Query languages -- XQuery, XQL, etc.

● Programming interfaces -- SAX, DOM

Page 7: Xml databases

XML lacks........

● Efficient storage ● Indexes● Security● Transactions and data integrity● Multi-user access● Triggers● Queries across multiple documents

Page 8: Xml databases

XML Database Architecture

Page 9: Xml databases

XML Databases

1. RDBMS s that support XML e.g. Oracle , DB2

2. Native XML Database e.g. Apache Xindice, Stanford XMLDB

Page 10: Xml databases

Relational Databases Vs XML Databases

Relational database XML database

A relational database contains tables.

An XML database contains collections.

A relational table contains records with the same schema.

A collection contains XML documents with the same schema.

A relational record is an unordered list of named values.

An XML document is a tree of nodes.

A SQL query returns an unordered set of records.

An XQuery returns an ordered sequence of nodes.

Page 11: Xml databases

Technologies

● Java - Servlets, JAXP● Apache Tomcat web server● Apache Xindice XML DB● RDBMS - MS-Access, MY-SQL or ORACLE● IDE - Eclipse

Page 12: Xml databases

Modules

● Parse XML Insert into Database

● Convert Database table into XML

● Reflect changes in Database to XML and Vice versa

Page 13: Xml databases

Parse XML Insert into Database

Page 14: Xml databases

Database Table to XML

Page 15: Xml databases

Java ResultsetMetadata

● An object that can be used to get information about the types and properties of the columns in a Resultset object

Page 16: Xml databases

Modules- to be Implemented

● Convert Text files to XML

● Convert HTML to XML

● Implementing Searching

● Implementing Indexing

Page 17: Xml databases

Searching XML

● Search process begins in the root node ● Search proceeds comparing characters in

the search key to edge labels● Entering a wrong block may lead to

backtracking

Page 18: Xml databases

Indexing XML

● The goal of indexing is to optimize the speed and performance of finding relevant documents

● Without an index, the search engine would scan every document

● It will take a considerable amount of time and computing power

Page 19: Xml databases

Conclusions

● XML can enable Effective searching

● As the XML document describes itself (meta-language) it will speedup finding relevant documents

Page 20: Xml databases

Thank you

XML is everywhere