distributed data management graeme kerr oracle in r&d programme

27
<Insert Picture Here> Distributed Data Management Graeme Kerr Oracle in R&D Programme

Upload: gloria-stanley

Post on 17-Jan-2016

215 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Distributed Data Management Graeme Kerr Oracle in R&D Programme

<Insert Picture Here>

Distributed Data ManagementGraeme KerrOracle in R&D Programme

Page 2: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Distributed Data Management

• Provides access to data when and where you need it• Optimizes information access regardless of physical

location of data.• Enables virtualisation of data in Grids.

• Types of Data:• Flat Files, Relational.• Structured & unstructured.• Text, Binary, proprietary.• Scalar, user defined typed, XML, RDF• Images, DICOM, LOB, Spatial.• High Volumes.

Page 3: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Methods of Data Distribution

• Distribute• Federate• Consolidate

Page 4: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Distribute Information

• Share information between users, applications, databases, organisations and partners. • Move or copy / replicate information as needed• Traditionally implemented as replication or message

queuing• Wide range of technology solutions to achieving distribution.• Continuous feed and periodic refresh capabilities are key

requirements.

• Common within Scientific Grid Approach• Works well when replicas are read-only.• Considerable issues when replicas are updateable.

Page 5: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Distribution - Example

SOURCE SITE

REPLICA SITES

Continuous Feeds

Page 6: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Distribution - Example

SOURCE SITE

REPLICA SITES

Continuous Feeds

Periodic Refresh

Page 7: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Distribution - Example

SOURCE DATABASE

REPLICA DATABASES

Oracle STREAMS

Transportable Tablespaces,RMAN,

Data Pump

Page 8: Distributed Data Management Graeme Kerr Oracle in R&D Programme

CERN LCG Example

RAL

BNL

CNAFIN2P3

Gridka

ASGC

TRIUMPF

SARA

PIC

NorduGrid

Online DB @ Detector

Offline DB @ CERN IT

Page 9: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Methods of Data Distribution

• Distribute• Federate• Consolidate

Page 10: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Federate Information

• Federate data in multiple data stores into a single virtual database• Access both Database and non-Database data• Access both structured and unstructured data• Hide physical location of data from applications• Vendor and open-source solutions.

• Used mostly as point solution within Commercial World rather than as a key design point.

Page 11: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Federation - Example

DB2Excel

WebServices

Page 12: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Federation - Example

DB2

XML

ExcelFlatFilesText

WebServices

FlatFiles Sybase

Page 13: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Federation - Example

DB2

XML

ExcelFlatFilesText

WebServices

FlatFiles Sybase

Page 14: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Data Federation - Example

DB2

XML

ExcelFlatFilesText

WebServices

FlatFiles Sybase

Page 15: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Methods of Data Distribution

• Distribute• Federate• Consolidate

Page 16: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Consolidate Information

• Consolidate heterogeneous data into a central pool.• Single database or pool of databases• Any data (audio, video, XML, email, messages, etc…)• Any platform (Linux, Windows, Solaris, HPUX, AIX, Tru64,

OpenVMS, OS/390)• Proven scalability

• Hundreds of Very Large Database customers• Industry leading benchmarks

• Very common in commercial world.

Page 17: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Typical Data Centre View

DB Clusters

Disk

DB Users

Page 18: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Consolidated Clusters with RAC & ASM

DB Cluster

Pool

Disk Pool

DB Users

Page 19: Distributed Data Management Graeme Kerr Oracle in R&D Programme

DB Server & Disk Virtualisation

DB Cluster

Pool

Disk Pool

DB Users

Page 20: Distributed Data Management Graeme Kerr Oracle in R&D Programme

DB Server & Disk Virtualisation

DB Cluster

Pool

Disk Pool

DB Users

Page 21: Distributed Data Management Graeme Kerr Oracle in R&D Programme

DB Server & Disk Virtualisation

DB Cluster

Pool

Disk Pool

DB Users

Page 22: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Distributed Data Management : Summary

• Oracle supports efficient utilization of all resources:• Consolidate Information—Scalable access to large amounts

of information• Federate Information—Data can be located where needed

but accessed as single database• Share Information—Multiple users and applications can share

data/information to operate more effectively

• Information Integration & fast connectivity critical to Grid Computing

Page 23: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Oracle Database Versions.

• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva

• Database 10G Enterprise Edition• Flagship, Full-featured version.

• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported

• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.

• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence

Page 24: Distributed Data Management Graeme Kerr Oracle in R&D Programme

Oracle Database Versions.

• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu &

Mandriva

• Database 10G Enterprise Edition• Flagship, Full-featured version.

• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported

• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.

• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence

Page 25: Distributed Data Management Graeme Kerr Oracle in R&D Programme

For More Information

Database: www.oracle.com/database/index.html

Express Edition: www.oracle.com/technology/products/database/xe/index.html

Tangosol: www.oracle.com/tangosol/index.html

Times 10: www.oracle.com/database/timesten.html

or

http://search.oracle.com

Database | Tangosol | etc

Page 26: Distributed Data Management Graeme Kerr Oracle in R&D Programme

EMEA Oracle in R&D Programme

[email protected]@oracle.com

Page 27: Distributed Data Management Graeme Kerr Oracle in R&D Programme