distributed data management graeme kerr oracle in r&d programme
TRANSCRIPT
<Insert Picture Here>
Distributed Data ManagementGraeme KerrOracle in R&D Programme
Distributed Data Management
• Provides access to data when and where you need it• Optimizes information access regardless of physical
location of data.• Enables virtualisation of data in Grids.
• Types of Data:• Flat Files, Relational.• Structured & unstructured.• Text, Binary, proprietary.• Scalar, user defined typed, XML, RDF• Images, DICOM, LOB, Spatial.• High Volumes.
Methods of Data Distribution
• Distribute• Federate• Consolidate
Distribute Information
• Share information between users, applications, databases, organisations and partners. • Move or copy / replicate information as needed• Traditionally implemented as replication or message
queuing• Wide range of technology solutions to achieving distribution.• Continuous feed and periodic refresh capabilities are key
requirements.
• Common within Scientific Grid Approach• Works well when replicas are read-only.• Considerable issues when replicas are updateable.
Data Distribution - Example
SOURCE SITE
REPLICA SITES
Continuous Feeds
Data Distribution - Example
SOURCE SITE
REPLICA SITES
Continuous Feeds
Periodic Refresh
Data Distribution - Example
SOURCE DATABASE
REPLICA DATABASES
Oracle STREAMS
Transportable Tablespaces,RMAN,
Data Pump
CERN LCG Example
RAL
BNL
CNAFIN2P3
Gridka
ASGC
TRIUMPF
SARA
PIC
NorduGrid
Online DB @ Detector
Offline DB @ CERN IT
Methods of Data Distribution
• Distribute• Federate• Consolidate
Federate Information
• Federate data in multiple data stores into a single virtual database• Access both Database and non-Database data• Access both structured and unstructured data• Hide physical location of data from applications• Vendor and open-source solutions.
• Used mostly as point solution within Commercial World rather than as a key design point.
Data Federation - Example
DB2Excel
WebServices
Data Federation - Example
DB2
XML
ExcelFlatFilesText
WebServices
FlatFiles Sybase
Data Federation - Example
DB2
XML
ExcelFlatFilesText
WebServices
FlatFiles Sybase
Data Federation - Example
DB2
XML
ExcelFlatFilesText
WebServices
FlatFiles Sybase
Methods of Data Distribution
• Distribute• Federate• Consolidate
Consolidate Information
• Consolidate heterogeneous data into a central pool.• Single database or pool of databases• Any data (audio, video, XML, email, messages, etc…)• Any platform (Linux, Windows, Solaris, HPUX, AIX, Tru64,
OpenVMS, OS/390)• Proven scalability
• Hundreds of Very Large Database customers• Industry leading benchmarks
• Very common in commercial world.
Typical Data Centre View
DB Clusters
Disk
DB Users
Consolidated Clusters with RAC & ASM
DB Cluster
Pool
Disk Pool
DB Users
DB Server & Disk Virtualisation
DB Cluster
Pool
Disk Pool
DB Users
DB Server & Disk Virtualisation
DB Cluster
Pool
Disk Pool
DB Users
DB Server & Disk Virtualisation
DB Cluster
Pool
Disk Pool
DB Users
Distributed Data Management : Summary
• Oracle supports efficient utilization of all resources:• Consolidate Information—Scalable access to large amounts
of information• Federate Information—Data can be located where needed
but accessed as single database• Share Information—Multiple users and applications can share
data/information to operate more effectively
• Information Integration & fast connectivity critical to Grid Computing
Oracle Database Versions.
• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva
• Database 10G Enterprise Edition• Flagship, Full-featured version.
• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported
• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.
• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence
Oracle Database Versions.
• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu &
Mandriva
• Database 10G Enterprise Edition• Flagship, Full-featured version.
• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported
• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.
• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence
For More Information
Database: www.oracle.com/database/index.html
Express Edition: www.oracle.com/technology/products/database/xe/index.html
Tangosol: www.oracle.com/tangosol/index.html
Times 10: www.oracle.com/database/timesten.html
or
http://search.oracle.com
Database | Tangosol | etc
EMEA Oracle in R&D Programme
[email protected]@oracle.com