sun oracle exadata technical overview v1

67
Exadata and Database Machine Overview 金金 @Channel Sales Consultant [email protected] 13701864820 26-Jan-2010

Upload: jenkin

Post on 06-May-2015

4.129 views

Category:

Technology


11 download

TRANSCRIPT

Page 1: Sun Oracle Exadata Technical Overview V1

Exadata and Database Machine Overview

金江 @Channel Sales [email protected] 1370186482026-Jan-2010

Page 2: Sun Oracle Exadata Technical Overview V1

革命性产品 - Exadata

• Exadata 存储服务器和数据库一体机

• V1 比普通数据仓库性能快 10 到 100 倍 ;V2 再次加速并创造 TCP-C 新世界纪录

• 最优异的扩展性,容量,带宽和处理能力

Page 3: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 3 –

Agenda 议程

• Overview

• Exadata Based Product Offerings

• Exadata Architecture and Features

• Best Data Warehousing Machine

• Best OLTP Machine

• Best Consolidation Machine

Page 4: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 4 –

产品推出Exadata Storage Server & Database Machine

• Exadata Storage Server – 专为 Oracle 数据库优化的存储产品– 极限 I/O 和 SQL 处理性能– 硬件和软件的组合

• Exadata Storage Server Software

• Sun Oracle Database Machine– 预配好的高性能 – 平稳的性能配置– 在构建 Oracle 部署过程中远离不定

因素• Exadata Storage Server Software• Oracle Database 11.2

Page 5: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 5 –

挑战 – 诸多瓶颈

• 目前由于存储引起数据库性能的情况较多– 存储系统限制了从存储到服务器的数据带宽– 存储阵列内部瓶颈 – SAN 的瓶颈– 由于物理磁盘速度引起的随机 I/O 瓶颈

• 数据带宽严重限制了数据仓库的性能• 随机 I/O 瓶颈限制了 OLTP 应用的性能

Page 6: Sun Oracle Exadata Technical Overview V1

Exadata Storage Server数据带宽瓶颈解决方案

• 增加更多信道 • 增宽信道 • 通过信道传送较少数据

Copyright © 2009, Oracle Corporation and/or its affiliates – 6 –

Page 7: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 7 –

Exadata Smart Storage打破数据带宽和随机 I/O 的瓶颈

• Oracle 解决数据带宽瓶颈三种手段– 大量的并行存储网格 of high performance Exadata

storage servers (cells). • Data bandwidth scales with data volume

– 数据密集处理 runs in Exadata storage. • Queries run in storage as data streams from disk,

offloading database server CPUs– 列压缩减少数据容量 up to 10x

• Exadata Hybrid Columnar Compression provides 10x lower cost, 10x higher performance

• 利用 Exadata 智能闪存缓冲卡来解决随机 I/O 瓶颈

– Increase random I/Os by factor of 20X

Exadata Storage Cells存储单元

Page 8: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 8 –

<Insert Picture Here>

Exadata Based Product Offerings

Page 9: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 9 –

Sun Oracle Database Machine

• 网格是未来架构• Highest performance, lowest cost, redundant, incrementally scalable

• Exadata 数据库机是完全满足所有数据管理需求的第一款全面的网格架构

Exadata Storage Server Grid• 14 High-performance low-cost

storage servers• 100 TB raw SAS disk storage

or

336 TB raw SATA disk storage• 5TB+ flash storage!

RAC Database Server Grid• 8 High-performance low-cost

compute servers• 2 Intel quad-core Xeons each

InfiniBand Network• 40 Gb/sec fault-tolerant unified

server and storage network

Page 10: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 10 –

Sun Oracle Database Machine

Exadata Storage Server Grid• 21 GB/sec disk bandwidth• 50 GB/sec flash bandwidth• 1 million I/Os per second

RAC Database Server Grid• Millions of transactions

per minute• Tens of millions of queries

per minute• Billions of rows per minute

InfiniBand Network• 880 Gb/sec aggregate throughput

性能提高到极致

Page 11: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 11 –

可扩展的性能和容量

• 扩展性– Scales to 8 rack database machine

by just adding wires• More with external InfiniBand switches

– Scales to hundreds of storage servers• Multi-petabyte databases

• 冗余和容错– Failure of any component is tolerated– Data is mirrored across storage

servers

Page 12: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 12 –

大大简化部署

• 数据库机消除了部署数据库系统的复杂性– Months of configuration, troubleshooting, tuning

• 数据库机当天即用– Pre-built, tested, standard, supportable configuration– Runs existing applications unchanged

• 即用的极致性能

数月到数天

Page 13: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 13 –

Sun Exadata Storage Server Hardware

• 大量 Exadata 并行且网格化的存储块– Up to 1.5 GB/sec raw data bandwidth per cell– Up to 75,000 IOPS with Flash

• Sun Fire™ X4275 服务器– 2 Quad-Core Intel® Xeon® E5540 Processors – 24GB RAM– Dual-port 4X QDR (40Gb/s) InfiniBand card– Disk Options

• 12 x 600 GB SAS disks (7.2 TB total)• 12 x 2TB SATA disks (24 TB total)

– 4 x 96 GB Sun Flash PCIe Cards (384 GB total) • 预装软件

– Oracle Exadata Storage Server Software– Oracle Enterprise Linux– Drivers, Utilities

• Oracle 统一提供支持– 3 year, 24 x 7, 4 Hr On-site response

Sun Exadata Storage Server Hardware

硬件提供商

软件提供商

Page 14: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 14 –

Sun Exadata Storage Server Hardware

24 GB DRAM

12 x 3.5” Disk Drives

2 Quad-Core Intel®

Xeon® Processors

预装软件 :

• Oracle Exadata Storage Server Software

• Oracle Enterprise Linux

• Drivers

Disk Controller HBA with 512M battery backed cache

InfiniBand QDR (40Gb/s) dual port card

ILOM

Dual-redundant, hot-swappable power

supplies

4 x 96GB Sun Flash PCIe Cards

Page 15: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 15 –

Sun Oracle Database Machine Full Rack Pre-Configured for Extreme Performance

• 8 Sun Fire™ X4170 Oracle Database servers• 14 Exadata Storage Servers (All SAS or all

SATA)• 3 Sun Datacenter InfiniBand Switch 36

– 36-port Managed QDR (40Gb/s) switch• 1 “Admin” Cisco Ethernet switch• Keyboard, Video, Mouse (KVM) hardware• Redundant Power Distributions Units (PDUs)• Single Point of Support from Oracle

– 3 year, 24 x 7, 4 Hr On-site response

Add more racks for additional scalability

Page 16: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 16 –

• 4 Sun Fire™ X4170 Oracle Database servers• 7 Exadata Storage Servers (All SAS or all

SATA)• 2 Sun Datacenter InfiniBand Switch 36

– 36-port Managed QDR (40Gb/s) switch• 1 “Admin” Cisco Ethernet switch• Keyboard, Video, Mouse (KVM) hardware• Redundant PDUs• Single Point of Support from Oracle

– 3 year, 24 x 7, 4 Hr On-site response

Can Upgrade to a Full Rack

Sun Oracle Database Machine Half Rack Pre-Configured for Extreme Performance

Page 17: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 17 –

• 2 Sun Fire™ X4170 Oracle Database servers

• 3 Exadata Storage Servers (All SAS or all SATA)

• 2 Sun Datacenter InfiniBand Switch 36 – 36-port Managed QDR (40Gb/s)

InfiniBand switch• 1 “Admin” Cisco Ethernet switch• Keyboard, Video, Mouse (KVM) hardware• Redundant PDUs• Single Point of Support from Oracle

– 3 year, 24 x 7, 4 Hr On-site response

Can Upgrade to an Half Rack

Sun Oracle Database Machine Quarter Rack Pre-Configured for Extreme Performance

Page 18: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 18 –

Sun Oracle Database Machine Basic SystemEntry Level non-HA Configuration

• 1 Sun Fire™ X4170 Oracle Database servers• 1 Exadata Storage Servers (All SAS or all

SATA)• 1 Sun Datacenter InfiniBand Switch 36

– 36-port Managed QDR (40Gb/s) InfiniBand switch

• InfiniBand Cables• Installed in Customer supplied Rack• Customer supplied Ethernet and KVM

Infrastructure• Single Point of Support from Oracle

– 3 year, 24 x 7, 4 Hr On-site response

Page 19: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 19 –

Standalone Exadata Storage Servers

• Purchase Exadata Storage Servers from Oracle– Customer supplied standard 19 inch rack

• Customer supplied x86 64-bit Linux Database Servers

• Hardware installation more complex• No single point of support for entire deployment

Page 20: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 20 –

Exadata 产品的容量

Single Server Quarter Rack Half Rack Full Rack

Raw Disk1SAS 7.2 TB 21 TB 50 TB 100 TB

SATA 24 TB 72 TB 168 TB 336 TB

Raw Flash1 384 GB 1.1 TB 2.6 TB 5.3 TB

User Data2

(assuming no

compression)

SAS 2 TB 6 TB 14 TB 28 TB

SATA 7 TB 21 TB 50 TB 100 TB

1 – Raw capacity calculated using 1 GB = 1000 x 1000 x 1000 bytes and 1 TB = 1000 x 1000 x 1000 x 1000 bytes.

2 - User Data: Actual space for end-user data, computed after single mirroring (ASM normal redundancy) and after allowing space for database structures such as temp, logs, undo, and indexes. Actual user data capacity varies by application. User Data capacity calculated using 1 TB = 1024 * 1024 * 1024 * 1024 bytes.

Page 21: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 21 –

Exadata 产品的性能

Single Server Quarter Rack Half Rack Full Rack

Raw Disk Data Bandwidth1,4

SAS 1.5 GB/s 4.5 GB/s 10.5 GB/s 21 GB/s

SATA 0.85 GB/s 2.5 GB/s 6 GB/s 12 GB/s

Raw Flash Data Bandwidth1,4 3.6 GB/s 11 GB/s 25 GB/s 50 GB/s

Max User Data Bandwidth2,4

(10x compression & Flash) 36 GB/s 110 GB/s 250 GB/s 500 GB/s

Disk IOPS3,4SAS 3,600 10,800 25,000 50,000

SATA 1,440 4,300 10,000 20,000

Flash IOPS3,4 75,000 225,000 500,000 1,000,000

Data Load Rate4 0.65 TB/hr 1 TB/hr 2.5 TB/hr 5 TB/hr

1 – Bandwidth is peak physical disk scan bandwidth, assuming no compression.

2 - Max User Data Bandwidth assumes scanned data is compressed by factor of 10 and is on Flash.

3 – IOPs – Based on IO requests of size 8K

4 - Actual performance will vary by application.

Page 22: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 22 –

<Insert Picture Here>

Exadata Architecture and Features

Page 23: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 23 –

Exadata 的配置

• 每个 Exadata 单元是拥有磁盘存储且运行 Exadata 软件的自我控制服务

• Oracle 数据库跨越不同 Exadata 单元进行部署

• Oracle 数据库与 Exadata 存储服务器之间增强协作

• 在网格架构内没有实际的 Exadata 单元的限制

Exadata Cell

InfiniBand Switch/Network

Single-Instance Database RAC Database

Exadata Cell Exadata Cell

Page 24: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 24 –

Exadata 的架构

DB Server

DB InstanceDBRM

ASM

Single-InstanceDatabase

RAC Database

DB Server

DB InstanceDBRM

ASM

DB Server

DB InstanceDBRM

ASM

OELCELLSRV MS

RSIORM

iDB Protocol over InfiniBand with Path Failover

InfiniBand Switch/Network

Cell Control

CLI

EnterpriseManager

OELCELLSRV MS

RSIORM

OELCELLSRV MS

RSIORM

Exadata Cell

Exadata Cell

Exadata Cell

Page 25: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 25 –

Exadata 软件的特性• Exadata Smart Scans 智能扫描

– 10X or greater reduction in data sent to database servers• Exadata Storage Indexes 存储索引

– Eliminate unnecessary I/Os to disk• Hybrid Columnar Compression (HCC) 混合列压缩

– Efficient compression increases effective storage capacity and increases user data scan bandwidths by a factor of 10X

• Exadata Smart Flash Cache 智能闪存缓存– Breaks random I/O bottleneck by increasing IOPs by 20X– Doubles user data scan bandwidths

• I/O Resource Manager (IORM) 资源管理– Enables storage grid by prioritizing I/Os to ensure predictable performance

• Inter-leaved Grid Disks 交错网格硬盘– Enables storage grid that allows multiple applications to place frequently

accessed data on faster portions of the disk

Page 26: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 26 –

Exadata Smart Scan 智能扫描

• Exadata 存储单元通过扫描卸载大大减少了传送到数据库服务器的数据– Row filtering based on “where” predicate– Column filtering– Join filtering– Incremental backup filtering– Scans on encrypted data– Data Mining model scoring

• 正常情况下减少 10x 数据量

• 完全应用透明– Even if cell or disk fails during a query

11.2

11.2

Page 27: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 27 –

传统的扫描过程

• 智能扫描举例 :– 寻找消费超过 $200 手机用

户– 用户信息仅占用 1T 表中的

2M 空间

• 传统存储情况下,所有的数据库智能操作都在数据库节点中完成

• 来源于存储的数据被数据库主机大部分所丢弃

• 大量的废弃数据占用了宝贵的数据库主机资源,严重影响任务执行

I/Os Executed:

1 terabyte of data returned to hosts

DB Host reduces

terabyte of data to 1000 customer names that are returned to client

Rows Returned

SELECT

customer_name FROM calls

WHERE amount > 200;

Table

Extents Identified

I/Os Issued

Page 28: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 28 –

Exadata 智能扫描过程

• 仅关心相关列 – customer_nameand required rows – where amount>200are are returned to hosts

• 条件评估消耗的 CPU 可卸载到Exadata

• 从数据库节点上移除扫描进程已获得更多可用 CPU 资源,消除大量无价值消息

– Returns the needle, not the entire hay stack

2MB of data returned to server

Rows Returned

Smart Scan

Constructed And Sent To Cells

Smart Scan

identifies rows and columns within

terabyte table that match request

Consolidated

Result Set Built From All

Cells

SELECT

customer_name FROM calls

WHERE amount > 200;

Page 29: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 29 –

智能扫描的透明特性• 对应用完全透明

– No application or SQL changes required– Returned data is fully consistent and transactional– If a cell dies during a smart scan, the uncompleted portions

of the smart scan are transparently routed to other cells containing a replica of the data

• 能正常处理复杂的场景包括– Uncommitted data and locked rows– Chained rows– Compressed tables– National Language Processing– Date arithmetic– Regular expression searches– Partitioned tables

高吞吐量 , 低开销 , 无需复杂优化

Page 30: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 30 –

Exadata Smart Scans:卸载 data mining scoring

• Data mining scoring executed in Exadata:

• All data mining scoring functions offloaded to Exadata• Up to 10x performance gains• Reduced CPU utilization on Database Server

select cust_idfrom customerswhere region = ‘US’and prediction_probability(churnmod, ‘Y’ using *) > 0.8;

Scoring function executed in

Exadata

11.2

Page 31: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 31 –

Exadata Storage Index 存储索引无成本的透明消除 I/O

• Exadata Storage Indexes maintain summary information about table data in memory

– Store MIN and MAX values of columns– Typically one index entry for every MB of disk

• Eliminates disk I/Os if MIN and MAX can never match “where” clause of a query

• Completely automatic and transparent

A B C D

1

3

5

5

8

3

Min B = 1Max B =5

Table Index

Min B = 3 Max B =8

Select * from Table where B<2 - 只有第一个集合能匹配

11.2

Page 32: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 32 –

50XUp To

Exadata Hybrid Columnar Compression混合列压缩

• 数据在列级存储然后再压缩

• 查询模式 针对数据仓库– Optimized for speed– 10X compression ratio is typical – Scans improve proportionally

• 归档模式 针对不常查询的数据– Optimized to reduce space– 15X compression is typical– Up to 50X for some data

11.2

Page 33: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 33 –

Exadata Hybrid Columnar Compression如何工作

• Tables are organized into sets of a few thousand rows called Compression Units (CUs)

• Within Compression Unit, data is Organized by Column and then compressed– Column organization brings similar values close

together, enhancing compression

• Useful for data that is bulk loaded and queried– Update activity is light

Reduces Table Size4x to 40x

4x to 50x Reduction

Compression Unit

Co

lum

n 1

Co

lum

n 2

Co

lum

n 3

Page 34: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 34 –

Exadata Hybrid Columnar Compression数据仓库型和归档型

Warehouse Compression

• 10x average storage savings• 10x Scan I/O reduction

Archive Compression

• 15x average storage savings– Up to 50x on some data

• Some access overhead • For cold or historical data

针对速度优化 针对空间优化

Smaller WarehouseFaster Performance

Reclaim 93% of DisksKeep Data Online

Can mix compression types by partition for ILM

Page 35: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 35 –

10 10 10 1116 19 19 19 20 21

29

43

05

101520253035404550

Siz

e R

edu

ctio

n F

acto

r b

y T

able

OLTP Compression (avg=3.3)

Query Compression (avg=14.6)

Archive Compression (avg=22.6)

真实场景中的压缩比Oracle Production E-Business Suite Tables

• Columnar compression ratios• Query = 14.6X• Archive = 22.6X• Vary by application and table

52

Page 36: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 36 –

磁盘随机 I/O 瓶颈

• 磁盘存放海量数据– But are limited to about 300 I/Os per second

• 闪存技术存放较少数据– But can run tens of thousands of I/Os

per second

• 理想的解决方式– Keep most data on disk for low cost– Transparently move hot data to flash– Use flash cards instead of flash disks to avoid disk

controller limitations– Flash cards in Exadata storage

• High bandwidth, low latency interconnect

300 I/O per Sec

10000X I/O’s per Sec

Page 37: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 37 –

Exadata Smart Flash Cache智能闪存缓存

• 在 4 快 Flash 卡中透明的缓存热数据

• 使用基于快捷 PCI 的 Flash 卡来获得更大的吞吐量和 IOPs, 并规避磁盘控制器的限制

• 智能缓存– Smarter than basic LRU algorithm– Knows when to skip caching objects to

avoid polluting or flushing the cache

• 允许应用显式优化缓存

11.2

4 x 96 GB Flash Cards

Page 38: Sun Oracle Exadata Technical Overview V1

Flash Cache 对 SGA 的透明扩展Extended Buffer Cache

120 GB Flash Cache

120 GB Flash Cache

16 GB SGA Memory

16 GB SGA Memory

Hot Data Warm Data

1. Blocks read into buffer

cache

3. Clean blocks moved to

Flash Cache based on

LRU*

2. Dirty blocks flushed to disk

4. User Process reads blocks

from SGA (copied from

Flash Cache if not in SGA)

360 GBMagnetic Disks

360 GBMagnetic Disks

Cold Data* Headers for Flash

Cached blocks kept in SGA

Copyright © 2009, Oracle Corporation and/or its affiliates – 38 –

Page 39: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 39 –

Exadata I/O Resource Management混合工作负载环境

• 传统存储环境中,同数据库中的用户或共享子存储的多个数据库由于应用处理无法平衡,从而束缚了共享存储的创建和管理– Hardware isolation is the approach to ensure separation

• Exadata I/O 资源管理确保一个数据库内的不同用户和任务可分配相对合理的 I/O 资源

• 举例 : – Interactive: 50% of I/O resources– Reporting: 30% of I/O resources– ETL: 20% of I/O resources

DatabaseServer

Exadata Cell

InfiniBand Switch/Network

Exadata Cell Exadata Cell

Page 40: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 40 –

Exadata I/O Resource Management多个数据库环境

• 确保不同的数据库可分配相对合理的 I/O 带宽– Database A: 33% I/O resources– Database B: 67% I/O resources

• 确保一个数据库内不同的用户和任务可分配相对合理的 I/O 带宽– Database A:

• Reporting: 60% of I/O resources• ETL: 40% of I/O resources

– Database B: • Interactive: 30% of I/O resources• Batch: 70% of I/O resources

Exadata Cell

InfiniBand Switch/Network

Database A Database B

Exadata Cell Exadata Cell

Page 41: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 41 –

Exadata 可扩展的存储网格• 使用 Automatic Storage Management 实现动态虚拟化存

储– Simple and non-intrusive resource allocation, and reallocation, enabling true

enterprise grid storage – Database work spread across storage resources for optimal performance

• 强大的存储分配手段和管理机制– Flexible configuration for performance and availability

Exadata Cell

InfiniBand Switch/Network

Single-Instance Database RAC Database

Exadata Cell Exadata Cell

Page 42: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 42 –

Exadata 存储层级

CellDisk

Sys Area Sys Area

Grid Disk n

Grid Disk 1

ASM disk

ASM disk

• Physical disks map to Cell Disks• Cell Disks partitioned into one or multiple Grid Disks• Grid Disks created in order of “hottest” first to “coldest” portion of the

disk last• ASM diskgroups created from Grid Disks• Transparent above the ASM layer

Physical Disk

Page 43: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 43 –

Exadata 存储层级示例ASM 镜像和容错设计

• Example shows cell disks divided into two grid disks– hot and cold

• Two ASM disk groups created across the two sets of grid disks • ASM striping evenly distributes I/O across the disk groups • ASM mirroring is used to protect against disk failures• ASM failure groups are used to protect against cell failures

Exadata Cell Exadata Cell

Hot Hot Hot

Cold Cold Cold

ASMDisk Group

ASMFailure Group

…Hot Hot Hot

Cold Cold Cold…

ASMFailure Group

Page 44: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 44 –

Interleaved Grid Disks交错网格硬盘

• Grid disks are optionally split and interleaved to place frequently accessed data in all grid disks on higher performing outer tracks

• All applications benefit from higher performance outer tracks of disks

Grid Disk 2 Hot Data, Cold Data

Grid Disk 1

Hot Data, Cold Data

11.2

Page 45: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 45 –

Exadata 存储的管理

• Enterprise Manager – Manage & administer Database and ASM

• Exadata Storage Plug-in ( > 10.2.0.3)– Enterprise Manager Grid Control Plug-in to monitor & manage

Exadata Storage Cells

• Comprehensive CLI– Local Exadata Storage cell management – Distributed shell utility to execute CLI across multiple cells

• Sun Embedded Integrated Lights Out Manager (ILOM)– Remote management and administration of hardware

Page 46: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 46 –

数据保护的有效手段• All single points of failure eliminated by the Exadata Storage architecture• Hardware Assisted Resilient Data (HARD) built in to Exadata Storage

– Prevent data corruption before it happens• Data Guard provides disaster protection and data corruption protection

– Automatically maintains one or more copies of the database• Flashback provides human error protection

– Snapshot-like capabilities to rewind database to before error• Recovery Manager (RMAN) provides backup to disk

– Archiving and corruption protection– Can be used with Oracle Secure Backup (OSB) or third party tape

backup software • These work just as they do for traditional non-Exadata storage

– Users and database administrator use familiar tools

Page 47: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 47 –

Exadata 共存与迁移

• 数据库可并存部署在 Exadata 和传统存储上– Tablespaces can exist on Exadata storage, traditional torage, or a combination of the

two, and is transparent to database applications– SQL offload processing requires all pieces of a tablespace reside on Exadata

• 如果当前使用 ASM 及 ASM 冗余可以在线迁移

• 可使用 RMAN 或 Data Guard 来完成迁移

Non-Exadata

DatabaseServer

Online Migration

Exadata

Page 48: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 48 –

<Insert Picture Here>

Best Data Warehouse Machine

Page 49: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 49 –

Best Data Warehouse Machine

• 大量高容量的硬件并行加速处理海量数据– Exadata runs data intensive processing

directly in storage

• 最全面的分析能力 – OLAP, Statistics, Spatial, Data Mining, Real-time

transactional ETL, Efficient point queries

• 强大的数据仓库特别优化手段– Flexible Partitioning, Bitmap Indexing, Join indexing,

Materialized Views, Result Cache

• 卓越的新数据仓库功能Data Mining

OLAP

ETL

New

Page 50: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 50 –

Exadata 存储特性

• Exadata 智能扫描– 10X or greater reduction in data sent to database servers

• Exadata 存储索引– Eliminate unnecessary I/Os to disk

• Hybrid Columnar Compression 混合列压缩– Efficient compression increases user data scan rates

• Exadata 闪存– Combined with Hybrid Columnar Compression, scan rate

is 20X more

Page 51: Sun Oracle Exadata Technical Overview V1

内存中并行执行

SQL语句

确定所查询表的大小

读到任意节点的缓冲区缓存中

表非常小

如果表非常适用于 内存中并行执行

直接从磁盘读取

表非常大

将表的各部分读到每个节点的缓冲区缓存中

只有同一 RAC 节点上的并行服务器可以访问每个部分

Copyright © 2009, Oracle Corporation and/or its affiliates – 51 –

Page 52: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 52 –

内存中并行执行

• 单个数据库一体机拥有 400GB 的可缓存内存

• 数据库版本 11.2 引入了对内存缓冲数据的并行查询处理

– Harnesses memory capacity of entire database cluster for queries

– Foundation for world record 1TB TPC-H

• Exadata 混合列压缩实现了将多个 T 级表或分区缓存在内存中

315,842

1,018,321

1,166,976

ParAccel Exasol Oracle & HPExadata

QphH: 1 TB TPC-H

比特定的内存数据仓库还快 Memory has 100x more bandwidth than Disk

Source: Transaction Processing Council, as of 9/14/2009: Oracle on HP Bladesystem c-Class 128P RAC, 1,166,976 QphH@1000GB, $5.42/QphH@1000GB, available 12/1/09. Exasol on PRIMERGY RX300 S4, 1,018,321 QphH@1000GB, $1.18/QphH@1000GB, available 08/01/08.ParAccel on SunFire X4100 315,842 QphH@1000GB, $4.57 /QphH@1000GB, available 10/29/07.

11.2

Page 53: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 53 –

大型扫描获得多级收益

1 TB压缩

10 TB 用户数据需要 10 TB 的 IO

100 GB分区裁剪

20 GB 存储索引

5 GB 智能扫描

亚秒级 On Database

Machine

数据减少 10 倍,扫描加快 2000 倍

Page 54: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 54 –

DBFS – 可扩展共享的文件系统

• Database Machine comes with DBFS shared Linux file system– Shared storage for ETL staging, scripts, reports and other application files

• Files stored as SecureFile LOBs in database tables stored in Exadata – Protected like any DB data – mirroring, DataGuard, Flashback, etc.

• 5 to 7 GB/sec file system I/O throughput

ETL Files in DBFS

使用外部表来加载数据

ETL

比高端 NAS 文件系统更高的吞吐量

11.2

Page 55: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 55 –

<Insert Picture Here>

Best OLTP Machine

Page 56: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 56 –

Best OLTP Machine

• 仅 Oracle 能运行“基于网格”的真实商业应用

• 特有的可容错可扩展 OLTP 数据库– RAC, Data Guard, Online Operations

• 特有的可容错可扩展,适配 OLTP 的存储– ASM, Exadata

• 卓越的新 OLTP 能力

Page 57: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 57 –

Exadata Flash 解决随机 I/O 瓶颈

第一款实现闪存优化的数据库

11.2

• Sun Oracle 数据库一体机拥有 5+TB 闪存存储

• Exadata Smart Cache 缓存热点数据• 数据库一体机能实现 :

– 20x more random I/Os• Over 1 million per second

– 2x faster sequential query I/O• 50 GB/sec

– 10x better I/O response time• Sub-millisecond

– Greatly Reduced Cost• 10x fewer disks for IOPS• Lower Power

Page 58: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 58 –

WAN

Complete, Open, Integrated AvailabilityMaximum Availability Architecture 最大可用性架构

• Protection from– Server Failures– Storage Failures– Network Failures– Site Failures

• Real-time remote standby open for queries• Human error correction

– Database, table, row, transaction level• Online indexing and table redefinition• Online patching and upgrades

Real Application

Clusters

ASM

Fast Recovery Area

Active Data Guard

Secure Backup

Page 59: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 59 –

Complete, Open, Integrated Security

Data Masking

Advanced Security

SecureBackup

加密和屏蔽

Database Vault

Label Security

访问控制

Configuration Management

Audit Vault Total

Recall

监控

Page 60: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 60 –

<Insert Picture Here>

Best Consolidation Machine

Page 61: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 61 –

整合架构

• 融合到数据库一体机– High performance for all applications– Low cost platform for all applications– Predictable response times in a shared environment– Handles all data management needs– Complete, Open, Integrated

ERP

CRM

Warehouse

Data MartHR

HRERP

Data MartWarehouse

CRM

Page 62: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 62 –

Consolidate Storage

• Exadata 和 ASM 可跨多个数据库共享所有的存储服务器

• 配置共享– Advanced data striping spreads every database

across all storage servers– Eliminates hot-spots and captive unused space– Full storage grid performance available to all

databases– Database or cluster level storage security

• 可预测性能– Exadata I/O resource manager prioritizes I/Os to

ensure predictable performance • At user, job, application, or database level

– No need for isolated storage islands

ERP

CRM

Warehouse

Data Mart

HR

Page 63: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 63 –

Consolidate Servers

• 多个数据库可运行在多个数据库一体机上

• 配置共享– Applications connect to a database service that

runs on one or more database nodes• Services can grow, shrink, & move dynamically

– Large databases can span nodes using RAC– Multiple small databases can run on a single node

• 可预测性能– Instance caging provides predictable CPU

resources when multiple databases run on the same node• Restricts a database to subset of processors

ERP CRM

Warehouse

Data

Mart

HR

Page 64: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 64 –

Best Consolidation Machine

• 在单一系统里整合不同工作负载– Warehouse oriented bulk data processing– OLTP oriented random updates– Multimedia oriented streaming files

• Sun Oracle 数据库一体机以极致性能处理任何混合后的工作负载– And predictable response times

• 卓越的新混合能力

ERP

CRM

Warehouse

Data Mart

HR

Page 65: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 65 –

Exadata 的价值主张

• 极限性能

• 线性扩展

• 企业即用

• 开放标准

Page 66: Sun Oracle Exadata Technical Overview V1

Copyright © 2009, Oracle Corporation and/or its affiliates – 66 –

未来的架构大规模的并行网格

Best for Data Warehousing

Best for OLTP

Best for Consolidation

Page 67: Sun Oracle Exadata Technical Overview V1

Q & A