emc greenplum site preparation guide

24
EMC ® Greenplum ® Data Computing Appliance Site Preparation Guide P/N: 300-012-149 Rev: A01 The Data Computing Division of EMC

Upload: letram

Post on 08-Dec-2016

233 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: EMC Greenplum Site Preparation Guide

EMC® Greenplum® Data Computing ApplianceSite Preparation Guide

P/N: 300-012-149Rev: A01

The Data Computing Division of EMC

Page 2: EMC Greenplum Site Preparation Guide

Copyright © 2011 EMC Corporation. All rights reserved.

EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice.

THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.

For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com

All other trademarks used herein are the property of their respective owners.

Page 3: EMC Greenplum Site Preparation Guide

Table of Contents 1

EMC Greenplum DCA Site Preparation Guide – Contents

EMC Greenplum DCA Site Preparation Guide - ContentsPreface ............................................................................................... 2

Chapter 1: About EMC Greenplum DCA...................................... 3Available DCA configurations............................................................ 3

GP10 (Quarter-Rack Configuration) ............................................ 5GP100 (Half-Rack Configuration) ................................................ 6GP1000 (One-Rack Configuration) .............................................. 7GP1000 +1 Scale-out Module (Two-Rack Configuration)............. 8

Component Specifications ................................................................ 8

Chapter 2: Preparing the Data Center Environment .............12Confirming Site Requirements.........................................................12

Floor Space Requirements .........................................................12Power and Cooling Requirements...............................................12Power Cord Specifications..........................................................13Enviromental Requirements.......................................................14Air Quality Requirements ...........................................................14

Optional Securing Brackets .............................................................15Anti-Tip Bracket.........................................................................16Anti-Move Bracket .....................................................................16Seismic Restraint Bracket ..........................................................17

Cabinet Positioning..........................................................................18Package Dimensions and Clearance.................................................19

Chapter 3: Gathering Site-Specific Information ....................20

Chapter 4: Next Steps ...................................................................22

Page 4: EMC Greenplum Site Preparation Guide

2

EMC Greenplum DCA Site Preparation Guide – Preface

Preface

This guide is intended for EMC personnel, partners and customers to plan for requirements before an installation of a new EMC Greenplum Data Computing Appliance (DCA) into a data center. This guide provides an overview of the system, information on data center requirements, a checklist of items to gather for software configuration and links to relevant documentation for use in the next steps of deployment. The requirements listed in this document must be met prior to performing a DCA installation.

This guide contains the following chapters and appendices:

• Chapter 1, “About EMC Greenplum DCA”

• Chapter 2, “Preparing the Data Center Environment”

• Chapter 3, “Gathering Site-Specific Information”

• Chapter 4, “Next Steps”

Page 5: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

1. About EMC Greenplum DCA

Greenplum Data Computing Appliance (DCA) is a self-contained data warehouse solution that integrates all of the database software, servers and switches necessary to perform big data analytics.

EMC Greenplum Data Computing Appliance (DCA) is a turn-key, easy to install data warehouse solution that provides extreme query and loading performance for analyzing large data sets. The EMC Greenplum DCA integrates Greenplum Database software with compute, storage and network components; delivered racked and ready for immediate data loading and query execution.

EMC Greenplum Data Computing Appliance runs the Greenplum Database relational database management system (RDBMS) software. Greenplum Database utilizes the DCA components to perform its database operations and processing.

See the following sections for a description of the DCA components and configurations.

• Available DCA configurations

• Component Specifications

Available DCA configurations

This section details the rack configurations currently available for the DCA. Note that in the Greenplum Database product and documentation, physical servers are referred to as hosts.

The GP10, GP100 and GP1000 have the same basic rack configuration, except for the number of segment hosts.

Table 1.1 DCA Components

DCA Component Quantity

Master Hosts 2 (one primary and one standby)

Segment Hosts GP10 (quarter-rack) = 4

GP100 (half-rack) = 8

GP1000 (one-rack) = 16

Interconnect Switches 2

Administration Switch 1

Available DCA configurations 3

Page 6: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

A GP1000 scale-out module (two-rack configuration) contains all of the components of the GP1000 minus the master hosts.

Table 1.2 DCA GP1000 Scale-Out Module Components

DCA Component Quantity

Segment Hosts 16 per rack

Interconnect Switches 2 per rack

Administration Switch 1 per rack

Available DCA configurations 4

Page 7: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

GP10 (Quarter-Rack Configuration)

Figure 1.1 GP10 quarter-rack configuration

Available DCA configurations 5

Page 8: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

GP100 (Half-Rack Configuration)

Figure 1.2 GP100 half-rack configuration

Available DCA configurations 6

Page 9: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

GP1000 (One-Rack Configuration)

Figure 1.3 GP1000 one-rack configuration

Available DCA configurations 7

Page 10: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

GP1000 +1 Scale-out Module (Two-Rack Configuration)

Figure 1.4 GP1000 plus 1 scale-out module (two-rack configuration)

Component SpecificationsThis section explains the specifications of the various server and networking components of the DCA. Note that in the Greenplum Database product and

Component Specifications 8

Page 11: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

documentation, physical servers are referred to as hosts.

Table 1.3 DCA Components

DCA Component Quantity

Master Hosts All Configurations = 2 (one primary and one standby)

Segment Hosts GP10 = 4

GP100 = 8

GP1000 = 16

GP1000+1 = 32

Interconnect Switches GP100/GP1000 = 2

GP1000+1 = 4

Administration Switch GP100/GP1000 = 1

GP1000+1 = 2

Master Host Specifications

The following diagram shows an example of how a Greenplum Database master host is configured in the DCA. DCA has two master hosts (the primary master and a standby master).

Figure 1.5 Greenplum Database Master Host Configuration on the DCA

Component Specifications 9

Page 12: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

Table 1.4 Master Host Server Specifications

Hardware Specifications Quantity

Processor Intel X5680 3.33 GHz (6 core) 2

Memory DDR3 1333 MHz 48 GB

Dual-port Converged Network Adapter

2 x 10 Gbps 1

Quad-port Network Adapter 4 x 1 Gbps 1

RAID controller Dual channel 6 Gb/s SAS 1

Hard Disks 600 GB 10 K RPM SAS

(one RAID5 volume of 4+1 with 1 hot spare)

6

Segment Host Specifications

The following diagram shows an example of how a Greenplum Database segment host is configured in the DCA. Greenplum GP100 (half-rack) has 8 segment hosts. Greenplum GP1000 (full-rack) has 16 segment hosts. Each segment host serves 6 Greenplum Database primary segment instances and 6 mirror segment instances.

Figure 1.6 Greenplum Database Segment Host Configuration on the DCA

Component Specifications 10

Page 13: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA

Table 1.5 Segment Host Server Specifications

Hardware Specifications Quantity

Processor Intel X5670 2.93 GHz (6 core) 2

Memory DDR3 1333 MHz 48 GB

Dual-port Converged Network Adapter

2 x 10 Gbps 1

Dual-port Network Adapter 2 x 1 Gbps 1

RAID controller Dual channel 6 Gb/s SAS 1

Hard Disks 600 GB 15 K RPM SAS

(two RAID5 volumes of 5+1 disks)

12

Network Component Specifications

Hardware Specifications Quantity

Interconnect Switch 24-port Converged Enhanced Ethernet (CEE), Fibre Channel over Ethernet (FCoE)

8 Fibre Channel Ports (future use)

2

Admin Switch 24-port 1 Gb Ethernet Layer 3 1

11

Page 14: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

2. Preparing the Data Center Environment

• Confirming Site Requirements

• Optional Securing Brackets

• Cabinet Positioning

• Package Dimensions and Clearance

Confirming Site RequirementsThe section summarizes the site requirements for the DCA.

• Floor Space Requirements

• Power and Cooling Requirements

• Power Cord Specifications

• Enviromental Requirements

• Air Quality Requirements

Floor Space Requirements

The following table describes the physical footprint of the DCA:

Table 2.1 DCA Physical Dimensions

Height Width Depth1

1. with door attached

Weight

GP10 (quarter-rack)75 in

190 cm

24 in

61 cm

41.6 in

104 cm

940 lbs

GP100 (half-rack)75 in

190 cm

24 in

61 cm

41.6 in

104 cm

1200 lbs

GP1000 (one-rack)75 in

190 cm

24 in

61 cm

41.6 in

104 cm

1700 lbs

GP1000+1(two-rack)75in

190 cm

48 in

122 cm

41.6 in

104 cm

3400 lbs

Power and Cooling Requirements

The following table describes the power and cooling requirements of the DCA:

Table 2.2 EMC Greenplum Data Computing Appliance Physical Dimensions

Total Power VA Power Connections Cooling (BTU/HR)

GP10 (quarter-rack) 2478 2 8450

Confirming Site Requirements 12

Page 15: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

Power Cord Specifications

Table 2.3 Power Cord Specifications

Power Cord Connector

Country Power Cord Model Descriptions

USA, Japan DCA1-US-15 DCA - Single Phase, 30Amp, 15ft ext cords with L6-30P plug

DCA1-US-21 DCA - Single Phase, 30Amp, 21ft ext cords with L6-30P plug

Australia DCA1-ASTL-15 DCA - Single Phase, 30Amp, 15ft ext cords with CLIPSAL 56PA332 plug

DCA1-ASTL-21 DCA - Single Phase, 30Amp, 21ft ext cords with CLIPSAL 56PA332 plug

Other Countries

DCA1-IEC3-15 DCA - Single Phase, 30Amp, 15ft ext cords with IEC309-332P6 plug

DCA1-IEC3-21 DCA - Single Phase, 30Amp, 21ft ext cords with IEC309-332P6 plug

Other Power Cord Types

DCA1-RUS-15 DCA - Single Phase, 30Amp, 15ft ext cords with RUSSELLSTOLL 3750DP plug

DCA1-RUS-21 DCA - Single Phase, 30Amp, 21ft ext cords with RUSSELLSTOLL 3750DP plug

GP100 (half-rack) 3980 2 13600

GP1000 (one-rack) 6980 4 23800

GP1000+1(two-rack) 13960 8 47600

Table 2.2 EMC Greenplum Data Computing Appliance Physical Dimensions

Confirming Site Requirements 13

Page 16: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

Enviromental Requirements

Table 2.4 Environmental Requirements

+15°C to +32°C (59°F to 89.6°F) site temperature

40% to 55% relative humidity

0 to 2439 meters (0 to 8,000 feet) above sea level operating altitude

Air Quality Requirements

EMC products are designed to be consistent with the requirements of the American Society of Heating, Refrigeration and Air Conditioning Engineers (ASHRAE) Environmental Standard Handbook and the most current revision of Thermal Guidelines for Data Processing Environments, Second Edition, ASHRAE 2009b.

The data center should maintain a cleanliness level as identified in ISO 14664-1, class 8 for particulate dust and pollution control. The air entering the data center should be filtered with a MERV 11 filter or better. The air within the data center should be continuously filtered with a MERV 8 or better filtration system. In addition, efforts should be maintained to prevent conductive particles, such as zinc whiskers, from entering the facility.

The allowable relative humidity level is 20 to 80% non condensing, however, the recommended operating environment range is 40 to 55%. For data centers with gaseous contamination, such as high sulfur content, lower temperatures and humidity are recommended to minimize the risk of hardware corrosion and degradation. In general, the humidity fluctuations within the data center should be minimized. It is also recommended that the data center be positively pressured and have air curtains on entry ways to prevent outside air contaminants and humidity from entering the facility.

For facilities below 40% relative humidity, it is recommended to use grounding straps when contacting the equipment to avoid the risk of Electrostatic discharge (ESD), which can harm electronic equipment.

As part of an ongoing monitoring process for the corrosiveness of the environment, it is recommended to place copper and silver coupons (per ISA 71.04-1985, Section 6.1 Reactivity), in airstreams representative of those in the data center. The monthly reactivity rate of the coupons should be less than 300 Angstroms. When monitored reactivity rate is exceeded, the coupon should be analyzed for material species and a corrective mitigation process put in place.

Confirming Site Requirements 14

Page 17: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

This EMC® cabinet ventilates from front to back; you must provide adequate clearance to service and cool the system. Depending on component-specific connections within the cabinet, the available power cord length may be somewhat shorter than the 15-foot standard.

Figure 2.1 Access and Ventilation Requirements

Optional Securing BracketsIf you intend to secure the optional stabilizer brackets to your site floor, prepare the location for the mounting bolts. The additional brackets help to prevent the cabinet from tipping while you service cantilevered levels, or from rolling during minor seismic events. The brackets provide three levels of protection for stabilizing the unit.

• Anti-Tip Bracket

• Anti-Move Bracket

• Seismic Restraint Bracket

Optional Securing Brackets 15

Page 18: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

Anti-Tip Bracket

Use this bracket to provide an extra measure of anti-tip security. One or two kits may be used. For cabinets with components that slide, EMC recommends that you use two kits.

Figure 2.2 Anti-Tip Bracket Placement

Anti-Move Bracket

Use this bracket to permanently fasten the unit to the floor.

Figure 2.3 Anti-Move Bracket Placement

Optional Securing Brackets 16

Page 19: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

Seismic Restraint Bracket

Use this bracket to provide the highest protection from moving or tipping.

Figure 2.4 Seismic Restraint Bracket Placement

Optional Securing Brackets 17

Page 20: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

Cabinet PositioningThe cabinet bottom includes four caster wheels. The front wheels are fixed; the two rear casters swivel in a 1.75-inch diameter. Swivel position of the caster wheels will determine the load-bearing points on your site floor, but does not affect the cabinet footprint. Once you have positioned, leveled, and stabilized the cabinet, the four leveling feet determine the final load-bearing points on your site floor.

Figure 2.5 Cabinet Positioning

When the cabinet is centered over two typical 24 in. (60.96 cm) by 24 in. (60.96 cm) floor tiles:

• Cutouts should be 8 in. (20.32 cm) by 6 in. (15.24 cm).

• Cutouts should be centered on the tiles, 9 in. (22.86 cm) from the front and rear and 8 in. (20.32 cm) from sides.

Cabinet Positioning 18

Page 21: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment

Package Dimensions and ClearanceMake certain your doorways and elevators are wide enough and tall enough to accommodate the shipping pallet and cabinet. Use a mechanical lift or pallet jack to position the packaged cabinet in its final location.

Figure 2.6 Door Clearance

Leave approximately 2.43 meters (8 feet) of clearance at the back of the cabinet to unload the unit and roll it off the pallet.

Figure 2.7 Unloading Clearance

Package Dimensions and Clearance 19

Page 22: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 3: Gathering Site-Specific Information

3. Gathering Site-Specific Information

In order to complete an installation of an EMC Greenplum DCA, the following information should be gathered from the customer’s network and database personnel:

Table 3.1 Site-Specific Information

Information Description

External IP and hostname of the primary master

This is the IP address and hostname that the customer will use to connect to the primary master host from their public LAN.

The master hostname is also used for client connections to Greenplum Database.

External IP and hostname of the standby master

This is the IP address and hostname that the customer will use to connect to the standby master host from their public LAN.

Netmask Netmask of the customer’s network.

Gateway Default gateway of the customer’s network and the IP address and interface name of the router.

NTP server IP The IP address or hostname of the customer’s preferred NTP (Network Time Protocol) server.

DNS name server IP The IP address of the customer’s DNS name server.

iDRAC password This is the password used for remote access to the master, standby master and segment hosts using the integrated Dell Remote Access Controller (iDRAC) interface. The default iDRAC password is calvin.

root password Customer supplied root password for the master, standby master and segment hosts. The default root password is changeme.

gpadmin password Customer supplied Greenplum Database superuser password. The default gpadmin password is changeme.

System locale The preferred locale to be used on the master, standby master and segment hosts. en_US.UTF-8 is the default locale for the Greenplum DCA (U.S. English and Unicode character set encoding).

A locale identifier consists of a language identifier and a region identifier, and optionally a character set encoding. For example, sv_SE is Swedish as spoken in Sweden, en_US is U.S. English, and fr_CA is French Canadian. If more than one character set encoding can be useful for a locale, then the specifications look like this: en_US.UTF-8 (locale specification and character set encoding).

System timezone The local timezone to be used on the master, standby master and segment hosts. The default timezone is PST.

20

Page 23: EMC Greenplum Site Preparation Guide

Greenplum DCA Site Preparation Guide – Chapter 3: Gathering Site-Specific Information

Database character set encoding

UNICODE (UTF-8) is the default character set encoding for Greenplum Database (server-side encoding). This is usually the best choice, as it allows the customer to store all possible Unicode characters from any language, but if all data you are storing is from a single language (now and in the future), it does entail a slight storage space penalty compared to an encoding specific to that language.

If the space savings is key, the customer should consider Latin-1, Latin-9, or WIN1252 for US or Western European installations, since those encodings use a single byte per character. Likewise in Thailand you might consider WIN874 to store Thai, since it uses a single byte per character. However, keep in mind that this prevents storing any data outside those character sets. Even in the US or Western Europe, customers might find that some of their data is Latin-1, while some is Latin-9 or Win1252, so any choice of single-byte encoding will not accommodate all of their data needs. See the Greenplum Database Administrator Guide for a list of all supported character set encodings.

Software Tools Connection to the DCA for setup and management requires an SSH utility. EMC recommends Putty or Cygwin.

Hardware Tools The following hardware tools will be required during installation of the DCA:

• Utility Knife

• 9/16’’ Socket Wrench

• ESD (electro-static discharge) kit

Power Connection for Service Laptop

Power for external devices should not be drawn from the DCA cabinet. A power connection is required for the EMC personnel service laptop. The connection should be a standard AC 100-240V~1.5A, 50-60hz outlet.

Dial-home Connectivity The DCA supports dial-home for event notification to EMC Global Services support center. Communication from the DCA to EMC is done via FTPS. Firewall access should be setup to allow FTPS traffic from the DCA’s external IP address to the following EMC addresses:

corpusfep3.emc.comcorpusfep4.emc.com

Table 3.1 Site-Specific Information

Information Description

21

Page 24: EMC Greenplum Site Preparation Guide

22

Greenplum DCA Site Preparation Guide – Next Steps:

4. Next Steps

The following documentation may be used during the next steps in implementing your Data Computing Appliance:

EMC Greenplum Data Computing Appliance Getting Started Guide

http://powerlink.emc.com/km/live1/en_US/Offering_Technical/Technical_Documentation/300-011-454.pdf

Greenplum Database 4.0 Administrator Guide

http://powerlink.emc.com/km/live1/en_US/Offering_Technical/Technical_Documentation/300-011-538.pdf

Greenplum Database 4.0 Release Notes

http://powerlink.emc.com/km/live1/en_US/Offering_Technical/Technical_Documentation/300-011-650.pdf