disclaimer - meetupfiles.meetup.com/1642258/ambari-overview.pdfdisclaimer • this document may...
TRANSCRIPT
© Hortonworks Inc. 2014
Disclaimer • This document may contain product features and technology directions
that are under development or may be under development in the future.
• Technical feasibility, market demand, user feedback, and the Apache Software Foundation community development process can all affect timing and final delivery.
• This document’s description of these features and technology directions does not represent a contractual commitment from Hortonworks to deliver these features in any generally available product.
• Product features and technology directions are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind.
1
© Hortonworks Inc. 2014
Apache Ambari Provisioning, Managing and Monitoring Hadoop
October 2014
© Hortonworks Inc. 2014
Agenda
• Ambari Overview and the Community • Provisioning Clusters and Managing Services • Extensibility Features • Futures • Learn More
Page 3
© Hortonworks Inc. 2014
How do you Operate a Hadoop Cluster?
Apache Ambari is a platform to
provision, manage and monitor
Hadoop clusters
© Hortonworks Inc. 2014
Apache Ambari Themes
Operate Hadoop at
Scale
Integrate with the Enterprise
Extend for the
Ecosystem
Deliver the core opera-onal capabili-es to
provision, manage and monitor Hadoop clusters at scale.
Robust API for integra-on with
exis@ng enterprise systems, such as
Teradata Viewpoint and MicrosoD SCOM.
Provide extensible plaForm for
Customers, Partners and the Community to, such as Stacks and
Views.
© Hortonworks Inc. 2014
Enterprise Operations Platform
Apache Ambari is a 100% open source platform for provisioning, managing and monitoring Apache Hadoop clusters
AMBARI WEB
Others
compute &
storage . . .
. . .
. . compute &
storage
.
. EXTEND
AMBARI REST API
AMBARI SERVER PROVISION | MANAGE | MONITOR
Integra@on with Exis@ng Tools
OPERATE
AMBARI STACKS
© Hortonworks Inc. 2014
Innovation at the Speed of Community
An active 100% open source Community: • 60+ Contributors / 30+ Committers • 250+ Ambari User Group Members
2014
April
Graduates to Apache Top Level Project
2013
Dec
Apache Ambari 1.5 Released Adds support for Hortonworks Data Platform 2.1 Adds new operational and extensibility capabilities
© Hortonworks Inc. 2014
What’s New in Ambari 1.5
• Support for HDP 1.3, 2.0 and 2.1 Stacks • Rolling Restarts • Maintenance Mode • Decommission TTs, NMs, RSs • Service and Component Restarts • Bulk Host Operations • Add Service • Ambari Blueprints – Tech Preview • For a complete list of changes:
https://issues.apache.org/jira/browse/AMBARI/fixforversion/12325019
Page 8
© Hortonworks Inc. 2014
What’s New in Ambari 1.6.0
• PostgreSQL Support – For Ambari DB, Hive Metastore and Oozie
• Ambari Blueprints – GA
• For a complete list of changes
https://issues.apache.org/jira/browse/AMBARI/fixforversion/12326544
Page 9
© Hortonworks Inc. 2014
What’s New in Ambari 1.6.1 • New Host Checks
– Custom JDK Path Check – Hostname Resolution Check – Reverse Hostname Lookup Check
• Improved External Database Support – Simplified JDBC Driver Setup – Database Connection “Test”
• Stack Repository Management Enhancements – Manage Stack repositories from Ambari Web – Template-driven .repo files
• Performance and Scale – 2,000 node cluster testing
• For a complete list of changes
https://issues.apache.org/jira/browse/AMBARI/fixforversion/12326545
Page 10
Provisioning
© Hortonworks Inc. 2014
Choose Services
• Core Hadoop plus Ecosystem Services • Popular Open Source Metrics + Monitoring systems
Page 12
© Hortonworks Inc. 2014
Master Component Assignment
• Highly customizable targeting of Master components
Page 13
© Hortonworks Inc. 2014
Slave Component Assignment
• Host-level control to target Slave components • Designate “gateway” hosts with client configurations
Page 14
© Hortonworks Inc. 2014
Auto Agent Bootstrap (via SSH)
• Configure to use “root” or a “sudoer” user for SSH • More flexibility in root-restricted environments
Page 15
© Hortonworks Inc. 2014
Manual Agent Bootstrap
• Ambari Agent “manual” bootstrap option • Useful when SSH restricted or not available
Page 16
© Hortonworks Inc. 2014
Pre-Flight Host Checks • Examine hosts, check for
conflicts before installation
• Checks for existing files, directories, packages and processes
Management
© Hortonworks Inc. 2014
Cluster Operations
• Start, Stop and Smoke Test Services • Apply configura@on changes and inject custom proper@es
19
© Hortonworks Inc. 2014
Configuration Management
• Ability to update core-site.xml properties • Add and Remove custom properties
Page 20
© Hortonworks Inc. 2014
Host Level Visibility + Controls
• View System Metrics and Host Information • Manage Host Components, including start/stop/decommission
• “Add Hosts” to expand cluster capacity
21
© Hortonworks Inc. 2014
Enable NameNode HA
• Post-cluster install, wizard driven experience • For Hadoop 2 Stack (QJM)
Page 22
© Hortonworks Inc. 2014
NameNode HA Alerts
Page 23
© Hortonworks Inc. 2014
Moving Masters
Page 24
• Move master components to different hosts – NameNode – SecondaryNameNode – TaskTracker (Hadoop 1) – ResourceManager (Hadoop 2)
• Move individual NameNodes in Hadoop 2 HA configuration – Hadoop 2
© Hortonworks Inc. 2014
HBase Multi-Master
Page 25
• Add multiple HBase Masters
• During Cluster Install – Assign 1+ HBase Masters
• Post Cluster Install – Add HBase Master to a host
© Hortonworks Inc. 2014
Host Configuration Groups
• Set custom configuration properties for one or more host groups (e.g. “host overrides”)
• Important for handing “heterogeneous” HW clusters – Different memory, mount points, directories
26
HEAPSIZE= 1024 HEAPSIZE= 512
© Hortonworks Inc. 2014
Staged Configurations Changes
• Make configuration changes and save in Ambari w/o stopping services
• Stage the changes on Ambari, restart components to pickup configurations
Page 27
Monitoring
© Hortonworks Inc. 2014
Cluster Heatmaps
• Gain health visibility of entire cluster • Quickly identify host outliers and potential issues • Includes both System and Hadoop Metric Heatmaps
29
© Hortonworks Inc. 2014
Hadoop Metrics
• Hadoop Service metrics such as HDFS, MapReduce and HBase
• Visibility info DFS capacity, file operations, RPC latency and JVM performance
30
© Hortonworks Inc. 2014
Hadoop + System Monitoring
31
• Watch key Hadoop operational metrics for all services
• Master service availability, CPU utilization, network latency, storage capacity
• Host-level alerting • Component service processes
and storage availability
© Hortonworks Inc. 2014
Customizable Metrics Dashboard
• Watch key System and Hadoop metrics
Page 32
© Hortonworks Inc. 2014
YARN Dashboard Widgets + Heatmaps
Page 33
© Hortonworks Inc. 2014
YARN Alerts
Page 34
Core Platform
© Hortonworks Inc. 2014
Platform Support
• Support for popular Linux platforms such as RHEL, CentOS, Oracle Linux and SLES
• Support for Oracle JDK 1.6 and 1.7 • Support for Open JDK 7
Page 36
© Hortonworks Inc. 2014
Database Support
Page 37
Hive Metastore Oozie Ambari DB
MySQL 5.x
Oracle 11g
PostgreSQL 8, 9
Default PostgreSQL MySQL Derby
© Hortonworks Inc. 2014
Secure Authentication
• Pluggable LDAP and AD, Default RDBMS • Supports AmbariAdmin and AmbariUser roles
© Hortonworks Inc. 2014
Ambari Extensibility Stacks, Blueprints and Views
Page 39
© Hortonworks Inc. 2014
Extensibility Features
• To add new Services (ISV or otherwise) beyond HDP Stack • To customize a Stack for customer specific environments
• To use Ambari for automating cluster installations • To share best practices on layout and cluster configuration
• To extend and customize the Ambari Web UI • Add new capabilities, customize existing capabilities
Stacks
Blueprints
Views
Goal: Extend Ambari without hard-coding in Ambari
© Hortonworks Inc. 2014
Community Links
Page 41
Capability Link
Stacks https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=38571133
Blueprints https://cwiki.apache.org/confluence/display/AMBARI/Blueprints
Views https://cwiki.apache.org/confluence/display/AMBARI/Views
© Hortonworks Inc. 2014
Anatomy of Ambari Extension Points
Ambari Server
Ambari Agent Ambari
Agent Ambari Agent
Ambari Web
Stacks Stacks
Stacks
java!js! python!
Ambari Views Ambari Stacks
© Hortonworks Inc. 2014
Ambari Stacks
Page 43
© Hortonworks Inc. 2014
Ambari Stacks
• Goal: Reduce time + effort to new Services to Ambari
• Defines a consistent Service lifecycle management interface that can be extended
• Dynamically add Stack + Service definitions • Bring companion technologies together with “Hadoop”
Page 44
AMBARI {rest}!
<ambari-web>!
Stacks HDFS YARN MR2
Hive
Pig
Oozie
NEW
NEW
NEW
© Hortonworks Inc. 2014
Stack Terminology
Page 45
Term Definition Examples
STACK Defines a set of Services, where to obtain the software packages and how to manage the lifecycle.
HDP-2.0, HDP-1.3.3
SERVICE Defines the Components that make-up the Service. HDFS, NAGIOS, YARN
COMPONENT The building-blocks of a Service, that adhere to a certain lifecycle.
NAMENODE, DATANODE, OOZIE_SERVER
CATEGORY The category of Component. MASTER, SLAVE, CLIENT
© Hortonworks Inc. 2014
Stack Mechanics
• Stacks define Services + Repos – What is in the Stack, and where to get the bits
• Each Service has a definition – What Components are part of the Service
• Each Service has defined lifecycle commands – start, stop, status, install, configure
• Lifecycle is controlled via command scripts • Ability to define “custom” commands
Page 46
AMBARI SERVER
Stack
Command Scripts
Service Defini@ons
AMBARI AGENT/S AMBARI AGENT/S AMBARI AGENT/S
python!xml!
Repos
© Hortonworks Inc. 2014
Stacks Support Inheritance
HDP 2.0 Stack
HDP 2.1 Stack
• Defines a set of Service definitions • Default service configurations and command
scripts
• Overrides any Service definitions, commands and configurations
• Adds new Services specific to this Stack
© Hortonworks Inc. 2014
Service Lifecycle Commands
Component Category Install Start Stop Configure Status Custom
MASTER
SLAVE
CLIENT
© Hortonworks Inc. 2014
Ambari Views
Page 49
© Hortonworks Inc. 2014
Ambari Views
• Goal: enable the delivery of custom UI experiences in Ambari Web
• Developers can extend the Ambari Web interface • Views expose custom UI features for Services
• Ambari Admins can authorize Views to Ambari users • Entitlements framework for controlling access to Views
© Hortonworks Inc. 2014
View Concepts
Page 51
Views Framework
Views
Core to Ambari
Built by Hortonworks, Community,
Partners
© Hortonworks Inc. 2014
Example Views
Capacity Scheduler Queue Manager
Hive Tez Query Analyzer
© Hortonworks Inc. 2014
View Components
• Deploy client-side assets (such as HTML + JavaScript)
• Expose server-side resources (such as REST endpoints)
© Hortonworks Inc. 2014
Views Deployment
• For Hadoop Operators: • Deploy Views in an Ambari Server that is managing a Hadoop
cluster
• For Data Workers: • Run Views in a dedicated “standalone” Ambari Server
Page 54
Ambari Server
HADOOP Store & Process
Ambari Views Server
Operators manage the cluster, may have Views deployed
Data Workers use the cluster and use the Ambari Views Server for Views
© Hortonworks Inc. 2014
View Versions and Instances
• Deploy multiple versions and create multiple instances of a view
• Manage accessibility and usage
© Hortonworks Inc. 2014
Terminology Term Description
View Definition Describes the view resources and core view properties such as name, version and any necessary configuration properties.
View Package Packages the view client and server assets (and dependencies)
View Deployment
Deploying a view into Ambari
View Version A specific version of a view
View Instance An instantiation of a specific view version
Framework Services
View context, instance data, configuration properties and events
© Hortonworks Inc. 2014
View Packaging
• View Definition: view.xml • Resource / Service classes : JAX-RS annotated • UI classes : html, Servlets deployed as web app (WEB-INF/web.xml).
• Application logic : Supporting classes • Dependencies : 3rd party jars or classes
├── WEB-INF │ └── web.xml ├── org │ └── apache │ └── ambari │ └── view │ └── weather │ ├── CityResource.class │ ├── CityResourceProvider.class │ ├── CityService.class │ └── WeatherServlet.class └── view.xml
view.jar
© Hortonworks Inc. 2014
Learn More
Page 58
Resource Location
Apache Ambari Project Page http://ambari.apache.org
Ambari Project Wiki https://cwiki.apache.org/confluence/display/AMBARI
Ambari Project JIRA https://issues.apache.org/jira/browse/AMBARI
© Hortonworks Inc. 2014
Extras
Page 59
© Hortonworks Inc. 2014
Ambari SCOM Overview
Page 60
Ambari SCOM Mgmt Pack
Cluster Hosts (virtualized or physical)
Ambari Server
HADOOP Store & Process
1
2
3
1
2
3
Ambari aggregates and exposes Hadoop metrics
Ambari SCOM displays cluster layout and Hadoop metric graphs
Ambari SCOM monitors Hadoop health and alerts in case of problems