sap hana architecture overview | sap hana tutorial

58
SAP HANA Foundation

Upload: zarantech-llc

Post on 16-Apr-2017

393 views

Category:

Education


52 download

TRANSCRIPT

Page 1: SAP HANA Architecture Overview | SAP HANA Tutorial

SAP HANA Foundation

Page 2: SAP HANA Architecture Overview | SAP HANA Tutorial

2

Problem: Heterogeneous Information Sources

“Heterogeneities are everywhere”

Different interfaces

Different data representations

Duplicate and inconsistent information

PersonalDatabases

Digital Libraries

Scientific DatabasesWorldWideWeb

Page 3: SAP HANA Architecture Overview | SAP HANA Tutorial

3

Problem: Data Management in Large Enterprises

• Vertical fragmentation of informational systems (vertical stove pipes)

• Result of application (user)-driven development of operational systems

Sales Administration Finance Manufacturing ...

Sales PlanningStock Mngmt

...

Suppliers

...

Debt MngmtNum. Control

...

Inventory

Page 4: SAP HANA Architecture Overview | SAP HANA Tutorial

4

Goal: Unified Access to Data

Integration System

Collects and combines information

Provides integrated view, uniform user interface

Supports sharing

WorldWideWeb

Digital Libraries Scientific Databases

Personal

Databases

Page 5: SAP HANA Architecture Overview | SAP HANA Tutorial

5

Two Approaches:

Query-Driven (Lazy)

Warehouse (Eager)

Source Source

?

Why a Warehouse?

Page 6: SAP HANA Architecture Overview | SAP HANA Tutorial

6

The Traditional Research Approach

Source SourceSource

. . .

Integration System

. . .

Metadata

Clients

Wrapper WrapperWrapper

Query-driven (lazy, on-demand)

Page 7: SAP HANA Architecture Overview | SAP HANA Tutorial

7

Disadvantages of Query-Driven Approach

Delay in query processing

Slow or unavailable information sources

Complex filtering and integration

Inefficient and potentially expensive for frequent queries

Competes with local processing at sources

Page 8: SAP HANA Architecture Overview | SAP HANA Tutorial

8

The Warehousing Approach

Data

Warehouse

Clients

Source SourceSource

. . .

Extractor/

Monitor

Integration System

. . .

Metadata

Extractor/

Monitor

Extractor/

Monitor

Information integrated in advance

Stored in wh for direct querying and analysis

Page 9: SAP HANA Architecture Overview | SAP HANA Tutorial

CS 336 9

Advantages of Warehousing Approach• High query performance

– But not necessarily most current information

• Doesn’t interfere with local processing at sources– Complex queries at warehouse

– OLTP at information sources

• Information copied at warehouse– Can modify, annotate, summarize, restructure, etc.

– Can store historical information

– Security, no auditing

Page 10: SAP HANA Architecture Overview | SAP HANA Tutorial

10

Not Either-Or Decision

• Query-driven approach still better for

– Rapidly changing information

– Rapidly changing information sources

– Truly vast amounts of data from large numbers of sources

– Clients with unpredictable needs

Page 11: SAP HANA Architecture Overview | SAP HANA Tutorial

11

What is a Data Warehouse?A Practitioners Viewpoint

“A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context.”

-- Barry Devlin, IBM Consultant

Page 12: SAP HANA Architecture Overview | SAP HANA Tutorial

12

What is a Data Warehouse?An Alternative Viewpoint

“A DW is a

– subject-oriented,

– integrated,

– time-varying,

– non-volatile

collection of data that is used primarily in organizational decision making.”

-- W.H. Inmon, Building the Data Warehouse, 1992

Page 13: SAP HANA Architecture Overview | SAP HANA Tutorial

13

A Data Warehouse is...• Stored collection of diverse data

– A solution to data integration problem

– Single repository of information

• Subject-oriented

– Organized by subject, not by application

– Used for analysis, data mining, etc.

• Optimized differently from transaction-oriented db

• User interface aimed at executive

Page 14: SAP HANA Architecture Overview | SAP HANA Tutorial

14

… Cont’d• Large volume of data (Gb, Tb)

• Non-volatile

– Historical

– Time attributes are important

• Updates infrequent

• May be append-only

• Examples

– All transactions ever at Sainsbury’s

– Complete client histories at insurance firm

– LSE financial information and portfolios

Page 15: SAP HANA Architecture Overview | SAP HANA Tutorial

15

Generic Warehouse Architecture

Extractor/Monitor

Extractor/Monitor

Extractor/Monitor

Integrator

Warehouse

Client Client

Design Phase

Maintenance

Loading

...

Metadata

Optimization

Query & Analysis

Page 16: SAP HANA Architecture Overview | SAP HANA Tutorial

16

Page 17: SAP HANA Architecture Overview | SAP HANA Tutorial

17

Page 18: SAP HANA Architecture Overview | SAP HANA Tutorial

18

Data Warehouse Architectures: Conceptual View

• Single-layer

– Every data element is stored once only

– Virtual warehouse

• Two-layer

– Real-time + derived data

– Most commonly used approach in

industry today

“Real-time data”

Operationalsystems

Informationalsystems

Derived Data

Real-time data

Operationalsystems

Informationalsystems

Page 19: SAP HANA Architecture Overview | SAP HANA Tutorial

19

Three-layer Architecture: Conceptual View

• Transformation of real-time data to derived data really requires two steps

Derived Data

Real-time data

Operationalsystems

Informationalsystems

Reconciled DataPhysical Implementationof the Data Warehouse

View level“Particular informational

needs”

Page 20: SAP HANA Architecture Overview | SAP HANA Tutorial

20

Data Warehousing: Two Distinct Issues

(1) How to get information into warehouse

“Data warehousing”

(2) What to do with data once it’s in warehouse

“Warehouse DBMS”

• Both rich research areas

• Industry has focused on (2)

Page 21: SAP HANA Architecture Overview | SAP HANA Tutorial

21

Issues in Data Warehousing

• Warehouse Design

• Extraction

– Wrappers, monitors (change detectors)

• Integration

– Cleansing & merging

• Warehousing specification & Maintenance

• Optimizations

• Miscellaneous (e.g., evolution)

Page 22: SAP HANA Architecture Overview | SAP HANA Tutorial

22

OLTP: On Line Transaction Processing Describes processing at operational sites

OLAP: On Line Analytical Processing Describes processing at warehouse

OLTP vs. OLAP

Page 23: SAP HANA Architecture Overview | SAP HANA Tutorial

23

Warehouse is a Specialized DBStandard DB (OLTP)

• Mostly updates

• Many small transactions

• Mb - Gb of data

• Current snapshot

• Index/hash on p.k.

• Raw data

• Thousands of users (e.g., clerical users)

Warehouse (OLAP)

Mostly reads

Queries are long and complex

Gb - Tb of data

History

Lots of scans

Summarized, reconciled data

Hundreds of users (e.g., decision-makers, analysts)

Page 24: SAP HANA Architecture Overview | SAP HANA Tutorial

24

Page 25: SAP HANA Architecture Overview | SAP HANA Tutorial

25

Page 26: SAP HANA Architecture Overview | SAP HANA Tutorial

26

Page 27: SAP HANA Architecture Overview | SAP HANA Tutorial

27

Page 28: SAP HANA Architecture Overview | SAP HANA Tutorial

28

Page 29: SAP HANA Architecture Overview | SAP HANA Tutorial

29

Page 30: SAP HANA Architecture Overview | SAP HANA Tutorial

30

Page 31: SAP HANA Architecture Overview | SAP HANA Tutorial

31

Page 32: SAP HANA Architecture Overview | SAP HANA Tutorial

32

Page 33: SAP HANA Architecture Overview | SAP HANA Tutorial

33

Page 34: SAP HANA Architecture Overview | SAP HANA Tutorial

34

Page 35: SAP HANA Architecture Overview | SAP HANA Tutorial

35

Page 36: SAP HANA Architecture Overview | SAP HANA Tutorial

36

Page 37: SAP HANA Architecture Overview | SAP HANA Tutorial

37

Page 38: SAP HANA Architecture Overview | SAP HANA Tutorial

38

Page 39: SAP HANA Architecture Overview | SAP HANA Tutorial

39

Page 40: SAP HANA Architecture Overview | SAP HANA Tutorial

40

Page 41: SAP HANA Architecture Overview | SAP HANA Tutorial

41

Page 42: SAP HANA Architecture Overview | SAP HANA Tutorial

42

Page 43: SAP HANA Architecture Overview | SAP HANA Tutorial

43

Page 44: SAP HANA Architecture Overview | SAP HANA Tutorial

44

Page 45: SAP HANA Architecture Overview | SAP HANA Tutorial

45

Page 46: SAP HANA Architecture Overview | SAP HANA Tutorial

46

Page 47: SAP HANA Architecture Overview | SAP HANA Tutorial

47

Page 48: SAP HANA Architecture Overview | SAP HANA Tutorial

48

Page 49: SAP HANA Architecture Overview | SAP HANA Tutorial

49

Page 50: SAP HANA Architecture Overview | SAP HANA Tutorial

50

Page 51: SAP HANA Architecture Overview | SAP HANA Tutorial

51

Page 52: SAP HANA Architecture Overview | SAP HANA Tutorial

52

Page 53: SAP HANA Architecture Overview | SAP HANA Tutorial

53

Page 54: SAP HANA Architecture Overview | SAP HANA Tutorial

54

Page 55: SAP HANA Architecture Overview | SAP HANA Tutorial

55

Page 56: SAP HANA Architecture Overview | SAP HANA Tutorial

56

Page 57: SAP HANA Architecture Overview | SAP HANA Tutorial

57

Page 58: SAP HANA Architecture Overview | SAP HANA Tutorial

58