an insight to data warehouse testing

17
By Rohit Agrawal Jan 2013

Upload: zuwena

Post on 31-Jan-2016

44 views

Category:

Documents


0 download

DESCRIPTION

An Insight To Data Warehouse Testing. By Rohit Agrawal Jan 2013. To Start With. Is the data correct…? Would table Cutomer_fact get loaded ever if Control Table has the following..? - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: An Insight To  Data Warehouse Testing

By Rohit Agrawal Jan 2013

Page 2: An Insight To  Data Warehouse Testing

Is the data correct…?

Would table Cutomer_fact get loaded ever if Control Table has the following..?

What if the DW load frequency is daily and ETL execution time exceeds 24 hrs…?

2

CustID Title FirstName

LastName Gender DateOfBirth

Age

123 Mr Chris Joseph F 12/12/2000

22

DataBase Schema Table LastLoadDate LoadFrequency

Info_DB Customer Cust_fact 31/01/2012 D

Page 3: An Insight To  Data Warehouse Testing

Introduction

Testing Process

Focus Points

Challenges

Best Practices

3

Page 4: An Insight To  Data Warehouse Testing

Introduction

Page 5: An Insight To  Data Warehouse Testing

What ?• Exhaustive testing of a Data Warehouse during

its design and on an on-going basis

Why ?• Organisation decisions depend entirely on the

Enterprise data and the data should be of utmost quality

Where ?• Starting from Source till Reporting

When ?• Designing phase till Production

5

Page 6: An Insight To  Data Warehouse Testing

Testing Process

Page 7: An Insight To  Data Warehouse Testing

7

Page 8: An Insight To  Data Warehouse Testing

Focus Points

Page 9: An Insight To  Data Warehouse Testing

9

Page 10: An Insight To  Data Warehouse Testing

10

All Customer data from different bank branches are loaded

Insert data of Customers with age greater than 60 in senior citizen category

Error out the records if Customer does not belong to the Bank

Page 11: An Insight To  Data Warehouse Testing

11

Check February has 29 days in leap years only

A row in stage with AccountID=123 has the expected data in DW

Zip Code is of 6 digits, State names are properly abbreviated

Page 12: An Insight To  Data Warehouse Testing

12

Impact of Executing complex queries during data load/Rendering reports in 30 sec

Source system scheduling conflicts

Incremental loads as per Audit columns like LastUpdateDate/Incremental Flag

Page 13: An Insight To  Data Warehouse Testing

Voluminous data from heterogeneous sources

Data Quality not assured at source

Business knowledge. Organisation-wide Enterprise data knowledge may not be

feasible

Very high cost of quality .This is because, any defect slippage will translate into high

cost for the organisation

The heterogeneous data sources will be updated asynchronously

13

Page 14: An Insight To  Data Warehouse Testing

14

Page 15: An Insight To  Data Warehouse Testing

15

“If you torture data sufficiently, it will confess to almost anything.”

Page 16: An Insight To  Data Warehouse Testing

White Paper on Data Warehouse Testing- By Manoj Philip Mathen

Adventures with Testing BI/DW Application- http://msdn.microsoft.com

www.google.com

A Comprehensive Approach to Data Warehouse Testing- By Matteo Golfarelli

16

Page 17: An Insight To  Data Warehouse Testing

Thank You..!