leveraging the microsoft repository in your warehouse
DESCRIPTION
Leveraging the Microsoft Repository in Your Warehouse. Saeed K. Rahimi Graduate Programs in Software University of St. Thomas DWSoft Corporation. Contents. Data Warehousing and Repository The Open Information Model and Meta Data Coalition Microsoft’s Data Warehousing Strategy - PowerPoint PPT PresentationTRANSCRIPT
Leveraging the Microsoft Repository in Your Warehouse
Saeed K. Rahimi
Graduate Programs in Software
University of St. Thomas
DWSoft Corporation
DAMA Presentation, August 2000 Page 2© Copyright 2000 Saeed K. Rahimi
Contents
Data Warehousing and Repository The Open Information Model and Meta
Data Coalition Microsoft’s Data Warehousing Strategy Integrating the Repository Into Your
Warehouse
DAMA Presentation, August 2000 Page 3© Copyright 2000 Saeed K. Rahimi
Data Warehouse Architecture
Repository
Metadata Services
OperationalData
• Extract• Cleans• Transform• Load
Data warehouse Life Cycle
DW DM
SQL Server 7.5 - Object Persistence Services
DM
DM
DAMA Presentation, August 2000 Page 4© Copyright 2000 Saeed K. Rahimi
What is Metadata? (Continued)
Information not just data What is 122599? Is it an integer number? A string? Christmas day 1999? Repository provides metadata about this:
mmddyy
DAMA Presentation, August 2000 Page 5© Copyright 2000 Saeed K. Rahimi
What Is Metadata?
Information about your Systems
Source systems, Warehouse systems,ProcessesEtc.
DatabasesSource databasesWarehouse databases
Table Columns
DAMA Presentation, August 2000 Page 6© Copyright 2000 Saeed K. Rahimi
What Is Metadata?
Information about your Transformations
From source system to data warehouseFrom data warehouse to source systems
CalculationsTotal-Sales = amount*quantity – discount +
shipping&handling
DAMA Presentation, August 2000 Page 7© Copyright 2000 Saeed K. Rahimi
What Is Metadata? (Continued)
OLAP information Sales cube
TotalSales measureTime, product, geography dimensions
Other information Reporting Tools Modeling Tools
DAMA Presentation, August 2000 Page 8© Copyright 2000 Saeed K. Rahimi
Metadata Benefits
Roadmap to information in the warehouse What is in the warehouse What it means How to get the information Who owns it When it was created How it was created/generated
DAMA Presentation, August 2000 Page 9© Copyright 2000 Saeed K. Rahimi
Metadata Benefits
All users that interact with the warehouse will benefit from enhanced information
Analysts can better understand what data is available, and how it was calculated
Maintains a history prior to current processes
DAMA Presentation, August 2000 Page 10© Copyright 2000 Saeed K. Rahimi
Metadata Benefits (Continued)
Provides a common frame of reference – “what does total sales mean”
Provides a central storage location of warehouse knowledge (helps with turnover, growth)
Allows enterprises to ensure data marts/warehouses are built using common terms and goals
DAMA Presentation, August 2000 Page 11© Copyright 2000 Saeed K. Rahimi
Open Information Model
The schema for the repository Common set of core definitions to promote sharing
of information among different tools and vendors Covers basic set of sub-models Created with help/input from over 20 partners and
reviewed by 300 Microsoft has transferred rights to evolve the OIM
to the meta data coalition (MDC)
DAMA Presentation, August 2000 Page 12© Copyright 2000 Saeed K. Rahimi
MDC OIM Components Current components
Unified Modeling Language (UML) Relational database schema Data transformations Multidimensional schema (OLAP) English semantics Legacy databases Component descriptions
Components under review Report definitions Entity/relationship diagrams Business engineering
DAMA Presentation, August 2000 Page 13© Copyright 2000 Saeed K. Rahimi
Sample OIM Constructs
ITfmTransformableObject(Columns)
ITfmTransformableObjectSet
TfmObjectSetConsistsOfObject
ITfmTransformation
Tfm
Transform
ationHasTargetColumns
Tfm
Transform
ationHasSourceColum
ns
Transformation – mapping from Source column to Target column
DAMA Presentation, August 2000 Page 14© Copyright 2000 Saeed K. Rahimi
Virtual Relationship
A
C
B
D
This
VS
A
C
B
D
This
DAMA Presentation, August 2000 Page 15© Copyright 2000 Saeed K. Rahimi
Virtual Properties
A
C
B
D
This
VS
This
a b
c
d e
A
a b c d e
C
B
D
c
d e
DAMA Presentation, August 2000 Page 16© Copyright 2000 Saeed K. Rahimi
Microsoft’s Strategy Impact
Prior to SQL server 7.0, DW was primarily available only to very large companies
Tools were expensive, expertise hard to find Microsoft has lowered the bar by bundling
components (with SQL server for free) and making DW available to many more companies
DAMA Presentation, August 2000 Page 17© Copyright 2000 Saeed K. Rahimi
Microsoft Warehouse Components
Data transformation services Extract, transform, load
OLAP services Multidimensional server and analysis
Repository Information sharing
English query English to SQL translation
DAMA Presentation, August 2000 Page 18© Copyright 2000 Saeed K. Rahimi
Repository Maintenance
Importing information into the repository has always been a challenging task Information needs to be mapped to the model
components Information capture needs to be automated
Once in the repository, maintaining versions of the information is needed New information load can not destroy the
information already in the repository
DAMA Presentation, August 2000 Page 19© Copyright 2000 Saeed K. Rahimi
Loading the Repository
Data Transformation Services
OLAP Services
English Query Application
OLTPDatabase
Star SchemaDatabase
DB Schema
Transform
OLAP
Semantics
DB Schema
Repository
Other
OtherSources
DAMA Presentation, August 2000 Page 20© Copyright 2000 Saeed K. Rahimi
Loading the Repository
DTS can save to repository OLAP information can be saved to
repository with SQL Server 2000 OLE DB and ODBC data sources and related
information can be imported into repository with SQL Server 2000
DAMA Presentation, August 2000 Page 21© Copyright 2000 Saeed K. Rahimi
Implementation Issues DTS doesn’t provide complete source to target
mappings for certain transformations For example
When a query used as source,
Interfaces to many third party tools incomplete or not available Modeling tools
ERWin Visio PowerDesigner
Warehouse tools Business Objects Information Advantage
DAMA Presentation, August 2000 Page 22© Copyright 2000 Saeed K. Rahimi
Repository Features
Extensible metamodel Version management Dynamic COM based interface Batch XIF file based interface Workspace concept Some pre-built interfaces to data warehouse
tools
DAMA Presentation, August 2000 Page 23© Copyright 2000 Saeed K. Rahimi
Keys to Success
Ensure repository information is current Provide access to all users of the warehouse Tackle implementation in well defined, small
steps Designate someone with the responsibility
and authority to manage the repository Maintain long term focus
DAMA Presentation, August 2000 Page 24© Copyright 2000 Saeed K. Rahimi
Access to All Users
Show users what they want to see Provide simple, easy to understand information Make warehouse information easily available “in
context” Use familiar interface and UI concepts
Web based access Client/server based access
Provide custom integration to existing tools and interfaces when necessary
DAMA Presentation, August 2000 Page 25© Copyright 2000 Saeed K. Rahimi
Modes of Access
Administrative Access Management Meta-model (schema) changes Meta-data loading and maintenance
End-user Access Browsing Navigation Impact analysis
DAMA Presentation, August 2000 Page 26© Copyright 2000 Saeed K. Rahimi
Types of Access
Local and over the LAN access Metamodel management Repository bulk loading Interface from other tools (loading meta-data)
Remote and over the WEB access Browser based
accessible from anywhere on the globe Navigation oriented Interface from/to other tools May be minor editing capabilities
DAMA Presentation, August 2000 Page 27© Copyright 2000 Saeed K. Rahimi
Access Control
User Classification Different Class of Users Need to Access Different
Objects in the Repository Different User Profiles Different Read/write Access Control
Possible User Classes Business Executive Casual User Database Administrator Data Warehouse Administrator Repository Administrator