® ibm india research lab © 2006 ibm corporation challenges in building a strategic information...
TRANSCRIPT
®
IBM India Research Lab
© 2006 IBM Corporation
Challenges in Building a Strategic Information Integration Infrastructure
Mukesh MohaniaIBM India Research Lab
IBM India Research Lab
2
The Integration Challenge Complex and heterogeneous environments
Many different types of systemsMany inter-related applications
Escalating needsVariety, velocity, volume
People are expensive
The world produces 250MB of information every year for every
man, woman and child on earth.
IBM India Research Lab
3
Sources: IBM & Industry Studies, Customer Interviews, Forrester
The Challenge Continued… The Challenge Continued…
40% of IT budgets may be spent on
integration.
30% of people’s time: searching for
relevant information.
The average billion dollar company:
48 disparate financial systems
42% of transactions are still paper-based.
85% of information is unstructured. Trx.Trx.
DocumentsDocumentsReportsReports
e-Mailse-Mails
MediaMedia
CustomersCustomersEmployeesEmployees
PartnersPartners
DatabasesDatabases
Orgs.Orgs.FinancialsFinancialsProductsProducts
WebContent
WebContent
79% of companies have more than two repositories and 25% have more than 15
60% + of CEOs: Need to do a better job capturing and understanding information
rapidly in order to make swift business decisions.
Only 1/3rd of CFOs believe that the information is easy to use, tailored,
cost effective or integrated.
30-50% of application design time is spent on
copy management.
IBM India Research Lab
4
Taikang Life Insurance
Business Challenge
Technical Challenge
4th largest Chinese insurance company 8,000 employees, 150,000 agents 3.5 million customers 28 branches, 170 sub-branches Data in DB2 UDB, Informix, Oracle, SQL Server, XML, e-mail, CRM and Portal applications Goals:
Up-to-the-minute status for executivesIncreased employee productivityBetter customer service
Background
IBM India Research Lab
5
Taikang Integrated Information Platform Architecture
Phone
Fax SMS Email
Web StoreFront
Mail Agents
Financial Planner
CoreSystems
Information IntegrationPlatform
ApplicationPlatform
Channels
Group & BankingGroup & Banking CSC Personal LifeCSC Personal Life FinancialsFinancialsInformix DB2/400 Oracle
Mapping (nicknames)
IntegratedInformation
Data Service
ODScache
XML SQL Web Services
IBM India Research Lab
6
Challenges in Integrating Information
Structured and unstructured data
Diversity of data sources (content repos, pricing application, databases, …)
Coming up with the model of how information fits together Understanding what info exists
Finding related pieces
Creating a common format
Deciding how to access and transform data What should be materialized, what accessed in real-time, how maintained
What pre-defined paths, what unplanned (navigation vs. search)
Configuring the appropriate software
Accessing information in the application
Monitoring the system and understanding usage, problems, etc
IBM India Research Lab
7
Another perspective ---
IBM India Research Lab
8
Virtual, collaborative organizations sharing
apps, data
in open heterogeneous environment.
A potentially vast aggregation of geographically dispersed computing resources
Leverages Intranet, Extranet, and Internet implementations
Lower TCO (Total Cost of Ownership)
Virtual Servers, Storage and Instruments
Grid Middleware
Distributed Physical Servers and Storage
Virtualization: Grid Computing
IBM India Research Lab
9
Data Virtualization for Information on the Grid
A Grid should allow information to be: Virtualized over Heterogeneous, Distributed Data Sources location & heterogeneity transparency Accessed via Open Protocols Autonomically administered Dynamic
Putting information on the Grid enables: Access to any data resource in a standard way Viewing a collection of data resources as a single integrated entity Placing data so as to exploit available processing/storage for performance
and scale
Lower TCO
IBM India Research Lab
10
Distributed Data Management and Grid Computing
Collaboration &
data sharing
Federation
Consolidation
Information Dissemination
Performance & Scalability Replication
Caching
Parallelism
Reduced Cost Autonomic
Mixed Workload Mgmt
Business Resiliency Replication
Fast Backup & Recovery
Enhance Current TechnologiesDynamic &Autonomic
Static &Manual
At Lower TCO (Total Cost of Ownership)
Tasks Required Technologies
IBM India Research Lab
11
Data Virtualization: Grid Middleware for Integration & QoXMiddleware masks dynamic nature of data sources, compute resources
Data Sources
InformationIntegration
DataAnalysis
DataArchival
Applications
FederatedAccess
Collaboration
Other OGSA ServicesLifecycle, Billing, Authentication, Workload Management,
Transaction Management
ConsistencyManagement
WorkflowCoordination
Authorization ReplicationSchema Management
RegistryDiscovery
Compute Resources
IBM India Research Lab
12
Distributed Data Management & Grid Computing
Monolithic Application
Architectures
ServicesOriented
Architectures
Query
Distributed Query
Federated Query
MPP Parallelism
SMP Parallelism
Transaction Parallelism
Federation
Open Standards
Parallelism
OGSA, Web Services
Transparent, Optimized, Integrated Access to Heterogeneous Data at Lower TCO
Discover & Leverage Resources• System • Information
DataMovement
OGSA Data Replication
FTP, ETML
DataPlacement
for QoS Information Dissemination
Data DrivenApplication
Parallelism Dynamic
Federation