implaince - a next generation information management appliance
TRANSCRIPT
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
1/26
IMPLAINCE - A NEXT GENERATION
INFORMATION MANAGEMENT
APPLIANCE
7/8/21010 Impliance : A Next Generation Information Management Appliance 1
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
2/26
SQL Takes too long and too much expertise to :
set up,
configure, tune, test, deploy, maintain, and enhance.
Rarely exceed a few hundred nodes. Redundant layers Competing components
7/8/21010 Impliance : A Next Generation Information Management Appliance 2
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
3/26
A next-generation information management system
Currently being designed and prototyped at the IBM AlmadenResearch Center
Outside In Design Methodology
Integrated hardware and software components
Easy-to-administer appliance
To store, retrieve, and analyze all types of structured, semi-structured, and unstructured information
Low Total Cost of Ownership(TCO)
7/8/21010 Impliance : A Next Generation Information Management Appliance 3
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
4/26
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
5/26
Total cost of ownership : Cost of software and hardware reducing
Dominated by labour costs, which includes : set up,
configure, tune, test, deploy, maintain, and enhance
Need for modularity and simplicity right from Inception leading to low costs.
Information Integration :: one version of truth You can access various data silos, BUT
Need to query across various data silos.
Requires one schema/ format of data all across the organization
7/8/21010Impliance : A Next Generation Information Management Appliance
5
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
6/26
Hardware Software Mismatch Hardware has become more scalable
low-power multi-core blade servers,
large memories
layers of large onchip caches
ultra-dense storage systems
commodity low-latency networks
Software is based on hardware
designed decades ago
7/8/21010Impliance : A Next Generation Information Management Appliance
6
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
7/26
Your SubtitleHereUse Cases
7/8/21010 7Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
8/26
Exploiting Customer Relationship Management
At least one center for handling customer phone calls, e-mails,
and/or web-page comments, questions, and complaints. Opportunity for selling more products to existing and
prospective customers
Requires trained motivated and hence expensive operators. Can record text and based correlating the information extracted
from the text of the conversation transcripts with the profile of
similar customers Customized offer to a customer through a combination of
services and products.
7/8/21010Impliance : A Next Generation Information Management Appliance
8
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
9/26
Integrating Content and Data The Usual Content Management Products have very limited
awareness of the semantics of the content or capabilities to search it,usually restricting search to the contents metadata.
Impliance gives the ability to search theactual content and relate it to structuredinformation from other sources.
E.g Insurance Companies.
Legal Compliance If an enterprise is involved in legal actions with another enterprise Need to preserve broad classes of information
that may be pertinent to the litigation Most of this is information is unstructured.
7/8/21010Impliance : A Next Generation Information Management Appliance
9
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
10/26
Your SubtitleHereFunctionality Overview
7/8/21010 10Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
11/26
Semantics : What the data actually means is provided in
databases by humans via its logical schema, Can be done automatically through textanalytics and annotation, imagerecognition algorithms, etc.
Search/Query : Data is too voluminous use begins by obtaining a subset of the data that
meets certain conditions on its content, context ormetadata.
7/8/21010Impliance : A Next Generation Information Management Appliance
11
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
12/26
Composition. Relating objects to each other and
Composing them into new objects
creates new information that is at the heart of the value proposition of
information management. Needs well defined schematics
Aggregation. To be consumable by humans, large bodies of data must be reduced through
aggregation along various dimensions, to discover higher-level models,
trends, and exceptions that
facilitate business decisions.
Most of the data being unstructuredbecomes difficult to analyze.
Impliance will make it easy.
7/8/21010Impliance : A Next Generation Information Management Appliance
12
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
13/26
The data infused into Impliance is mapped from its initial formatto a uniform data model.
The query processing engine can now store it and execute queries
over it
The discovery process executes queries over the data and uses theresults to derive annotations that are added to the data.
The end user uses an interactive retrieval interface to find the
desired information, optionally making use of the annotationsadded by the discovery process.
The query processing engine does not understand theannotations; instead, it supports a mixed data/meta-data modelthat relies on smart query construction
7/8/21010Impliance : A Next Generation Information Management Appliance
13
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
14/26
7/8/21010Impliance : A Next Generation Information Management Appliance
14
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
15/26
Your SubtitleHereMain Ideas
7/8/21010 15Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
16/26
Supports Economies of scale
Reduces the time to value (TTV) : Pre-installation :The necessary software is pre-installed, automatically
detecting which hardware components are available and ,
Reconfiguring : itself if there are changes.
Better integration of different software components.
Tight integration among layers of software to improve efficiency
Data Reduction/Pushing Down : higher-level functionality such asaggregation and predicate application can be more easily pushed downcloser to the storage for early data reduction.
Use of OpenSource and CommercialSoftware to accelerate systemdevelopment.
7/8/21010Impliance : A Next Generation Information Management Appliance
16
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
17/26
Databases typically manage highly-structured
data with a common format and relatively smallattributes, which conforms nicely to the tables of
relational database systems.
Impliance unifies the management of all data
under one umbrella.
7/8/21010 17Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
18/26
It provides interfaces to search structured and unstructured
content and metadata alike.
All types of data can be incorporated into Impliance.
Impliance treats each such new version of a data item as
immutable. Thus reduces the problem of determining
whether any replica has the most recent version.
7/8/21010 18Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
19/26
Additional metadata will be extracted for each
document by running different kinds of annotators.
Using schema mapping technologies structures
from different sources can be consolidated.
Additional relationships across documents can be
identified by running various analyses on all pairs
of documents.
7/8/21010 19Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
20/26
7/8/21010 20Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
21/26
A typical Impliance installation will consist of several
instances of Impliance deployed in geographically
separated locations for disaster recovery as well as load
balancing.
Impliance needs an efficient way of organizing the
storage, computations, and the topology of the
uunderlying hardware.
7/8/21010 21Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
22/26
7/8/21010 22Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
23/26
Data nodes have direct ownership of a subset of the persistent
storage and are the most efficient when performing operations
on that storage. Grid nodes perform analytic computations. They may be
pulled into a work crew to perform long or short-term
operations, and have no long-term state.
Cluster nodes are responsible for making consistent locking
and caching decisions on data within data consistency
groups.
7/8/21010 23Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
24/26
Google Base? Primary data store which allow various type of data to be published in
simple way. Impliance focuses more on proactive information discovery,richer
businss analytics and data management.
Database Appliances (Netezza, DataAlegro, etc.)?Both offer appliances for business intelligenceapplications on relational data.
Not just structured (relational) data Discovery of semantics
More pro-active
Also both Oracle Secure Enterprise Search (OSES) andIBMWebsphere Information Integrator enable many data types to becrawled but the interface used was not as advanced as IMPLIANCE.
7/8/21010 24Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
25/26
Impliance
A box with software pre-installed
Delivered to enterprise: appliance or service
Functions? Store and manage all information
accept all types of enterprises data Deliver all intelligence
Integrate cross silo information
Advanced analytics with richersemantics
Properties?
Low TCO easy to deploy (plug & play)
simple and stable Scalability
FromSMB to Very Large (PetaBytes)
(Not for high-end OLTP!)Data+Content+Digital Media
Relational
data
SQL
content
JCR
XMLXSL
T
Web page
Native
retrieval
interface
Native
update/
loadinterface
HTTP
Video
Archive
ILM
7/8/21010 25Impliance : A Next Generation Information Management Appliance
-
8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE
26/26
7/8/21010 Impliance : A Next Generation Information Management Appliance 26