implaince - a next generation information management appliance

Upload: sandyjbs

Post on 10-Apr-2018

223 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    1/26

    IMPLAINCE - A NEXT GENERATION

    INFORMATION MANAGEMENT

    APPLIANCE

    7/8/21010 Impliance : A Next Generation Information Management Appliance 1

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    2/26

    SQL Takes too long and too much expertise to :

    set up,

    configure, tune, test, deploy, maintain, and enhance.

    Rarely exceed a few hundred nodes. Redundant layers Competing components

    7/8/21010 Impliance : A Next Generation Information Management Appliance 2

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    3/26

    A next-generation information management system

    Currently being designed and prototyped at the IBM AlmadenResearch Center

    Outside In Design Methodology

    Integrated hardware and software components

    Easy-to-administer appliance

    To store, retrieve, and analyze all types of structured, semi-structured, and unstructured information

    Low Total Cost of Ownership(TCO)

    7/8/21010 Impliance : A Next Generation Information Management Appliance 3

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    4/26

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    5/26

    Total cost of ownership : Cost of software and hardware reducing

    Dominated by labour costs, which includes : set up,

    configure, tune, test, deploy, maintain, and enhance

    Need for modularity and simplicity right from Inception leading to low costs.

    Information Integration :: one version of truth You can access various data silos, BUT

    Need to query across various data silos.

    Requires one schema/ format of data all across the organization

    7/8/21010Impliance : A Next Generation Information Management Appliance

    5

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    6/26

    Hardware Software Mismatch Hardware has become more scalable

    low-power multi-core blade servers,

    large memories

    layers of large onchip caches

    ultra-dense storage systems

    commodity low-latency networks

    Software is based on hardware

    designed decades ago

    7/8/21010Impliance : A Next Generation Information Management Appliance

    6

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    7/26

    Your SubtitleHereUse Cases

    7/8/21010 7Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    8/26

    Exploiting Customer Relationship Management

    At least one center for handling customer phone calls, e-mails,

    and/or web-page comments, questions, and complaints. Opportunity for selling more products to existing and

    prospective customers

    Requires trained motivated and hence expensive operators. Can record text and based correlating the information extracted

    from the text of the conversation transcripts with the profile of

    similar customers Customized offer to a customer through a combination of

    services and products.

    7/8/21010Impliance : A Next Generation Information Management Appliance

    8

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    9/26

    Integrating Content and Data The Usual Content Management Products have very limited

    awareness of the semantics of the content or capabilities to search it,usually restricting search to the contents metadata.

    Impliance gives the ability to search theactual content and relate it to structuredinformation from other sources.

    E.g Insurance Companies.

    Legal Compliance If an enterprise is involved in legal actions with another enterprise Need to preserve broad classes of information

    that may be pertinent to the litigation Most of this is information is unstructured.

    7/8/21010Impliance : A Next Generation Information Management Appliance

    9

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    10/26

    Your SubtitleHereFunctionality Overview

    7/8/21010 10Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    11/26

    Semantics : What the data actually means is provided in

    databases by humans via its logical schema, Can be done automatically through textanalytics and annotation, imagerecognition algorithms, etc.

    Search/Query : Data is too voluminous use begins by obtaining a subset of the data that

    meets certain conditions on its content, context ormetadata.

    7/8/21010Impliance : A Next Generation Information Management Appliance

    11

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    12/26

    Composition. Relating objects to each other and

    Composing them into new objects

    creates new information that is at the heart of the value proposition of

    information management. Needs well defined schematics

    Aggregation. To be consumable by humans, large bodies of data must be reduced through

    aggregation along various dimensions, to discover higher-level models,

    trends, and exceptions that

    facilitate business decisions.

    Most of the data being unstructuredbecomes difficult to analyze.

    Impliance will make it easy.

    7/8/21010Impliance : A Next Generation Information Management Appliance

    12

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    13/26

    The data infused into Impliance is mapped from its initial formatto a uniform data model.

    The query processing engine can now store it and execute queries

    over it

    The discovery process executes queries over the data and uses theresults to derive annotations that are added to the data.

    The end user uses an interactive retrieval interface to find the

    desired information, optionally making use of the annotationsadded by the discovery process.

    The query processing engine does not understand theannotations; instead, it supports a mixed data/meta-data modelthat relies on smart query construction

    7/8/21010Impliance : A Next Generation Information Management Appliance

    13

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    14/26

    7/8/21010Impliance : A Next Generation Information Management Appliance

    14

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    15/26

    Your SubtitleHereMain Ideas

    7/8/21010 15Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    16/26

    Supports Economies of scale

    Reduces the time to value (TTV) : Pre-installation :The necessary software is pre-installed, automatically

    detecting which hardware components are available and ,

    Reconfiguring : itself if there are changes.

    Better integration of different software components.

    Tight integration among layers of software to improve efficiency

    Data Reduction/Pushing Down : higher-level functionality such asaggregation and predicate application can be more easily pushed downcloser to the storage for early data reduction.

    Use of OpenSource and CommercialSoftware to accelerate systemdevelopment.

    7/8/21010Impliance : A Next Generation Information Management Appliance

    16

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    17/26

    Databases typically manage highly-structured

    data with a common format and relatively smallattributes, which conforms nicely to the tables of

    relational database systems.

    Impliance unifies the management of all data

    under one umbrella.

    7/8/21010 17Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    18/26

    It provides interfaces to search structured and unstructured

    content and metadata alike.

    All types of data can be incorporated into Impliance.

    Impliance treats each such new version of a data item as

    immutable. Thus reduces the problem of determining

    whether any replica has the most recent version.

    7/8/21010 18Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    19/26

    Additional metadata will be extracted for each

    document by running different kinds of annotators.

    Using schema mapping technologies structures

    from different sources can be consolidated.

    Additional relationships across documents can be

    identified by running various analyses on all pairs

    of documents.

    7/8/21010 19Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    20/26

    7/8/21010 20Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    21/26

    A typical Impliance installation will consist of several

    instances of Impliance deployed in geographically

    separated locations for disaster recovery as well as load

    balancing.

    Impliance needs an efficient way of organizing the

    storage, computations, and the topology of the

    uunderlying hardware.

    7/8/21010 21Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    22/26

    7/8/21010 22Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    23/26

    Data nodes have direct ownership of a subset of the persistent

    storage and are the most efficient when performing operations

    on that storage. Grid nodes perform analytic computations. They may be

    pulled into a work crew to perform long or short-term

    operations, and have no long-term state.

    Cluster nodes are responsible for making consistent locking

    and caching decisions on data within data consistency

    groups.

    7/8/21010 23Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    24/26

    Google Base? Primary data store which allow various type of data to be published in

    simple way. Impliance focuses more on proactive information discovery,richer

    businss analytics and data management.

    Database Appliances (Netezza, DataAlegro, etc.)?Both offer appliances for business intelligenceapplications on relational data.

    Not just structured (relational) data Discovery of semantics

    More pro-active

    Also both Oracle Secure Enterprise Search (OSES) andIBMWebsphere Information Integrator enable many data types to becrawled but the interface used was not as advanced as IMPLIANCE.

    7/8/21010 24Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    25/26

    Impliance

    A box with software pre-installed

    Delivered to enterprise: appliance or service

    Functions? Store and manage all information

    accept all types of enterprises data Deliver all intelligence

    Integrate cross silo information

    Advanced analytics with richersemantics

    Properties?

    Low TCO easy to deploy (plug & play)

    simple and stable Scalability

    FromSMB to Very Large (PetaBytes)

    (Not for high-end OLTP!)Data+Content+Digital Media

    Relational

    data

    SQL

    content

    JCR

    XMLXSL

    T

    Web page

    Native

    retrieval

    interface

    Native

    update/

    loadinterface

    HTTP

    Video

    Archive

    ILM

    7/8/21010 25Impliance : A Next Generation Information Management Appliance

  • 8/8/2019 IMPLAINCE - A NEXT GENERATION INFORMATION MANAGEMENT APPLIANCE

    26/26

    7/8/21010 Impliance : A Next Generation Information Management Appliance 26