building taxonomies from the ground-up prakash govindarajulu managing consultant realtech inc...

22
Building taxonomies from the ground-up Prakash Govindarajulu Managing Consultant RealTech Inc [email protected] 781-883-8338

Upload: melvyn-armstrong

Post on 03-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Building taxonomies from the ground-up

Prakash GovindarajuluManaging Consultant

RealTech Inc

[email protected]

RealTech Inc.Taxonomy and Semantics

• Why taxonomies?

• How can they influence semantics?

• Bottom-up methodology for building them

SearchTaxonomy

SemanticSystem

RealTech Inc.P2P Communication –

inherent semantics

Richness of Dialog, opportunity to:

-Clarify context - Clarify query - Refine query - Answer if known

RealTech Inc.The Semantic Problem

InformationSystems

The semantic problem: - Knowledge representation - Capturing domain expertise - Accessing knowledge without loss of richness of dialog

RealTech Inc.Terminology:

data, information, knowledge

Data: any fact

Metadata/Information: the act or fact of informing. Data about data Provides context Relationship with other data objects

Knowledge:the fact or state of knowing; the perception of fact or truth Interpreted data, “understands” data and information to refine or

fulfil a query Experiential data

Data

Metadata

Knowledge

RealTech Inc.“Knowledge” Representation

Today

SystemsWithData

(documents,Database

Wikisblogs)

InformationAccess Systems

(Search)

RealTech Inc.Search – Yet Another Index?

SystemsWithData

(documents,database)

InformationAccess Systems

(Search)

Index

RealTech Inc.Challenges

SystemsWithData

(documents,database)

InformationAccess Systems

(Search)

User’s perspective is not well understood

Search relevancy therefore suffers

How can related information be presented?

Solution approach: Taxonomy

Index

RealTech Inc.Taxonomy anyone?

From digg.com 6/8/07

RealTech Inc.Taxonomy – Classification

Classification:Categories&Subcategories

RealTech Inc.Taxonomy – Guided Navigation

Perspectives or Facets

RealTech Inc.Building Taxonomy – the

Human Element

SystemsWithData

(documents,database)

SystemWith

Metadata(Taxonomy)

InformationAccess Systems

(Search)

Human Element

Consensus Governance

RealTech Inc.Taxonomy Guidelines …1

System withMetadata – Taxonomy

ManagementSystem

Human Element

- Audience: Know thy user for they are not Thee

- Harmony of Purpose

- CommunicationListeningUnderstandingConsensus vs majority

- Metadata for data

- Metadata for metadata

- Governance

RealTech Inc.Taxonomy Guidelines …2

System withMetadata – Taxonomy

ManagementSystem

Human Element

Jerusalem Artichokes

The OTHER Category

Courtesy: Wikipedia

RealTech Inc.Taxonomy Guidelines …3

System withMetadata – Taxonomy

ManagementSystem

Human Element

Defining relationships among metadata and data elements:

First step toward semantics

Vehicle

Manufactured-in

CITY

EMISSION RULES

Applies-To

Observed-In

RealTech Inc.Taxonomy Guidelines …4

System withMetadata – Taxonomy

ManagementSystem

Human Element - Knowledge worker Participation

- Buy-in from management

- Buy-in from other affected departments

RealTech Inc.TaxoConsolidator

Master TaxonomySource 1 Source 2

TaxoConsolidator allows user to create a Master taxonomy by combining (adding and merging through dragndrop)

individual taxonomies createdby knowledge workers

Courtesy: RealTechInc

RealTech Inc.Wordmap – Taxonomy Management System

Creating and maintaining taxonomiesUsing Wordmap

Courtesy: Earley & Associates

RealTech Inc.Moving toward semantics

• Understand your audience• Define the domain of problems clearly• Need for robust representation of data, information in

that domain• Involve all relevant knowledge workers• Create a taxonomy – with metadata and relationships• Expose and use the taxonomy in systems enterprise

wide• Need for a flexible rules and inference engines to

interpret data and discover related “knowledge”

RealTech Inc.Next Steps toward semantics

EnterpriseSystems(documents,database)

Taxonomy Mangement

System(Taxonomy)

InformationAccess Systems

(Search)

Human Element

RulesInferenceEngines

(SemanticInterpreter)

RealTech Inc.New Tools for Taxonomy

New Tools

- TaxoBuilder – term extraction tool and individual taxonomy construction tool

- TaxoConsolidator – consolidates individual taxonomies

- Wordmap from E&A

RealTech Inc.Summary

• Search and understanding user request through query semantics

• Represent your information through taxonomies as a fulcrum for all information management in enterprise

• Build semantic interpreters of relationships and metadata to complement search