building taxonomies from the ground-up prakash govindarajulu managing consultant realtech inc...
TRANSCRIPT
Building taxonomies from the ground-up
Prakash GovindarajuluManaging Consultant
RealTech Inc
RealTech Inc.Taxonomy and Semantics
• Why taxonomies?
• How can they influence semantics?
• Bottom-up methodology for building them
SearchTaxonomy
SemanticSystem
RealTech Inc.P2P Communication –
inherent semantics
Richness of Dialog, opportunity to:
-Clarify context - Clarify query - Refine query - Answer if known
RealTech Inc.The Semantic Problem
InformationSystems
The semantic problem: - Knowledge representation - Capturing domain expertise - Accessing knowledge without loss of richness of dialog
RealTech Inc.Terminology:
data, information, knowledge
Data: any fact
Metadata/Information: the act or fact of informing. Data about data Provides context Relationship with other data objects
Knowledge:the fact or state of knowing; the perception of fact or truth Interpreted data, “understands” data and information to refine or
fulfil a query Experiential data
Data
Metadata
Knowledge
RealTech Inc.“Knowledge” Representation
Today
SystemsWithData
(documents,Database
Wikisblogs)
InformationAccess Systems
(Search)
RealTech Inc.Search – Yet Another Index?
SystemsWithData
(documents,database)
InformationAccess Systems
(Search)
Index
RealTech Inc.Challenges
SystemsWithData
(documents,database)
InformationAccess Systems
(Search)
User’s perspective is not well understood
Search relevancy therefore suffers
How can related information be presented?
Solution approach: Taxonomy
Index
RealTech Inc.Building Taxonomy – the
Human Element
SystemsWithData
(documents,database)
SystemWith
Metadata(Taxonomy)
InformationAccess Systems
(Search)
Human Element
Consensus Governance
RealTech Inc.Taxonomy Guidelines …1
System withMetadata – Taxonomy
ManagementSystem
Human Element
- Audience: Know thy user for they are not Thee
- Harmony of Purpose
- CommunicationListeningUnderstandingConsensus vs majority
- Metadata for data
- Metadata for metadata
- Governance
RealTech Inc.Taxonomy Guidelines …2
System withMetadata – Taxonomy
ManagementSystem
Human Element
Jerusalem Artichokes
The OTHER Category
Courtesy: Wikipedia
RealTech Inc.Taxonomy Guidelines …3
System withMetadata – Taxonomy
ManagementSystem
Human Element
Defining relationships among metadata and data elements:
First step toward semantics
Vehicle
Manufactured-in
CITY
EMISSION RULES
Applies-To
Observed-In
RealTech Inc.Taxonomy Guidelines …4
System withMetadata – Taxonomy
ManagementSystem
Human Element - Knowledge worker Participation
- Buy-in from management
- Buy-in from other affected departments
RealTech Inc.TaxoConsolidator
Master TaxonomySource 1 Source 2
TaxoConsolidator allows user to create a Master taxonomy by combining (adding and merging through dragndrop)
individual taxonomies createdby knowledge workers
Courtesy: RealTechInc
RealTech Inc.Wordmap – Taxonomy Management System
Creating and maintaining taxonomiesUsing Wordmap
Courtesy: Earley & Associates
RealTech Inc.Moving toward semantics
• Understand your audience• Define the domain of problems clearly• Need for robust representation of data, information in
that domain• Involve all relevant knowledge workers• Create a taxonomy – with metadata and relationships• Expose and use the taxonomy in systems enterprise
wide• Need for a flexible rules and inference engines to
interpret data and discover related “knowledge”
RealTech Inc.Next Steps toward semantics
EnterpriseSystems(documents,database)
Taxonomy Mangement
System(Taxonomy)
InformationAccess Systems
(Search)
Human Element
RulesInferenceEngines
(SemanticInterpreter)
RealTech Inc.New Tools for Taxonomy
New Tools
- TaxoBuilder – term extraction tool and individual taxonomy construction tool
- TaxoConsolidator – consolidates individual taxonomies
- Wordmap from E&A