knowledge-based medical image indexing and retrieval

20
1 Knowledge-Based Medical Image Indexing and Retrieval Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot Image Perception, Access & Language (IPAL) French-Singaporean Joint Lab (CNRS,I2R,NUS,UJF)

Upload: neil-stephenson

Post on 03-Jan-2016

71 views

Category:

Documents


2 download

DESCRIPTION

Knowledge-Based Medical Image Indexing and Retrieval. Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot I mage P erception, A ccess & L anguage (IPAL) French-Singaporean Joint Lab (CNRS,I2R,NUS,UJF). Approach. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Knowledge-Based Medical Image Indexing and Retrieval

1

Knowledge-Based Medical Image Indexing and Retrieval

Caroline LACOSTEJoo Hwee LIM

Jean-Pierre CHEVALLET Daniel RACOCEANU

Nicolas Maillot

Image Perception, Access & Language (IPAL) French-Singaporean Joint Lab (CNRS,I2R,NUS,UJF)

Page 2: Knowledge-Based Medical Image Indexing and Retrieval

2

Approach

Indexing both image and text using medical concepts from the NLM’s Unified Medical Language System (UMLS)

• To incorporate expert knowledge

• To work at a higher semantic level

• To unify text and image

Structured learning approach to extract medical semantics from images (e.g. modality, anatomy, pathology)

Fusion between textual and visual information

Page 3: Knowledge-Based Medical Image Indexing and Retrieval

3

UMLS-based text indexing UMLS (Unified Medical Language System)

• Multilingual: meta-thesaurus 17 languages• Large : More than 50000 concepts, 5,5 millions of terms• Consistent: categorization of concepts in semantic types

Why using concepts and not terms ?• Remove the problem of term variation / synonymy

• Ex: “Fracture” and “Broken Bones”• “Natural” multi-lingual indexing

Using Meta-Thesaurus Structure• Vector space model does not take into account this structure• Using Semantic Dimension (Given by the UMLS)

Tested this year• Dimension filtering: At least one matching according one of the

three dimension • Dimension weighting: Re-weight similarity according to the

number of matched dimension

Page 4: Knowledge-Based Medical Image Indexing and Retrieval

4

Semantic Image Indexing

Low-level features- Color- Texture- Shape

High level features- Modality- Anatomy- Pathology

Computed Tomography

Chest

Nodules

?

Supervised learning framework • Global indexing to access image modality• Local indexing to access semantic local features related to

anatomy, modality, and pathology concepts

Page 5: Knowledge-Based Medical Image Indexing and Retrieval

5

AngiographyX-ray

Micro

Global Indexing

Learning set: 4000 images

Medical Image I

Learning

Semantic Classification

Low-levelfeature

extraction

Low-levelfeature

extraction

32 VMTs (modality, anatomy, spatial UMLS

concepts, and color percepts)

MRIHead Sagittal

Color: histogramTexture: Gabor features

Spatial: Thumbnails

VMT indexes

P(VMT|I)

Modality-anatomy probabilities

SVM

VMT:Visual Medical Terms

0

0.1

0.2

0.3

0.4

0.5

0.6

Page 6: Knowledge-Based Medical Image Indexing and Retrieval

6

X-rayBoneFracture

Local indexing

Learning set: 3631 patches

LearningLow-levelfeature

extraction

64 local VMTs (modality, anatomy, pathology concepts,

and color percepts)

Color: first moments

Texture: Gabor features

VMT indexes

per block

Low-levelfeature

extraction per patch

Semantic Classification

per patch

Histograms of local VMTs

Spatial Aggregation in grid layout (3x3, 3x1,

3x2, 1x3, 2x3)

SVM

Page 7: Knowledge-Based Medical Image Indexing and Retrieval

7

Visual retrieval

Retrieval using the global visual indexing

• Retrieval based on the Manhattan distance between 2 indexes (~histogram of modality-anatomy concepts)

• Modality filtering according to the textual query modality concepts

• I admissible if P( modQ | I) > T

Retrieval using the local indexing • Retrieval based on the mean of Manhattan distances between

local VMT histograms

Late Fusion : mean of the two similarity measures

Page 8: Knowledge-Based Medical Image Indexing and Retrieval

8

Medical Image Retrieval

Typical query

Show me x-ray images with fractures of the femur.

UMLSconcepts

Low-levelfeatures

UMLS-basedfeatures

FUSION Index

MatchingTextualretrieval

MatchingVisual

retrieval

Mixedretrieval

FUSION

MetaMap

SVMclassifiers

IndexMatching

Mixedretrieval

Page 9: Knowledge-Based Medical Image Indexing and Retrieval

9

Comparative Results on CLEF2005

Method Textual Visual MAP Rank

Concept, Dimension re-weighting (1) x 26.46% 1/31

Concept, Pseudo-Rel. FB, Dim filt x 22.94% 2/31

Concepts, Dimension filtering x 22.70% 3/31

Terms, Dimension filtering x 20.88% 5/31

Concept, semantic doc expansion x 18.56% 10/31

Fusion Local+Global Indexing (2) x 06.41% 3/11

Global Indexing according to modality x 05.66% 4/11

Local Indexing with VMT x 04.84% 6/11

Late fusion (1) + (2) x x 30.95% 1/37

Concept fusion (1) + (2) adaptive threshold

x x 28.78% 2/37

Concept fusion (1) + (2) Fixed Threshold 0.15

x x 28.45% 3/37

Page 10: Knowledge-Based Medical Image Indexing and Retrieval

10

Conclusion CLEF multilingual medical image retrieval task:

• Very difficult task: large collection, “precise” and “semantic” queries

Concept Indexing:• It works !!• Inter-media common indexing• Multilingual

Image and text: complementary• Text: closer to the meaning• Efficiency of the visual filtering to remove aberrant images

Importance of • semantic dimensions for text• visual terms and learning

The UMLS indexing induces a lot of perspectives:• Early fusion• Semantic-based retrieval

Page 11: Knowledge-Based Medical Image Indexing and Retrieval

11

Inter-Media Pseudo-Relevance FeedbackApplication to ImageCLEF Photo 2006

Nicolas MAILLOTJean-Pierre CHEVALLET

Joo Hwee LIM

Image Perception, Access & Language (IPAL) French-Singaporean Joint Lab (CNRS,I2R,NUS,UJF)

Page 12: Knowledge-Based Medical Image Indexing and Retrieval

12

Inter-Media Pseudo-Relevance Feedback

Problem: • Multi-modal (text+image) Information Indexing and

Retrieval• Multi-modal documents• Multi-modal queries• Application to the ImageCLEF Photo Task

Objectives: • Inter-media enrichment

• dealing with synonymy • Re-using existing mono-media image and text retrieval

systems

Page 13: Knowledge-Based Medical Image Indexing and Retrieval

13

Related Works Mono-modal pseudo-relevance feedback [Xu and Croft 96]

Translation models [Lin et al. 05]

• Automatic translation of the textual query into a visual query• Requires prior mining of textual/visual relationships

Late Fusion [Chevallet et al. 05]

• Text and image retrieval are done in parallel and the results are merged (weighted sum)

• No mutual enrichment Latent Semantic Indexing (text+visual keywords)

• Computationally expensive

Inter-Media Pseudo-Relevance Feedback

Page 14: Knowledge-Based Medical Image Indexing and Retrieval

14

Text Processing

Morpho-Syntax

• Part of Speech Tagging

• Unknown Proper Nouns detection

• Word Normalization & Spelling correction Index terms

• Noun Phrase detection using WordNet

• Geographic named entities detected using Wordnet or the <LOCATION> tag

Concept Indexing

• Selection based on the most frequent sense provided by Wordnet

Inter-Media Pseudo-Relevance Feedback

Page 15: Knowledge-Based Medical Image Indexing and Retrieval

15

Image Processing

Image Segmentation• Meanshift Segmentation• Patch Based Tessellation

Feature Extraction• Color histograms• Bags of SIFT

• Scale Invariant Feature Transform• Gabor Features (Texture)

Inter-Media Pseudo-Relevance Feedback

Extraction of one local orientation histogram per

location

Page 16: Knowledge-Based Medical Image Indexing and Retrieval

16

Image RetrievalEngine

Text RetrievalEngine

ImageText

Image Text

Image Text TextText Query Expansion

RankedDocuments

Index Documents

Query

Top k Retrieved Documents

Inter-Media Pseudo-Relevance Feedback

Overview

Expanded Query

Page 17: Knowledge-Based Medical Image Indexing and Retrieval

17

Modality A/M MAP P10 P20 P30 Rank

Inter Media PseudoRelevance Feedback Text+Visual Automatic 0.3337 0.5067 0.5000 0.4583 2/43

Best Run Text+Visual Manual 0.3850 0.6283 0.5300 0.4550 1/43

Inter-Media Pseudo-Relevance Feedback

Results

Page 18: Knowledge-Based Medical Image Indexing and Retrieval

18

Conclusion

Using the image modality to expand the text query • Use of the text associated with the top k images

retrieved• Assumption: these k images are relevant

Future work:• Advanced conceptual filtering and reasoning

• some concepts are not characterized by the visual appearance

• Comparison with the translation model

Inter-Media Pseudo-Relevance Feedback

Page 19: Knowledge-Based Medical Image Indexing and Retrieval

19

Thanks !

Page 20: Knowledge-Based Medical Image Indexing and Retrieval

20

Supervised learning framework

Learning set: VMT

instances

SVM classifiers

Image/Region

Learning

Semantic Classification

VMT indexes

P(VMT|I)

Low-levelfeature

extraction

Low-levelfeature

extraction