sketching out your search intent

83
Sketching out Your Search Intent - Towards bridging intent and semantic gaps in web-scale multimedia search Xian-Sheng Hua Lead Researcher, Microsoft Research Asia June 24 2010 – ACM Multimedia 2010 TPC Meeting – University of Amsterdam

Upload: xian-sheng-hua

Post on 14-Jun-2015

982 views

Category:

Technology


6 download

DESCRIPTION

A talk I gave at TPC workshop of ACM Multimedia 2010 at University of Amsterdam on June 24, 2010.

TRANSCRIPT

Page 1: Sketching Out Your Search Intent

Sketching out Your Search Intent - Towards bridging intent and semantic gaps in

web-scale multimedia search

Xian-Sheng Hua Lead Researcher, Microsoft Research Asia

June 24 2010 – ACM Multimedia 2010 TPC Meeting – University of Amsterdam

Page 2: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Evolution of CBIR/CBVR

2

1970~ 1980’

1990’

~2001

~2003 ~2006 Commercial Search

Engines

Academia Prototypes

(1) Tagging Based (2) Large-Scale CBIR/CBVR

Annotation Based

Direct-Text Based

Conventional CBIR

Manual Labeling Based

Page 3: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Three Schemes

Example-Based

Annotation-Based

Text-Based

Page 4: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Limitation of Text-Based Search

• High noises Surrounding text: frequently not related to the visual content

Social tags: also noisy, and a large portion don’t have tags

• Mostly only covers simple semantics Surrounding text: limited

Social tags: People tend to only input simple and personal tags

Query association: powerful but coverage is still limited (non-clicked images will never be well indexed)

A complex example: Finding Lady Gaga walking on red carpet with black dress

Page 5: Sketching Out Your Search Intent
Page 6: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Limitation of Annotation-Based Search

• Low accuracy The accuracy of automatic annotation is still far from satisfactory

Difficult to solve due

• Limited coverage The coverage of the semantic concepts is still far from the rich content

that is contain in an image

Difficult to extend

• High computation Learning costs high computation

Recognition costs high if the number of concepts is large

Still has a long way to go

Page 7: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Limitation of Large-Scale Example-Based Search

• You need an example first How to do it if I don’t have an initial example?

• Large-scale (semantic) similarity search is still difficult Though large-scale (near) duplicate detection is good

An example: TinEye

Page 8: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Possible Way-Outs?

• Large-scale high-performance annotation? Model-based? Data-driven? Hybrid? …

• Large-scale manual labeling? ESP game? Label Me? Machinery Turk? …

• New query interface? Interactive search?

Or … To combine text, content and interface?

Page 9: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Simple Attempts

• Exemplary attempts based on this idea Search by dominant color (Google/Bing)

Show similar images (Bing)/Find similar images (Google)

Show more sizes (Bing)

Page 11: Sketching Out Your Search Intent
Page 12: Sketching Out Your Search Intent
Page 14: Sketching Out Your Search Intent
Page 15: Sketching Out Your Search Intent
Page 16: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Two Recent Projects

• Sketching out your search intent Image Search by Color Sketch

Image Search by Concept Sketch

Page 17: Sketching Out Your Search Intent

Search by Color Sketch WWW 2010 Demo

Page 18: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Seems It Is Not New?

• “Query by sketch” has been proposed long time ago But on small datasets and difficult to scale up

And seldom work well actually

Not use to use – “object sketch”

Page 19: Sketching Out Your Search Intent
Page 20: Sketching Out Your Search Intent

It is based on a SIGGRAPH 95’s paper: C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995

on small scale image set and difficult to scale-up (0.44 second for 10,000 images)

Page 21: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Page 22: Sketching Out Your Search Intent
Page 23: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Our Approach

• Color sketch … Is not “object sketch”

But only rough color distribution

Supports large-scale data

And uses text at the same time

Page 24: Sketching Out Your Search Intent
Page 25: Sketching Out Your Search Intent
Page 26: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Our Approach

• Color sketch … Is not “object sketch”

But only rough color distribution

Supports large-scale data

And uses text at the same time

• Advantages of “Color Sketch” More presentative than dominant color (and text only)

More robust than “object sketch”

Easy to use compared with “object sketch”

Easy to scale up

Page 27: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Example: Blue flower with green leaf

Page 28: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Example: Brooke Hogan walking on red carpet

Page 29: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Color Sketch Is More Powerful Blue Flower with Green Leaf Brooke Hogan walking on red carpet

Page 30: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Color Sketch Is More Powerful

30

Page 31: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Demo

• It has been productized in Bing

Page 46: Sketching Out Your Search Intent
Page 66: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Summary – Why it Works • What are difficult

Semantics is difficult

Intent prediction is difficult

• What are easier Use color to describe images’ content

Use color to describe users’ intent

Color sketch is an intermediate representation to connect images’ content and users’ intent.

content intent color sketch

Page 67: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Limitation of Color Sketch

• Only works well for those semantics that can be described by color distribution

• Only works well when people is able to transfer their desire into color distribution

A more complex example: A flying butterfly on the top-left of a flower.

Page 68: Sketching Out Your Search Intent

Search by Concept Sketch SIGIR 2010

Page 69: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search By Concept Sketch

• A system for the image search intentions concerning: The presence of the semantic concepts (semantic constraints)

the spatial layout of the concepts (spatial constraints)

flower

butterfly

A concept map query Image search results of our system

Page 70: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Framework

Concept Distribution Estimation

3

Spatial distribution comparison

Relevance score

5

flower

butterfly

A: butterfly B: flower

A concept map query

A: X=0.5, Y=0.5 B: X=0.8, Y=0.8

Instant Concept Modeling

2

A candidate image

Create Candidate Image Set

1

Intent Interpretation

4

flower

butterfly

Concept Models

flower

butterfly

Desired distributions

flower

butterfly

Estimated distributions

Page 71: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search Results Comparison

“show similar image”

snoopy

Image search by concept map

snoopy

Text-based image search

Task: searching for the images with “snoopy appear at right”

Page 72: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search Results Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”

Page 73: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search Results Comparison

jeep grass

Task: searching for the images with “a jeep shown above a piece of grass”

Page 74: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search Results Comparison

house car

Task: searching for the images with “a car parked in front of a house”

Page 75: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search Results Comparison

keyboard mouse

Task: searching for the images with “a keyboard and a mouse being side by side”

Page 76: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Search Results Comparison

Colosseum grass

Task: searching for the images with “sky/Colosseum/grass appear from top to bottom”

Page 77: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

• Allow users to explicitly specify the occupied region of a concept

house

lawn

house

lawn

Default

Search for the long narrow lawn at the bottom of the image

house

lawn

Advanced Function: Influence Scope Indication

Page 78: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Searching for small windmill

Searching for large windmill

Advanced Function: Influence Scope Adjustment

Page 79: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Advanced Function: Visual Modeling Adjustment

Visual Assistor

house

sky

lawn

Advanced Function

Clear Search

Previous Next

Allow users to indicate or narrow down what a desired concept looks like (visual constraints)

Page 80: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

jeep

Searching for front-view jeeps

Searching for side-view jeeps

Visual instances

Advanced Function: Visual Modeling Adjustment

Page 81: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

flower

butterfly

Searching for butterfly with yellow flower

Searching for butterfly with red flower

Visual instances

Advanced Function: Visual Modeling Adjustment

Page 82: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Conclusions

• To bridge the gap between users’ intent and visual semantics of images by Sketching out your search intent, and

Combining text and visual content

• Two exemplary approaches introduced Search by color sketch

Search by concept sketch

Page 83: Sketching Out Your Search Intent

Xian-Sheng Hua, MSR Asia

Thank You