correlative multi-label video annotation

ACM Multimedia 2007ACM Multimedia 2007

Guo-Jun Qi, Guo-Jun Qi, Xian-Sheng HuaXian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei and , Yong Rui, Jinhui Tang, Tao Mei and Hong-Jiang ZhangHong-Jiang Zhang

Microsoft Research AsiaMicrosoft Research Asia

September 25, 2007September 25, 2007

MotivationMotivation Correlative Multi-Label AnnotationCorrelative Multi-Label Annotation Modeling correlationsModeling correlations Learning the classifierLearning the classifier Connections to Gibbs Random FieldConnections to Gibbs Random Field

Experiments Experiments Live DemoLive Demo

How many images and videos in the How many images and videos in the world?world?

May 2007: 500

millionsAug. 2007 : 1

billion2000 images

/minute

Sep. 2007 : 84

millions

70 - 80’ Manual Labeling

90’ Pure Content Based (QBE)

Now Automated Annotation

Manual

Automatic

Learning-Based

1970 1980 1990 2000

Now Automated Annotation

Learning-Based

Modeling and

Learning

Classifier

Training samples

Features

Learning-based video annotation schemes

Person

Building

New sampleLake?

A typical strategy – A typical strategy – Individual Concept Individual Concept DetectionDetection

Annotate multiple concepts separatelyAnnotate multiple concepts separately

Low-Level Features

Outdoor Face PersonPeople-

MarchingRoad

Walking- Running

-1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1

√ Person√ Street√ Building

× Beach× Mountain

√ Crowd√ Outdoor√ Walking/Running

√ Marching? Marching

Low-Level Features

Outdoor Face PersonPeople-Marchin

Walking- Running

-1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1

Low-Level Features

Outdoor Face PersonPeople-Marchin

Walking- Running

-1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1

Concept Model Vector

Score Score Score Score Score Score

Concept Fusion

Another typical strategy – Another typical strategy – Fusion-BasedFusion-Based Context Based Concept fusion (CBCF)Context Based Concept fusion (CBCF)

Low-Level Features

Outdoor Face PersonPeople-

MarchingRoad

Walking- Running

-1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1

Concept Fusion

Concept Model Vector

Score Score Score Score Score Score

Our strategy – Our strategy – Integrated Concept Integrated Concept DetectionDetection

Correlative Multi-Label Learning (CML)Correlative Multi-Label Learning (CML)

-1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1 -1 / 1

Low-Level Features

OutdoorPeople-

MarchingRoadFace Person

Walking- Running

Multi-Label Annotation

No correlation

Has Correlations, but uses a second step

Model concepts and correlations in one step

Individual Detectors

Fusion Based

Integrated

1st Paradigm

2nd Paradigm

3rd Paradigm

Our strategy – Our strategy – Integrated Concept Integrated Concept DetectionDetection

Correlative Multi-Label Learning (CML)Correlative Multi-Label Learning (CML)

How to model concepts and the How to model concepts and the correlations among concept in a single correlations among concept in a single stepstep

NotationsNotations

Modeling concept and correlations Modeling concept and correlations simultaneouslysimultaneously

6.0,5.0,4.0,3.0,2.0,1.0x

1:,1:,1:,1:,1: treecarbeachroadperson y

02.002.0

1.0001.0

NoYesconceptfeature

Modeling concept and correlations Modeling concept and correlations simultaneouslysimultaneously

6.0,5.0,4.0,3.0,2.0,1.0x

1:,1:,1:,1:,1: treecarbeachroadperson y

00011,1

3,11,1

3,11,13,1

1,12,1

，，

NYNYConceptConcept

/,/2,1

Modeling concept and correlationsModeling concept and correlations

12 KDK

Learning the classifierLearning the classifier

Misclassification Error

Loss function

Empirical risk

Regularization

Introduce slackvariables

Lagrange dual

Find solution by SMO

Connection to Gibbs Random FieldConnection to Gibbs Random Field

Define a random field

Rewrite the classifier

is a random field

consists of all adjacent sites, that is, this RF is fully connected

Define energy functionDefine GRF

Connection to Gibbs Random FieldConnection to Gibbs Random Field

Rewrite the classifier

Define energy function

Intuitive explanation of CML

Define a random field

is a random field

consists of all adjacent sites, that is, this RF is fully connected

Define GRF

ExperimentsExperiments TRECVID 2005 dataset (170 hours)TRECVID 2005 dataset (170 hours) 39 concepts (LSCOM-Lite)39 concepts (LSCOM-Lite) Training (65%), Validation (16%), Testing (19%)Training (65%), Validation (16%), Testing (19%)

ExperimentsExperiments TRECVID 2005 dataset (170 hours)TRECVID 2005 dataset (170 hours) 39 concepts (LSCOM-Lite)39 concepts (LSCOM-Lite) Training (65%), Validation (16%), Testing (19%)Training (65%), Validation (16%), Testing (19%) CML (CML (MAP=0.290MAP=0.290) improves IndSVM () improves IndSVM (MAP=0.246MAP=0.246) 17% and CBCF ) 17% and CBCF

((MAP=0.253MAP=0.253) 14%) 14%

CMLCBCFSVM

SVM CML ↑ 17%CBCF CML ↑14%

((MAP=0.253MAP=0.253) 14%) 14%

CMLCBCFSVM

SVM CML ↑ 131%CBCF CML ↑128%

((MAP=0.253MAP=0.253) 14%) 14%

CMLCBCFSVM CMLCBCFSVM CMLCBCFSVM

((MAP=0.253MAP=0.253) 14%) 14%

Correlative Multi-Label Video AnnotationCorrelative Multi-Label Video Annotation A new paradigm for multi-label annotationA new paradigm for multi-label annotation Models correlations and concepts Models correlations and concepts

simultaneouslysimultaneously Has a close connection to Gibbs Random FieldHas a close connection to Gibbs Random Field

Multi-Instance Multi-Label AnnotationMulti-Instance Multi-Label Annotation Exploit correlations among concepts and among Exploit correlations among concepts and among

instances at the same timeinstances at the same time Not only can get image/frame level annotation, Not only can get image/frame level annotation,

but also can get region level annotationbut also can get region level annotation

MountainWater

Scenery

Correlative Multi-Label Video AnnotationCorrelative Multi-Label Video Annotation A new paradigm for multi-label annotationA new paradigm for multi-label annotation Models correlations and concepts Models correlations and concepts

simultaneouslysimultaneously Has a close connection to Gibbs Random FieldHas a close connection to Gibbs Random Field

correlative multi-label video annotation

cml map

cbcf map

modeling concept

indsvm map

cml cbcf svm svmcml17

cml cbcf svm svmcml131

concepts lscomlite training

multiple concepts

Technology

correlative conjunctions: would rather… than…

calvelli, experiments in correlative ontography

structured max-margin learning for multi-label image...

correlative conjunctions fill

distance learning packet week 2...4. i am a vegetarian,...

correlative conjunctions _ paralel yapilar

the collection of physical knowledge and its application...

serena sorrentinolabel normalization and lexical annotation...

conjunctions. tlw identify and distinguish between...

english comparative correlative construction:...

correlative microscopy of the caulobacter crescentus

a correlative sts

search-based face annotation by weakly label web facial...

parallel structure with correlative conjunctions

package ‘lemon’ - r · 4 annotate_y_axis arguments...

correlative level coding

correlative fluorescence- and scanning, transmission ... ·...

grammar worksheets: parallelism, including correlative...

scalable multi-label annotation

correlative sciences/tumour biology committee … ·...