mega-boundaries temporal video boundary detection

16
Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Upload: lola-spong

Post on 14-Dec-2015

223 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Mega-boundariesTEMPORAL VIDEO BOUNDARY DETECTION

Page 2: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Mega-boundaries

Mega boundaries are defined between macro segments that exhibit different structural and feature consistency.

A good example of mega boundaries application is commercial detection

Page 3: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Commercial detection

Common method is detection of high activity rate and black frame detection coupled with silence detection

Lienhart 1997 Use of monochrome images , scene breaks , and action.

Blum 1992 Use of black frames and activity detector.

Iggulden 1997 Distance between black frame sequences.

Dimitrova 2002 Automatically spots repetitive patterns. Must be identified before

recognizing

Page 4: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Commercial detection

Nafeh 1994 Learning and discerning of broadcast using Neural Network

Bonner 1982

McGee 1999

Novak 1988

Y. Li 2000

Agnihotri 2003

Page 5: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Features for Commercial detection

Mega boundaries detection method’s are based on what features we have on the test video

Unicolor Frame for commercial break

High visual activity

Letterbox format

Dataset of 8 hours of video from TV programs

Feature data consists of 600000 frames

Page 6: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Triggers and Verifiers

Trigger - Features that can aid in determining the location of the commercial break

Verifier – Features that can determine boundaries of the commercial break

We use the time interval between detected unicolor frame as triggers

Presence of a letterbox change or high cut rate expressed in terms of low cut used as verifiers.

Constrains on commercial breaks are longer than 1 minute and shorter than 6 minute.

Page 7: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Bayesian Belief Network Model

Directed acyclical graph (DAG) The nodes correspond to variables

The arcs describe direct casual relationship between linked variables

The strength of these links is given by conditional probability distributions

P(x1,..,xn)=P(Xn|Xn-1,..,x1)*. . . *P(x2|x1)P(x1) Ω(x1,..xn) - Variables define as DAG

P(xi|∏i)=P(xi|x1,..,xn)

P(.|.) is a cpd (conditional probability density)

Using probability density function and chain rule

Page 8: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION
Page 9: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Bayesian Belief Network Model

Probability for the verification node using pdf and chain rule

Probability for potential commercial

Probability for separator

Page 10: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Bayesian Belief Network Model

Probability for sequence of black frames

Probability for key frame distance

Page 11: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Evolved Algorithm

Challenge to create algorithm for all countries broadcast difference

Genetic algorithms implement a form Darwinian evolution.

Uses chromosome etc..

Eshelman’s CHC algorithm

CHC is general algorithm with 3 features Monotonic

CHC prevents parents from mating if their genetic is too similar.

CHC uses soft restart

Page 12: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

CHC for our Experiment

Default Parameters. 50 chromosomes, divergence rate of 35%.

Each parameter was coded as a binary string.

Each chromosome was decoded into set of parameters for the commercial detector and this detector was given a test video stream.

Correct label for the video frames were detected by human

Highest precision and recall was achieved with precision + recall

Page 13: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Results

First data set 8 hours of TV broad cast consisting of 13 TV programs

1.5 hours of 28 different commercials

Second dataset 4 hours of TV broad cast consisting of 11 TV programs

1 hour of 35 different commercials

FN,FP,TP,TN

Recall=TP/(TP+FN)

Precision=TP/(TP+FP)

Page 14: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Results

Using First data set

Results from first 4 experiment (recall and precision) 80.8% and 92.6% , 80.8% and 92.6% , 79.7% and 87.4% , 81.3% and 94.3%

Experiment 5 used experiment 4 and result was 88% and 90 %

Using second data set

This dataset was acquired after the algorithm

Test to the Genetic Algorithm

Results are shown in Figure.

Page 15: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION
Page 16: Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Conclusion

Boundary segmentation in videoVisual scene segmentationMultimodal story segmentationCommercial detection

Questions ?