universal and composite hypothesis testing via mismatched divergence jayakrishnan unnikrishnan lcav,...

Universal and composite hypothesis testing via Mismatched Divergence

Jayakrishnan Unnikrishnan

LCAV, EPFL

CollaboratorsDayu Huang, Sean Meyn, Venu Veeravalli, University of Illinois

Amit Surana, UTRC

IPG seminar2 March 2011

Outline

• Universal Hypothesis Testing– Hoeffding test

• Problems with large alphabets

– Mismatched test• Dimensionality reduction• Improved performance

• Extensions– Composite null hypotheses– Model-fitting with outliers– Rate-distortion test– Source coding with training

• Conclusions2

Universal Hypothesis Testing

• Given a sequence of i.i.d. observations test the hypothesis

– Focus on finite alphabets i.e. PMFs

• Applications: anomaly detection, spam filtering etc.

1 2, , , nXX X

Alternate : ,

~ unknowni

p ppH X

Sufficient statistic

• Empirical distribution:

– where denotes the number of times letter appears in

– is a random vector

1 2: , , , N

an a1 2, , , nXX X

Hoeffding’s Universal Test

• Hoeffding test [1965]:

– Uses KL divergence between and as test statistic

0ˆ { ( ) }nH D p p I ‖

0{ : ( ) }q D q p ‖

• Hoeffding test is optimal in error-exponent sense:

– Sanov’s Theorem in Large Deviations implies

0ˆ { ( ) }nH D p p I ‖

ˆ( 0) exp( )

ˆ( 1) exp( ( ))

• Hoeffding test is optimal in error-exponent sense:

– Sanov’s Theorem in Large Deviations implies

• Better approximation of false alarm probability via– Weak convergence under

0ˆ { ( ) }nH D p p I ‖

ˆ( 0) exp( )

ˆ( 1) exp( ( ))

2n AD p pn ‖

Error exponents are inaccurate

8Alphabet size, A = 20

Large Alphabet Regime

• Hoeffding test performs poorly for large (alphabet size)– suffers from high bias and variance

[ ( )]2

[ (p n

D p pn

Large Alphabet Regime

• Hoeffding test performs poorly for large (alphabet size)– suffers from high bias and variance

• A popular fix: Merging low probability bins

[ ( )]2

[ (p n

D p pn

Binning

Quantization

General principle

• Dimensionality reduction

• Essentially we compromise on universality but improve performance against typical alternatives

• Generalization: parametric family for typical alternatives

Hoeffding test

np0( )nD p p‖

Mismatched test

np{ }p

Mismatched test

n̂p ˆ 0( )

nD p p

Mismatched test

0( )nD p p‖

0( )MMnD p p‖

Mismatched test

• Use mismatched divergence instead of KL divergence

– interpretable as a lower bound to KL divergence

• Idea in short: replace with ML estimate from i.e., it is a GLRT

0ˆ { ( ) }MM

nH D p p ‖I

np{ }p

ˆ0 0( ) ( )n

MMnD p p D p p

‖ ‖ML

Exponential family example

• Mismatched divergence is solution to a convex problem

( ) ( ) exp ( ) ( ) ,d

x x fp p x

0( ) sup , ( )MMi i

D p p f p

Exponential family example

• Mismatched divergence is solution to a convex problem

• Binning when

( ) ( ) exp ( ) ( ) ,d

x x fp p x

0( ) sup , ( )MMi i

D p p f p

( ) ( )iBi x xf I

Mismatched Test properties+ Addresses high variance issues

- However not universally optimal in error-exponent sense

+ Optimal when alternate distribution lies in • achieves same error exponents as Hoeffding• implies optimality of GLRT for composite hypotheses

[ ( )]

[ MMp n

where d

Performance comparison

23A = 19, n = 40

Weak convergence

• When observations

– Approximate thresholds for target false alarm

n dD p pn‖

Weak convergence

– Approximate thresholds for target false alarm

– Approximate power of test25

n dD p pn‖

0~ p p

1( ) ( ) (0, )MM MM

n pD p p D p pn

‖ ‖ N

EXTENSIONSAND

APPLICATIONS

Composite null hypotheses

• Composite null hypotheses / model fitting

: ~ for any

~ , for any

H X p p

q qH X

: ~ for any

~ , for any

H X p p

q qH X

{ : inf ( ) }q q pD ‖

: ~ for any

~ , for any

H X p p

q qH X

{ : ( ) }q qD ‖ P

Weak convergence

2n A dpn

D ‖ P

Weak convergence

2n A dpn

D ‖ P

~ p P21

( ) ( ) (0, )n pp pn

D D ‖ ‖P P N

Weak convergence

– Approximate thresholds for target false alarm– Approximate power of test– Study outlier effects

2n A dpn

D ‖ P

~ p P21

( ) ( ) (0, )n pp pn

D D ‖ ‖P P N

Outliers in model-fitting

• Data corrupted by outliers or model-mismatch– Contamination mixture model

(1 ) p qp ò ò

• Goodness of fit metric– Limiting behavior used to quantify the goodness of fit

(1 ) p qp ò ò

)( nD p ‖ P

• Limiting behavior of goodness of fit metric changes

(1 ) p qp ò ò

2n A dpn

D ‖ P

21( ) ( ) (0, )n pp p

nD D P P N‖ ‖

• Sensitivity of goodness of fit metric to outliers

(1 ) p qp ò ò

21) ( ) ( )

2( Tq p GD p q p P ò‖

2 2 ( ) ( )Tp q p G q p ò

Rate-distortion test

• Different generalization of binning– Rate-distortion optimal compression

• Test based on optimally compressed observations [P. Harremoës 09]

– Results on limiting distribution of test statistic

0ˆ { ( )( )( ) }nH pD p I ‖

Source coding with training

• A wants to encode and transmit source to B– Unknown distribution on known alphabet – Given training samples

1, , nX X

• A wants to encode and transmit source to B– Unknown distribution on known alphabet– Given training samples

• Choose codelengths based on empirical frequencies

1, , nX X

log( ( ))x np x

• A wants to encode and transmit source to B– Unknown distribution on known alphabet– Given training samples

• Choose codelengths based on empirical frequencies

• Expected excess codelength is chi-squared

1, , nX X

log( ( ))x np x

1[ | ] ( )

AX H pn E

CLT vs LDP

• Empirical distribution (type) of

1( ) { }

x X xn

1{ }niX

CLT vs LDP

• Empirical distribution (type) of

• Obeys LDP (Sanov’s theorem):

• Obeys CLT:

1( ) { }

x X xn

1{ }niX

{ ( )} exp( ( , ))p np N p n p ò òP

( ) (0, )n pn pp N

CLT vs LDP

LDP• Good for large

deviations

• Approximates asymptotic slope of log-probability – Pre-exponential factor

may be significant

CLT• Good for moderate

deviations

• Approximates probability

Conclusions

– Error exponents do not tell the whole story• Not a good indicator of exact probability• Tests with identical error exponents can differ drastically over finite

samples

– Weak convergence results give better approximations than error exponents (LDPs)

– Compromising universality for performance improvement against typical alternatives

– Threshold selection, Outlier sensitivity, Source coding with training

References• J. Unnikrishnan, D. Huang, S. Meyn, A. Surana, and V. V. Veeravalli,

“Universal and Composite Hypothesis Testing via Mismatched Divergence” IEEE Trans. Inf. Theory, to appear.

• J. Unnikrishnan, S. Meyn, and V. Veeravalli, “On Thresholds for Robust Goodness-of-Fit Tests” presented at IEEE Information Theory Workshop, Dublin, Aug. 2010.

• J. Unnikrishnan, “Model-fitting in the presence of outliers” submitted to ISIT 2011.

– available at http://lcavwww.epfl.ch/~unnikris/

Thank You!

universal and composite hypothesis testing via mismatched divergence jayakrishnan unnikrishnan lcav,...

test statistic

mismatched test properties

composite hypotheses

composite hypothesis

large alphabet size

null hypotheses model

large deviations

divergence interpretable

Documents

immoveable property returns 2014sanjeevsanghi professor 5....

supplemental brief of amici curiae lcav, et al. in nordyke...

surface & coatings...

brief of amici curiae lcav in nordyke v. king (9th cir.,...

brief of amicus curiae lcav in woollard v. sheridan (d. md,...

designing a magnetron injection gun and magnetic field to...

brief of amici curiae lcav and others in ass’n of n.j....

selecting transmit powers and carrier sense thresholds in...

veeravalli mokasa (r1) - dtcp - andhra...

venugopal v. veeravalli

amici curiae brief of lcav and others in massachusetts v....

k.suhit reddy, jayakrishnan, manipal institute of...

list of star rated ginning & pressing...

network traffic simulation and assignment: … traffic...

topics in decentralized detection by...

brief of amicus curiae lcav in nordyke v. king (9th cir.,...

lcav amicus brief in ileto v. glock, inc. (9th cir....

brief of amicus curiae lcav, et al. in u.s. v. hayes (u.s.,...

dr. jayakrishnan thavody associate professor department of...

brief of amicus curiae lcav in ass'n of n.j. rifle & pistol...