1 noise-insensitive boolean-functions are juntas guy kindler & muli safra slides prepared with...
Post on 20-Dec-2015
215 views
TRANSCRIPT
11
Noise-Insensitive Noise-Insensitive Boolean-Functions Boolean-Functions
are Juntasare Juntas
Guy Kindler & Muli SafraGuy Kindler & Muli Safra
Slides prepared with help of: Adi AkaviaSlides prepared with help of: Adi Akavia
22
InfluentialInfluential People People The theory of the The theory of the InfluenceInfluence of of
Variables on Boolean FunctionsVariables on Boolean Functions [KKL,BL,R,M][KKL,BL,R,M] and related issues, has and related issues, has been introduced to tackle been introduced to tackle social choicesocial choice problems. This area has motivated a problems. This area has motivated a magnificent sequence of works, magnificent sequence of works, related to Erelated to Economicsconomics [K],[K], percolationpercolation [BKS],[BKS], Hardness of ApproximationHardness of Approximation [DS][DS]Revolving around the Revolving around the Fourier/Walsh Fourier/Walsh analysis of Boolean functionsanalysis of Boolean functions… …
And the real important question:And the real important question:
33
Where to go for Dinner?Where to go for Dinner?
The The alternativesalternatives
Diners would cast their vote in Diners would cast their vote in an (electronic) envelope.an (electronic) envelope.The system would decide –The system would decide –not necessarily by majority…not necessarily by majority…
It turns out someone –in the It turns out someone –in the Florida wing- has the ability Florida wing- has the ability to flip some votesto flip some votes
PowerPower
influenceinfluence
44
Voting SystemsVoting Systems nn agents, each voting either “for” ( agents, each voting either “for” (TT) )
or “against” (or “against” (FF) – a Boolean function ) – a Boolean function over over nn variables variables ff is the outcome is the outcome
The values of the agents (variables) The values of the agents (variables) may each, independently, flip with may each, independently, flip with probability probability
Bottom LineBottom Line: one cannot design an : one cannot design an ff that would be robust to such noise --that would be robust to such noise --that is, would, on average, change that is, would, on average, change value w.p.value w.p.< < O(1)O(1)-- unless taking into account -- unless taking into account only very few of the votesonly very few of the votes
55
DictatorshipDictatorship
DefDef: a Boolean function : a Boolean function P([n])P([n]){-1,1}{-1,1} is is a a monotone monotone ee--dictatorshipsdictatorships --denoted --denoted ffee--if:--if:
e
T e xf x
F e x
e
T e xf x
F e x
66
JuntasJuntas
DefDef: a Boolean function : a Boolean function f:P([n])f:P([n]){-1,1}{-1,1} is a is a jj--JuntaJunta if if JJ[n] [n] wherewhere |J|≤ j|J|≤ j, , s.t. for every s.t. for every xxP([n])P([n]), , f(x) = f(x f(x) = f(x J) J)
DefDef: : ff is an is an [[, j], j]--JuntaJunta if if j- j-Junta Junta f’f’ s.t. s.t.
DefDef: : ff is an is an [[, j, p], j, p]--JuntaJunta if if j- j-Junta Junta f’f’ s.t. s.t.
nx~U
f x f ' xPr nx~U
f x f ' xPr
px~
f x f ' xPr
px~
f x f ' xPr
We would tend to omit p
p-biased, product distribution
77
Long-CodeLong-Code In the long-code In the long-code L:[n]L:[n] {0,1} {0,1}22nn
each each element is encoded by an element is encoded by an 22nn-bits-bits
This is the most extensive binary code, This is the most extensive binary code, having one bit for every subset in having one bit for every subset in P([n])P([n])
88
Long-CodeLong-Code Encoding an element Encoding an element ee[n][n]:: EEee legally-encodeslegally-encodes an element an element ee if if EEee = f = fee
FF FF TT TT TT
99
Long-Code Long-Code Monotone- Monotone-DictatorshipDictatorship
The truth-table of a Boolean function The truth-table of a Boolean function over over nn elements, can be considered elements, can be considered as a as a 22nn bits long string (each bits long string (each corresponding to one input setting – corresponding to one input setting – or a subset of or a subset of [n][n]))
For a long-code, the legal code-words For a long-code, the legal code-words are all monotone dictatorshipsare all monotone dictatorships
How about the Hadamard code?How about the Hadamard code?
1010
Long-code Tests Long-code Tests Def Def (a (a long-code testlong-code test): given a code-): given a code-
word word ww, probe it in a constant number , probe it in a constant number of entries, andof entries, and accept w.h.p if accept w.h.p if ww is a monotone is a monotone
dictatorshipdictatorship reject w.h.p if reject w.h.p if ww is not close to any is not close to any
monotone dictatorshipmonotone dictatorship
1111
Efficient Long-code TestsEfficient Long-code Tests
For some applications, it suffices if the test For some applications, it suffices if the test may accept illegal code-words, nevertheless, may accept illegal code-words, nevertheless, ones which have short ones which have short list-decodinglist-decoding::
DefDef(a (a long-code list-testlong-code list-test): given a code-word ): given a code-word ww, probe it in 2 or 3 places, and, probe it in 2 or 3 places, and accept w.h.p if accept w.h.p if w w is a monotone dictatorship,is a monotone dictatorship, reject w.h.p if reject w.h.p if ww is not evenis not even approximately approximately
determined by a short list of domain elementsdetermined by a short list of domain elements
that is, if that is, if a a JuntaJunta JJ[n][n] s.t. s.t. f f is close to is close to f’ f’ and and f’(x)=f’(xf’(x)=f’(xJ) J) for allfor all x x
NoteNote: a long-code list-test, distinguishes between the case : a long-code list-test, distinguishes between the case ww is a is a dictatorshipdictatorship, to the case , to the case ww is far from a is far from a juntajunta..
1313
BackgroundBackground Thm (Friedgut)Thm (Friedgut): a Boolean function : a Boolean function ff with small with small
average-sensitivityaverage-sensitivity is an is an [[,j]-,j]-juntajunta
Thm (Bourgain)Thm (Bourgain): a Boolean function : a Boolean function f f with small with small high-high-frequency weight frequency weight is an is an [[,j]-,j]-juntajunta
ThmThm: a Boolean function : a Boolean function f f with small with small high-frequency high-frequency weight weight in a in a pp--biased biased measure is an measure is an [[,j]-,j]-juntajunta
CorollaryCorollary: a Boolean function : a Boolean function f f with with smallsmall noise-noise-sensitivitysensitivity is an is an [[,j]-,j]-juntajunta
ParametersParameters: : average-sensitivity average-sensitivity [M,R,BL,KKL,F] [M,R,BL,KKL,F] high-frequency weight high-frequency weight [KKL,B][KKL,B]noise-sensitivity noise-sensitivity [BKS][BKS]
1414
[n][n]x
IIz
[n][n]
Noise-SensitivityNoise-Sensitivity
How often does the value of How often does the value of ff changes changes when the input is perturbed?when the input is perturbed?
x
IIz
1515
DefDef((,p,x,p,x[n] [n] ): Let ): Let 0<0<<1<1, and , and xxP([n])P([n]). .
Then Then y~y~,p,x,p,x, if , if y = (x\I)y = (x\I) z z where where I~I~
[n][n] is a is a noise subsetnoise subset, and, and z~ z~ pp
II is a is a replacementreplacement..
DefDef((-noise-sensitivity-noise-sensitivity): let ): let 0<0<<1<1, then, then
[ When [ When p=½p=½ equivalent to flipping each equivalent to flipping each coordinate in coordinate in xx w.p. w.p. /2/2.].]
[n] [n]p ,p,xx~ ,y~
ns f = Pr f x f y
[n] [n]p ,p,xx~ ,y~
ns f = Pr f x f y
[n] [n] xIIz
Noise-SensitivityNoise-Sensitivity
1616
Fourier/Walsh TransformFourier/Walsh Transform
Write Write f:{-1, 1}f:{-1, 1}nn{-1, 1}{-1, 1} as a polynomial as a polynomial
What would be the monomials?What would be the monomials?
For every set For every set SS[n][n] we have a monomial which we have a monomial which is the product of all variables in is the product of all variables in SS (the only (the only relevant powers are either relevant powers are either 00 or or 11))
It now makes sense to consider the degree of It now makes sense to consider the degree of ff or to break it or to break it according to the various degrees of the monomials..according to the various degrees of the monomials..
( )[ ]
S
S n
ff S cÍ
= å ( )[ ]
S
S n
ff S cÍ
= å
1717
High/Low FrequenciesHigh/Low Frequencies
DefDef: the : the high-frequencyhigh-frequency portion of portion of ff::
DefDef: the : the low-frequency low-frequency portion of portion of ff: :
DefDef: the : the high-frequency-weighthigh-frequency-weight is: is:
DefDef: the : the low-frequency-weightlow-frequency-weight is: is:
kS
S k
ff S
kS
S k
ff S
kS
S k
ff S
kS
S k
ff S
22k
2S k
ff S
22k
2S k
ff S
22k
2S k
ff S
22k
2S k
ff S
1818
Low High-Frequency WeightLow High-Frequency WeightPropProp: the : the -noise-sensitivity can be expressed in -noise-sensitivity can be expressed in
Fourier transform terms asFourier transform terms as
PropProp: Low : Low nsns Low Low high-freq weighthigh-freq weightProofProof: By the above proposition, low noise-sensitivity : By the above proposition, low noise-sensitivity
impliesimplies
nevertheless, nevertheless, ff being being {-1, 1}{-1, 1} function, by Parseval function, by Parseval formula (that the formula (that the norm 2norm 2 of the function and its of the function and its Fourier transform are equal) impliesFourier transform are equal) implies
( ) ( )2S
S
1 f S ~1l-å ( ) ( )2S
S
1 f S ~1l-å
2
S
f S 1 2
S
f S 1
2S
S
2 ns f =1 1 f S 2S
S
2 ns f =1 1 f S
1919
Average and RestrictionAverage and Restriction
DefDef: Let : Let II[n],[n], xxP([n]\I)P([n]\I), , the the restriction function restriction function isis
DefDef: the : the average function average function isis
NoteNote::
I
Iy P I
A f : P I
A f x E f x y
I
Iy P I
A f : P I
A f x E f x y
I
I
f x : P I 1,1
f x y f x y
I
I
f x : P I 1,1
f x y f x y
I Iy P I
A f x E f x y
I Iy P I
A f x E f x y
[n]I
x
y
[n]I
x
y y
y yy
2020
Fourier ExpansionFourier Expansion PropProp: :
PropProp:: I SS I
A ff (S)
I S
S I
A ff (S)
I STS I T I S
f x f T x
I STS I T I S
f x f T x
2121
InfluenceInfluence/Variation/Variation
DefDef: the : the variation variation of of II on on ff::
PropProp: the following are equivalent : the following are equivalent definitions to the definitions to the variation variation of of II on on ff::
22
I I 2S I
ff A ff S
variation 22
I I 2S I
ff A ff S
variation
I Iy P Ix P I
f E var f x y
variation
I I
y P Ix P If E var f x y
variation
Influencei(f) = variationi(f) = variation{i}(f)
2525
Low-frequencies Variation and Low-frequencies Variation and a.s.a.s.
DefDef: the : the low-frequency variationlow-frequency variation is: is:
DefDef:: the the average-sensitivity average-sensitivity isis
And in Fourier representation:And in Fourier representation:
DefDef: the : the low-frequency average-sensitivity low-frequency average-sensitivity is:is:
ii [n]
ff
as variation ii [n]
ff
as variation
2
S
ff (S) S as 2
S
ff (S) S as
2
k kI I
S IS k
ff f S
variation variation 2
k kI I
S IS k
ff f S
variation variation
2
i [n] S k
ff f (S) S
k kias variation
2
i [n] S k
ff f (S) S
k kias variation
2626
Biased Walsh Product Biased Walsh Product [Talagrand][Talagrand]
DefDef: In the : In the pp-biased product distribution -biased product distribution pp, , the probability of a subset the probability of a subset x x isis
The usual The usual Fourier basis Fourier basis is not orthogonal is not orthogonal with respect to the with respect to the biased inner-product,biased inner-product,
Hence, we use the Hence, we use the Biased Walsh ProductBiased Walsh Product::
x n xnp x p (1 p) x n xnp x p (1 p)
p1 p i x
xi x1 p
p
i
p1 p i x
xi x1 p
p
i
ii S
x x
S ii S
x x
S
2727
Main ResultMain Result
TheoremTheorem: :
constant constant >0>0 s.t. any Boolean function s.t. any Boolean function
f:P([n])f:P([n]){-1,1}{-1,1} satisfying satisfying
is an is an [[,j]-junta ,j]-junta for for j=O(j=O(-2-2kk332k2k))..
CorollaryCorollary: :
fix a fix a pp-biased distribution -biased distribution pp over over P([n])P([n]). .
Let Let >0>0 be any parameter. be any parameter.
Set Set k=logk=log1-1-(1/2)(1/2). .
Then Then constant constant >0>0 s.t. any Boolean function s.t. any Boolean function
f:P([n])f:P([n]){-1,1}{-1,1} satisfying satisfying
is an is an [[,j]-junta ,j]-junta for for j=O(j=O(-2-2kk332k2k))..
2k22
f Ok
2k22
f Ok
2ns f O k 2ns f O k
2828
The The KKL/FreidgutKKL/Freidgut Framework Framework
ThmThm: any Boolean function : any Boolean function ff is an is an [[,j]-,j]-junta junta for for
ProofProof::1.1. Specify the juntaSpecify the junta
where, let where, let k=O(as(f)/k=O(as(f)/)) and fix and fix =2=2-O(k)-O(k)
2.2. Show the complement of Show the complement of JJ has small variation has small variation
f / O asj = 2 f / O asj = 2
iJ i | f variation iJ i | f variation
[n]
J
3030
KKL/FreidgutKKL/FreidgutLemmaLemma::
ProofProof: :
Now, lets bound each argument:Now, lets bound each argument:
PropProp[KKL][KKL]: :
ProofProof: characters of size : characters of size kk contribute to contribute to the the average-sensitivityaverage-sensitivity at least at least (since(since ))
2k
2
as ff k 2k
2
as ff k
Jf 2
variation Jf 2
variation
2k kJ J 2
ff f variation variation 2k kJ J 2
ff f variation variation
[n]
J
2k
2k f
2k
2k f
2
S
as ff S S 2
S
as ff S S
3232
Beckner/Nelson/BonamiBeckner/Nelson/Bonami InequalityInequality
DefDef: let : let TT be the following operator on be the following operator on ff
ThmThm: for any : for any p≥rp≥r andand ≤((r-1)/(p-1))≤((r-1)/(p-1))½½
CorollaryCorollary: for : for gg of degree of degree kk
1 ,p,xy
f x f yE
T 1 ,p,xy
f x f yE
T
rpff T
rpff T
4 44k
4 2g g
4 44k
4 2g g
3434
k
i J
2 2
O(k)S S
i S,S k i S,S ki J i J2 r
2 4/ r
O(k) O(k)S S
i S i Si J i Jr 2
22/ rO(k)
kJ
O(k) r
i J
f
f (S) 2 f(S)
2 f(S) 2 f(S)
as f2 f 2
f
i
iinfluenc
variation vari on
e
ati
k
i J
2 2
O(k)S S
i S,S k i S,S ki J i J2 r
2 4/ r
O(k) O(k)S S
i S i Si J i Jr 2
22/ rO(k)
kJ
O(k) r
i J
f
f (S) 2 f(S)
2 f(S) 2 f(S)
as f2 f 2
f
i
iinfluenc
variation vari on
e
ati
FreidgutFreidgut’s Proof’s ProofPropProp::
ProofProof:: k
Jf 4
variation kJ
f 4 variation
we do not know whether as(f) is
small!
True only since this is a {-1,0,1} function.
So we cannot proceed this way with only this way with only asaskk!!
3535
If If kk were 1 were 1
Easy caseEasy case (!?!): If we’d have a bound on the (!?!): If we’d have a bound on the non-linear weight, we should be done.non-linear weight, we should be done.
The linear part is a set of independent The linear part is a set of independent characters (the singletons)characters (the singletons)
Concentration of measureConcentration of measure: In order for those : In order for those to hit close to to hit close to 11 or or -1-1 most of the time, they most of the time, they must avoid the law of large numbers, must avoid the law of large numbers, namely be almost entirely placed on one namely be almost entirely placed on one singleton [by Chernoff like bound]singleton [by Chernoff like bound]
(!) (!) [FKN, ext.] [FKN, ext.] if if ff is close to is close to linear linear then then ff is is close to close to shallowshallow ( ( a constant function or a a constant function or a dictatorship) dictatorship)
3636
Almost Linear Almost Linear Almost Almost ShallowShallow
Thm(Thm([FKN][FKN])): : global constant global constant MM, , s.t. s.t. Boolean function Boolean function ff, , shallow shallow Boolean function Boolean function gg, s.t. , s.t.
Hence, Hence, ||f||fII[x][x]>1>1||||22 is small is small ffII[x][x] is is close to close to shallowshallow!!
22 1
2 2f g M f
22 1
2 2f g M f
3737
How to Deal with Dependency How to Deal with Dependency between Characters?between Characters?
RecallRecall
(theorem’s premise)(theorem’s premise)
IdeaIdea: Let: Let Partition Partition [n]\J[n]\J into into II11,…,I,…,Irr, for , for r >> kr >> k w.h.p w.h.p ffII[x][x] is close to is close to linearlinear (low freq (low freq
characters intersect characters intersect II expectedly by expectedly by 11 element, while high-frequency weight is low).element, while high-frequency weight is low).
2k kJ J2
ff f variation + variation 2k kJ J2
ff f variation + variation
2k22
1f Ok
2k22
1f Ok
J i | f kivariation J i | f kivariation
[n]
J
I1
I2IrI
3838
So what?So what?
ffII[x][x] is close to is close to linearlinear
By By [FKN][FKN],, ffII[x][x] is shallow for any is shallow for any xx
Still, Still, ffII[x][x] could be a different could be a different dictatorship for different dictatorship for different xx’s, hence ’s, hence the variation of each the variation of each iiII might be might be low!!low!!
P([n])
J
I1
I2IrI
3939
Dictatorship and its Dictatorship and its SingletonSingleton
PropProp: for a dictatorship : for a dictatorship hh, , coordinate coordinate ii s.t.s.t. (where (where pp is the bias). is the bias).
Corollary (from [FKN])Corollary (from [FKN]): : global constant global constant MM, s.t. , s.t. Boolean function Boolean function hh, either, eitheror or
{}( )h i p> {}( )h i p>
h i p h i p
21n 2
h M hvariation 21n 2
h M hvariation
{1} {2} {i} {n} {1,2} {1,3} {n-1,n} S {1,..,n}
weight
Characters
Total weight of no more than Total weight of no more than 1-p1-p
4040
Main LemmaMain Lemma LemmaLemma: : >0>0, s.t. for any , s.t. for any and any and any
function function g:P([m])g:P([m]) , the following , the following holds: holds:
m
4 24k k k
2 2x~Pr g x O g g
m
4 24k k k
2 2x~Pr g x O g g
Low-freq high-freq
4141
Probability ConcentrationProbability Concentration Simple BoundSimple Bound:: ProofProof::
Low-freq BoundLow-freq Bound: Let : Let g:P([m])g:P([m]) be be of degree of degree kk and and >0>0, then , then >0>0 s.t. s.t.
ProofProof: recall the corollary:: recall the corollary:
m
tt
tx~Pr g x g
m
tt
tx~Pr g x g
m
44 4k
2x~Pr g x g
m
44 4k
2x~Pr g x g
4 44k
4 2g g
4 44k
4 2g g
4242
Lemma’s ProofLemma’s Proof Now, let’s prove the lemma: Now, let’s prove the lemma: Bounding low and high freq Bounding low and high freq
separately:separately:,,
simple bound
4 224 4k k k
2 2g g
4 224 4k k k
2 2g g
m
m m
x~
k k
x~ x~
Pr g x
Pr g x Pr g x
m
m m
x~
k k
x~ x~
Pr g x
Pr g x Pr g x
Low-freq bound
4343
ffII[x][x] Mostly Constant Mostly Constant
LemmaLemma: : >0>0, s.t. for any , s.t. for any and any and any function function g:P([m])g:P([m])
DefDef: Let : Let DDII be the set of be the set of xxP(P(II)), s.t. , s.t. ffII[x][x] is a dictatorship is a dictatorship
Next we show, that Next we show, that |D|DII|| must be small, must be small, hence for most hence for most xx, , ffII[x][x] is constant. is constant.
I ID x P I : i I ,s.t. f x i p I ID x P I : i I ,s.t. f x i p
m
4 24k k k1 22 2x~
Pr g x M g M g
m
4 24k k k1 22 2x~
Pr g x M g M g
4444
LemmaLemma:: ProofProof: denote: denote , then, then
|D|DII|| must be small must be small
[n]
24k kI 1 2 2x~
Pr x D M M f
[n]
24k kI 1 2 2x~
Pr x D M M f
i Ig x f x i i Ig x f x i
[n]
i i
I ix~ x P Ii I
4 24k k k1 22 2
i I i I
4 2
4k1 S 2 S
i I S [n],S k,S I i i I S [n],S k,S I i2 2
22 24k k
1 2 2i I S [n],S k,S I i i I
Pr x D Pr g x p
M g M g
M f S M f S
M f S M f
[n]
i i
I ix~ x P Ii I
4 24k k k1 22 2
i I i I
4 2
4k1 S 2 S
i I S [n],S k,S I i i I S [n],S k,S I i2 2
22 24k k
1 2 2i I S [n],S k,S I i i I
Pr x D Pr g x p
M g M g
M f S M f S
M f S M f
Prev lemma
I TT [n],T I S
f x S f T
I TT [n],T I S
f x S f T
Each S is counted only for one index iI.
(Otherwise, if S was counted for both i and j
in I, then |SI|>1!)
Parseval
4545
Simple PropSimple Prop PropProp: let : let {a{aii}}iiII be be sub-distributionsub-distribution, ,
that is, that is, iiIIaaii11, , 00aaii, then , then iiIIaaii
22maxmaxiiII{a{aii}}.. ProofProof::
2 2i max max
maxi I
1a a aa
2 2i max max
maxi I
1a a aa
1 2 3 max n
ai no more than no more than 11
1
1 2 3 n
ai
1/a1/amaxmax
1
4646
|D|DII|| must be small - Cont must be small - Cont ThereforeTherefore
(since(since ), ),
HenceHence
2
2 2k
i I i Ii I S [n],S k, S [n],S k,S I i S I i
f S max f S max f
ivariation
2
2 2k
i I i Ii I S [n],S k, S [n],S k,S I i S I i
f S max f S max f
ivariation
2
S [n],S k,S I i
f S 1
2
S [n],S k,S I i
f S 1
[n]
24k kI 1 2 2x~
Pr x D M M f
[n]
24k kI 1 2 2x~
Pr x D M M f
5151
Where to go for Dinner?Where to go for Dinner?
The The alternativesalternatives
Diners would cast their vote in Diners would cast their vote in an (electronic) envelope.an (electronic) envelope.The system would decide –The system would decide –not necessarily by majority…not necessarily by majority…
It turns out someone –in the It turns out someone –in the Florida wing- has the ability Florida wing- has the ability to flip some votesto flip some votes
PowerPower
influenceinfluence
Of course they’ll have to discuss it
over dinner….
5252
DiscussionDiscussion Tests that look at only 2 or 3 places Tests that look at only 2 or 3 places
cannot produce a large gap between cannot produce a large gap between probability of acceptance of a probability of acceptance of a dictatorship and that of a function not dictatorship and that of a function not so close to a juntaso close to a junta
Nevertheless, if requiring the function Nevertheless, if requiring the function to have additional properties, such as to have additional properties, such as local-maximality, one may be able to local-maximality, one may be able to design a test with a large gapdesign a test with a large gap
5353
Shallow FunctionShallow Function DefDef: a function : a function ff is is linearlinear, if only singletons , if only singletons
have non-zero weight have non-zero weight DefDef: a function : a function ff is is shallowshallow, if , if ff is either a is either a
constant or a dictatorship.constant or a dictatorship. ClaimClaim: Boolean linear functions are shallow.: Boolean linear functions are shallow.
0 1 2 3 k n
weight
Charactersize
5454
Boolean Linear Boolean Linear Shallow Shallow ClaimClaim: Boolean linear functions are : Boolean linear functions are
shallow.shallow. ProofProof: let : let ff be Boolean linear be Boolean linear
function, we next show:function, we next show:1.1. {i{ioo}} s.t. s.t.
((i.e.i.e. ))
2.2. And conclude, that eitherAnd conclude, that either or or i.e.i.e. ff is shallow is shallow
0S , i ,f S 0 0S , i ,f S 0 00 iff fi 00 iff fi
ff ff 00 iffi 00 iffi
5555
Claim 1Claim 1 Claim 1Claim 1: let : let ff be boolean linear be boolean linear
function, then function, then {i{ioo}} s.t. s.t. ProofProof: w.l.o.g assume: w.l.o.g assume
for any for any zz{3,…,n}{3,…,n}, consider , consider xx0000=z=z, , xx1010=z=z{1}{1}, , xx0101=z=z{2}{2}, , xx1111=z=z{1,2}{1,2}
thenthen .. Next value must be far from Next value must be far from {-1,1}{-1,1},, A contradiction! (boolean function) A contradiction! (boolean function) ThereforeTherefore
00 iff fi 00 iff fi
f 1 f 2 0 f 1 f 2 0
ab a'b'a,b a' ,b' : f x f x min f 1 , f 2 ab a'b'a,b a' ,b' : f x f x min f 1 , f 2
ab a'b'
ab a'b'1 1
ab a'b'2 2
f x f x
f 1 x x
f 2 x x
ab a'b'
ab a'b'1 1
ab a'b'2 2
f x f x
f 1 x x
f 2 x x
f 2 0 f 2 01
-1
?
5757
Claim 2Claim 2 Claim 2Claim 2: let : let ff be boolean function, s.t. be boolean function, s.t.
Then eitherThen either or or ProofProof: consider : consider f(f()) and and f(if(i00))::
ThenThen but but ff is boolean, hence is boolean, hence thereforetherefore
00 iff fi 00 iff fi
ff ff 00 iffi 00 iffi
0
0 0
ff fi
fi ffi
0
0 0
ff fi
fi ffi
0 0fi f 2 fi 0 0fi f 2 fi
0fi 0,1 0fi 0,1 0fi f 0,2 0fi f 0,2
1
-1
0 f f
0fi 0fi
0fi 0fi
5959
Proving FKN: Proving FKN: almost-linear almost-linear close to close to
shallowshallow TheoremTheorem: Let : Let f:P([n])f:P([n]) be be linearlinear, ,
LetLet let let ii00 be the index s.t. is maximal be the index s.t. is maximal
then then
NoteNote: : ff is is linearlinear, hence, hencew.l.o.g., assume w.l.o.g., assume ii00=1=1, then all we need to , then all we need to
show is:show is:
We show that in the following claim and We show that in the following claim and lemma.lemma.
0fi 0fi
2
2f 1
2
2f 1
0
2
0 i2
ff fi 1 o 1
0
2
0 i2
ff fi 1 o 1
n
ii 1
ff fi
n
ii 1
ff fi
n 2
i 2
fi 1 o 1
n 2
i 2
fi 1 o 1
6060
CorollaryCorollary CorollaryCorollary: Let : Let ff be linear, and be linear, and
then then a a shallow booleanshallow boolean function function gg s.t.s.t.
ProofProof: let: let , let , let gg be the be the boolean function closest to boolean function closest to ll. . Then,Then,this is true, as this is true, as is small (by theorem),is small (by theorem), and additionallyand additionally is small, sinceis small, since
2f g 3 o 1 2f g 3 o 1
0ffi 0ffi
2
2f g 9 o 1 2
2f g 9 o 1
2
2f 1
2
2f 1
2l g
2l g
2fl
2fl
2
2f 1
2
2f 1
6161
Claim 1Claim 1 Claim 1Claim 1: Let : Let f f be linear. be linear.
w.l.o.g., assumew.l.o.g., assumethen then global constant global constant c=min{p,1-p}c=min{p,1-p} s.t. s.t. i 2,...,n: fi c i 2,...,n: fi c
f 1 f 2 ... f n f 1 f 2 ... f n
{} {1} {2} {i} {n} {1,2} {1,3} {n-1,n} S {1,..,n}
weight
Characters
Each of weight no more than Each of weight no more than cc
6262
Proof of Claim1Proof of Claim1 ProofProof: assume: assume
for any for any zz{3,…,n}{3,…,n}, consider , consider xx0000=z=z, , xx1010=z=z{1}{1}, , xx0101=z=z{2}{2}, , xx1111=z=z{1,2}{1,2}
thenthen Next value must be far from Next value must be far from {-1,1} {-1,1} !! A contradiction! (toA contradiction! (to ))
2
2f 1
2
2f 1
f 2 c f 2 c
ab a'b'a,b a' ,b' : f x f x min f 1 , f 2 c ab a'b'a,b a' ,b' : f x f x min f 1 , f 2 c
ab a'b'
ab a'b'1 1
ab a'b'2 2
f x f x
f 1 x x
f 2 x x
ab a'b'
ab a'b'1 1
ab a'b'2 2
f x f x
f 1 x x
f 2 x x
1
-1
?
6464
LemmaLemma LemmaLemma: Let : Let g g be linear, let be linear, let
assumeassume , then , then CorrolaryCorrolary: The theorem follows from : The theorem follows from
the combination of the combination of claim1claim1 and the and the lemmalemma:: Let Let mm be the minimal index s.t. be the minimal index s.t. ConsiderConsider If If m=2m=2: the theorem is obtained (by : the theorem is obtained (by
lemma)lemma) Otherwise -- a contradiction to Otherwise -- a contradiction to
minimality of minimality of m m ::
20
2g c
20
2g c 20
2g 1 o 1 20
2g 1 o 1
n 2
i m
fi c 2
n 2
i m
fi c 2
n 2
i m
g b fi
n 2
i m
g b fi
note n 220
2i m
g fi
n 220
2i m
g fi
1
n 2
mi
fi c 1 o 1 c o 1
1
n 2
mi
fi c 1 o 1 c o 1
2
2g 1
2
2g 1
6565
Lemma’s ProofLemma’s Proof Lemma’s ProofLemma’s Proof: Note: Note
Hence, all we need to show is that Hence, all we need to show is that
Intuition:Intuition: Note that Note that |g||g| and and |b||b| are far from are far from 00
(since (since |g||g| is is -close to -close to 11, and , and cc-close to -close to bb).). AssumeAssume b>0b>0, then for almost all inputs , then for almost all inputs xx, ,
g(x)=|g(x)| g(x)=|g(x)| (as(as ) ) Hence Hence E[g] E[g] E[|g(x)|] E[|g(x)|], and , and therefore therefore var(g) var(g) var(|g|) var(|g|)
20
2g var g 20
2g var g
2
2var g g 1 2
2var g g 1
var g var g var g var g
20
2g c
20
2g c
6666
EE22[g] - E[g] - E22[|g|] = 2E[|g|] = 2E22[|g|1[|g|1{f<0}{f<0}] ] o( o() ) (by Azuma’s inequality)(by Azuma’s inequality)
We next show We next show var(g) var(g) var(|g|) var(|g|):: By the premiseBy the premise however however
thereforetherefore
Proof-mapProof-map::
|g|,|b| are far from 0|g|,|b| are far from 0
g(x)=|g(x)| for almost all xg(x)=|g(x)| for almost all x
E[g] E[g] E[|g|] E[|g|]
var(g) var(g) var(|g|) var(|g|)
var g var g
var g var g o var g var g o
2 2 2 2
2var g g E g var g E g E g 2 2 2 2
2var g g E g var g E g E g
6767
Variation LemmaVariation Lemma
LemmaLemma(variation): (variation): >0>0, and , and r>>k r>>k s.t.s.t.
CorollaryCorollary: for most : for most II and and xx, , ffII[x][x] is almost is almost constantconstant
J1r
24k kII ~ 2
E f O f
variation J1r
24k kII ~ 2
E f O f
variation
P([n])
J
I1
I2IrI
I Iy Ix P I
f E var f x y
variation
I I
y Ix P If E var f x y
variation
6868
By union bound on By union bound on II11,…,I,…,Irr: :
(set(set ))
Let Let f’(x) = sign( Af’(x) = sign( AJJ[f](x[f](xJ) )J) ). . f’f’ is the boolean function closest to is the boolean function closest to AAJJ[f][f], , thereforetherefore
Hence Hence ff is an is an [[,j],j]-junta.-junta.
Using Idea2Using Idea2
24k kJ 2
f r O f variation 24k kJ 2
f r O f variation
24k k
2O f 24k k
2O f
22 4k k
1 2ff ' O f 22 4k k
1 2ff ' O f
P([n])
J
I1
I2IrI
4kr 4kr
2
2
2 2
J J2 2
24k k
2
ff '
f A f A ff '
2 O f
2
2
2 2
J J2 2
24k k
2
ff '
f A f A ff '
2 O f
6969
variation-Lemma - Proof variation-Lemma - Proof PlanPlan
LemmaLemma(variation): (variation): >0>0, and , and r>>k r>>k s.t.s.t.
Sketch for proving the variation lemmaSketch for proving the variation lemma::
1.1. w.h.p w.h.p ffII[x][x] is almost linear is almost linear
2.2. w.h.p w.h.p ffII[x][x] is close to shallow is close to shallow
3.3. ffII[x] [x] cannot be close to cannot be close to dictatorship dictatorship too often.too often.
J1r
2 24k k 4k kII ~ 2 2
E f O f O f
variation J1r
2 24k k 4k kII ~ 2 2
E f O f O f
variation
O k2 O k2
7171
XOR TestXOR Test Let Let be a random procedure for be a random procedure for
choosing two disjoint subsets choosing two disjoint subsets xx,,y y s.t.:s.t.:ii[n][n], , iix\y x\y w.p w.p 1/31/3, , iiy\x y\x w.p w.p 1/31/3, and, andiixxy y w.pw.p 1/3 1/3..
DefDef((XOR-TestXOR-Test): Pick ): Pick <x,y>~<x,y>~,, Accept if Accept if f(x)f(x)f(y)f(y),, Reject otherwise.Reject otherwise.
7272
ExampleExample ClaimClaim: Let : Let ff be a dictatorship, then be a dictatorship, then ff
passes the XOR-test w.p. passes the XOR-test w.p. 2/32/3. . ProofProof: Let : Let ii be the dictator, then be the dictator, then
PrPr<x,y>~<x,y>~[f(x)[f(x)f(y)]=Prf(y)]=Pr<x,y>~<x,y>~
[i[ixxy]=2/3 y]=2/3
ClaimClaim: Let : Let f’f’ be a be a -close to a -close to a dictatorship dictatorship ff, then , then f’f’ passes the XOR- passes the XOR-test w.p. test w.p. 2/3 – 2/32/3 – 2/3((--22)). .
ProofProof: see next slide…: see next slide…
7373
x,y ~
x,y ~
x,y ~
x,y ~
2 2
2 2
2
f ' x f ' y
f x f y f ' x f x f ' y f y
f x f y f ' x f x f ' y f y
f x f y f ' x f x f ' y f y f ' x f x f ' y f y
2 2 11 2 13 3 32 1 132 2
3 3
Pr
Pr
Pr
Pr
x,y ~
x,y ~
x,y ~
x,y ~
2 2
2 2
2
f ' x f ' y
f x f y f ' x f x f ' y f y
f x f y f ' x f x f ' y f y
f x f y f ' x f x f ' y f y f ' x f x f ' y f y
2 2 11 2 13 3 32 1 132 2
3 3
Pr
Pr
Pr
Pr
7474
Local MaximalityLocal Maximality DefDef: : ff is is locally maximallocally maximal with respect to a with respect to a
test, test, if if f’f’ obtained from obtained from ff by a change on one by a change on one input input xx00, that is,, that is,
PrPr<x,y>~<x,y>~[f(x)[f(x)f(y)] f(y)] Pr Pr<x,y>~<x,y>~[f’(x)[f’(x)f’(y)]f’(y)] DefDef: Let : Let xx be the distribution of all be the distribution of all y y such such
that that <x,y>~<x,y>~.. ClaimClaim: if : if ff is is locally maximallocally maximal then then
f(x) = -sign(Ef(x) = -sign(Ey~y~(x)(x)[f(y)])[f(y)])..
0
0 0
f x x xf ' x
f x x x
0
0 0
f x x xf ' x
f x x x
7575
ClaimClaim::
ProofProof: immediate from the Fourier-: immediate from the Fourier-expansion, and the fact that expansion, and the fact that yyx=x=
y~ (x) S x
E f y f S
y~ (x) S x
E f y f S
7676
ConjectureConjecture: Let : Let ff be be locally maximallocally maximal (with respect to the XOR-test), (with respect to the XOR-test), assume assume ff passes the XOR-test w.p passes the XOR-test w.p 1/2 + 1/2 + , for some constant , for some constant >0>0, , then then ff is close to a junta. is close to a junta.