synonymy and near-synonymy in deep lexical semantics niloofar montazeri and jerry r. hobbs...

Synonymy and Near-Synonymyin Deep Lexical Semantics

Niloofar Montazeri and Jerry R. Hobbs

Information Sciences Institute

University of Southern California

Marina del Rey, CA

Deep Lexical Semantics:Methodology

words

core theories

link with axioms

Construct core theories of abstract phenomena in various domains

Express meanings of word senses as logical axioms in terms of predicates in core theories

Core Theories and Lexical Periphery

Define (Characterize) words in terms supplied by the core theories.

range(x,y,z) <--> scale(s) &subscale(s1,s) & bottom(y,s1) & top(z,s1) & in(u1,x) & at(u1,y) & in(u2,x) & at(u2,z) & (u x)( v s1) at(u,v)

Axiomatize core theories with richly explicated core predicates:

Core Theory of Scales: scale, <, subscale, top, bottom, at

s s1y zv

x = {u1 . . . . . . . u . . . . . u2}

word: “range”

linkingaxiom

core theory

Abstract Words in Context

By specializing “at” and the scale in various ways, we can get a whole range of possible meanings for “range”:

The scores on the test ranged from 33 to 96. The timber wolf ranges from Mexico to the Arctic. His behavior ranged from cheerful to sullen.

Deep Lexical Semanticsof Event-Related Words

(forall (x y z) (iff (give x y z) (exist (e1 e2 e3) (and (cause x e1)(change’ e1 e2 e3)(have’ e2 x y) (have’ e3 z y)))))

x gives y to z = x causes a change from x having y to z having y

Builds on old work by Gruber, Lakoff, Schank, Jackendoff, and many others on lexical decomposition, but ....

“primative” predicates are explicated in core theories:

restricts possible interpretations of the predicates enables reasoning within theory logic, not syntax

Use in Textual Inference T: Russia is blocking oil from entering Ukraine.H: Oil cannot be delivered to Ukraine.

not’(n2,c2) & can’(c2,x2,d2) & deliver’(d2,x2,o2,u2)

block’(b1,x1,e1) & enter’(e1,o1,u1)

cause’(c1,x1,n1) & not’(n1,p1) & possible’(p1,e1)

cause’(d2,x2,c3) & changeTo’(c3,h2) & have’(h2,u2,o2)

in’(h2,o2,u2)possible’(p1,c4) & cause’(c4,x3,e1)

Defeasibleinferences

fromcore theories

Lexicaldecomposition

axioms

Core Theories: Change

change(e1,e2): state e1 changes into state e2

e1 and e2 involve a common entity; change of state of something

Transitive if the something is the same

e1 and e2 are different unless there is an intermediate state (cyclic change)

“move” is change of state of “at” relation

change(e1,e2) = changeFrom(e1) = changeTo(e2)

Theory of Causality: Causal Complex

e1 e2

e3 e4e

....

s

causal-complex(s,e)

e1 s, ....

When every event or state in s happens or holds, then e happens or holds.

All eventualities in s are relevant to the effect.

A rigorous, nondefeasible notion, but can’t specify everything.

causal complex

effect

Theory of Causality: Cause

In a causal complex, some eventualities are distinguished as causes.

power on

finger insocket

shock

What is presumable depends on task, context, knowledge base, ....“Cause” is a useful but defeasible notion.

presumable

cause

Causes are the focus ofplanning, prediction,

explanation, interpretingdiscourse

(but not diagnosis)

Methodology

Having axiomatized these core theories, ....

Focus on most common 450 word senses involving change of state and causality

Determine radial structure of set of WordNet senses of word, and characterize by incremental differences in associated axioms

Encode axioms for the most abstract or general senses or “supersenses”

Evaluate on a textual entailment task

“Enter”

enter-S1: x enters p = changeTo(p(x)) enter-V2: enter a race enter-V4: enter into calculations enter-V9: enter into career

enter-S11: x enters y = changeTo(in(x,y)) enter-V1: enter a room enter-V6: enter from stage left

enter-S2: x enters y in z = cause(x,enter-S11(y,z)) enter-V5: enter in ledger enter-V8: enter picture into text

p = at/in

+ cause

Logically, and sometimeschronologically

At each hop, we specialize a predicate orconstrain an argument

“Hit”

hit-S1: x hits y = changeTo(at(x,y)) We hit Detroit by noon. The temperature hit -20.

hit-S11: x hits y = changeTo’(e,in(x,y)) & sudden(e) & impact(x,y) The car hit a tree.

hit-S2: x hits z with y = cause(x,hit-S11(y,z)) He hit the ball.

+ sudden impact

+ cause

Supersenses give topological structure.Specific senses specialize general predicates or put constraints on arguments

Synonymy and Near-Synonymy

Synonymy: The axiomatic lexical decompositions are the same

Near-synonymy: The axiomatic lexical decompositions differ only incrementally (similar to word senses in radial structure)

“Receive” and “Get”

receive-S1: x receives y from z = change(have(z,y), have(x,y)) -> changeTo(have(x,y)) = FrameNet sense 1 subsumes all of WordNet’s senses where “have” is specialized to owning, having a property, perceiving, hosting, etc.

get-S11: x gets y = cause(x, changeTo(have(x,y))) He always gets what he wants. I’ll get the book at the library.

get-S12: x gets y = changeTo(have(x,y)) He got the flu. I got a call from Sue. You’ll get your results tomorrow.

Synonyms ornear-synonyms

“Go”, “Hit”, and “Reach”

go-V1: x goes from e1 to e2 = change(e1,e2) & arg*(x,e1) & arg*(x,e2)

go-FV3: specialize e1 and e2 to “at” relations x goes from y to z = change(at(x,y), at(x,z))

hit-S1: x hits z = changeTo(at(x,z))

reach-S1: x reaches z = changeTo(at(x,z))

x is an argument of ei or a participant in ei

3rdFrameNet

sense

Not synonymous atmore general levels

of “go”

Not synonymous atmore specific levels

of “hit”

“Deliver”, “Give”, and “Provide”

deliver-S1: x delivers y from w to z = cause(x, change(rel(y,w), rel(y,z)))

deliver-S11: specializes rel to havestipulate that x = w = cause(x, change(have(x,y), have(z,y)))

give-S0: cause(x, exist(y))

give-S1: cause(x, change(have(x,y), have(z,y)))

provide-S1: x provides z with y = cause(x, changeTo(possible(e))) & arg(y,e) & need(z,e) provide for medical emergencies

provide-S11: possible(e) specializes to have(z,y) = cause(x, changeTo(have(z,y))) & need(z,e)

synonyms

near synonyms

An Aside on “Need”

In a core theory of cognition (based on beliefs and goals),

Define badFor(e,x) = e causes a goal of x’s not to happen

need(x,e) = cause(not(e), e1) & badFor(e1,x)

(Gordon& Hobbs)

Overlapping Radial Structures

deliver-S1

deliver-S11

give-S0

give-S1

provide-S1

provide-S11

A Problem

Synonymy is a relation between word senses.

Carving the uses of words into word senses is highly arbitrary; e.g., WordNet is very fine-grained; VerbNet less so.

word-1, sense-iword-2, sense-j

Why not just stipulate that these are three different senses, so that in the middle the senses are perfectly synonymous?

Context-Dependent Synonymy

give = cause to have

provide = cause to have + need

She gave food to the hungry man.

She provided food for the hungry man.

The sentences are equivalent even though “give” and “provide” are only near synonyms.

Synonymous Specific Senses

So far the examples have been at an abstract level, but intersection of radial structures can occur at very specific levels too:

deliver-V1: deliver a talkgive-V12: give a talk

Small Perspective Differences:Near Synonyms?

hold: x holds e = cause(x, not(changeFrom(e))) hold that pose

Specialize e to at: cause(x, not(changeFrom(at(y,z)))) hold the picture against the wall

block: x blocks e = cause(x, not(changeTo(e))) The senator blocked the judge’s appointment

Specialize e to at: cause(x, not(changeTo(at(y,z)))) He blocked my way

These can describe the same situation: hold(x,e) = block(x,not(e))

“Capture” and “Seize”

x holds y = cause(x, not(changeFrom(at(y,z))))

x captures y = cause(x, changeTo(hold(x,y)))

“Seize”: Almost all senses of “seize” subsumed under

seize-S1: x seizes y = cause’(e, x, changeTo(hold(x,y))) & forceful(e)

nearsynonyms

Text Entailment Example:Synonymy from Inference

H: The captors let the hostage go free.

T: A Filipino hostage in Iraq was released.

release(x,y,z)

changeFrom’(e0,e1) & cause’(e1,x,e2) & not’(e2,e3) & changeFrom’(e3,e4) & at’(e4,y,z)

not(e5) & cause’(e5,x,e6) & not’(e6,e7)

changeTo’(e7,e8)

let(x,e7) & go’(e17,y,e8) & free’(e8,y,c,e9)

not’(e8,e10) & cause(e10,c,e11) & not’(e11,e9) & move’(e9,y,z,w)

changeFrom’(e9,e12) & at’(e12,y,z)Rexist(e0)

“Blunder” and “Lapse”

blunder(e,x) = error(e,x) & cause(e1,e) & stupid’(e1,x)

lapse(e,x) = error(e,x) & cause(e1,e) & neglectful’(e1,x)

Need to capture meaning of “error” in theory of composite entities as a mismatch between a composite entity and a pattern that serves as a norm.

Need to capture meanings of “stupid” and “neglect” in theory of cognition; “stupid” in terms of ability to learn and reason “neglect” in terms of model of attention

(Gordon & Hobbs)

(Edmunds & Hirst)

Near Synonyms?

raise(x,y,z,w) = cause(x, change(at(y,z), at(y,w))) & above(w,z)

rise(y,z,w) = change(at(y,z), at(y,w)) & above(w,z)

Differ only by cause(x, ...)

raise(x,y,z,w) = cause(x, change(at(y,z), at(y,w))) & above(w,z)

lower(x,y,z,w) = cause(x, change(at(y,z), at(y,w))) & above(z,w)

Differ only in relation of w and z

To be near synonyms the words have to describe the same situations.

Message

The meanings of word senses can be expressed as axioms in terms of predicates from core theories of the relevant phenomena.

The word senses of any word form a radial structure, where adjacent nodes differ by incremental changes in the axioms (specializations of predicates, constraints on arguments).

Word senses of different words can have identical axiomatic decompositions; this is synonymy.

Word senses of different words can have nearly identical axiomatic decompositions -- is this near synonymy?

Is near synonymy a natural kind, about which we can make reliable judgments?

More important than labeling word senses as near synonyms is capturing the distinctions formally.

synonymy and near-synonymy in deep lexical semantics niloofar montazeri and jerry r. hobbs...

Documents

change of state

x atu1

x atu2

vaxiomatize core theories

core theoriescore theories

explicated core predicates

core theory of scales

state e1 changes