recent advances in distantly supervised relation extractionwilliam/papers/part1_distant... ·...

Recent Advances in Distantly Supervised Relation Extraction

William WangDepartment of Computer Science

University of California, Santa Barbara

Joint work with Jiawei Wu, Lei Li, Pengda Qin, Weiran Xu.

CIPS Summer School 2018

Beijing, China

Agenda

• Motivation

• Challenges in Semi-Supervised Learning • Reinforced Co-Training (Wu et al., NAACL 18)• Reinforced Distant Supervision Relation Extraction

(Qin et al., ACL 18a)

• DSGAN (Qin et al., ACL 18b)• Conclusions

Motivation

• Most of the existing successful stories of deep learning are still based on supervised learning.

• For example, object recognition, machine translation, text classification.

• However, in many applications, it is not realistic to obtain large amount of labeled data.

• We need to leverage unlabeled data.

A Classic Example of Semi-Supervised Learning

• Co-Training (Blum and Mitchell, 1998)

Challenges• The two classifiers in co-training have to be

independent.• Choosing highly-confident self-labeled examples

could be suboptimal.• Sampling bias shift is common.

Our Approach: Reinforced SSL

• Assumption: not all the unlabeled data are useful.

• Idea: performance-driven semi-supervised learning that learns an unlabeled data selection policy with RL, instead of using random sampling.

• 1. Partition the unlabeled data space• 2. Train a RL agent to select useful unlabeled data• 3. Reward: change in accuracy on the validation set

Reinforcement Learning

Environment

𝑎𝑡𝑠$%&

𝑟$%&

𝑠$𝑟$

Agent Environment

DeepQ-Network UnlabeledData

Reinforced Co-Training(Wu et al., NAACL 2018)

Deep Q-Learning

Experiment 1: Clickbait Detection

Experiment 2: Generic Text Classification

Conclusion

• We proposed a novel RL framework for semi-supervised learning• Strong results in SSL text classification• Also showed effectiveness in relation extraction

Deep Reinforcement Learning for Distantly Supervised Relation Extraction

Outline

• Motivation• Algorithm• Experiments• Conclusion

Outline

Plain Text Corpus(Unstructured Info)

Classifier Entity-relation Triple(Structured Info)

Relation Type withLabeled Dataset

Relation Type withoutLabeled Dataset

Relation Extraction

“If two entities participate in a relation, any sentencethat contains those two entities might express thatrelation.” (Mintz, 2009)

Distant Supervision

Data(x): <Belgium, Nijlen>Label(y): /location/contains

Target Corpus(Unlabeled)

DS Label(y): /location/contains

Nijlen is a municipalitylocated in the Belgianprovince of Antwerp.

Distant Supervision

DS Data(x):

v Within-Sentence-Bag Level

v Entity-Pair Level

§ Hoffmann et al., ACL 2011.§ Surdean et al., ACL 2012.§ Zeng et al., ACL 2015. § Li et al., ACL 2016.

§ None

Wrong Labeling

§ Place_of_Death

i. Some New York city mayors – William O’Dwyer, Vincent R. Impellitteri and Abraham Beame – were born abroad.

ii. Plenty of local officials have, too, including two New York city mayors, James J. Walker, in 1932, and William O’Dwyer, in 1950.

Wrong Labeling

v Entity-Pair Level

v Most of entity pairs only have several sentences

1 Sentence55%2 Sentence

Other4%

Wrong Labeling

v Lots of entity pairs have repetitive sentences

Outline

Sentence-Level Indicator

Entity-Pair Level Wrong Labeling Problem

Learn a Policy to Denoise theTraining Data

General Purpose and Offline Process

Without Supervised Information

Requirements

Negative set

Positive set

Negative set

Positive setFalse Positive Policy Based Agent

𝑅𝑒𝑤𝑎𝑟𝑑

𝐴𝑐𝑡𝑖𝑜𝑛

Classifier𝑇𝑟𝑎𝑖𝑛

False Positive

DS Dataset Cleaned Dataset

Overview

v State

v Action

v Reward§ ???

Deep Reinforcement Learning

§ Sentence vector§ The average vector of previous removed sentences

§ Remove & retain

v One relation type has an agent

v Sentence-level

v Split into training set and validation set

§ Positive: Distantly-supervised positive sentences§ Negative: Randomly sampled

RL Agent Train

TrainRelationClassifier

RelationClassifier

𝐹&34&

𝐹&3× +𝓡3 + ×(−𝓡3)

Noisydataset𝑃$=>3

Cleaneddataset

Removedpart

𝓡3 = 𝛼(𝐹&3 - 𝐹&34& )

RL Agent

Epoch i-1

EpochiNoisy

dataset𝑃$=>3

+𝑁$=>3

𝑁$=>3 +

Positive Set Negative Set

§ Accurate

§ Steady

§ Fast

Reward

False Positive

§ Obvious

Positive Set Negative Set

RelationClassifier

PreTrain RelationClassifier

Calculate

Epoch 𝑖

Reward

False Positive

Positive Negative

Outline

v Dataset: SemEval-2010 Task 8v True Positive: Cause-Effectv False Positive: Other

Evaluation on a Synthetic NoiseDataset

v True Positive + False Positive: 1331 samples

0 10 20 30 40 50 60 70 80 90 100

F1Score

200 FPs in 1331 Samples

(198/388)

(197/339)

(195/308)(180/279) (179/260)

False PositiveRemoved Part

0 10 20 30 40 50 60 70 80 90 100

F1Score

0 FPs in 1331 samples

(0/258)

(0/150)

(0/121)

(0/59)(0/32)

v CNN+ONE, PCNN+ONE§ Distant supervision for relation extraction via piecewise

convolutional neural networks. (Zeng et al., 2016)

v CNN+ATT, PCNN+ATT§ Neural relation extraction with selective attention over

instances. (Lin et al., 2016)

Distant Supervisionon NYT Freebase Dataset

Distant Supervision

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

CNN-basedCNN+ONE

CNN+ONE_RL

CNN+ATT

CNN+ATT_RL

Distant Supervision

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

PCNN-based

PCNN+ONE

PCNN+ONE_RL

PCNN+ATT

PCNN+ATT_RL

Outline

vWe propose a deep reinforcement learningmethod for robust distant supervision relationExtraction.

v Our method is model-agnostic.

v Our method boost the performance of recentlyproposed neural relation extractors.

Conclusion

DSGAN: Adversarial Learning for DenoisingDistantly Supervised Relation Extraction

(Qin et al., ACL 2018b)

DS data space

DS Positive Data

DS Negative Data

Distant Supervision Data Distribution

DS data space

DS False PositiveData

DS True Positive Data

DS Negative Data

The Decision Boundaryof DS Data

The DesiredDecision Boundary

Data Distribution

Adversarial Training

True Positive

False Positive

Noisy Positive Set

Generator

True Positive

False Positive

Label1

Label0

Label1

DSGAN (Qin et al., ACL 2018b)

Epoch 𝐢

𝐁𝐚𝐠𝐤4𝟏

𝐁𝐚𝐠𝐤

𝐁𝐚𝐠𝐤%𝟏

DSPositiveDataset

𝑠&𝑠I𝑠J

𝑝& = 0.57

𝑝I = 0.02

𝑝J = 0.83

𝑝T = 0.26

𝑝V = 0.90

Sampling

𝐥𝐚𝐛𝐞𝐥 = 𝟏

𝐥𝐚𝐛𝐞𝐥 = 𝟎D

𝒓𝒆𝒘𝒂𝒓𝒅

DS positivedataset

𝐥𝐚𝐛𝐞𝐥 = 𝟏

𝐥𝐚𝐛𝐞𝐥 = 𝟎

Pre-training

𝑝K= 0.7

High-confidencesamples

Low-confidencesamples

Generator Discriminator

DS negativedataset

v Sentence-Level Noise Reduction

v Training Without Supervised Information

v Model-Agnostic

Characteristics

Distant Supervision Relation Extraction

0 0.1 0.2 0.3 0.4

CNN-basedCNN+ONECNN+ONE_GANsCNN+ATTCNN+ATT_GANs

Distant Supervision Relation Extraction

0 0.1 0.2 0.3 0.4

PCNN-basedPCNN+ONEPCNN+ONE_GANsPCNN+ATTPCNN+ATT_GANs

Conclusion

• We introduce Reinforced Co-Training, a new approach that combines reinforcement learning and semi-supervised learning.• We show that in weakly-supervised relation

extraction, reinforcement learning can be utilized to de-noise the training signals.• Adversarial learning serves as a joint learning

framework, and it can also be applied to de-noising distantly supervised IE data.

Thanks!

http://nlp.cs.ucsb.edu

recent advances in distantly supervised relation extractionwilliam/papers/part1_distant... ·...

Documents

resumes - files.eric.ed.gov · pdf filerabinowitz and joseph...

progressive combinatorial algorithm for multiple structural...

1 paul beame university of washington satisfiability and...

s kew in p arallel q uery p rocessing paraschos koutris paul...

a… olla/numeros/laolla53.pdf · y se unió a impellitteri...

beame, lindsa tyo speak at today's...

biology -...

review of basic computational...

lower bounds for read/write streams paul beame joint work...

dakota humphries (project lead) thomas impellitteri (tech...

cremation authorization - impellitteri -...

arsenic treatment technologies christopher a. impellitteri...

model checking large software...

paul beame university of washington

u8 mt zion florence church 1 mon brian zepf brian zepf...

distantly related organisms share structural similarities...

cross-relation cross-bag attention for distantly...

populating web scale knowledge graphs using distantly...

improving distantly-supervised neural relation...

multiparty communication complexity and threshold circuit...