joint learning of the embedding of words and entities for named entity disambiguation

4

Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation Ikuya Yamada 1,2 Hiroyuki Shindo 3 Hideaki Takeda 4 Yoshiyasu Takefuji 2 1 Studio Ousia 2 Keio University 3 Nara Institute of Science and Technology 4 National Institute of Informatics

Upload: ikuya-yamada

Post on 10-Apr-2017

269 views

Category:

Technology

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

Ikuya Yamada1,2 Hiroyuki Shindo3 Hideaki Takeda4 Yoshiyasu Takefuji2

1Studio Ousia 2Keio University 3Nara Institute of Science and Technology 4National Institute of Informatics

Page 2: Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

STUDIO OUSIA

Named Entity Disambiguation

‣ Named Entity Disambiguation (NED) is the task of resolving named entity mentions to their correct references in a knowledge base

2

/wiki/Frozen_(2013_film)

New Frozen Boutique to Open at Disney's Hollywood Studios

/wiki/The_Walt_Disney_Company

/wiki/Disney’s_Hollywood_Studios

Page 3: Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

STUDIO OUSIA

Joint Learning of Embedding of Words and Entities

‣ The proposed method extends skip-gram model to map words and entities into the same continuous vector space

‣ Three models are combined to train the embedding:

‣ KB graph model (graph) learns to estimate neighboring entities given the entity in the link graph of Wikipedia

‣ Anchor context model (anchor) learns to predict neighboring words given the entity using anchors and their context words

‣ Conventional skip-gram model (word) learns to predict neighboring words given the target word

3

Wikipedia link graph Neighboring words of words and anchors

Aristotle�was�a�philosopher�+

Logic�

Science�

Europe� Socrates�Renaissance�

Metaphysics�

Philosopher�Philosophy�

Avicenna�Aristotle�Plato�

Page 4: Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

STUDIO OUSIA

‣We propose two simple context models based on the proposed embedding:

‣ Textual context: cosine similarity between the vector of the target entity and the average vector of noun words in a document

‣ Coherence: cosine similarity between the vector of the target entity and the average vector of other entities in a document

‣ These context models and standard NED features (e.g., prior probability and entity prior) are combined using supervised machine-learning (GBRT)

‣We achieved state-of-the-art accuracies on two popular NED datasets

4

SOTA accuracies on two popular datasets!

Entity Disambiguation

Knowledge Graph Embedding: A Survey of …Abstract—Knowledge graph (KG) embedding is to embed components of a KG including entities and relations into continuous vector spaces, so

Person Name Disambiguation by Bootstrapping

A Supervised Machine Learning Approach to Conjunction Disambiguation in Named Entities

Word sense disambiguation

Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web

Entityclassi er.eu: Real-time Classi cation of Entities in Text with ... · (\Diego Maradona") or common entities (\football"). Disambiguation module assigns entity candidate with

Joint Embedding of Hierarchical Categories and Entities for Concept

Disambiguation of Biomedical Text

Syntactic Disambiguation through Lexicon Enrichment

Lecture: Word Sense Disambiguation

KingArthurKingArthur Forotheruses,seeKingArthur(disambiguation). “ArthurPendragon”redirectshere. Forotheruses,see ArthurPendragon(disambiguation

Tulip: Lightweight Entity Recognition and Disambiguation ...lipczak/mypublications/tulip-presentation.pdf · Analysis on 50 documents with ground-truth data (1166 entities) 85% of

Cross Document Entity Disambiguation

AGDISTIS - Graph-Based Disambiguation of Named Entities using Linked Datasvn.aksw.org/papers/2014/ISWC_AGDISTIS/public.pdf · 2014. 8. 1. · AGDISTIS is related to the research area

Inventor Disambiguation Workshop EVALUATION OUTCOMES

WORD SENSE DISAMBIGUATION - ETDA

Robust – Word Sense Disambiguation exercise

Word Sense Disambiguation - cuni.cz

Software-Hardware Cooperative Memory Disambiguation

Word Sense Disambiguation: A Survey

DH101 2013/2014 course 7 - OCR, Printed text recognition, Handwriting recognition, Ornaments classification, Named entities disambiguation

Analytics for Noisy Unstructured Text Data, Hyderabad, 08/01/20071 A Supervised Machine Learning Approach to Conjunction Disambiguation in Named Entities

Disambiguation deduplication wp-v4

Automated Linking Data with - Apache Stanbol · 2012-09-20 · Entity-Linking + Disambiguation (e.g. by using Solr MLT) Disambiguation of already linked Entities More Domain speciﬁc

EMBEDDING ENTITIES AND RELATIONS FOR LEARN ING AND INFERENCE IN KNOWLEDGE BASES · 2018. 1. 4. · edge bases. Tensor factorization (e.g. (Nickel et al., 2011; 2012)) and neural-embedding-based

Word Sense Disambiguation - umm-csci.github.io · Introduction Word Sense Disambiguation Word Sense Disambiguation (WSD) is the task of identifying which sense of an ambiguous word

Entity Linking: Finding Extracted Entities in a …delip/entity_linking.pdfKeywords: Entity Linking, Record Linkage, Entity Resolution, Knowl-edge Base Population, Entity Disambiguation,

Sculpture - sweethaven02.com · Sculpture “Sculptor”redirectshere. Forotheruses,seeSculptor (disambiguation)andSculpture(disambiguation). Sculptureisthebranchofthevisualartsthatoperates

Word-sense disambiguation

Word Sense Disambiguation and Induction

Corbett Chang 1983 - Pronoun Disambiguation

Random Disambiguation Paths

Creating Translation Context with Disambiguation

NSEEN: Neural Semantic Embedding for Entity Normalizationambite/...Semantic_Embedding_for_Entity_Normalizati… · Data Normalization, linking entities to their canonical forms, is