research share · 2018. 5. 25. · data research share text queries as annotations shebanq ...

1
data research share text queries as annotations SHEBANQ www.godgeleerdheid.vu.nl/etcbc [email protected] Wido van Peursen Eep Talstra Janet Dyk Constantijn Sikkel Jan Krans Oliver Glanz Reinoud Oosting Brenda Kronemeijer-Heyink Gino Kalkman Ulrik Sandborg-Petersen Nicolai Winther-Nielsen /etc bc Eep Talstra Centre for Bible and Computing < node xml:id=" n_88917 "> < link targets =" r_1 r_2 r_3 r_4 r_5 r_6 r_7 r_8 r_9 r_10 r_11 "/> </ node > < a xml:id="a_88917" label=" sentence_atom " ref=" n_88917 " as="lingo"/> < a xml:id="a_f71355" label="ft" ref=" n_88917 " as="lingo">< fs > <f name=" sentence_atom_number " value=" 0 "/> </ fs ></ a > < edge xml:id=" e_1 " from=" n_88917 " to=" n_84383 "/> < a xml:id="a_e1" label=" parents " ref=" e_1 " as="link"/> < region xml:id=" r_1 " anchors =" 0 5 "/> < node xml:id=" n_2 ">< link targets =" r_1 "/></ node > < a xml:id="a_2" label=" word " ref=" n_2 " as="monads"/> < region xml:id=" r_2 " anchors =" 6 23 "/> < node xml:id=" n_3 ">< link targets =" r_2 "/></ node > < a xml:id="a_3" label=" word " ref=" n_3 " as="monads"/> < region xml:id=" w_1 " anchors =" 24 24 "/> labeled edges nodes n_ object id annotations (features) annotations (empty) primary text UNICODE-utf8 regions r_ monad number parents parents pa re nts parents parents pare nts parents lexeme_utf8= ר א י תold_lexeme_utf8= ר א י תvocalized_lexeme_utf8= רֵ אִ י תsurface_consonants_utf8= ר א י תgraphical_lexeme_utf8= רֵ אִ ֖ יְ רֵ אִ ֖ י תָ רָ ֣ א אֱ . הִ ֑ י ם אֵ ֥ ת הַ ָ מַ ֖ יִ ם וְ אֵ ֥ ת הָ אָ ֽ רֶ ץ ׃r_1 0-5 r_2 6-23 w_1 24 r_3 25-38 w_2 39 w_3 58 w_4 67 w_5 92 w_6 105 r_4 40-57 r_5 59-66 r_6 68- 71 r_7 72-91 r_8 93- 96 r_9 97- 104 r_10 106- 109 r_11 110-121 p_7 122- 123 n_2 n_3 n_4 n_5 n_6 n_7 n_8 parents n_9 n_10 n_11 n_12 word word word word word word word word word word word n_84383 sentence number_within_chapter=1 n_59559 phrase determination=determined is_apposition=false number_within_clause=4 phrase_function=Objc phrase_type=PP parents n_34680 clause_atom pare nts n_77637 subphrase parents mother n_77638 subphrase parents n_40770 phrase_atom parents n_28737 clause parent n_88917 sentence_atom r_7 .. r_5 r_11 .. r_9 r_11 .. r_5 r_11 .. r_5 r_11 .. r_1 r_11 .. r_1 r_11 .. r_1 r_11 .. r_1 clause_atom_number=1 clause_atom_relation=0 clause_atom_relation_daughter_tense=unknown clause_atom_relation_kind=No_relation clause_atom_relation_mother_tense=unknown clause_atom_relation_preposition_class=none clause_atom_type=xQtl indentation=0 < a xml:id="a_f22" label="ft" ref=" n_3 " as="utf8">< fs > < f name=" lexeme_utf8 " value=" ר א י ת"/> < f name=" old_lexeme_utf8 " value=" ר א י ת"/> < f name=" vocalized_lexeme_utf8 " value=" א ִ י ת"/> < f name=" surface_consonants_utf8 " value=" ר א י ת"/> < f name=" graphical_lexeme_utf8 " value=" א ִ֖ י"/> </ fs ></ a > link to regions Linguistic Annotation Framework urn:nbn:nl:ui:13-ukhm-eb

Upload: others

Post on 22-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: research share · 2018. 5. 25. · data research share text queries as annotations SHEBANQ  w.t.van.peursen@vu.nl Wido van Peursen Eep Talstra Janet Dyk Constantijn Sikkel

data

research share

text

queries as annotations

SHEBANQ

www.godgeleerdheid.vu.nl/[email protected] van PeursenEep TalstraJanet DykConstantijn SikkelJan Krans

Oliver GlanzReinoud OostingBrenda Kronemeijer-HeyinkGino KalkmanUlrik Sandborg-PetersenNicolai Winther-Nielsen

/etc bc

Eep Talstra Centre for Bible and Computing

<node xml:id="n_88917"><link targets="r_1 r_2 r_3 r_4 r_5 r_6 r_7 r_8 r_9 r_10 r_11"/>

</node><a xml:id="a_88917" label="sentence_atom" ref="n_88917" as="lingo"/>

<a xml:id="a_f71355" label="ft" ref="n_88917" as="lingo"><fs><f name="sentence_atom_number" value="0"/>

</fs></a><edge xml:id="e_1" from="n_88917" to="n_84383"/>

<a xml:id="a_e1" label="parents" ref="e_1" as="link"/>

<region xml:id="r_1" anchors="0 5"/><node xml:id="n_2"><link targets="r_1"/></node>

<a xml:id="a_2" label="word" ref="n_2" as="monads"/>

<region xml:id="r_2" anchors="6 23"/><node xml:id="n_3"><link targets="r_2"/></node>

<a xml:id="a_3" label="word" ref="n_3" as="monads"/>

<region xml:id="w_1" anchors="24 24"/>

labeled edges

nodesn_object id

annotations(features)

annotations(empty)

primary textUNICODE-utf8

regionsr_monad number

parentsparentsparents

parentsparents

parentsparents

parents

lexeme_utf8= תישארold_lexeme_utf8= תישאר

vocalized_lexeme_utf8= תישארsurface_consonants_utf8= תישאר

graphical_lexeme_utf8= ישאר

׃ץראה תאו םימשה תא םיה.א ארב תישארב

r_10-5

r_26-23

w_124

r_325-38

w_239

w_358

w_467

w_592

w_6105

r_440-57

r_559-66

r_668-71

r_772-91

r_893-96

r_997-104

r_10106-109

r_11110-121

p_7122-123

n_2n_3n_4n_5n_6n_7n_8

parents

n_9n_10n_11n_12

word word word word word word word word word word word

n_84383

sentence

number_within_chapter=1

n_59559

phrase

determination=determinedis_apposition=false

number_within_clause=4phrase_function=Objc

phrase_type=PP

parents

n_34680clause_atom

parents

n_77637

subphrase

parents

mothern_77638

subphrase

parents

n_40770

phrase_atom

parents

n_28737

clause

parentn_88917

sentence_atom

r_7 .. r_5r_11 .. r_9

r_11 .. r_5

r_11 .. r_5

r_11 .. r_1

r_11 .. r_1

r_11 .. r_1r_11 .. r_1

clause_atom_number=1clause_atom_relation=0

clause_atom_relation_daughter_tense=unknownclause_atom_relation_kind=No_relation

clause_atom_relation_mother_tense=unknownclause_atom_relation_preposition_class=none

clause_atom_type=xQtlindentation=0

<a xml:id="a_f22" label="ft" ref="n_3" as="utf8"><fs><f name="lexeme_utf8" value=" תישאר "/>

<f name="old_lexeme_utf8" value=" תישאר "/><f name="vocalized_lexeme_utf8" value=" תישאר "/>

<f name="surface_consonants_utf8" value=" תישאר "/><f name="graphical_lexeme_utf8" value=" ישאר "/>

</fs></a>

link to regions

Linguistic Annotation Framework

urn:nbn:nl:ui:13-ukhm-eb