the hebrew bible as data wido van peursen eep talstra centre for bible and computer @shebanq_ /...

8
THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

Upload: evan-palmer

Post on 30-Dec-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

THE HEBREW BIBLE AS DATA

Wido van PeursenEep Talstra Centre for Bible and Computer

@shebanq_ / @PeursenWTvan

Page 2: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

2

THE CORPUS

Hebrew Bible > Ca. 400.000 words> Probably composed over a period of about 1000 years (1200-200 BC)> Complex transmission history> Oldest complete manuscript: Codex Leningradensis, 1008/9 AD> Various linguistic layers (e.g. vowel signs)> No native speakers

Page 3: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

3

THE DATABASE

WIVU database of the Hebrew Bible> [WIVU = Werkgroep Informatica Vrije Universiteit]

> Createted since 1970s> Linguistic levels:

> Morphology (encoding rather than tagging!)> Words> Phrases> Clauses> Sentences> Text hierarchy

Page 4: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

4

THE DATA STRUCTURE

Page 5: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

5

EMDROS

Central concept: objects with features> Each object can carry unlimited features> Objects can be aggregated arbitrarily into new objects> Structure that can deal with overlapping hierarchies

> query language: MQL

Page 6: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

6

HOWEVER….

1. No dedicated space on the web where an authorized version of this resource is guaranteed to exist.

2. No possibility to annotate it, link to it or build (open source) tools around it.

3. Results of existing queries cannot be shown on the web.

4. EMDROS is maintained by one-person private company.

5. Mainly used by specialists in Bible & Computer.

Page 7: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

7

SHEBANQ

To build a bridge between the linguistically annotated Hebrew Text corpus and biblical scholars.

Three steps:

(1) make text & annotations, available to scholars

(2) demonstrate how queries can function to address research questions: repository of saved queries.

(3) give textual scholarship more empirical basis, by creating the opportunity of unique identifiers referring to saved queries.

Page 8: THE HEBREW BIBLE AS DATA Wido van Peursen Eep Talstra Centre for Bible and Computer @shebanq_ / @PeursenWTvan

8