greek historiography through dependency syntax treebanking · greek historiography through...

83
Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University Robert J. Gorman, Dept. of Classics Vanessa B. Gorman, Dept. of History University of Nebraska-Lincoln

Upload: lylien

Post on 29-Aug-2019

224 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Greek Historiography Through Dependency Syntax

TreebankingDigital Classicist New England March 25, 2015, Tufts University

Robert J. Gorman, Dept. of Classics Vanessa B. Gorman, Dept. of History

University of Nebraska-Lincoln

Page 3: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

How accurate are the quotes, paraphrases, excerpts, and

epitomes attributed to earlier authors?

Page 4: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

The Layers of Athenaeus (c. 200 CE)

• Narrator (Athenaeus himself) • The 24 Deipnosophists • 2500+ quotes or paraphrases to 800+

writers • All hopelessly intertangled

Page 5: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Corrupting Luxury in Ancient Greek Literature

By

Robert J. Gorman and

Vanessa B. Gorman

The University of Michigan Press, Ann Arbor

Page 6: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Derive Syntactic “Thumbprints”

• Create a database of syntactically analyzed Greek prose

• Teach the computer to distinguish known authors (proof of concept)

• Compare directly-transmitted to epitomized prose by the same author

Page 7: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Epitomizers and Excerptors

• Polybius (2nd c. BCE) has 5 of 40 books preserved through direct transmission oOthers mainly preserved in the excerptors

working for Emperor Constantine VII Porphyrogenitus (10th c. CE)

• Diodorus Siculus (1st c. BCE) has 15 of 40 books preserved through direct transmission oOthers mainly in Photius (9th century CE)

and the same Constantine excerptors

Page 8: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Fragments of Lost Authors

• Compare to fragments of the same author that are preserved elsewhere

• Compare to context in Athenaeus and Photius

• Does it resemble: o The other fragments of the same author? o The context in Athenaeus?

Page 9: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Dependency Syntax Treebanking

• Corpus Linguistics • Annotation: create a database of

syntactically-analyzed prose • Abstraction: translate into a computer

searchable dataset • Analysis: develop algorithms to query

that dataset

Page 10: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Dependency vs. Constituency Grammar

Page 11: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Dependency vs. Constituency Grammar

Page 12: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

http://nlp.perseus.tufts.edu/syntax/treebank/greek.html

Page 13: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

My DatasetAUTHOR WORK TOKEN COUNT STATUS

Athenaeus Books 12-13 45,584 tokens submitted

Lysias Orations 1, 14, 15 7,650 tokens submitted

Polybius Book 1 28,288 tokens submitted

Herodotus Book 1 32,879 tokens editing

Plutarch Lycurgus 10,567 tokens submitted

Antiphon Oration 1 2,015 tokens editing

Diodorus Siculus Book 11 6,247 tokens [11.1-20 only]

in progress

Thucydides Book 1 13,720 tokens [1.1-80 only]

in progress

TOTAL [2/20/2015]   146,950 tokens  

Page 14: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 15: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

παρεσκευάζετο γὰρ πολλῇ δυνάμει πλεῖν ἐπὶ τὴν Ἑλλάδα καὶ συμμαχεῖ

ν τοῖς Ἕλλησι κατὰ τῶν Περσῶν .

“He was preparing to sail to Greece with a great force and to fight with the Greeks against the Persians.”

(Diodorus 11.26.4 [sent. 58])

Page 16: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 17: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 18: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 19: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 20: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Color coding

Page 21: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 22: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 23: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 24: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Prague tagset

Page 25: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 26: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 27: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Thuc. 1.13.4 [elision]

Page 28: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

A flat tree: Thuc. 1.9.2 [135 words]

Page 29: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

A deep tree: Athen. 12.11 [82 words]

Page 30: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 31: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 32: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

For each word in AGDT we have:

• dependency (word’s parent, children)• syntactic relation (grammatical label for dependency)•Lemma•Morphology•Position in sentence

Page 33: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 34: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Dependency DegreeLinear vs. hubby structure

Page 35: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 36: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 37: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Mary: SBJ-PRED-ROOThad: PRED-ROOTa: ATR-OBJ-PRED-ROOTlittle: ATR-OBJ-PRED-ROOTlamb: OBJ-PRED-ROOT

Page 38: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 39: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 40: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 41: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Ὦ τοῦ στρατηγήσαντος ἐν Τροίᾳ ποτὲ / Ἀγαμέμνονος παῖ"O child of Agamemnon, once leading an army at Troy"

Page 42: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Ὦ τοῦ στρατηγήσαντος ἐν Τροίᾳ ποτὲ / Ἀγαμέμνονος παῖ"O child of Agamemnon, once leading an army at Troy"

Page 43: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 44: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 45: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 46: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 47: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 48: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 49: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 50: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 51: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Burrows Delta

Page 52: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 53: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 54: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 55: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Craig’s Zeta

Hit HitHit Hit HitHit Hit

• Divide corpus 1 into segments of equal size (size = n)• Segments with at least 1 example of given feature are hits.• Each hit is worth 1 point.

• Hits/segments = preferred feature score

Miss MissMissMissMiss MissMiss Miss

• Divide corpus 2 into segments of size n.• Segments with no examples of feature are misses.• Each miss is worth -1 point.

• Misses/segments = avoided feature score

Page 56: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 57: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Thucydides

Page 58: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 59: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 60: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 61: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 62: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 63: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 64: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Herodotus

Page 65: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 66: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 67: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Polybius

Page 68: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 69: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 70: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Homer

Page 71: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 72: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 73: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 74: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

Maciej Eder

Page 75: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 76: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 77: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University
Page 78: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

What Next?

• Test! Test! Test! • Cast the net as widely as possible:

oMany flavors of sWord • With POS, with Dependency Distance …

• N-grams

oMany computational approaches

Page 79: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

What next?

• Test! Test! Test! • Aim directly at research question

oAthenaeus and fragments oAre fragments of single author

distinguishable according to transmitting source?

Page 80: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

What’s needed?

• Trees! Trees! Trees! • Metadata

oDigital Athenaeus oDigital Fragmenta Historicorum

Graecorum • Scalable workflow

o Stable identification for each token

Page 81: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

The Vision Thing

• Treebanker’s Utopia o Real time feedback for annotators

• Is this syntactic structure feasible?

• Is this structure prone to inter-annotator disagreement?

• Philologist’s Elysium o Real time feedback for close readers o How does this text compare to others:

• Lexically, syntactically, semantically?

• Pragmatically, acoustically, etc.?

Page 82: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University

• Leipzig Open Philology Project oDigital Athenaeus Project

• Perseus and Perseids Projects, Tufts University o Perseus Open Publication Series

• University of Nebraska−Lincoln oDept. of History oDept. of Classics and Religious Studies

Page 83: Greek Historiography Through Dependency Syntax Treebanking · Greek Historiography Through Dependency Syntax Treebanking Digital Classicist New England March 25, 2015, Tufts University