old german reference corpus - hu-berlin.de · old german reference corpus comprising all texts...

1
Old German Reference Corpus Comprising all texts composed in Old High German and Old Saxon (around 7501050 CE, about 650.000 words) Digitization, annotation and online publication for scientific retrieval Digitization of dictionaries Digitization of reference editions Collation with manuscript Edition Conversion HTML > XML Lexical information Header (meta data) Grammatical information Manual annotation and disambiguation Integration into database http://www.cesg.unifr.ch http://www.e-codices.unifr.ch/en Manuscript (sample): Sangaller Credo (St. Gallen, ms. 911) Dictionary Reference edition TITUS edition http://titus.uni-frankfurt.de ELAN presentation XML version of dictionary Structured information on text Fully annotated ELAN version Grammar ANNIS (database and retrieval system) Research Project Title: Referenzkorpus Altdeutsch (750–1050)” Project leaders: Karin Donhauser (Humboldt University, Berlin), Jost Gippert (Johann Wolfgang Goethe University, Frankfurt), Rosemarie Lühr (Friedrich Schiller University, Jena) Funding institution: DFG Begin of funding: December 2008 Staff Berlin: Sonja Linde, Julia Richling, Eva Schlachter, Daniel Brauer, Silke Unverzagt Frankfurt: Ralf Gehrke, Roland Mittmann, Johannes Heimeroth Jena: Natalia Chumakova, Ulrike Ertel, Falko Georgi, Fransis Giseke, John Reichel, Laura Sturm Partner Project Title: Referenzkorpus Mittelhochdeutsch (1050–1350)” Project leaders: Klaus-Peter Wegera (Bochum), Thomas Klein (Bonn), Stefanie Dipper (Bochum) scripted scripted scripted scripted scripted DeutschDiachronDigital Altdeutsch www.sprachgeschichte.de/DDD

Upload: others

Post on 24-Sep-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Old German Reference Corpus - hu-berlin.de · Old German Reference Corpus Comprising all texts composed in Old High German and Old Saxon (around 750–1050 CE, about 650.000 words)

Old German Reference CorpusComprising all texts composed in Old High German and Old Saxon (around 750–1050 CE, about 650.000 words)

Digitization, annotation and online publication for scientific retrieval

Digitizationof dictionaries

Digitization ofreference editions Collation with

manuscript

Edition

ConversionHTML > XML

Lexicalinformation

Header(meta data)

Grammaticalinformation

Manual annotationand disambiguation

Integration intodatabase

http://www.cesg.unifr.chhttp://www.e-codices.unifr.ch/en

Manuscript (sample): Sangaller Credo (St. Gallen, ms. 911)

DictionaryReference edition

TITUS edition

http://titus.uni-frankfurt.de

ELAN presentation

XML version of dictionary

Structured information on text

Fully annotated ELAN version Grammar

ANNIS (database and retrieval system)

Research ProjectTitle: “Referenzkorpus Altdeutsch (750–1050)”Project leaders: Karin Donhauser (Humboldt University, Berlin),Jost Gippert (Johann Wolfgang Goethe University, Frankfurt),Rosemarie Lühr (Friedrich Schiller University, Jena)Funding institution: DFGBegin of funding: December 2008

StaffBerlin: Sonja Linde, Julia Richling, Eva Schlachter, Daniel Brauer,Silke UnverzagtFrankfurt: Ralf Gehrke, Roland Mittmann, Johannes HeimerothJena: Natalia Chumakova, Ulrike Ertel, Falko Georgi, Fransis Giseke, John Reichel, Laura Sturm

Partner ProjectTitle: “Referenzkorpus Mittelhochdeutsch (1050–1350)”Project leaders: Klaus-Peter Wegera (Bochum), Thomas Klein (Bonn), Stefanie Dipper (Bochum)

scriptedscripted

scripted

scripted

scripted

DeutschDiachronDigital

Altdeutsch www.sprachgeschichte.de/DDD