sabine tittel, state of the art: the „old french situation“
TRANSCRIPT
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
1/20
Workshop Connecting textual
corpora and dictionaries25-26 April 2013, Krakw
S. Tittel, DEAF State of the Art: The Old French Situation
State of the Art:The Old French Situation
1
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
2/20
The DEAF
Includes all primary text sources
Evaluates the results of all dictionaries of OldFrench
Ongoing composition since 1968
Published in print G K (incl. prefixes A Z)
On-line since 2010 Project funding ends 2017
S. Tittel, DEAF State of the Art: The Old French Situation 2
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
3/20
The corpus of Old French
Includes texts of all genres and whole time period
DEAF bibliography: references to about 2,700
primary text sources Extrapolation from digital corpora:
Corpus NCA: 140 texts, total 3 M words
Corpus BFM: 75 texts, total 3.3 M words=> estimated size OF corpus: 60 120 M words
=> No digital corpus of Old French
S. Tittel, DEAF State of the Art: The Old French Situation 3
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
4/20
Situation of Corpus Linguistics andCorpus Lexicography from DEAF POV
Overview existing projects of Old French
corpus linguistics and
corpus lexicography
DEAF is not part of this talk
Basis for discussion of main objectives of theworkshop
Possible input into later debate
S. Tittel, DEAF State of the Art: The Old French Situation 4
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
5/20
Agenda
1.List of existing projects
2.What are the main considerations forcompiling electronic corpora?
3.What are existing/expected links between
electronic dictionaries and corpora?4.How can textual corpora and the dictionarywriting process benefit each other?
S. Tittel, DEAF State of the Art: The Old French Situation 5
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
6/20
Corpus linguistic Projects
1. BFM
Base de Franais Mdival(Lyon)
2. CoRPTeFCorpus reprsentatif des premiers textesfranais (Lyon)
3. NCANouveau Corpus d'Amsterdam (Stuttgart)
4. DocLing
Les plus anciens documents linguistiques de laFrance (Zurich)
5. Corpus de testaments de Saint-Di-des-Vosges(Aberystwyth)
6. TFATextes de Franais Ancien (Ottawa)7. MCVFModliser le changement:les voies du franais
(Ottawa)8. Frantext (Nancy)
S. Tittel, DEAF State of the Art: The Old French Situation 6
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
7/20
Corpus lexicographical projects
1. ANDElThe Anglo-Norman Dictionary
(Aberystwyth)
2. [DMFDictionnaire du Moyen Francais(ATILF, Nancy)]
3. DCTDictionnaire lectronique de Chrtiende Troyes (Ottawa / ATILF, Nancy)
Issues exemplary for workshop topic
S. Tittel, DEAF State of the Art: The Old French Situation 7
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
8/20
DCT
Corpus: romances of Chrtien de Troyes [BN fr.794; Guiot]
Readable en bloc, partly lemmatised
Dictionary: A Q
Search functions
close-meshed linking to corpus and vice versa Linking to bitmap images of ms.
Linking to Godefroy, DMF, DEAF
S. Tittel, DEAF State of the Art: The Old French Situation 8
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
9/20
Issues DCT from DEAF POV
Single ms. as basis for the language ofChrtien
=> Information isolated
Too little focus on semantics
S. Tittel, DEAF State of the Art: The Old French Situation 9
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
10/20
What are the main considerations
for compiling electronic corpora?
S. Tittel, DEAF State of the Art: The Old French Situation 10
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
11/20
Main considerations for compilingelectronic corpora
Far away from workable approaches which suffice
the existing philological standards
the desired information retrieval.
Target: Edit data for studies
morphosyntactic, phraseological, semantic, andhistorico-cultural
Basis: the comprehension of the text
Q: Enough emphasis on semantic tagging?
S. Tittel, DEAF State of the Art: The Old French Situation 11
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
12/20
What are existing/expected links
between electronic dictionaries andcorpora?
S. Tittel, DEAF State of the Art: The Old French Situation 12
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
13/20
Existing (electronic) links of DEAF
a) Corpus lexicographical projects
ANDEl, and DMF
b) Corpus linguistic projects
[Frantext: regular use but no linking]
S. Tittel, DEAF State of the Art: The Old French Situation 13
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
14/20
Q: Why only these?
A: Cost-benefit ratio.
Incorporating results
Turns them into source material Need to be carefully verified and evaluated
Both Enrichment Obstruction
S. Tittel, DEAF State of the Art: The Old French Situation 14
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
15/20
Example:jorday
Traditional analysis of 1,000 slips: ca. 50senses / sub-senses
the Old French corpus 50,000 att.
High cost: min. 3 years
Low benefit: corpus driven approach + 1 or 2extra sub-senses
S. Tittel, DEAF State of the Art: The Old French Situation 15
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
16/20
Expected Links
Vision: integration of all valuable results
Two modest dreams:1) Better semantic preparation of digital corpus2) On-line publication of more dictionaries (FEW, TL)
More linking to dictionaries would Not change content of DEAF articles Enhance the quality of on-line publication Refine the meshes of a digital network
S. Tittel, DEAF State of the Art: The Old French Situation 16
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
17/20
How can textual corpora contribute to
the dictionary writing process?How can electronic dictionariescontribute to corpora enrichment?
S. Tittel, DEAF State of the Art: The Old French Situation 17
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
18/20
Dictionaries and corpus qualitydetermine each other
The respective qualities rise in mutualdependence.
=> synopsis comparative
/ vergleichende Zusammenschau
/ the comparative synopsis of knowledge=> key to quality enhancement
S. Tittel, DEAF State of the Art: The Old French Situation 18
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
19/20
How can textual corpora benefit thedictionary writing process?
Two approaches:
1. Newly designed and conceptualised
dictionaries (ex. DCT)
2. Long-established dictionaries
Characteristic for COST Action
=> 1st step: Coop. DEAF-DocLing
S. Tittel, DEAF State of the Art: The Old French Situation 19
-
7/30/2019 Sabine Tittel, State of the Art: The Old French Situation
20/20
DEAF Team ca. 2012
S. Tittel, DEAF State of the Art: The Old French Situation 20