commonalities and differences

26
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006 , Kobe Japan. Commonalities and Differences between TC 37 and SC32/WG 2 in Terminological Usage and Methodological Approach Day: 2 Slot No. K2 Name: Sue Ellen Wright 1 + Gerhard Budin 2 Organization: 1 Kent State University; 2 University of Vienna

Upload: garan

Post on 10-Feb-2016

56 views

Category:

Documents


1 download

DESCRIPTION

Different communities with different historical traditions TC 37: linguistics, terminology studies, translation, information science, engineering SC 32/WG 2: computer science, information systems, engineering, information science Differences in scopes and thematic coverage - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Commonalities and differences

9th Open Forum on Metadata RegistriesHarmonization of Terminology, Ontology

and Metadata20th – 22nd March, 2006 , Kobe Japan.

Commonalities and Differences between

TC 37 and SC32/WG 2 in Terminological Usage and Methodological Approach

Day: 2 Slot No. K2Name: Sue Ellen Wright1 + Gerhard Budin2

Organization: 1Kent State University; 2University of Vienna

Page 2: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Commonalities and differences

Different communities with different historical traditions– TC 37: linguistics, terminology studies, translation,

information science, engineering– SC 32/WG 2: computer science, information systems,

engineering, information science– Differences in scopes and thematic coverage

Shared interests and goals– Terminological precision and transparence in

communication and information processes– Interoperability in heterogeneous information

envirionments

Page 3: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Variant usage between TC 37 and JTC 1/SC 32 with

regard to data category and data element

Page 4: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Complex and open data categories

Page 5: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Closed data category

Page 6: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Conceptual domain

Page 7: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Simple and complex data elements in the 11179 community

Page 8: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Data category specifications and names

Page 9: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

ISO 12620:1999

Computer Assisted Terminology—Data Categories– Too many data categories– Commonly used data categories scattered through the standard

Application areas– Official term banks, language planning organizations, standards

organizations– Major corporations, small and medium businesses– Individual terminologists, translators, and technical writers– Localization industry– Government agencies

Page 10: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Accessibility Issues

Need for clear guidance on application-specific use of the data categories

Thematic categorization in 12620:1999 No recommended sample sets Issue of data modeling variance Alphabetical ordering in the new on-line

version Addition of data categories for other language

resources

Page 11: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Global Data Category Registry (DCR)

Difficulty in achieving consensus on logical ordering

Addition of new working environments in TC 37– Lexicography– Morpho-syntactic markup– Semantic markup– NLP lexicons– SKOS environment

Page 12: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Web Accessibility

Core standard: ISO 12620– Rules for establishing a Data Category Registry as

a metadata registry (DCR) Registration authority Maintenance of the DCR on the Web Need for Guide to the data categories in order

to facilitate clear and easy usability

Page 13: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Terminology Management Environments and Aims

National, regional, and local groups establishing databases and data banks to serve the public interest on a number of levels

Standardizers and other domain-specific experts Companies, enterprises, and governmental agencies Individual terminologists, translators, technical

writers, and students creating their own terminology resources

Page 14: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Classification Criteria

Monolingual and multilingual Specialized design or off-the-shelf software Constraints involving data input, storage,

retrieval, and information output Needs of various stakeholders Related resources

– Thesauri, taxonomies, and ontologies– Semantic Web environments

Page 15: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Criteria for a Guide to Terminological Data Categories

Replacement for ISO/TR 12618:1994, Computer aides in terminology – Creation and use of terminological databases

Variation in database structures– Simple glosses– Translation-oriented terminology management– Complex multi-application approaches

Page 16: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Basic Data Category Groups

terms (including all source-language synonyms and equivalents in other languages)

classification of term types (e.g., synonyms, variants, full and abbreviated forms)

term-related information (grammar, etymology, register, status) descriptive information definitions, contexts examples, notes, and graphic information administrative information identifiers of various sorts dates, responsibility entry status sources, combinable with terms themselves, definitions, contexts, notes,

and other pieces of information

Page 17: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Other Guides to Data Category Selection

ISO 10241:1992 (and revision), Preparation and layout of international terminology standards – Primarily standards-oriented

ISO 12616:2002, Translation-oriented terminography– Traditional format, main-entry-term-oriented– Not focused on the principle of term autonomy– Issues with granularity and data element

elementarity

Page 18: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Issues for a New Guide

Data modeling variance and term autonomy n conjunction with the application of the closed data category termType

Locating data categories throughout the collection– Locating non-standard synonymic data category names for

standardized names (use for…) Providing sample “common” data category layouts

– Overcoming “presentational distance” between different data category groups

Page 19: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Translation-Related Dat-Cats

False friend Degree of equivalence Transfer comment

– Anchor at language or term-section level Directionality

Page 20: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Standardization & Language Planning

Term status (preferred, admitted, deprecated)– Regulated terms (legal regulations)– Suggested terms

Process status (of the term entry itself) Administrative status (of term planning phase)

Page 21: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Links and Relations

Terminological concept systems Controlled vocabularies (thesauri, etc.) Classification systems Ontologies and taxonomies

– Expression in SKOS (Simple Knowledge Organization System markup)

Distinction between generic systems– Subsumation, is-a relations

… and non-generic, non-subsumation systems– Meronymic (partitive) relations– All other relations

Page 22: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Links to Other Entries & Objects

Internal links to other entries– Antonym– Entailed term– See– See also– Concept system links

Links to other objects– Corpus trace (link to term in context)– Concordance links– Shared resource links, etc.– Sources & source identifiers

Page 23: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Language-Related Links

XML:lang as a language identifier XML:lang applied as a common locale

identifier (UTS#35) Transcription, transliteration, Romanization Source & target language identification

Page 24: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Shared Data Categories

Word/term-related data categories Standard “lexicography” categories (e.g.,

etymology, definition, etc.) Administrative categories

Page 25: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006

Page 26: Commonalities and differences

9th Open Forum for Metadata Registry, Kobe, 2006