commonalities and differences

Post on 10-Feb-2016

56 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

DESCRIPTION

Different communities with different historical traditions TC 37: linguistics, terminology studies, translation, information science, engineering SC 32/WG 2: computer science, information systems, engineering, information science Differences in scopes and thematic coverage - PowerPoint PPT Presentation

TRANSCRIPT

9th Open Forum on Metadata RegistriesHarmonization of Terminology, Ontology

and Metadata20th – 22nd March, 2006 , Kobe Japan.

Commonalities and Differences between

TC 37 and SC32/WG 2 in Terminological Usage and Methodological Approach

Day: 2 Slot No. K2Name: Sue Ellen Wright1 + Gerhard Budin2

Organization: 1Kent State University; 2University of Vienna

9th Open Forum for Metadata Registry, Kobe, 2006

Commonalities and differences

Different communities with different historical traditions– TC 37: linguistics, terminology studies, translation,

information science, engineering– SC 32/WG 2: computer science, information systems,

engineering, information science– Differences in scopes and thematic coverage

Shared interests and goals– Terminological precision and transparence in

communication and information processes– Interoperability in heterogeneous information

envirionments

9th Open Forum for Metadata Registry, Kobe, 2006

Variant usage between TC 37 and JTC 1/SC 32 with

regard to data category and data element

9th Open Forum for Metadata Registry, Kobe, 2006

Complex and open data categories

9th Open Forum for Metadata Registry, Kobe, 2006

Closed data category

9th Open Forum for Metadata Registry, Kobe, 2006

Conceptual domain

9th Open Forum for Metadata Registry, Kobe, 2006

Simple and complex data elements in the 11179 community

9th Open Forum for Metadata Registry, Kobe, 2006

Data category specifications and names

9th Open Forum for Metadata Registry, Kobe, 2006

ISO 12620:1999

Computer Assisted Terminology—Data Categories– Too many data categories– Commonly used data categories scattered through the standard

Application areas– Official term banks, language planning organizations, standards

organizations– Major corporations, small and medium businesses– Individual terminologists, translators, and technical writers– Localization industry– Government agencies

9th Open Forum for Metadata Registry, Kobe, 2006

Accessibility Issues

Need for clear guidance on application-specific use of the data categories

Thematic categorization in 12620:1999 No recommended sample sets Issue of data modeling variance Alphabetical ordering in the new on-line

version Addition of data categories for other language

resources

9th Open Forum for Metadata Registry, Kobe, 2006

Global Data Category Registry (DCR)

Difficulty in achieving consensus on logical ordering

Addition of new working environments in TC 37– Lexicography– Morpho-syntactic markup– Semantic markup– NLP lexicons– SKOS environment

9th Open Forum for Metadata Registry, Kobe, 2006

Web Accessibility

Core standard: ISO 12620– Rules for establishing a Data Category Registry as

a metadata registry (DCR) Registration authority Maintenance of the DCR on the Web Need for Guide to the data categories in order

to facilitate clear and easy usability

9th Open Forum for Metadata Registry, Kobe, 2006

Terminology Management Environments and Aims

National, regional, and local groups establishing databases and data banks to serve the public interest on a number of levels

Standardizers and other domain-specific experts Companies, enterprises, and governmental agencies Individual terminologists, translators, technical

writers, and students creating their own terminology resources

9th Open Forum for Metadata Registry, Kobe, 2006

Classification Criteria

Monolingual and multilingual Specialized design or off-the-shelf software Constraints involving data input, storage,

retrieval, and information output Needs of various stakeholders Related resources

– Thesauri, taxonomies, and ontologies– Semantic Web environments

9th Open Forum for Metadata Registry, Kobe, 2006

Criteria for a Guide to Terminological Data Categories

Replacement for ISO/TR 12618:1994, Computer aides in terminology – Creation and use of terminological databases

Variation in database structures– Simple glosses– Translation-oriented terminology management– Complex multi-application approaches

9th Open Forum for Metadata Registry, Kobe, 2006

Basic Data Category Groups

terms (including all source-language synonyms and equivalents in other languages)

classification of term types (e.g., synonyms, variants, full and abbreviated forms)

term-related information (grammar, etymology, register, status) descriptive information definitions, contexts examples, notes, and graphic information administrative information identifiers of various sorts dates, responsibility entry status sources, combinable with terms themselves, definitions, contexts, notes,

and other pieces of information

9th Open Forum for Metadata Registry, Kobe, 2006

Other Guides to Data Category Selection

ISO 10241:1992 (and revision), Preparation and layout of international terminology standards – Primarily standards-oriented

ISO 12616:2002, Translation-oriented terminography– Traditional format, main-entry-term-oriented– Not focused on the principle of term autonomy– Issues with granularity and data element

elementarity

9th Open Forum for Metadata Registry, Kobe, 2006

Issues for a New Guide

Data modeling variance and term autonomy n conjunction with the application of the closed data category termType

Locating data categories throughout the collection– Locating non-standard synonymic data category names for

standardized names (use for…) Providing sample “common” data category layouts

– Overcoming “presentational distance” between different data category groups

9th Open Forum for Metadata Registry, Kobe, 2006

Translation-Related Dat-Cats

False friend Degree of equivalence Transfer comment

– Anchor at language or term-section level Directionality

9th Open Forum for Metadata Registry, Kobe, 2006

Standardization & Language Planning

Term status (preferred, admitted, deprecated)– Regulated terms (legal regulations)– Suggested terms

Process status (of the term entry itself) Administrative status (of term planning phase)

9th Open Forum for Metadata Registry, Kobe, 2006

Links and Relations

Terminological concept systems Controlled vocabularies (thesauri, etc.) Classification systems Ontologies and taxonomies

– Expression in SKOS (Simple Knowledge Organization System markup)

Distinction between generic systems– Subsumation, is-a relations

… and non-generic, non-subsumation systems– Meronymic (partitive) relations– All other relations

9th Open Forum for Metadata Registry, Kobe, 2006

Links to Other Entries & Objects

Internal links to other entries– Antonym– Entailed term– See– See also– Concept system links

Links to other objects– Corpus trace (link to term in context)– Concordance links– Shared resource links, etc.– Sources & source identifiers

9th Open Forum for Metadata Registry, Kobe, 2006

Language-Related Links

XML:lang as a language identifier XML:lang applied as a common locale

identifier (UTS#35) Transcription, transliteration, Romanization Source & target language identification

9th Open Forum for Metadata Registry, Kobe, 2006

Shared Data Categories

Word/term-related data categories Standard “lexicography” categories (e.g.,

etymology, definition, etc.) Administrative categories

9th Open Forum for Metadata Registry, Kobe, 2006

9th Open Forum for Metadata Registry, Kobe, 2006

top related