commonalities and differences
DESCRIPTION
Different communities with different historical traditions TC 37: linguistics, terminology studies, translation, information science, engineering SC 32/WG 2: computer science, information systems, engineering, information science Differences in scopes and thematic coverage - PowerPoint PPT PresentationTRANSCRIPT
9th Open Forum on Metadata RegistriesHarmonization of Terminology, Ontology
and Metadata20th – 22nd March, 2006 , Kobe Japan.
Commonalities and Differences between
TC 37 and SC32/WG 2 in Terminological Usage and Methodological Approach
Day: 2 Slot No. K2Name: Sue Ellen Wright1 + Gerhard Budin2
Organization: 1Kent State University; 2University of Vienna
9th Open Forum for Metadata Registry, Kobe, 2006
Commonalities and differences
Different communities with different historical traditions– TC 37: linguistics, terminology studies, translation,
information science, engineering– SC 32/WG 2: computer science, information systems,
engineering, information science– Differences in scopes and thematic coverage
Shared interests and goals– Terminological precision and transparence in
communication and information processes– Interoperability in heterogeneous information
envirionments
9th Open Forum for Metadata Registry, Kobe, 2006
Variant usage between TC 37 and JTC 1/SC 32 with
regard to data category and data element
9th Open Forum for Metadata Registry, Kobe, 2006
Complex and open data categories
9th Open Forum for Metadata Registry, Kobe, 2006
Closed data category
9th Open Forum for Metadata Registry, Kobe, 2006
Conceptual domain
9th Open Forum for Metadata Registry, Kobe, 2006
Simple and complex data elements in the 11179 community
9th Open Forum for Metadata Registry, Kobe, 2006
Data category specifications and names
9th Open Forum for Metadata Registry, Kobe, 2006
ISO 12620:1999
Computer Assisted Terminology—Data Categories– Too many data categories– Commonly used data categories scattered through the standard
Application areas– Official term banks, language planning organizations, standards
organizations– Major corporations, small and medium businesses– Individual terminologists, translators, and technical writers– Localization industry– Government agencies
9th Open Forum for Metadata Registry, Kobe, 2006
Accessibility Issues
Need for clear guidance on application-specific use of the data categories
Thematic categorization in 12620:1999 No recommended sample sets Issue of data modeling variance Alphabetical ordering in the new on-line
version Addition of data categories for other language
resources
9th Open Forum for Metadata Registry, Kobe, 2006
Global Data Category Registry (DCR)
Difficulty in achieving consensus on logical ordering
Addition of new working environments in TC 37– Lexicography– Morpho-syntactic markup– Semantic markup– NLP lexicons– SKOS environment
9th Open Forum for Metadata Registry, Kobe, 2006
Web Accessibility
Core standard: ISO 12620– Rules for establishing a Data Category Registry as
a metadata registry (DCR) Registration authority Maintenance of the DCR on the Web Need for Guide to the data categories in order
to facilitate clear and easy usability
9th Open Forum for Metadata Registry, Kobe, 2006
Terminology Management Environments and Aims
National, regional, and local groups establishing databases and data banks to serve the public interest on a number of levels
Standardizers and other domain-specific experts Companies, enterprises, and governmental agencies Individual terminologists, translators, technical
writers, and students creating their own terminology resources
9th Open Forum for Metadata Registry, Kobe, 2006
Classification Criteria
Monolingual and multilingual Specialized design or off-the-shelf software Constraints involving data input, storage,
retrieval, and information output Needs of various stakeholders Related resources
– Thesauri, taxonomies, and ontologies– Semantic Web environments
9th Open Forum for Metadata Registry, Kobe, 2006
Criteria for a Guide to Terminological Data Categories
Replacement for ISO/TR 12618:1994, Computer aides in terminology – Creation and use of terminological databases
Variation in database structures– Simple glosses– Translation-oriented terminology management– Complex multi-application approaches
9th Open Forum for Metadata Registry, Kobe, 2006
Basic Data Category Groups
terms (including all source-language synonyms and equivalents in other languages)
classification of term types (e.g., synonyms, variants, full and abbreviated forms)
term-related information (grammar, etymology, register, status) descriptive information definitions, contexts examples, notes, and graphic information administrative information identifiers of various sorts dates, responsibility entry status sources, combinable with terms themselves, definitions, contexts, notes,
and other pieces of information
9th Open Forum for Metadata Registry, Kobe, 2006
Other Guides to Data Category Selection
ISO 10241:1992 (and revision), Preparation and layout of international terminology standards – Primarily standards-oriented
ISO 12616:2002, Translation-oriented terminography– Traditional format, main-entry-term-oriented– Not focused on the principle of term autonomy– Issues with granularity and data element
elementarity
9th Open Forum for Metadata Registry, Kobe, 2006
Issues for a New Guide
Data modeling variance and term autonomy n conjunction with the application of the closed data category termType
Locating data categories throughout the collection– Locating non-standard synonymic data category names for
standardized names (use for…) Providing sample “common” data category layouts
– Overcoming “presentational distance” between different data category groups
9th Open Forum for Metadata Registry, Kobe, 2006
Translation-Related Dat-Cats
False friend Degree of equivalence Transfer comment
– Anchor at language or term-section level Directionality
9th Open Forum for Metadata Registry, Kobe, 2006
Standardization & Language Planning
Term status (preferred, admitted, deprecated)– Regulated terms (legal regulations)– Suggested terms
Process status (of the term entry itself) Administrative status (of term planning phase)
9th Open Forum for Metadata Registry, Kobe, 2006
Links and Relations
Terminological concept systems Controlled vocabularies (thesauri, etc.) Classification systems Ontologies and taxonomies
– Expression in SKOS (Simple Knowledge Organization System markup)
Distinction between generic systems– Subsumation, is-a relations
… and non-generic, non-subsumation systems– Meronymic (partitive) relations– All other relations
9th Open Forum for Metadata Registry, Kobe, 2006
Links to Other Entries & Objects
Internal links to other entries– Antonym– Entailed term– See– See also– Concept system links
Links to other objects– Corpus trace (link to term in context)– Concordance links– Shared resource links, etc.– Sources & source identifiers
9th Open Forum for Metadata Registry, Kobe, 2006
Language-Related Links
XML:lang as a language identifier XML:lang applied as a common locale
identifier (UTS#35) Transcription, transliteration, Romanization Source & target language identification
9th Open Forum for Metadata Registry, Kobe, 2006
Shared Data Categories
Word/term-related data categories Standard “lexicography” categories (e.g.,
etymology, definition, etc.) Administrative categories
9th Open Forum for Metadata Registry, Kobe, 2006
9th Open Forum for Metadata Registry, Kobe, 2006