prof. dr. klemens waldhör chief architect · 2011. 2. 27. · db4o partially implemented for...
TRANSCRIPT
![Page 1: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/1.jpg)
openTMS; 23.02.2011; Dr. Klemens Waldhör; www.opentms.de 1
Prof. Dr. Klemens WaldhörChief [email protected]
![Page 2: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/2.jpg)
The ProjectOverview
Open TMS Goals Architecture Implementation Current Status GUIs
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 2
![Page 3: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/3.jpg)
openTMS Sponsors
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 3
![Page 4: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/4.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 4
Overview
Free and Open Source Translation Memory System
![Page 5: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/5.jpg)
openTMS; 23.02.2011; Dr. Klemens Waldhör; www.opentms.de 5
Goals
Development of the OpenSource Translation Memory system OpenTMS
Three translation memory systems for one and the same process? Software investments that make translation costs shoot through the roof? Exchange formats that put the brakes on productivity? FOLT (Forum Open Language Tools) is concerned with the entire process of producing multilingual documentation. From the creation of the source text to production in foreign languages, we analyze our processes for weaknesses and a lack of standardisation.
Primary objectives:- Sharing experiences of processes using standard industry software - Sharing experiences of the use of Open Source software - Standardisation of interchange formats -Testing new Open Source technologies and improving existing technologies in the translation market - Public support for non-proprietary software and software development - Publication of aims and results
www.folt.de
![Page 6: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/6.jpg)
OpenTMS Requirements Architecture
Server / Client Architecture Modular software approach Web based Thin client
Language Modelling High level modelling Extendibility Language independent Standards based
Data Sources Abstract data source approach Integrated default fuzzy search Integrated translate method Support of all SQL database
systems Support of all types of data
sources like csv, noSQL, TMX, XLIFF files
MT support through databases
Scalability Single and multi user requirement High performance databases
Interfaces Interface Layer Integration into CMS Workflow support
Java Java Coding Standards Java Documentation Standard Delivered as a set of jar files
Eclipse Development base swt for GUIs
OS independent Windows, Linux, Mac …
Hardware independent
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 6
![Page 7: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/7.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 7
Architecture
The core of openTMS
![Page 8: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/8.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 8
TranslationMemory
Converter
Back Converter
MachineTranslation
OpenTMSEditor
Segmenter
TerminologyTranslation
XLIFF
Example Work Flow
CMS1.
2.
3.
![Page 9: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/9.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 9
Architecture based on Standards
XLIFF TMX TBX SRX …
In general XML
![Page 10: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/10.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 10
Application Model
UserModel
DataModel
DocumentModel
ProcessModel
Security Model
GUI Model Interface Model
OpenTMS Core Library
OpenTMS System Architecture
For details see Waldhör, K. (2008). OPENTMS SOFTWARE ARCHITECTURE.
![Page 11: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/11.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 11
Modelling Language
Monolingual Object Multilingual Object
General Linguistic Object
N:1
inherits
Data Source
Terminology
Translation Memory
mapping
Linguistic Property N:1
![Page 12: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/12.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 12
PreTranslation
MemoryConverter Back
Converter
OpenTMSTranslation
EditorSegmenter
InteractiveTerminologyTranslation
InteractiveTranslation
Memory
Data SourceHuman Initiated Interactions
OpenTMS Initiated Interactions
OpenTMS Processes
![Page 13: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/13.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 13
Data Sources Language related data are represented as “data sources” Idea
Makes the data access interface independent from the data itself Not being restricted to SQL databases only
Also flat data or xml files TMX, XLIFF files as a data source …
Machine translation (MT) as data source Spread sheets
E.g. Excel as terminology lists Object Oriented Databases DMS systems “Web Sites” (http based interfaces)
Define a common interface for all access functions Allows adaption to individual data source properties
e.g. read only data sources like MT
![Page 14: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/14.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 14
OPENTMS
SOFTWARE
OpenTMS Data
SourceLayer
Data type specific access
functions
Maps the OpenTMS access functions to the
specific data component
Access to data sources through
standardised interface
Various data components like files
etc.
Data Sources
![Page 15: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/15.jpg)
Data Sources
Data Source Status Data Source methods defined
Are extended depending on needs and requirements
SQL Access optimisation Hibernate based
Other OpenSource databases… OODBS
DB4O partially implemented for testing purposes
Other data sources TMX files XLIFF files MT
Google & Microsoft Translator
Core Methods
Create Delete Import TMX, XLIFF File Export TMX, XLIFF File Copy between data sources Translate … Data Source described thru xml
based configuration files XML-RPC Access Servlet Engine Access
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 15
![Page 16: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/16.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 16
Fuzzy Search – Core Function of TM
Step 1: Search in KD-TREE Restricts the number of segments to search Finds possible matching (source) segments
Step 2: Levenshtein Similarity Compare matches from step 1 computing the real
similarity Step 3: Get source and target MOLs / MUL Create translation (alt-trans)
![Page 17: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/17.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 17
Translation Core Functions
Convert (to and from XLIFF) Currently using Araya converters
www.heartsome.de Complex document format like WinWord etc. thru
Open Office Converters Segmentation Currently thru Araya
Translate
![Page 18: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/18.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 18
User Interfaces
Editors and more
![Page 19: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/19.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 19
Ubuntu VM Distribution
![Page 20: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/20.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 20
Data Source Editor
Edit MOL/MOL Properties
Search FunctionsDelete & Save Functions
Language Specific Segments
![Page 21: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/21.jpg)
Beo Logisch! docliff Editor
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 21
![Page 22: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/22.jpg)
Web Xliff Editor
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 22
![Page 23: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/23.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 23
Integration Araya XLIFF Editor
![Page 24: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/24.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 24
Downloads http://sourceforge.net/projects/op
en-tms Ubuntu Version Windows Version
www.heartsome.de/arayatest/opentmsserver.exe Includes data source editor
Web GUI and docliff editor Send E-Mail to
[email protected] As part of the Araya Xliff Editor:
www.heartsome.de/arayatest/araya-freeversion.exe
![Page 25: Prof. Dr. Klemens Waldhör Chief Architect · 2011. 2. 27. · DB4O partially implemented for testing purposes Other data sources TMX files XLIFF files MT Google & Microsoft Translator](https://reader034.vdocuments.net/reader034/viewer/2022051900/5fee8119de251b3b3465dc2f/html5/thumbnails/25.jpg)
FOLT Überblick Stand 03.07.2009; Dr. Klemens Waldhör 25
Contact
Heartsome Europe GmbHFriedrichstr. 17D-90574 Roßtal
www.heartsome.de
Prof. Dr. Klemens Waldhör
T: +49 9127 579001F: +49 9127 951178 [email protected]