its2.0 cms to tms implementation: gbc server/plint ... · pdf filedrupalto gbc server/plint...

35
ITS2.0 CMS to TMS implementation: Drupal to GBC Server/PLINT Showcase with VDMA with VDMA Pd L Dí O P edro L. ez Orzas Karl Fritsche Mauricio del Olmo The MultilingualWebLT Working Group receives funding by the European Commission (project name LTWeb) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

Upload: vobao

Post on 15-Mar-2018

219 views

Category:

Documents


2 download

TRANSCRIPT

ITS2.0 CMS to TMS implementation: Drupal to GBC Server/PLINT Showcase 

with VDMAwith VDMAP d L Dí OPedro L. Díez Orzas

Karl Fritsche  Mauricio del Olmo 

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

Special thanksSpecial thanks 

Laura Guerrero, Giuseppe Deriard, Pablo Badía (Linguaserve)

Clemens Weins, Stephan Walter (Cocomore AG)(Cocomore AG)

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 2 -

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

ITS 2 0 Use case in MLW LTITS 2.0 Use case in MLW‐LT C d Li b f th• Cocomore and Linguaserve are members of theW3C MultilingualWeb‐LT Working Group.

• MLW‐LT receives funding by the EuropeanCommission (project name LT‐Web) through theCommission (project name LT Web) through theSeventh Framework Programme (FP7) in the areaof Language Technologiesof Language Technologies.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 4 -4

Multisectorial and MultinationalMultisectorial and Multinational

Cocomore AG is an agency fori i d IT i h i

Linguaserve is a company forl ili l d d l i dcommunication and IT with extensive

experience in open‐source software,interface development and an

multilingual advanced solutions, andstrongly committed to developing long‐term relationships and to creating value

understanding of the content creationside.Cocomore wish to provide better

for its customers: for both small andmedium‐sized companies and for largecorporations and public administrationCocomore wish to provide better

multilingual solutions for clients and tolay the foundation for a long‐term CMD‐to localisation chain integration

p pbodies. Linguaserve specialises in webglobalisation and invests in the field ofR&D.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 5 -

to‐localisation‐chain‐integration. R&D.

5

Use case: VDMAUse case: VDMA• VDMA: German machinery and plant manufacturers'• VDMA: German machinery and plant manufacturers

association• Largest industrial association in the capital goods industry in• Largest industrial association in the capital goods industry in

Europe (3170 industrial members)• Highly export-orientedHighly export oriented

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 6 -6

Use Case: MLW LT as a SolutionUse Case: MLW‐LT as a SolutionA t t d d d• Annotated, processed and translated 141 press releases

• Using Drupal MLW‐LT modulesmodules

• Automatic content round tripping with Linguaserve as LSPLSP

• Languages: GER, FR, ZH

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 7 -7

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

Automated off‐line translation system(Interoperability)

Publication(W b it )

InformationSystem

MTCAT

Export and send Workflow

(Website) (CMS / DMS)LSP HT

CATReceive and

import

• CMS = Content Management System / DMS = Document Management System• MT = Machine Translation / HT = Human Translation

d l lThe MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh 

Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.- 9 -

• CAT = Computer Assisted Translation tool

ArchitectureArchitecturee  g

b site

a.org

A We

w.vdm

VDMA

www

V

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 10 -

ITS 2 0 in CMS/TMS processingITS 2.0 in CMS/TMS processing

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 11 -

ITS 2 0 format flowITS 2.0 format flowe XHTML +XHTML + e CAT t lCAT t l e CAT t lCAT t l e XHTML +XHTML +

Orig

inal fil XHTML + 

ITS 2.0XHTML + ITS 2.0

cessed

fil CAT toolorientedXML

CAT toolorientedXML ns

latedfil CAT tool

orientedXML

CAT toolorientedXML Fi

nal fil XHTML + 

ITS 2.0 translated

XHTML + ITS 2.0 

translatedO

Pre‐pro XMXM

Tran

XMXM translatedfile

translatedfile

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 12 -

Some ITS 2 0 benefits (I)Some ITS 2.0 benefits (I)ITS 2 0 I b th ’ t l d t ti• ITS 2.0 Increases both, user’s control and automatic decision processes, allowing more “intelligence “ in the 

t i lsystems involve: – Translate: can be automatically or manually annotated. E.g.: it allows not to add “not translatable” terms in several specificallows not to add “not translatable” terms in several specific glossaries or MT systems.

– Localization Note: direct communication with PM and– Localization Note: direct communication with PM and translators. When alert type, can be used for triggering certain processes in the Translation Workflow.p

– Domain: automatic selection of CAT/MT terminology and dictionaries. Selection of Translation Memories by domains.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 13 -

Some ITS 2 0 benefits (II)Some ITS 2.0 benefits (II)Language Information: quality check to ensure the source– Language Information: quality check to ensure the source language content is according to the Webservice parameter.

– Allowed Characters: quality check for the target content– Allowed Characters: quality check for the target content.– Storage Size: quality check for both, original content, and the target content to be uploaded into the CMS. It can be used alsotarget content to be uploaded into the CMS. It can be used also for visual control of translators.

– Provenance: possibility to reassign the same translator/reviewer p y g /in new versions of the same content and inform the PM. Tracking control in the CMS.

– Readiness (ITS 2.0 extension): control of processes to be done, date control for availability, delivery, and  priority.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 14 -

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

Drupal: CMS to empower languagesDrupal: CMS to empower languages

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 16 -16

CMS: Editing ContentCMS: Editing Content

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 17 -17

CMS: Translation ManagementCMS: Translation Management

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 18 -18

CMS: Send and receiveCMS: Send and receive

Sending it to LSP and automaticallyreintegratereintegrate

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 19 -19

CMS: Review ProcessCMS: Review Process

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 20 -20

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

CMS TMS InteroperabilityCMS‐TMS Interoperability

WebservicesCMS TMSWebservices

Engine(Processing)

Workflow(Management)

CAT Tool / Machine Translation

CMS TMS

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 22 -22

Webservice operationsWebservice operations

•Send file for translation

C fi ti f ti•Confirmation of reception

•Cancellation

•Check availability of

translations

•Download translations

•Download confirmation

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 23 -23

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

GBC Server / PLINT empowered with ITS 2.0

In the LSP side, ITS 2.0 metadata encompasses three areas:

• Pre-production/post-production engine for processing content files annotated with ITS 2.0.

• LSP internal localization workflow to id t t j t t

Engine(Processing)

Workflow(Management)

CAT Tool / Machine Translation

provide support to project management and production processes.

CAT tool sage for translation re ision (Processing) (Management) Translation•CAT tool usage for translation, revision and postediting with ITS 2.0 annotated content.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 25 -25

TMS: Pre processing ITS2 0 in PLINTTMS: Pre‐processing ITS2.0 in PLINTPre-processing : CMS exported file to CAT oriented XML file

Engine(Processing)

Pre-processing : CMS-exported file to CAT-oriented XML file

Translate Translatable nodes extracted with tags for blockingLocalization Note Reference node to inform the translator

Workflow(Management)

Domain Reference node created to inform the translatorStorage Size To inform the translator and for post-processing

Localization Note Alert sent to the project manager

CAT Tool / Machine Translation

Localization Note Alert sent to the project manager Localization Note Tooltip visualized in the workflowLanguage Information Quality checked with webserviceStorage Size Source content is checked for qualityStorage Size Source content is checked for qualityReadiness* Priority checked with webservice and set

Domain Automatic selection of terminology and Translation M iMemoriesProvenance Same translator/reviser reassigned in new versions of same contentReadiness* Expected finalization date

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 26 -

Readiness Expected finalization date

26

TMS: Post processing ITS2 0 in PLINTTMS: Post‐processing ITS2.0 in PLINTPost-processing: CMS-exported files updated with the

Engine(Processing)

Post processing: CMS exported files updated with the translation and the ITS 2.0 metadata

Translate Translated nodes insertedTranslate Non-translatable content unblocked

Workflow(Management)

Language Information Language attributes updatedAllowed Characters Restriction compliance checkStorage Size Limitation compliance check

CAT Tool / Machine Translation

g pProvenance Data category info added or updatedReadiness* Data category node updated

D i T l ti M i t d d l ifi dDomain Translation Memories stored and classifiedReadiness* Delivery date verified

Translate Non-translatable content parts blockedLocalization Note and Domain Blocked but visible for the translators and revisersLanguage Information Blocked but visible for the translators and revisers

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 27 -27

and revisers

ITS 2 0 Translator trainingITS 2.0 Translator trainingT l t d i d t i i d t lTranslators and reviewers need training and tools with specific viewing and editing functionalities.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 28 -

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

MLW LT CMS TMS SWOTMLW‐LT CMS‐TMS SWOTStrengthsStrengths WeaknessesWeaknessesgg

There is not a CMS‐TMS interoperability standard.

More efficient control over the contentFaster and more efficient 

i ti b t l li ti •Readiness can simplify the web servicecommunication between localization chain actors Localization platforms and format I d d t

•Readiness can simplify the web service standard

ThreatsThreatsOpportunitiesOpportunities

Independent 

Accelerator for adoption of interoperability localization chains.Improvement for those clients already 

Source content annotation and tools

•CMS developers adoption.•Content creators adoption

using it.Increasing fully automatic processes and localization expert systems.

•ITS 2.0 compliant and facilities CAT/Post‐editing tools• SEO applications

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 30 -

SEO applications

Business: Opportunities rise from needsBusiness: Opportunities rise from needs

Hi hl f t d t d b it th t d ffi i t• Highly frequent updated web sites that need efficient multilingual updates and maximal control:– Corporate, information and news– Multilingual E‐commerce – Export portals and intranets...

• Highly distributed content creation through the CMS.g y g• Web 2.0 (user content created)

Applying MT systems for immediacy– Applying MT systems for immediacy

• Using ITS 2.0 to contribute for multilingual SEO.

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 31 -

1.1. ITS 2.0 CMS / TMS Showcase ITS 2.0 CMS / TMS Showcase presentationpresentation

2. ITS 2.0 Drupal / GBC Server‐ PLINT with VDMAwith VDMA

3 Drupal ITS 2 0: CMS to empower languages3. Drupal ITS 2.0: CMS to empower languages

4. CMS‐TMS Interoperability

5. GBC Server and PLINT: TMS empowered with ITS 2.0

6. Business case

7. Showcase demonstration

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

Open source ITS 2 0 Drupal ModuleOpen source ITS 2.0 Drupal Module

Tool: Drupal Module for editing and viewing of ITS 2.0 markup: >> https://drupal.org/project/itsp p g p j

Tool: Drupal Module to connect to TMGMT Translator Linguaserve>> http://drupal.org/sandbox/kfritsche/1908422 http://drupal.org/sandbox/kfritsche/1908422

Tool: Drupal Module to interact with TMGMT >> http://drupal.org/sandbox/kfritsche/1908598 http://drupal.org/sandbox/kfritsche/1908598

Tool: ITS 2.0 jQuery Plugin >> http://plugins.jquery.com/its-parser/

VDMA preproduction: staging-mlwlt.cocomore.com:8080/# Username: test Password: fenMarOf0

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 33 -33

Password: fenMarOf0

ITS 2 0 GBC Server / PLINTITS 2.0 GBC Server / PLINT

htt // li t/l d / t l/MLWLTWP3D E ihttps://www.linguaserve.net/las_demos/control/MLWLTWP3DemoEngineuser: demospassword: demosLingu@serve

http://www.w3.org/International/multilingualweb/lt/wiki/images/6/67/Linguaserve_Preproduction_step_demo.zip2) CAT tool usage with ITS 2.0: http://www.w3.org/International/multilingualweb/lt/wiki/images/c/ca/Linguaserve ITS CAT Tool usage demo.zipguaserve_ITS_CAT_Tool_usage_demo.zip3) L10n workflow interaction for the post-production phase: http://www.w3.org/International/multilingualweb/lt/wiki/images/a/aa/Linguaserve Postproduction step demo zip

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 34 -34

guaserve_Postproduction_step_demo.zip

Thank you.y

The MultilingualWeb‐LT Working Group receives funding by the European Commission (project name LT‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.

- 35 -