Download - Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Mul(lingualWeb-‐LT Execu(ve Summary
Felix Sasaki DFKI / W3C Fellow
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Project goals • Provide reference implementa(ons of metadata for mul(lingual processes – Content crea(on, (human or machine) transla(on, localiza(on workflows, ...
• Define a metadata standard based on implementa(ons and exis(ng work – From Interna(onaliza(on Tag Set (ITS) 1.0 > ITS 2.0
• Con(nue and enlarge a community around the Mul(lingualWeb
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Groups involved
MLW-‐LT consor(um (Reference Implementa(ons)
W3C MLW-‐LT Working Group
Members (Standardiza(on)
MLW PC members (Community building)
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Requirements Gathering • Workshop June 2012, Dublin – 71 a^endees – New stakeholders: linked open data community – New implementers: Adobe, ]init[, Logrus, Tilde
• Requirements gathering document – W3C public working drab – Wiki version 21.000+ access
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Standardiza(on Process ... • ITS 2.0 drab development June – December 2012 – 40+ individuals par(cipa(ng – 2100+ emails, aggressive standardiza(on progress – Engaging “invited experts” and further par(cipants, including higher-‐level decision makers:
Adobe, CNR, DERI, Ecole Mohammadia
d'Ingenieurs Rabat, ]init[, Logrus, NCSR, Opera, SAP, Tilde
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
... driven by implementa(ons • Test suite development star(ng August 2012, driven by TCD – Input: Files with ITS 2.0 metadata – Output: metadata overview – Current state: 223 input files, 839 implementer output files, 80% coverage
<!DOCTYPE html> ... <p>Everything started when Zebulon discovered that he had a <span translate="NO">doppelgänger</span> ... </html>
... /html/body[1]/p[1] translate="yes" /html/body[1]/p[1]/span[1] translate="no" ...
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
“Metadata for the Mul(lingual Web” • Summarizing usage scenarios and implementa(ons
• Aligned with implementa(on development
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Usage scenarios and implementa(on highlights
• XLIFF transla(on package crea(on driven by ITS 2.0 metadata
• Quality check driven by metadata constraints • Installa(on of workflow from CMS to TMS system • CMS implementa(on of metadata authoring support
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Usage scenarios and implementa(on highlights
• Text-‐processing component interconnected with Drupal
• Cocomore – Linguaserve: showcase “localiza(on workflow with VDMA”
• Linguaserve: “real (me MT with Spanish Tax Agency”
• Volunteer implementer Shaun McCance – ITS Tool: XML to PO and back
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Not covered during this review • Valida(on of HTML5+ITS (UEP) – Available at h^p://validator.nu/ – Staged for integra(on in W3C validator
• ITS Libre Office Writer Extension -‐ ]init[ • ITS 2.0 Enriched Terminology Annota(on – Tilde • Visual designs to render "ITS for HTML5” – Logrus • Localisa(on Workflows Using ITS 2.0 with Adobe CQ and Apache JackRabbit – Adobe
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Deliverables for year one • D1.1 Detailed Overall Management and Bodies Management, including the Quality Assurance Plan
• D1.2.1 Report on Internal and External Communica(on Tools
• D1.2.2 LT-‐Web -‐ W3C Coordina(on Yearly Report • D1.2.3 Contact Database • D2.1 Requirements and Use Case Document • D2.2 LT-‐Web Metadata Drab Documents
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Deliverables for year one • D4.1.1 Lucy Modifica(on • D4.1.2 MaTrEx Modifica(on • D4.1.3 Linguaserve Online System Modifica(on • D4.1.4 Report on Modifica(ons in MT Systems • D6.1.1 Workshop 1 • D6.1.2 Summary Report 1
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
WP5 “Deep Web Informa(on and MT Training”
• Deliverables – D5.1.1 MT Training Module – D5.1.2 XLIFF Deep Web MT Training Exporter – D5.2 Metadata-‐Aware MT Training
• Delivery date will be delayed to be able to benefit from Cocomore training data
• Overall WP will be in (me (conclusion by M21)
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Communica(on • W3C infrastructure + telephone conference tool – Mailing lists – IRC – Ac(on / issue tracker – ... see D1.2.1
• Separate channels for – Working Group (standardiza(on) – Workshop planning (MLW PC) – Public
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Management
MLW-‐LT consor(um (Reference Implementa(ons)
W3C MLW-‐LT Working Group
Members (Standardiza(on)
MLW PC members (Community building) Communica(on
infrastructure
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Conclusion • Community building via suppor(ng ... – Reference implementa(on – Standardiza(on – Outreach
• ... pays of! • Similar projects could be useful in the future
The Mul(lingualWeb-‐LT Working Group receives funding by the European Commission (project name LT-‐Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815.
Q/A