automating your qa process to save time and improve consistency iñaki hernández-lasa linguistic...
TRANSCRIPT
Automating Your QA Process to Save Time and Improve
Consistency
Iñaki Hernández-LasaLinguistic Engineering Department Manager
Tek Translation International
Agenda
The Need For Automated Language Quality
What Can Be Automated
Technologies Available
Real Production Case Study
The Language Quality Checker
Language Quality tasks are far too time-consuming.
Manual language review activity is prone to error.
Manual language review provides no statistical analysis.
Manual language review cannot “catch” all errors.
Reviewers are subjective and constantly introduce exceptions.
There is a need to introduce a more empirical and objective approach to
language.
Statistical analysis for Language Quality Benchmarking (LQB) is critical in our
industry.
Issues with traditional spell checking tools – insertion of lists for acceptance.
The Need For Automated Language Quality
Spelling verification
Source segment with multiple translations
Source segment = Target segment
Punctuation
Terminology adherence to an approved glossary
Overtranslations: CaMel words and Product Names
What Can Be Automated
EXAMPLE OF E-MAIL ADDRESS
[^\s]+\@[^\s]+\.[A-Za-z]{2,4}
Any number or letter followed by @ followed by any number of letter followed by . followed by any letter capitalised or not which ranges between 2 or 4 characters.
Technologies Available
Hashtables: Association between keys and values Fast searches, for instance source with multiple targets Sou=Key Targ= Value
Regular expressions engine: special text string for describing a pattern
Overtranslation : CaMel word in source and in target iFilter Windows interface: interpreting different file formats
.rtf, .doc, .tmx, .ttx, .txt, .xliff, .csv XML parser: interpret different formats like .ttx and .tmx into .xml XSL transformation: Generation of different output files (.xls, .html and
.txt)
Real case study LQC Project for one of the largest US Energy Enterprises
Installation Guide – 9 files – Total 4,368 words Instruction Manual – 27 files – Total 49,723 words
Based on the total of 54,091 words
Real Production Case Study
PROOF READING SCENARIO WITH NO LANGUAGE TECHNOLOGY
Real Production Case Study
PROOF READING SCENARIO
HO
UR
S
WithoutLanguage
Technologies
18 hours (Based on 3,000 words per hour)
Real case study LQC Project for one of the largest US Energy Enterprises
Installation Guide – 9 files – Total 4,368 words Instruction Manual – 27 files – Total 49,723 words
Based on the total of 54,091 words Running an LE tool: 5 minutes Statistical Analysis: 2,031 errors = 3,7% errors Implementation of errors reported at a rate of 5 per minute = 300 errors per
hour Total implementation time for 2,031 errors = 6.5 hours
Real Production Case Study
PROOF READING SCENARIO WITH LANGUAGE TECHNOLOGY
Real Production Case Study
PROOF READING SCENARIO – TIME SAVINGS
HO
UR
S WithoutLQC
18 hoursWith LQC
6.5 hours
(Based on 3,000 words per hour)
Real Production Case Study
357
652
179
231
612
ERROR CATEGORISATION & OCURRENCES
Translation SOU=TAR
Punctuation
Consistency
Spelling
Terminology
Be careful when installing and inserting each wire.Tenga cuidado cuando instale e inserte cada cable.Sea cuidadoso cuando instale e inserte cada conductor.
Water-Oil Condensation TemperatureTemperatura de condensación agua-aceiteTemperatura de la condensación del agua-aceite
There are two possibilities:Existen dos posibilidades:Hay dos posibilidades:
First, remove the plug present in the modem’s connector.Primero, quite el enchufe presente en el conector del módem.Approved Glossary entry remove = retirar
Real Production Case Study
EXAMPLES OF ERRORS FOUND
Real Production Case Study
PROOF READING SCENARIO – COST SAVINGS
Reduction of costs by up to 65% (from 18 hours to 6.5 hours)
65% cost savings
Real Production Case Study
AIM OF TEK’S LANGUAGE QUALITY CHECKER (LQC)
To assist in Language Quality tasks through Linguistic Engineering automation
To provide Language Quality Benchmarking (LQB)
Quantitative analysis - reporting objectively on the number of mistakes
Qualitative analysis – reporting empirically on the type of mistakes
To guarantee consistency of existing lexical resources: terminology and consistent segments
To save time during proofreading cycles
To improve higher cost containment in localization practices
Real Production Case Study
TYPES OF MISTAKES IDENTIFIED BY TEK’S LQC
Glossary adherence - Non-adherence to an approved glossary
Consistency Checker - Cases where SOU=Multiple Translations
Translation Checker – Empty translations and cases where SOU=TAR
Spelling Checker – Target spelling
Grammar Errors – Target grammatical errors. List of configurable forbidden words.
Overtranslations – Terms that should not have been translated: CaMel words, Product Names, etc.
Mistranslations - Terms that should have one translation and have another (false friends)
Punctuation Checker – Source vs. Target equivalence / (¡ / ¿)
LQC UI
Where Does The LQC Reside?
Business Intelligence Suite
PersonalisedWorkflow
Early-WarningSystem
On-lineQuotation
ChangeManagement
ProjectManagement
Document Management
LinguisticTools
Open Architecture
ETMTekTerm
TMWarehouse
LQCSelf-Service
On-LineTranslation
TE
K’S
ON
-DE
MA
ND
GLO
BA
LIZ
ATIO
N P
LATFO
RM
Tek Overview
Technology
One On-Demand Globalization Platform for business intelligence, language management, open connectivity and worldwide collaboration.
One powerful ERP system driving processes for all users.
Process
One set of tools and ISO approved quality standards covering over 60 languages.
One “Follow the Sun” approach through our Client OneWorld Centers in different time zones for a streamlined service to our clients.
People
One highly specialized and experienced team of people sharing the OneWorld vision.
One united company since 1961.
Tek's OneWorld Solution provides an easy-to-use localization business management system for the Manufacturing, IT and Life Sciences Industries.
It provides:
Tek’s LE Department
TM MANAGEMENT
ENTERPRISE TERMINOLOGY MANAGEMENT
L10N / TECH. WRITING STYLE GUIDES
Quality@Source
Consistency in Project Components
Reduction of Validation Time
Reduction of Management & Effort
More streamlined / efficient L10N
Cost Savings
LINGUISTIC ENGINEERING DEPARTMENT
Thank You!
For any enquiries please contact me atihATtektrans.com