PowerPoint-Präsentation - CMS – Cross Media Solutions

advertisement
Translating Database Content
with
www.crossmediasolutions.de
Who We Are…
CMS – Cross Media Solutions



is an independent software vendor from Würzburg, Germany.
Founded in 2006 from elements of the printing company Stürtz GmbH,
CMS has decades of experience in providing database
solutions for publishers of technical and scientific documentation
and industrial catalogs

Technology partners

My name is Steven Calkins.
www.crossmediasolutions.de
What will be discussed…
Translating Database Content
with KITE

Introduction
1. Why is standardized database translation important?
2. Definition of terms

KITE:
1. Features
2. Advantages of XLIFF and KITE
3. Integration in workflow

Conclusion
1. Why is this issue urgent?
2. Decisions to be made
www.crossmediasolutions.de
Why is standardized database
translation important?
Estimates put all the content stored in databases on the world wide web at more than 500 times the information
available on the pages which can be found by popular search engines:
Surface Web
Database
Database
Search engines index
only the surface web
Database queries
are not indexed
Database
Database
Database
Database
Database
Deep Web
Database
“Dynamic Searchable Databases represent the Deep Web. The Deep Web is made up of hundreds of thousands of publicly accessible databases and
is approximately 500 times bigger than the surface Web. At each Deep Web database, you enter a query and get back a Web page created
dynamically (on-the-spot) specific to your search. These dynamic Web pages are not linked since they didn’t exist before your query and cease to exist
after being sent to you — the search engines can't see nor find them.” http://aip.completeplanet.com *
Using the Deep Web as a model of knowledge distribution we can infer that the vast majority of available knowledge is found in databases.
Company information is generally stored in a database, typically an electronic resource planning (ERP) software. So if something needs localization,
most likely the original source is found somewhere in a database.
* CMS – Cross Media Solutions GmbH is not responsible for the content of any links external to its own website.
www.crossmediasolutions.de
Why is standardized database
translation important?

Regulations have been increasing in recent years:

USA: Sarbanes-Oxley Act (finance documents).

EU: EU Directive 2006/42/EG (manuals, in 2009).

The people served by translations have greatly
varying needs:

Designers & technicians need more “technical”
documentation than normal product users do.
www.crossmediasolutions.de
Can translation be accomplished
with standard procedures?


For low-complexity, small volumes it can be done

Human translators are absolutely necessary, but
Translation Memory systems can help translators
With increasing volume and complexity, automated
procedures become necessary
1. translate consistently with proper vocabulary,
2. avoid translating the same terms twice.
www.crossmediasolutions.de
Definition of terms
Term
Meaning
TMS
Translation Memory Systems They work by looking up segments in a database containing large numbers of previously
translated segments with their translation. They provide
matches that fit exactly or are similar to the term being translated and display them as suggestions to the translator.
Segments
Units of sensible translation, usually sentences. Database
contents are well-suited for processing with TMS because
the segmentation process can be virtually eliminated. *
TMX
Translation Memory eXchange Format
OSCAR standard (Open Standards for Container/Content
Allowing Reuse, a committee of LISA -- the Localization
Industry Standards Association)
XLIFF
XML Localization Interchange File Format
OASIS standard
KITE
The TMS of Cross Media Solutions implementing the TMX
and XLIFF standards. Fully XLIFF compliant.
* Still, the segmentation must not be lost in the database–XLIFF–database roundtrip process.
www.crossmediasolutions.de
Features of








Localization of database contents
Translation memory functionality
Termbase functionality (TBX)
Scalable, suitable for very large XLIFF files
XLIFF 1.2 and TMX 1.4 support
for Mac OS X and Windows
Single-user and site licenses
Can be integrated in an outsourced workflow
www.crossmediasolutions.de
Advantages of XLIFF and TMS






Translation memory functionality
Faster, more consistent and non-redundant translation
Vocabulary is project-oriented
It is not proprietary, it is open to all
Permits independent developments
Enables tight integration with extant systems
www.crossmediasolutions.de
Advantages of
All the advantages of XLIFF and TMS, plus:






Preserves database content assignment
Contextual information available
Scalable, suitable for very large XLIFF files
XLIFF 1.2 and TMX 1.4 support
for Mac OS X and Windows
Low cost: Single-user and site licenses
www.crossmediasolutions.de
Views of
www.crossmediasolutions.de
Start by opening a translation memory
Lucene-indexed folder
containing translation
memory. Details here.
Information about the
translation memory may
be inspected below
The translation memory may
be manipulated in the TM menu:
 Open a TM





Save the segments in the TM
Close the TM
Create a new TM
Import a TM from a TMX file
Export the TM to a TMX file
Once the proper TM is open,
an XLIFF file may be edited.
Various meta-information is
displayed about the selected
XLIFF file before opening it.
Note the language pair:
German -> English
On the left side a tree view is displayed.
This shows the content structure of the XLIFF file.
Important:
properly handles groups.
Not every XLIFF system does this recursively.
To our knowledge, only KITE is fully XLIFF
compliant.
Other features: Approved translation pairs
are colored green for easy identification
Other features: Context information is displayed,
notes can be inserted
Segments: Segments (translation pairs)
can be edited in the Segment menu:
 Approved
 Not approved
 Accept only 100% TM matches
 Accept best match
 Accept source (for terms not to be translated
like proper names, technical terms, etc)
Explorer-like tree view can show
group hierarchy to any depth
Matches for the translation units are suggested
by the Translation Memory.
Double-click transfers the suggestion
to the target unit(s).
The button “Accept 100% match” transfers only
100% matches to any selected segments
The button “Accept best match” also transfers
the best TM suggestion to the segments
The button “Accept source text” transfers, e.g.,
technical terms from the source text without
translating them.
The translation memory is searchable:
The search text is typed here. The quality here.
The results are given here.
For a live demonstration,
please visit CMS
at Stand E10
www.crossmediasolutions.de
How can
be integrated
into a workflow?
The catalog data is found in a Product Information
Management (PIM) system and an Electronic
Resource Planning (ERP) system. Typically, the
PIM system can extract information from the ERP
and publish this data enriched with marketing texts
online and in print. Where does KITE fit in?
www.crossmediasolutions.de
How can
be integrated
into a workflow?
Let assume the following task: a large
online and print catalog is to be translated
from German into English.
The work is to be shared among outside
translators and an internal translation
department.
1.
2.
3.
4.
The internal department first exports the
Translation Memory (TM) and XLIFF segments
from the database.
All 100% matches are found in KITE so that
no redundant translation is done. All available
terms are now correctly assigned.
The XLIFF is re-imported into the database.
Now the XLIFF and TM are sent to an external
translator, who receives only those segments that
need translation.
www.crossmediasolutions.de
How can
ERP
Product
Information
Management
XLIFF
import
Localized
online and
printed
documentation
generation,
user manuals,
catalogs
save money?
XLIFF +
TM
export
XLIFF
+
TMX
Internal translation dept.
www.crossmediasolutions.de
XLIFF
+
TMX
External translation service
Why is this issue urgent?
The following cites the new EU directive 2006/42/EG. This directive
will become effective 30.12.2009:
1.7.4. Instructions
All machinery must be accompanied by instructions in the official Community language
or languages of the Member State in which it is placed on the market and/or put into
service….
1.7.4.1. General principles for the drafting of instructions
(b) Where no "Original instructions" exist in the official language(s) of the country
where the machinery is to be used, a translation into that/those language(s) must be
provided by the manufacturer or his authorised representative or by the person
bringing the machinery into the language area in question. …
(d) In the case of machinery intended for use by non-professional operators, the wording and
layout of the instructions for use must take into account the level of general education
and acumen that can reasonably be expected from such operators.
www.crossmediasolutions.de
What decisions are necessary?





Does my company have large quantities of database
information?
Will we be required to translate large sections of
the database for technical documentation?
Should we use document-based translation?
What is the source of our company documentation?
Could automation help us?
We would be glad to speak with you further on these
issues at our stand (E10) or contact:
steven.calkins@crossmediasolutions.de .
www.crossmediasolutions.de
Thank you for your attention
www.crossmediasolutions.de
Download