Tips for Creating Translation-Oriented Source Documents

advertisement
Tips for Creating Translation-Oriented
Source Documents
A good translation begins with the source text. Even while composing a text, a writer can do a lot to minimize the preparation overhead
for the subsequent translation with a translation management system (TMS) – regardless of whether additional technologies such as
an editorial system or authoring assistance are used.
Some simple rules for creating a translation-oriented document:

Pay attention to correct

Avoid using spaces for

Use uniform date, time,

Codify the spelling of
line breaks
formatting purposes
and number formats
specialized terms
Hard line breaks (¶) should be used for new paragraphs only. The TMS interprets a
hard line break as the end of a segment. Therefore, use of hard line breaks within
a sentence results in incorrect segmentation. On the other hand, soft or manual line
breaks (Ctrl + Enter) are not interpreted as the end of a segment and thus also result
in the faulty detection of text units. Often, such line breaks are inserted unintentionally
through copying from other applications into the source document.
To indent texts, use tabs or indents; do not use multiple spaces! Such spaces are displayed after importing the document to the TMS and thus impair the appearance and
necessitate additional editing. Even if used at the same position, the spaces in the translation usually look different from those in the source document. Moreover, the translator
is often unable to judge whether a space serves a specific purpose.
For the detection of date, time, and number formats, TM system makes use of predefined rules. For example, the system settings of Across define that date formats in
German have the format dd.mm.yyyy or dd.mm.yy. However, if the numbers are separated by a space, the date will no longer be recognized as a connected group of numbers. Therefore, we recommend determining uniform conventions and consistently
applying these in your texts.
Different spelling of terms – e.g. writing a term as one word (metadata), as two
words (meta data), or as two words with a hyphen (meta-data) – is a frequent
cause of inconsistent translation, as the automatic terminology detection does not
work in such cases.
Tips for Creating Translation-Oriented
Source Documents

Spell abbreviations

Remove superfluous

Apply automatic

Use the original formats
correctly
formatting
hyphenation
If you spell an abbreviation incorrectly or in an unusual way, the segmentation during
the translation may be faulty. The sentence could inadvertently be separated into two
segments that the translator would first have to correct manually.
If you frequently use color marking to highlight text passages, be sure to remove such
from the entire text (including line breaks and spaces) prior to translation. If you fail to
do so, the "invisible" formatting will be offered to the translator as possible formatting
and could result in the translator using the wrong font type or font color.
If you want to hyphenate words, you can either use "automatic hyphenation" or manually insert an optional hyphen. Though a plain hyphen delivers a visually identical
result, it causes problems in the TMS. As hyphens are treated as additional characters in a word, the system cannot find any 100% match with existing sentences in
the translation memory. Moreover, the terminology detection cannot find any suitable
database entry either.
Whenever possible, avoid delivering PDF files as source document format for translation.
Instead, always endeavor to provide the original file formats from which the PDF files
were generated. If documents need to be revised prior to translation, formatting deficiencies resulting from the conversion can cause additional workload.
Across Systems GmbH | +49 7248 925 425 | info@across.net www.across.net
Download