26 Machine Translation and Tourism Discourse: A Spanish-English-French […]
Hikma 23 (Número especial I) (2024), 1 - 32
Regarding our second RQ, DeepL consistently outperformed GT when
it came to translation accuracy, as DeepL’s results produced more fluent
translations both in EN and FR. This was also the conclusion reached by
Hidalgo-Ternero (2021) in his comparative study of machine-translated
phraseology from ES into EN using GT and DeepL. In addition, Peña Aguilar
(2023) examined specific linguistic challenges in translating between ES and
EN. In her study, she demonstrated that DeepL outperformed other popular
MT systems, such as Bing and GT, by correcting some problems present in
the source text. Our study confirms this finding, as shown in the first example
of Table 6, where DeepL did not replicate the error present in the original
Spanish version (a comma between the subject and predicate, an error that
GT did replicate). The tendency toward literal translation by MT tools (in our
study, especially by GT) was also pointed out in Fuentes-Luque and
Santamaría Urbieta’s (2020) study on the performance of MT when translating
tourism texts in the English-Spanish combination.
While GT has made significant strides over the years, it still occasionally
fell short in terms of accuracy. Although GT can handle straightforward and
commonly used phrases well, the tool struggled when confronted with more
complex or context-dependent content, another conclusion similar to the
findings of Fuentes-Luque and Santamaría Urbieta (2020). This was the case
of the error regarding the opening hours in Table 6, in which the future tense
of the verb to be in ES to express a reiterative pattern of opening hours was
translated literally into EN. DeepL did better, as it did not mimic the ES
structure.
After using MT to render the ES content into EN and FR, we concluded
that the MT-generated content had to be revised to ensure first-class quality
standards. Human evaluation is still a valuable resource for assessing the
quality of MT-generated output, as it provides the expertise necessary to
identify and address linguistic nuances, ensuring that the final translations
meet the desired standards of accuracy and fluency. Consider example 3 in
Table 6, where GT translated a previously translated segment differently,
without any apparent justification, or example 1 in Table 8, where DeepL’s
translation included words which did not exist in FR. Overall, our results align
with those of the study conducted by Leiva Rojo (2020), where translations of
English museum texts into Spanish were assessed. Although Leiva Rojo’s
study revealed that many of the official translations analysed were “very poor”
and, in contrast, our study found acceptable official translations, both studies
show that MT did not significantly improve their quality.
Thus, answering RQ2, when the goal is to provide high-quality, verified
content, GT and DeepL can serve as complementary tools in the translation
process. However, when a translation from scratch by a professional