
Marlén Izquierdo and Naroa Zubillaga 5
Hikma 24(1) (2025), 1 - 31
2. LEARNER CORPORA IN EMPIRICAL TRANSLATION RESEARCH
As repositories of authentic, acted-out cross-linguistic
correspondences, parallel corpora, as well as comparable corpora, are of
direct benefit not only to translators but also to learners of a foreign language,
be it for general purposes or for specific purposes, such as translation
(Bowker, 1999). Notwithstanding this advantage, to truly progress towards a
meaningful teaching-learning experience it is necessary to observe first-hand
what learner language use is like. In short, the more learner data we examine,
the more insights we gain into their linguistic/translator competence and,
therefore, the easier it might be to raise student awareness of their
learning/training needs.
Accordingly, building corpora of students' translations is motivated by
the need to extend the fruitful combination of learner corpus research (LCR)
and CBTS. Among the first initiatives to combine translator training research
and CBTS, Bowker’s corpora created by translators (CCBT) stands out as “a
type of learner corpora that can be used to investigate difficulties encountered
by trainee translators” (Bowker, 2003, p. 169). Similarly, the Translation
Teaching and Learning Corpus (TTLC) project was carried out to “give priority
to actual learner needs and integrate both language-based and process-
based translating skills” (Tiayon, 2004, p. 119). Other ensuing successful
projects worth highlighting have been the UPF learner translation corpus,
featuring the language pair English-Catalan (Espunya Prat, 2014); special
mention deserves the undergraduate learner translator corpus (ULTC) that
features translations from English or French into Arabic (Alfuraih, 2020). This
kind of proposals unquestionably meant a breakthrough in empirical
translation research, even though there was still a lot of room for improvement.
The language combinations lacked variety, translations were analysed in only
one direction, and inconsistencies or lack of systematicity in error-annotation
prevented the comparability of findings across research projects, let alone a
generalization of results with real pedagogical application (Granger and Lefer,
2020). In addition, learner corpus research focused most of its attention on L2
teaching-learning, to the detriment of translation training. This apparent
neglect, however, would nevertheless benefit student translation corpora in
the years to come, making available for researchers long-tested and widely
attested protocols in corpus design, methodology, analysis, and application.
In order to overcome the abovementioned shortcomings, the
Multilingual Student Translation project (MUST) was launched in 2016 as an
ambitious, yet well-thought, initiative that has grown over the past years and
at the time of writing brings together researchers in the fields of translation,
contrastive linguistics, and language/translation pedagogy from 38
universities (Granger and Lefer, 2020). The MUST project provides