MT@BZ translation corpus v1.0
Please use the following text to cite this item or export to a predefined format:
De Camillis, Flavia; Chiocchetti, Elena and Stemle, Egon W., 2023, MT@BZ translation corpus v1.0, CLARIN DSpace, http://hdl.handle.net/20.500.12124/60
Authors
Item identifier
Date issued
2023-06-13
Size
52 texts,
130.000 tokens
Description
The MT@BZ is a translation corpus that consists of 52 decrees published by the Autonomous Province of Bolzano (South Tyrol) aligned with their machine translated versions. More precisely, it consists of 26 decrees in German and the same 26 in Italian in their official versions, respectively machine translated by the project team into Italian and into German. 10 of them are COVID-19 related decress, while 16 are miscellaneous. Overall, they consist of around 130,000 words. Their machine translation was carried out with a customized version of ModernMT. Later, the corpus was uploaded first into the annotation platform Webanno, then transferred to Inception. Four annotators annotated the translation errors made by the machine according to an ad hoc error taxonomy for quality assessment. Finally, the annotations were curated to create a gold standard corpus.
Acknowledgement
Institute for Applied Linguistics, Eurac Research
Project code:/
Project name:Machine Translation at South Tyrolean Institutions
This item isPublicly Available
and licensed under:
Files in this item
Loading files... This may take a few seconds as file previews are being generated. If the process takes too long, please contact the system administrator test@test.sk