Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Directions of work or proceedings

A Corpus of Machine Translation Errors Extracted from Translation Students Exercises

Abstract : In this paper, we present a freely available corpus of automatic translations accompanied with post-edited versions, annotated with labels identifying the different kinds of errors made by the MT system. These data have been extracted from translation students exercises that have been corrected by a senior professor. This corpus can be useful for training quality estimation tools and for analyzing the types of errors made MT system.
Document type :
Directions of work or proceedings
Complete list of metadata
Contributor : Natalie Kübler Connect in order to contact the contributor
Submitted on : Thursday, March 31, 2022 - 5:02:27 PM
Last modification on : Sunday, June 26, 2022 - 12:18:20 PM
Long-term archiving on: : Friday, July 1, 2022 - 7:07:42 PM


Publisher files allowed on an open archive


  • HAL Id : hal-01134899, version 1


Guillaume Wisniewski, Natalie Kübler, François yvon. A Corpus of Machine Translation Errors Extracted from Translation Students Exercises. Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis. LREC 2014, May 2014, Reykjavik, Iceland. pp.3585-3588, 2014, Proceedings of the Ninth Language Resources and Evaluation Conference (LREC 2014), 26-31 May, Reykjavik, Iceland. ⟨hal-01134899⟩



Record views


Files downloads