MT Evaluation : human-like vs. human acceptable

Amigó, Enrique, Giménez, Jesús, Gonzalo, Julio y Màrquez, Lluís(2006) .MT Evaluation : human-like vs. human acceptable. .En: . ()

Ficheros (Some files may be inaccessible until you login with your e-spacio credentials)
Nombre Descripción Tipo MIME Size
Documento.pdf Pdf del documento application/pdf

Titulo MT Evaluation : human-like vs. human acceptable
Autor(es) Amigó, Enrique
Giménez, Jesús
Gonzalo, Julio
Màrquez, Lluís
Materia(s) Informática
Resumen We present a comparative study on Machine Translation Evaluation according to two different criteria: Human Likeness and Human Acceptability. We provide empirical evidence that there is a relationship between these two kinds of evaluation: Human Likeness implies Human Acceptability but the reverse is not true. From the point of view of automatic evaluation this implies that metrics based on Human Likeness are more reliable for system tuning. Our results also show that current evaluation metrics are not always able to distinguish between automatic and human translations. In order to improve the descriptive power of current metrics we propose the use of additional syntax-based metrics, and metric combinations inside the QARLA Framework.
Fecha 2006-07-17
Formato application/pdf
Idioma eng
Fuente Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 17–24, Sydney, July 2006.
Versión de la publicación publishedVersion
Relacionado con el proyecto: info:eu-repo/grantAgreement/S2009/TIC-1542
Nivel de acceso y licencia
Tipo de recurso lecture
Tipo de acceso Acceso abierto

Versión Tipo de filtro
Contador de citas: Google Scholar Search Google Scholar
Estadísticas de acceso: 412 Visitas, 55 Descargas  -  Estadísticas en detalle
Creado: Fri, 21 Nov 2014, 15:19:27 CET