This paper examines the extent to which the writing style of texts published on highly usable web-sites provides input that is amenable to Internet-based machine translation (MT) services, thus giving rise to a form of MT-friendly controlled language, in a loosely defined controlled translation environment for web content. Controlled languages are traditionally divided into two groups, human-oriented and machine-oriented, depending on the purpose and applications for which they have been developed. Even though this distinction is justified and useful in a number of respects, it is recognised that these two types of controlled languages often share significant features.The paper investigates the common ground between human-oriented and machine-oriented controlled languages in the context of reader-friendly monolingual on-line material that needs to be translated into a variety of target languages by means of web-based MT services. The paper argues that a usability-oriented writing style for originally monolingual web content not only improves the readability of on-line material for human Internet users, but can also dramatically improve the performance of web-based MT systems. The discussion suggests in conclusion that authoring conventions and guidelines primarily aimed at improving readability for human users in the Internet environment also have a significant impact on the successful application of on-line MT technology, if monolingual web content needs to be disseminated in a variety of languages.

Controlled Language, Web Usability and Machine Translation Services on the Internet

Federico Gaspari
2004-01-01

Abstract

This paper examines the extent to which the writing style of texts published on highly usable web-sites provides input that is amenable to Internet-based machine translation (MT) services, thus giving rise to a form of MT-friendly controlled language, in a loosely defined controlled translation environment for web content. Controlled languages are traditionally divided into two groups, human-oriented and machine-oriented, depending on the purpose and applications for which they have been developed. Even though this distinction is justified and useful in a number of respects, it is recognised that these two types of controlled languages often share significant features.The paper investigates the common ground between human-oriented and machine-oriented controlled languages in the context of reader-friendly monolingual on-line material that needs to be translated into a variety of target languages by means of web-based MT services. The paper argues that a usability-oriented writing style for originally monolingual web content not only improves the readability of on-line material for human Internet users, but can also dramatically improve the performance of web-based MT systems. The discussion suggests in conclusion that authoring conventions and guidelines primarily aimed at improving readability for human users in the Internet environment also have a significant impact on the successful application of on-line MT technology, if monolingual web content needs to be disseminated in a variety of languages.
2004
Controlled language
Online machine translation
English
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12078/27433
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact