Parse tree based machine translation for less-used languagesJernej Vičič
, Andrej Brodnik
, 2008, izvirni znanstveni članek
Opis: The article describes a method that enhances translation performance of language pairs with a less used source language and a widely used target language. We propose a method that enables the use of parse tree based statistical translation algorithms for language pairs with a less used source language and a widely used target language. Automatic part of speech (POS) tagging algorithms have become accurate to the extent of efficient use in many tasks. Most of these methods are quite easily implementable in most world languages. The method is divided in two parts; the first part constructs alignments between POS tags of source sentences and induced parse trees of target language. The second part searches through trained data and selects the best candidates for target sentences, the translations. The method was not fully implemented due to time constraints; the training part was implemented and incorporated into a functional translation system; the inclusion of a word alignment model into the translation part was not implemented. The empirical evaluation addressing the quality of trained data was carried out on a full implementation of the presented training algorithms and the results confirm the employability of the method.
Najdeno v: osebi
Ključne besede: machine translation, parse tree
Objavljeno: 15.10.2013; Ogledov: 4346; Prenosov: 77
Polno besedilo (0,00 KB)