Speaker: Matthias Eck
Title: Communicating Unknown Words in Machine Translation
Abstract:
Unknown words are a major problem for every machine translation system. Regular evaluations and demos do not always show this very well, but in actual communication the lack of specialty vocabulary and named entity translations can seriously affect the communication ability.
A new approach is presented that uses monolingual encyclopedias and dictionaries to "communicate" unknown words. Instead of the actual unknown word, its definition is extracted and translated, which leads to considerable improvements in translation quality.
Wednesday, March 19, 2008
Subscribe to:
Posts (Atom)