HOME
I’ve just finished writing up my masters thesis entitled: “Using EBMT to Produce Foreign Language Subtitles”. Below is a brief description of the project. Title: Using EBMT to Produce Foreign Language Subtitles In this thesis, we set out to establish how Example-Based Machine Translation (EBMT) can be used to speed up the subtitling process, thus improving the throughput of the subtitler, and also as a means of automatically producing foreign language subtitles which subtitle companies may not normally provide, even though they would be extremely helpful for the viewing public. Through the development of the modular corpus-based MT engine, MaTrEx (Stroppa et al., 2006), and the collection of a large amount of subtitle data extracted from over 50 full-length features (Armstrong et al., 2006), we were able to apply a number of EBMT techniques to produce subtitles for the language directions German-English and English-German. These machine-produced subtitles were evaluated using a range of both well-established automatic metrics common to Machin Translation as well as some novel manual evaluation strategies. Both automatic metrics and the human evaluation were very useful in the developmental process where we were able to isolate and fix errors made by our system. In addition, through obtaining a human's perspective on the subtitles produced by our system, we were able to gauge the acceptability of these subtitles for public viewing, and have provided a solid grounding for future research into the acceptability of (semi-)automatically generated subtitles.
[20/04/2006] |
||||||||||||||