Inference of finite-state transducers from regular languages

作者:

Highlights:

摘要

Finite-state transducers are models that are being used in different areas of pattern recognition and computational linguistics. One of these areas is machine translation, where the approaches that are based on building models automatically from training examples are becoming more and more attractive. Finite-state transducers are very adequate to be used in constrained tasks where training samples of pairs of sentences are available. A technique to infer finite-state transducers is proposed in this work. This technique is based on formal relations between finite-state transducers and finite-state grammars. Given a training corpus of input–output pairs of sentences, the proposed approach uses statistical alignment methods to produce a set of conventional strings from which a stochastic finite-state grammar is inferred. This grammar is finally transformed into a resulting finite-state transducer. The proposed methods are assessed through series of machine translation experiments within the framework of the EUTRANS project.

论文关键词:Machine translation,Grammatical inference,Formal language theory,Stochastic finite-state transducers,Natural language processing

论文评审过程:Received 17 March 2004, Accepted 17 March 2004, Available online 8 March 2005.

论文官网地址:https://doi.org/10.1016/j.patcog.2004.03.025