A Novel Hybrid Approach to Improve Neural Machine Translation Decoding using Phrase-Based Statistical Machine Translation 2021

博客：https://www.tzer.top/

使用SMT指导NMT翻译；在beam search 过程中，如果SMT翻译的token存在beam中，那么就让该token的概率与当前beam中最大概率相等。为了让NMT能够使用SMT的翻译结果，需要将SMT的输出进行bpe encode。

实验中尝试了应用该方法的三个策略：

实验结果显示1，2的效果比较好。BLEU最大提升0.2， METEOR最大提升0.3。提升很小。

数据：WMT2018 英德德英，3,533,496

SMT：mose短语翻译模型 KenLM语言模型 giza++对齐模型

NMT：Transformer（Google 原始参数）

16k subword vocab；beam size 5；traning step 100k

给出的一个案例：

系统的流程：首先SMT进行Translate，然后对结果BPE encode，与NMT用作者提出的beam search解码，最后将输出decoding by BPE。

German -> English

原文：München 1856 : vier Karten , die Ihren Blick auf die Stadt verändern

SMT：Munich 1856 : four cards , which your attention to the city of change

SMT encode by BPE：▁Munich ▁18 56 ▁: ▁four ▁cards ▁, ▁which ▁your ▁attention ▁to ▁the ▁city ▁of ▁change

Reference：Munich 1856 : four maps that will change your view of the city

第一步：

对于NMT的候选来说，SMT 结果_Munich 存在与NMT候选中，所以将_Munich的概率与_there相等