Neural Machine Translation in Linear Time

Authors: Nal Kalchbrenner, Lasse Espeholt, Karen Simonyan, Aaron van den Oord, Alex Graves, Koray Kavukcuoglu (2016)

arXiv: 1610.10099

Domains

Architecture

TLDR (English)

Uses dilated convolutions for seq2seq, liberating sequence modeling from "must use RNN sequential computation". Together with ConvS2S, represents the strongest attempt at parallel sequence modeling before Transformer.

TLDR（中文）

用扩张卷积做 seq2seq，把序列建模从"必须 RNN 顺序计算"中解放出来；和同期 ConvS2S 一起是 Transformer 之前"并行序列建模"的最强尝试。

Appears in These Articles

Attention：让每个位置选择上下文
Attention: Choosing the Relevant Context

Co-cited Papers

These papers appear in the same articles as this one

Related Papers

Other papers in the same domain