Skip to content

kalchbrenner2016-bytenet

arXiv: 1610.10099

TLDR (English)

Uses dilated convolutions for seq2seq, liberating sequence modeling from "must use RNN sequential computation". Together with ConvS2S, represents the strongest attempt at parallel sequence modeling before Transformer.

TLDR(中文)

用扩张卷积做 seq2seq,把序列建模从"必须 RNN 顺序计算"中解放出来;和同期 ConvS2S 一起是 Transformer 之前"并行序列建模"的最强尝试。