luong2015-attention
arXiv: 1508.04025
TLDR(中文)
系统化地比较 global vs local attention、不同打分函数(dot / general / concat),是后人讲 "attention score 是怎么算的" 时最常引用的工程化版本。
TLDR (English)
Systematically compares global vs local attention and different scoring functions (dot/general/concat). The most commonly cited engineering reference when explaining "how attention scores are computed".