Fig. 4From: Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networksThe scaled dot-product attention and multi-head self-attentionBack to article page