Skip to main content
Fig. 4 | BMC Bioinformatics

Fig. 4

From: Biomedical document triage using a hierarchical attention-based capsule network

Fig. 4

The self-attention mechanism calculation process. We can get three vectors that are a query vector, a key vector and a value vector for each word by multiplying the embedding word vector by the three matrices trained during the training. The size of these new vectors is 3, while the dimensions of the embedding word and encoder are 4. We evaluate the dependence between the words with the dot product operation

Back to article page