Fig. 1From: Using BERT to identify drug-target interactions from whole PubMedArchitecture for all the BERT models, where Wi represents input word token and Oi represents contextual embeddings at the output layer. The O[CLS] is first token of output sequence and contains class labelBack to article page