Fig. 4From: Weakly supervised spatial–temporal attention network driven by tracking and consistency loss for action detectionIt is about the details of branch no.3 shown in Fig. 3. We use Gram matrix between FB and FC. This figure corresponds to the formula 8 and 9Back to article page