Skip to main content

Table 5 Temporal action localization experiment

From: Weakly supervised spatial–temporal attention network driven by tracking and consistency loss for action detection

Method

Mode

T(IOU@0.3)

T(IOU@0.5)

A(IOU@0.5)

G-TAD [48]

Full

40.2

46.7

P-GCN [49]

Full

63.6

49.1

48.3

Nguyen [36]

Weak

46.6

26.8

3C-Net [34]

Weak

40.9

24.6

35.4

WSGN [31]

Weak

42.0

25.1

Islam [29]

Weak

46.8

29.6

35.2

BaS-Net [35]

Weak

44.6

27.0

34.5

DGAM [32]

Weak

46.8

28.8

41.0

HAM-Net [39]

Weak

50.3

31.0

41.5

Ours

Weak

64.4

49.6

52.2

  1. The table lists the comparison results of mAP (16 frames clip). We compared with typical fully and weakly supervised methods. T(IOU@0.3) indicates THUMOS14 with IOU@0.3, T(IOU@0.5) indicates IOU=0.5, and A(IOU@0.5) indicates ActivityNet with IOU@0.5. Note that, the proposed method is an object location-unsupervised classification-supervised attention network