Skip to main content

Table 10 UCF-101 (split 1)

From: Gated spatio and temporal convolutional neural network for activity recognition: towards gated multimodal deep learning

Methods Accuracy
Spatial streams (three-channel RGB) 72.7%
Motion streams (three flow fields) 76.5%
SVM Fusion (model B) 81.5%
Averaging (model A) 82.7%
Gating network (model C) VGG-16 83%
Gating network (model C) ResNet-50 88.5%