From: Consistent constraint-based video-level learning for action recognition
Training method
Loss function
HMDB51
Clip level
Lossce
32.6%
Video level
33.15%
Lossvll(Lossccl)
35.38%