Skip to main content

Table 4 Confusion matrix for the surveillance video. The average performance is 81.5%. "pick", "scan", and "drop" represent pickup, scanning, and drop, respectively.

From: Unsupervised Action Classification Using Space-Time Link Analysis

Category

pick

scan

drop

pick

0.67

0

0.33

scan

0

1

0

drop

0.223

0

0.78