Time-dependent bag of words on manifolds for geodesic-based classification of video activities towards assisted living and healthcare