[2212.11030] Deep set conditioned latent representations for action recognition