A Hierarchical Graphical Model For Recognizing Human Actions And Interactions In Video