ECCV 2014 - LNCS 8689-8695

Assessing the Quality of Actions

Hamed Pirsiavash, Carl Vondrick, and Antonio Torralba

Massachusetts Institute of Technology, USA
hpirsiav@mit.edu
vondrick@mit.edu
torralba@mit.edu

Abstract. While recent advances in computer vision have provided reliable methods to recognize actions in both images and videos, the problem of assessing how well people perform actions has been largely unexplored in computer vision. Since methods for assessing action quality have many real-world applications in healthcare, sports, and video retrieval, we believe the computer vision community should begin to tackle this challenging problem. To spur progress, we introduce a learning-based framework that takes steps towards assessing how well people perform actions in videos. Our approach works by training a regression model from spatiotemporal pose features to scores obtained from expert judges. Moreover, our approach can provide interpretable feedback on how people can improve their action. We evaluate our method on a new Olympic sports dataset, and our experiments suggest our framework is able to rank the athletes more accurately than a non-expert human. While promising, our method is still a long way to rivaling the performance of expert judges, indicating that there is significant opportunity in computer vision research to improve on this difficult yet important task.

LNCS 8694, p. 556 ff.

Full article in PDF | BibTeX