Sequential Interval Network for parsing complex structured activity

作者:

Highlights:

摘要

We propose a new graphical model, called a Sequential Interval Network (SIN), for parsing complex, structured activities whose composition can be represented by a stochastic grammar. By exploiting the grammar, the generated network captures an activity’s global temporal structure while avoiding a time-sliced manner model. In this network, the hidden variables are the start and end times of the component actions, which allows reasoning about duration and observation on interval/segment level. Exact inference can be achieved and yield the posterior probabilities of the timing variables as well as each frame’s component label. Importantly, by using uninformative expected value of future observations, the network can predict the probability distribution of the timing of future component actions. We demonstrate this framework on vision tasks such as recognition and temporally segmentation of action sequence, or parsing and making future prediction online when running in streaming mode while observing an assembly task.

论文关键词:

论文评审过程:Received 16 October 2014, Revised 9 May 2015, Accepted 13 July 2015, Available online 26 July 2015, Version of Record 13 January 2016.

论文官网地址:https://doi.org/10.1016/j.cviu.2015.07.006