Wang, Jing and Xu, Zhijie (2012) STV-based Video Feature Processing for Action Recognition. Signal Processing, 93 (8). pp. 2151-2168. ISSN 0165-1684
|PDF - Accepted Version |
Download (2117kB) | Preview
In comparison to still image-based processes, video features can provide rich and intuitive information about dynamic events occurred over a period of time, such as human actions, crowd behaviours, and other subject pattern changes. Although substantial progresses have been made in the last decade on image processing and seen its successful applications in face matching and object recognition, video-based event detection still remains one of the most difficult challenges in computer vision research due to its complex continuous or discrete input signals, arbitrary dynamic feature definitions, and the often ambiguous analytical methods. In this paper, a Spatio-Temporal Volume (STV) and region intersection (RI) based 3D shape-matching method has been proposed to facilitate the definition and recognition of human actions recorded in videos. The distinctive characteristics and the performance gain of the devised approach stemmed from a coefficient factor-boosted 3D region intersection and matching mechanism developed in this research. This paper also reported the investigation into techniques for efficient STV data filtering to reduce the amount of voxels (volumetric-pixels) that need to be processed in each operational cycle in the implemented system. The encouraging features and improvements on the operational performance registered in the experiments have been discussed at the end.
|Subjects:||Q Science > QA Mathematics > QA75 Electronic computers. Computer science|
|Schools:||School of Computing and Engineering|
School of Computing and Engineering > Computer Graphics, Imaging and Vision Research Group
|Depositing User:||Jing Wong|
|Date Deposited:||12 Jul 2012 12:13|
|Last Modified:||03 May 2013 08:15|
Downloader CountriesMore statistics for this item...
Item control for Repository Staff only: