Wang, Jing (2012) Spatio-Temporal Volume-based Video Event Detection. Doctoral thesis, University of Huddersfield.

Online and offline video clips provide rich information on dynamic events that occurred over a period of time, for example, human actions, crowd behaviours, and other subject pattern changes. Although substantial progresses have been made in the last 3 decades on 2D image feature processing and their applications in areas such as face matching and objects recognition, video event detection still remains one of the most challenging fields in computer vision study due to the wide range of continuous and non-linear signals engaged by an imaging system, and the inherent semantic difficulties in machine-based understanding of the detected feature patterns.

For bridging the gap between the pixel-level image features and the semantic “meanings” of a videoed single human event, this research has investigated the problem domain through employing the 3D Spatio-Temporal Volume (STV) structure and its global feature paradigm for event pattern recognition. The process pipeline follows an improved Pair-wise Region Comparison (I-PWRC) and a region intersection (RI) based 3D template matching approach for detecting and identifying human actions under uncontrolled real-world videoing conditions. To maintain the run-time performance of this innovative system design, this programme has also developed an efficient pre-filtering mechanism to reduce the amount of voxels (volumetric pixels) that need to be processed in each operational cycle.

For further improving the system’s adaptability and robustness, several optimisation techniques, such as scale-invariant template matching and event location prediction mechanisms, have also been developed and implemented. The proposed design has been tested on various renowned online computer vision research databases and been benchmarked against other classic implementation strategies and systems. Satisfactory evaluation results have been obtained through statistical analyses on standard test criteria such as "Recall" rate and the processing efficiency.

JingWang_Final_Thesis_-_Feb_12.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (4MB) | Preview


Downloads per month over past year

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email