Article
Color aided motion-segmentation and object tracking for video sequences semantic analysis
Article first published online: 10 OCT 2007
DOI: 10.1002/ima.20113
Copyright © 2007 Wiley Periodicals, Inc.
Issue
1098-1098/asset/cover.gif?v=1&s=fecfeed4370a915bee5fd684a67fce6708bdedcf)
International Journal of Imaging Systems and Technology
Special Issue: Special Issue on Applied Color Image Processing
Volume 17, Issue 3, pages 174–189, 2007
Additional Information
How to Cite
Briassouli, A., Mezaris, V. and Kompatsiaris, I. (2007), Color aided motion-segmentation and object tracking for video sequences semantic analysis. Int. J. Imaging Syst. Technol., 17: 174–189. doi: 10.1002/ima.20113
Publication History
- Issue published online: 10 OCT 2007
- Article first published online: 10 OCT 2007
- Manuscript Accepted: 30 AUG 2007
- Manuscript Received: 16 FEB 2007
Funded by
- European Commission. Grant Numbers: FP6-001765 aceMedia, FP6-027685 MESH, FP6-027026 K-Space
- Abstract
- References
- Cited By
Keywords:
- motion segmentation;
- object tracking;
- video semantics;
- motion and color fusion
Abstract
The high rates at which digital multimedia is being generated and used makes it necessary to develop systems that can process it in an efficient manner. This can be achieved by extracting semantics from processing the video's low-level information. We present a novel algorithm which fuses color and motion information, in order to extract semantics from the video sequence. The motion estimates are processed statistically to give areas of activity in the video. Color segmentation is applied to these areas, and also to their complementary regions in each frame, in order to achieve the moving object segmentation. The extracted color layers in the activity and background areas are compared using the earth mover's distance (EMD), and a novel method, which we introduce, and which is based on a likelihood ratio test (LRT). The segmentation results of our LRT-based approach are shown to be more robust than the EMD results, and both methods are shown to be more accurate than the existing combined color-motion approaches. Furthermore, the LRT method allows the retrieval of additional semantics, namely of “maps” that indicate with what likelihood a pixel belongs to a moving object. The areas of activity can be used to retrieve semantics for the kind of activity taking place. The color-aided segmentation of the moving entities provides a full description of their appearance, so it can be used, for example, to classify the video based on the objects in it. Experiments with real sequences show that this method leads to accurate results and useful semantics. © 2007 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 17, 174–189, 2007

1098-1098/asset/olbannerleft.jpg?v=1&s=be2f67331b2f5164cb01f7c891fafdd9bd2326af)
1098-1098/asset/olbannerright.jpg?v=1&s=625bc919a4c8784eed670b90b5112a9aeee99225)