This site uses CSS for layout and formatting, and JavaScript
for effects. If you can see this message, you are not viewing
this site as the designer intended.
Thomas F. El-Maraghi

Publications

Theses

T. F. El-Maraghi, Robust on-line appearance models for visual tracking, Ph.D. Thesis, University of Toronto, Toronto, February 2003. Show Abstract PDF

A framework for learning robust, adaptive appearance models to be used for motion-based tracking of natural objects is proposed. The approach involves a mixture of stable image structure, learned over long time courses, along with 2-frame transient motion information and an outlier process. This class of appearance models is shown to have low storage requirements and that the model parameters can be learned efficiently with an on-line variant of the expectation-maximization (EM) algorithm. Two implementations of appearance models based on the framework presented here are developed. The first is based on the filter responses from a steerable pyramid. This model is used in a motion-based tracking algorithm to provide robustness in the face of image outliers, such as those caused by occlusions. It also provides the ability to adapt to natural changes in appearance, such as those due to facial expressions or variations in 3D pose. The second implementation is based on a robust representation of the color of the target object. Experimental results on a variety of natural image sequences of people moving within cluttered environments are shown, both for the wavelet- and color-based appearance models individually and in combination with one another. A quantitative analysis of the performance of the tracker is provided for one of the test sequences.

T. F. El-Maraghi, Segment-based disparity estimation, Master's Thesis, Queen's University, Kingston, August 1996. Show Abstract

Algorithms for the estimation of binocular disparity are formulated using low-level visual processing. Typically, these approaches do not model discontinuities in disparity and thus perform poorly at occlusion boundaries and depth discontinuities. We seek to improve the accuracy of binocular stereo disparity estimates by explicitly modeling disparity discontinuities and occlusion. We focus on intermediate-level visual processing, and exploit the fact that many depth discontinuities correspond to monocular changes in intensity. Thus, we propose a segment-based approach to disparity estimation that combines parametric disparity models with gray-scale image segmentation. The segments represent hypothetical surface facets. The models of the disparity within each segment are used to: (1) find occlusion, and (2) merge neighbouring segments with coherent disparity. We begin by obtaining coarse disparity estimates using a gradient-based SSD technique. An intensity segmentation is obtained using a smoothing process based on robust statistics and then locating spatial outliers. Then, we fit variable-order parametric models to the coarse disparity estimates within each segment. The disparity models are used to locate occlusions by testing for pixels that are not binocularly visible. Next, we refine the parameters of the disparity models using robust estimation. This calculation is performed on a segment-by-segment basis, allowing for discontinuities at segment boundaries and for the presence of occluded pixels. We introduce a process to merge adjacent segments that can be described by a single disparity model. The merging process reduces the number of segments that correspond only to textural changes or intensity variation by joining segments likely to belong to the same surface. We evaluate the performance of our algorithm on a number of natural and synthetic image pairs.

Journal Articles

A. D. Jepson, D. J. Fleet, and T. F. El-Maraghi, Robust on-line appearance models for visual tracking IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(10):1296-1311, October 2003. Show Abstract

We propose a framework for learning robust, adaptive, appearance models to be used for motion-based tracking of natural objects. The model adapts to slowly changing appearnce, and it maintains a natural measure of the stability of the observed image structure during tracking. By identifying stable properties of appearance, we can weight them more heavilty for motion estimation, while less stable properties can be proportionally downweighted. The appearance model involves a mixture of stable image structure, learned over long time courses, along with two-frame information and an outlier process. An online EM-algorithm is used to adapt the appearance model paramters over time. An implementation of this approach is developed for an appearance model based on the filter responses from a steerable pyramid. The model is used in a motion-based tracking algorithm to provide robustness in the face of image outliers, such as those caused by occlusions, while adapting to natural changes in appearance such as those due to facial expressions of variations in 3D pose.

Conference Papers

T. F. El-Maraghi and A. D. Jepson, Saturated independent color coordinates for image alignment, IEEE International Conference on Pattern Recognition, Quebec City, August 2002. Show Abstract

Image alignment should be based on features that are robust with respect to phenomena such as global illumination changes, shading variations, and local highlights. While hue provides a natural degree of invariance to these phenomena, we show that it has several deficiencies in terms of gradient-based image alignment. To overcome these limitations, we introduce a 2D representation for hue that maintains a highly compressed representation for saturation. This allows the representation to model gray without sacrificing the desirable properties of hue. We show that our approach has a more consistent local domain of convergence when used for gradient-based alignment, and demonstrate its use in the context of a probabilistic motion-based tracker.

R. Mann, A. D. Jepson, and T. F. El-Maraghi, Trajectory segmentation using dynamic programming, IEEE International Conference on Pattern Recognition, Quebec City, August 2002. Show Abstract

We consider the segmentation of a trajectory into piecewise polynomial parts, or possibly other forms. Segmentation is formulated as an optimization problem which trades off model fitting error versus the cost of introducing new segments. We present a novel dynamic programming approach that finds the optimal segmentation for trajectories. The approach is easily extended to handle different segment types and top down information about segment boundaries, when available. We show segmentation results for video sequences of a basketball undergoing gravitational and nongravitational motion.

A. D. Jepson, D. J. Fleet, and T. F. El-Maraghi, Robust on-line appearance models for visual tracking;, IEEE Conference on Computer Vision and Pattern Recognition, Kauai, December 2001, Vol. I, pp. 415-422.
Best Paper Runner-Up Award Show Abstract PDF © IEEE

We propose a framework for learning robust, adaptive, appearance models to be used for motion-based tracking of natural objects. The approach involves a mixture of stable image structure, learned over long time courses, along with 2-frame motion information and an outlier process. An on-line EM-algorithm is used to adapt the appearance model parameters over time. An implementation of this approach is developed for an appearance model based on the filter responses from a steerable pyramid. This model is used in a motion-based tracking algorithm to provide robustness in the face of image outliers, such as those caused by occlusions. It is also provides the ability to adapt to natural changes in appearance, such as those due to facial expressions or variations in 3D pose. We show experimental results on a variety of natural image sequences of people moving within cluttered environments.
Home | Research Interests | Publications | Professional Activities | Curriculum Vitae