SSP Project Summary:
[EPCC home] [SSP home] [2001 projects] [2000 projects] [1999 projects] [1998 projects] [1997 projects] [1996 projects] [1995 projects] [1994 projects] [1993 projects]

A Parallel Connectionist Viewpoint Quasi-Invariant Iconic Object Recognition System

In a previous MSc project, Grove [1995] developed an iconic object recognition system. A key aspect of the foveation subsystem was the extraction of image features from a foveated representation (ie. where image pixels represent smaller portions of the image the closer to the centre of the image). The other key aspect of the foveation subsystem was the transformation from a cartesian to a radial representation, which has advantages in matching (in providing a mechanism for scale and rotation invariance).

Grove's system then extracted a set of features based on Marr's primal sketch: bars, edges, corners and blobs. These were extract from 3*3 neighbourhoods of the iconic image, using a hard-coded pattern match. These features were matched to a set of stored model features, with a serious of permutations to allow scale and rotation invariance.

In testing Grove's program, it was found that it took about 30 seconds to process each foveation on the fastest department machine at that time (twain). This was adequate for small scale testing, but means that there is not much hope of real applications until a factor of 100 increase in speed per foveation point is achieved. Much of the time required is in the actual foveation process.

This project will investigate a parallel re-implementation of the existing project, in either a SIMD or a MIMD mode.

A SIMD mode is suggested as each pixel in the foveal image can be computed effectively in parallel, although requires input image regions of varying size. However, once that is done, feature extraction is done independently on each pixel, as is most of the matching. Thus, a lot of processing over the 25*64 foveal image can be done in about 1000-way parallel.

On the other hand, MIMD processing should also be possible as each feature plane can be extracted independently, and each model/scale/orientation setting can be matched independently. Each of these processes could be assigned to an independent processor.

While there is plenty of opportunity for distributing processing, doing so without an excessive communication overheads will probably be the key problem of either approach to the project.

The main steps to the project are:

Expertise Required

Students should have experience in C/C++, and it would also be helpful to have experience in machine or computational vision or image processing.

Resources Required

C++, CM-200 or T3D

Resources Supplied

Working C++ serial version, test images

References

T. Grove, R. Fisher, "Attention Directed Iconic Object Matching", unpublished technical report. See Computational Vision Reading folder in AI's South Bridge Library. T. Grove, "Attention Directed Iconic Object Matching", MSc Dissertation, Dept. of Artificial Intelligence, Univ. of Edinburgh, 1995.

Paulo Marques worked on this project.

Compressed PostScript of the project's final report will be available here (744 bytes) .

Webpage maintained by mario@epcc.ed.ac.uk