HMAX

M. Riesenhuber and T. Poggio,
"Hierarchical Models of Object Recognition in Cortex,"
Nature Neurosciece, Vol. 2, No. 11, November 1999. pp 1019-1025.

They say simple cells and complex cells and
complex cells are responsible for invariant properties.
Invariance can be implemented as a pooling mechanism
where there are view-invariant units and view-tuned units.

Hierarchical feedforward network is considered and
the network is based on MAX rather than linear summation (SUM).
MAX is proved to be more robust and invariant than SUM.

That is, Hierarchical MAX has view-invariant object recognition ability
and is supported by biological (physiological) facts.
Anyway, I say, this is a very similar concept to ISA, but a little more flexible.

But this paper doesn't say much of implementation such as
how to get the simple cells or complex cells,
and how to construct the network structure.

See [1] for a specific implementation and examples of HMAX.
In [1], interestingly, simple and complex cells are not learned from data
but designed by second derivative of Gaussians.

[1] T. Serre and M. Riesenhuber,
"Realistic Modeling of Simple and Complex Cell Tuning in the HMAX Model,
and Implications for Invariant Object Recognition in Cortex,"
AI Memo 2004-017, CBCL Memo 239, MIT, July 2004.

- H. Choi

"models of object recognition"

M. Riesenhuber and T. Poggio,
"Models of Object Recognition,"
Nature Neurosciece Supplement, Vol. 3, November 2000. pp 1199-1204

It's a review paper about object recognition models.
Here is what I got from the paper.

"the distinction between identification and categorization is mostly semantic."

There are two kinds of models for object recognition.
1. view-based model: "objects are represented as collections of view- specific features"
It is something like ICA.
2. object-centered model: there is 3-D model of the object.
It's something like Geon theory.

They took "view-based model" in this paper.
Considering the speed of processing in the brain,
feedback model cannot be the prime model for object recognition.
It should be more like feedforward processing
where invariant properties can be obtained by hierarchical structure.

Making the connections between input image to higher level units as in Fig. 3,
the different tasks (categorization and identification) can be achieved by learning.

- H. Choi

Defense!!!

DISSERTATION DEFENSE OF
HEEYOUL "HENRY" CHOI
TITLE: Manifold Integration: Data Integration on Multiple Manifolds
Advisor: Dr. Yoonsuck Choe
2:00 p.m. Room 307 HRBB

- H. Choi