In object recognition, translating an image by a few pixels in some direction should not affect the
Question:
In object recognition, translating an image by a few pixels in some direction should not affect the category recognized. Suppose that we consider images with an object in the foreground on top of a uniform background. Also suppose that the objects of interest are always at least 10 pixels away from the borders of the image. Is the CNN in Q8.3 invariant to the translation of at most 10 pixels in some direction? Here, the translation is applied only to the foreground object while keeping the background fixed. If your answer is yes, show that the CNN will necessarily produce the same output for two images where the foreground object is arbitrarily translated by at most 10 pixels. If your answer is no, provide a counter-example by describing a situation where the output of the CNN is different for two images where the foreground object is translated by at most 10 pixels. If your answer is no, can you find any particular translation of less than 10 pixels in which the CNN will generate an invariant output for the translation?
Step by Step Answer:
Machine Learning Fundamentals A Concise Introduction
ISBN: 9781108940023
1st Edition
Authors: Hui Jiang