官术网_书友最值得收藏!

Additional data generation using affine transformation

We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.

The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.

These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.

主站蜘蛛池模板: 华容县| 龙里县| 高尔夫| 类乌齐县| 武安市| 河间市| 乌兰浩特市| 永仁县| 安溪县| 湖州市| 安徽省| 神池县| 弥渡县| 淮北市| 同仁县| 北流市| 滁州市| 镇江市| 岫岩| 南丹县| 穆棱市| 蒙山县| 贵德县| 平果县| 太保市| 修武县| 商南县| 玉环县| 册亨县| 古浪县| 利津县| 宜宾县| 象州县| 济阳县| 邹城市| 江门市| 林周县| 弥勒县| 厦门市| 长春市| 大名县|