官术网_书友最值得收藏!

Additional data generation using affine transformation

We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.

The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.

These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.

主站蜘蛛池模板: 镇巴县| 洪洞县| 西林县| 安康市| 会理县| 贺兰县| 凭祥市| 会泽县| 青铜峡市| 长海县| 如东县| 昌宁县| 榆林市| 格尔木市| 钦州市| 邓州市| 常州市| 沙雅县| 平舆县| 图片| 栾城县| 岱山县| 靖江市| 清水河县| 东宁县| 柳河县| 且末县| 鄢陵县| 五台县| 云安县| 基隆市| 安福县| 岳普湖县| 隆昌县| 武宁县| 霍城县| 房山区| 安阳市| 威远县| 潼关县| 宝坻区|