官术网_书友最值得收藏!

Additional data generation using affine transformation

We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.

The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.

These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.

主站蜘蛛池模板: 桂东县| 乐昌市| 五河县| 南汇区| 合阳县| 平潭县| 平谷区| 临泽县| 洪湖市| 宜兰县| 印江| 沙湾县| 兴义市| 玛曲县| 常熟市| 交口县| 承德市| 高尔夫| 延庆县| 葫芦岛市| 海门市| 汕头市| 股票| 曲阜市| 凉城县| 丰都县| 土默特左旗| 蕉岭县| 黄石市| 湛江市| 格尔木市| 平远县| 盐亭县| 年辖:市辖区| 云阳县| 夏河县| 明光市| 博白县| 吴川市| 乃东县| 卢氏县|