- Intelligent Projects Using Python
- Santanu Pattanayak
- 203字
- 2021-07-02 14:10:49
Additional data generation using affine transformation
We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.
The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.
These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.
- 筆記本電腦使用、維護與故障排除實戰
- Augmented Reality with Kinect
- Learning Stencyl 3.x Game Development Beginner's Guide
- 電腦軟硬件維修從入門到精通
- 微軟互聯網信息服務(IIS)最佳實踐 (微軟技術開發者叢書)
- Building 3D Models with modo 701
- 固態存儲:原理、架構與數據安全
- 單片機原理及應用:基于C51+Proteus仿真
- 單片機原理及應用
- UML精粹:標準對象建模語言簡明指南(第3版)
- The Reinforcement Learning Workshop
- 詳解FPGA:人工智能時代的驅動引擎
- Hands-On One-shot Learning with Python
- 快·易·通:2天學會電腦組裝·系統安裝·日常維護與故障排除
- 微處理器及控制電路識圖