- Intelligent Projects Using Python
- Santanu Pattanayak
- 203字
- 2021-07-02 14:10:49
Additional data generation using affine transformation
We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.
The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.
These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.
- micro:bit魔法修煉之Mpython初體驗(yàn)
- 電腦維護(hù)365問
- 筆記本電腦維修不是事兒(第2版)
- 計(jì)算機(jī)組裝與維護(hù)(第3版)
- CC2530單片機(jī)技術(shù)與應(yīng)用
- 電腦高級維修及故障排除實(shí)戰(zhàn)
- 筆記本電腦應(yīng)用技巧
- Intel Edison智能硬件開發(fā)指南:基于Yocto Project
- Internet of Things Projects with ESP32
- BeagleBone Robotic Projects
- Managing Data and Media in Microsoft Silverlight 4:A mashup of chapters from Packt's bestselling Silverlight books
- Neural Network Programming with Java(Second Edition)
- FPGA實(shí)驗(yàn)實(shí)訓(xùn)教程
- Building Machine Learning Systems with Python
- USB應(yīng)用分析精粹:從設(shè)備硬件、固件到主機(jī)端程序設(shè)計(jì)