- Intelligent Projects Using Python
- Santanu Pattanayak
- 203字
- 2021-07-02 14:10:49
Additional data generation using affine transformation
We will use the keras ImageDataGenerator to generate additional data, using affine transformation on the image pixel coordinates. The transformations that we will primarily use are rotation, translation, and scaling. If the pixel spatial coordinate is defined by x = [x1x2]T ∈ R2, then the new coordinate of the pixel can be given by the following:

Here, M = R2x2 is the affine transformation matrix, and b = [b1 b2]T ∈ R2 is a translation vector.
The term b1 specifies the translation along one of the spatial directions, while b2 provides the translation along the other spatial dimension.
These transformations are required, because neural networks are not, in general, translational invariant, rotational invariant, or scale invariant. Pooling operations do provide some translational invariance, but it is generally not enough. The neural network doesn't treat one object in a specific location in an image and the same object at a translated location in another image as the same thing. That is why we require several instances of an image at different translated positions for the neural network to learn better. The same explanation applies to rotation and scaling.
- 筆記本電腦使用、維護與故障排除實戰
- Raspberry Pi 3 Cookbook for Python Programmers
- 深入淺出SSD:固態存儲核心技術、原理與實戰(第2版)
- 計算機組裝與維修技術
- Machine Learning Solutions
- 微控制器的應用
- 計算機組成技術教程
- 微服務架構基礎(Spring Boot+Spring Cloud+Docker)
- Zabbix 4 Network Monitoring
- 多媒體應用技術(第2版)
- 微服務架構實戰:基于Spring Boot、Spring Cloud、Docker
- Learning Microsoft Cognitive Services
- Exceptional C++:47個C++工程難題、編程問題和解決方案(中文版)
- Unreal Development Kit Game Programming with UnrealScript:Beginner's Guide
- 數字噴墨與應用