- Practical Convolutional Neural Networks
- Mohit Sewak Md. Rezaul Karim Pradeep Pujari
- 332字
- 2021-06-24 18:58:50
Introduction to the MNIST dataset
Here we use MNIST (Modified National Institute of Standards and Technology), which consists of images of handwritten numbers and their labels. Since its release in 1999, this classic dataset is used for benchmarking classification algorithms.
The data files train.csv and test.csv consist of hand-drawn digits, from 0 through 9 in the form of gray-scale images. A digital image is a mathematical function of the form f(x,y)=pixel value. The images are two dimensional.
We can perform any mathematical function on the image. By computing the gradient on the image, we can measure how fast pixel values are changing and the direction in which they are changing. For image recognition, we convert the image into grayscale for simplicity and have one color channel. RGB representation of an image consists of three color channels, RED, BLUE, and GREEN. In the RGB color scheme, an image is a stack of three images RED, BLUE, and GREEN. In a grayscale color scheme, color is not important. Color images are computationally harder to analyze because they take more space in memory. Intensity, which is a measure of the lightness and darkness of an image, is very useful for recognizing objects. In some applications, for example, detecting lane lines in a self-driving car application, color is important because it has to distinguish yellow lanes and white lanes. A grayscale image does not provide enough information to distinguish between white and yellow lane lines.
Any grayscale image is interpreted by the computer as a matrix with one entry for each image pixel. Each image is 28 x 28 pixels in height and width, to give a sum of 784 pixels. Each pixel has a single pixel-value associated with it. This value indicates the lightness or darkness of that particular pixel. This pixel-value is an integer ranging from 0 to 255, where a value of zero means darkest and 255 is the whitest, and a gray pixel is between 0 and 255.
- 計算機綜合設計實驗指導
- Java Data Science Cookbook
- Creating Mobile Apps with Sencha Touch 2
- 使用GitOps實現Kubernetes的持續部署:模式、流程及工具
- Learning Spring Boot
- 深入淺出 Hyperscan:高性能正則表達式算法原理與設計
- Splunk智能運維實戰
- 大數據技術原理與應用:概念、存儲、處理、分析與應用
- Filecoin原理與實現
- 實現領域驅動設計
- AndEngine for Android Game Development Cookbook
- 基于數據發布的隱私保護模型研究
- 深入理解Flink:實時大數據處理實踐
- 一類智能優化算法的改進及應用研究
- Machine Learning for Mobile