官术网_书友最值得收藏!

The softmax function

The softmax function is mainly used to handle classification problems and preferably used in the output layer, outputting the probabilities of the output classes. As seen earlier, while solving the binary logistic regression, we witnessed that the sigmoid function was able to handle only two classes. In order to handle multi-class we need a function that can generate values for all the classes and those values follow the rules of probability. This objective is fulfilled by the softmax function, which shrinks the outputs for each class between 0 and 1 and divides them by the sum of the outputs for all the classes:

For examples, , where x refers to four classes.

Then, the softmax function will gives results (rounded to three decimal places) as:

Thus, we see the probabilities of all the classes. Since the output of every classifier demands probabilistic values for all the classes, the softmax function becomes the best candidate for the outer layer activation function of the classifier.

主站蜘蛛池模板: 铜川市| 合水县| 张家口市| 沁阳市| 尖扎县| 天台县| 毕节市| 乐都县| 乾安县| 柞水县| 大同市| 三台县| 乡城县| 堆龙德庆县| 甘南县| 成安县| 霍邱县| 建水县| 镇巴县| 综艺| 靖远县| 呼图壁县| 宜兰市| 英吉沙县| 巴青县| 醴陵市| 西宁市| 兴山县| 镇宁| 镇远县| 宜州市| 舟山市| 和硕县| 咸丰县| 类乌齐县| 吕梁市| 青神县| 巴彦淖尔市| 海盐县| 巩留县| 灌云县|