官术网_书友最值得收藏!

Naive Bayes – pros and cons

In this section, we present the advantages and disadvantages in selecting the Naive Bayes algorithm for classification problems.

These are the pros:

  • Training time: The Naive Bayes algorithm only requires one pass on the entire dataset to calculate the posterior probabilities for each value of the feature in the dataset. So, when we are dealing with large datasets or low-budget hardware, the Naive Bayes algorithm is a feasible choice for most data scientists.

  • Prediction time: Since all the probabilities are pre-computed in the Naive Bayes algorithm, the prediction time of this algorithm is very efficient.

  • Transparency: Since the predictions of Naive Bayes algorithms are based on the posterior probability of each conditional feature, it is easy to understand which features are influencing the predictions. This helps users to understand the predictions.

These are the cons:

  • Prediction accuracy: The prediction accuracy of the Naive Bayes algorithm is lower than other algorithms we will discuss in the book. Algorithm prediction accuracy is dataset dependent. A lot of research has proved that algorithms such as random forest, support vector machines (SVMs), and deep neural networks (DNNs) outperform the Naive Bayes algorithm in terms of classification accuracy. 

  • Assumption of independence: Since we assume that the features are independent of each other, this algorithm may lose information for features that are dependent on each other. Other advanced algorithms do use this dependence information when calculating predictions. 

主站蜘蛛池模板: 彰武县| 双柏县| 乌海市| 博客| 永兴县| 科尔| 婺源县| 靖边县| 法库县| 龙山县| 永川市| 岑巩县| 呼图壁县| 广丰县| 清远市| 右玉县| 贵德县| 廊坊市| 柳林县| 承德市| 高台县| 康马县| 东源县| 金乡县| 光山县| 罗田县| 浮梁县| 吴桥县| 夏河县| 水城县| 双峰县| 改则县| 汕头市| 铁岭县| 阜康市| 福清市| 建宁县| 临夏市| 晋江市| 丁青县| 长治县|