官术网_书友最值得收藏!

Scoring of unusualness

Once a model has been constructed, the likelihood of any future observed value can be found within the probability distribution. As described earlier, we had asked the question, "Is getting fifteen pieces of mail likely?". This question can now be empirically answered, depending on the model, with a number between zero (no possibility) and one (absolute certainty). ML will use the model to calculate this fractional value out to approximately 300 significant figures (which can be helpful when dealing with very low probabilities). Let's observe the following graph:

ML calculates the probability of the dip in value in this time series

Here, the probability of the observation of the actual value of 921 at this point in time was calculated to be 6.3634e-7 (or more commonly a mere 0.000063634% chance). This very small value is perhaps not that intuitive to most people. As such, ML will take this probability calculation, and via a process of quantile normalization, re-cast that observation on a severity scale between 0 and 100, where 100 is the highest level of unusualness possible for that particular dataset. In the preceding case, the probability calculation of 6.3634e-7 was normalized to a score of 94. This normalized score will come in handy later as a means by which to assess the severity of the anomaly for purposes of alerting and/or triage.

主站蜘蛛池模板: 卓尼县| 西青区| 新泰市| 昆山市| 襄樊市| 临沧市| 庄浪县| 通渭县| 遵义县| 嵊州市| 塘沽区| 理塘县| 云阳县| 仙游县| 河津市| 正安县| 吉水县| 清水河县| 家居| 贡觉县| 江门市| 新和县| 保康县| 仪陇县| 海丰县| 尼玛县| 内丘县| 土默特左旗| 望城县| 昆明市| 张北县| 化隆| 巴林右旗| 乐安县| 阳信县| 博罗县| 汉源县| 西充县| 子长县| 芜湖县| 石林|