官术网_书友最值得收藏!

Looking into the basics of the Apriori algorithm

The Apriori algorithm is part of our affinity analysis methodology and deals specifically with finding frequent itemsets within the data. The basic procedure of Apriori builds up new candidate itemsets from previously discovered frequent itemsets. These candidates are tested to see if they are frequent, and then the algorithm iterates as explained here:

  1. Create initial frequent itemsets by placing each item in its own itemset. Only items with at least the minimum support are used in this step.
  2. New candidate itemsets are created from the most recently discovered frequent itemsets by finding supersets of the existing frequent itemsets.
  3. All candidate itemsets are tested to see if they are frequent. If a candidate is not frequent then it is discarded. If there are no new frequent itemsets from this step, go to the last step.
  4. Store the newly discovered frequent itemsets and go to the second step.
  5. Return all of the discovered frequent itemsets.

This process is outlined in the following workflow:

主站蜘蛛池模板: 时尚| 井陉县| 寿阳县| 耒阳市| 江山市| 砚山县| 依安县| 景谷| 遂昌县| 麟游县| 保亭| 四平市| 巢湖市| 德惠市| 镇平县| 荣昌县| 南京市| 镇沅| 五家渠市| 公主岭市| 南城县| 招远市| 大冶市| 平武县| 东兴市| 江油市| 安溪县| 大石桥市| 墨竹工卡县| 万全县| 台北市| 山西省| 金川县| 大连市| 论坛| 阳山县| 余江县| 东海县| 黄大仙区| 社旗县| 阜平县|