官术网_书友最值得收藏!

Looking into the basics of the Apriori algorithm

The Apriori algorithm is part of our affinity analysis methodology and deals specifically with finding frequent itemsets within the data. The basic procedure of Apriori builds up new candidate itemsets from previously discovered frequent itemsets. These candidates are tested to see if they are frequent, and then the algorithm iterates as explained here:

  1. Create initial frequent itemsets by placing each item in its own itemset. Only items with at least the minimum support are used in this step.
  2. New candidate itemsets are created from the most recently discovered frequent itemsets by finding supersets of the existing frequent itemsets.
  3. All candidate itemsets are tested to see if they are frequent. If a candidate is not frequent then it is discarded. If there are no new frequent itemsets from this step, go to the last step.
  4. Store the newly discovered frequent itemsets and go to the second step.
  5. Return all of the discovered frequent itemsets.

This process is outlined in the following workflow:

主站蜘蛛池模板: 朔州市| 甘孜| 长泰县| 友谊县| 溆浦县| 开江县| 惠州市| 平山县| 东阳市| 兴化市| 百色市| 若羌县| 开原市| 新化县| 江孜县| 乌兰察布市| 兰西县| 阿拉善右旗| 卫辉市| 娄底市| 福州市| 黄浦区| 沽源县| 娱乐| 玛曲县| 汉寿县| 怀集县| 衢州市| 综艺| 额敏县| 叙永县| 东明县| 遵化市| 武汉市| 磐石市| 阳高县| 顺义区| 温宿县| 阳高县| 西林县| 五华县|