- Learning Data Mining with Python(Second Edition)
- Robert Layton
- 220字
- 2021-07-02 23:40:11
Overall methodology
To perform association rule mining for affinity analysis, we first use the Apriori algorithm to generate frequent itemsets. Next, we create association rules (for example, if a person recommended movie X, they would also recommend movie Y) by testing combinations of premises and conclusions within those frequent itemsets.
- For the first stage, the Apriori algorithm needs a value for the minimum support that an itemset needs to be considered frequent. Any itemsets with less support will not be considered.
Setting this minimum support too low will cause Apriori to test a larger number of itemsets, slowing the algorithm down. Setting it too high will result in fewer itemsets being considered frequent.
- In the second stage, after the frequent itemsets have been discovered, association rules are tested based on their confidence. We could choose a minimum confidence level, a number of rules to return, or simply return all of them and let the user decide what to do with them.
In this chapter, we will return only rules above a given confidence level. Therefore, we need to set our minimum confidence level. Setting this too low will result in rules that have a high support, but are not very accurate. Setting this higher will result in only more accurate rules being returned, but with fewer rules being discovered overall.
- 大學(xué)計(jì)算機(jī)應(yīng)用基礎(chǔ)實(shí)踐教程
- Java FX應(yīng)用開發(fā)教程
- Python金融數(shù)據(jù)分析
- Java程序設(shè)計(jì)與實(shí)踐教程(第2版)
- Teaching with Google Classroom
- 單片機(jī)C語(yǔ)言程序設(shè)計(jì)實(shí)訓(xùn)100例
- Illustrator CS6設(shè)計(jì)與應(yīng)用任務(wù)教程
- Greenplum構(gòu)建實(shí)時(shí)數(shù)據(jù)倉(cāng)庫(kù)實(shí)踐
- INSTANT Premium Drupal Themes
- Jakarta EE Cookbook
- 計(jì)算機(jī)應(yīng)用基礎(chǔ)
- 人件集:人性化的軟件開發(fā)
- 深入理解MySQL主從原理
- 秒懂算法:用常識(shí)解讀數(shù)據(jù)結(jié)構(gòu)與算法
- Effective Python:編寫高質(zhì)量Python代碼的90個(gè)有效方法(原書第2版)