官术网_书友最值得收藏!

Downloading the example code

You can download the example code files from your account at http://www.packtpub.com for all the Packt Publishing books you have purchased. If you purchased this book elsewhere, you could visit http://www.packtpub.com/support and register to have the files e-mailed directly to you. I've also setup a GitHub repository that contains a live version of the code, along with new fixes, updates and so on. You can retrieve the code and datasets at the repository here: https://github.com/dataPipelineAU/LearningDataMiningWithPython2

You can read the dataset can by looking at each row (horizontal line) at a time. The first row (0, 1, 0, 0, 0) shows the items purchased in the first transaction. Each column (vertical row) represents each of the items. They are bread, milk, cheese, apples, and bananas, respectively. Therefore, in the first transaction, the person bought cheese, apples, and bananas, but not bread or milk. Add the following line in a new cell to allow us to turn these feature numbers into actual words:

features = ["bread", "milk", "cheese", "apples", "bananas"]

Each of these features contains binary values, stating only whether the items were purchased and not how many of them were purchased. A1 indicates that at least 1 item was bought of this type, while a 0 indicates that absolutely none of that item was purchased. For a real world dataset, using exact figures or a larger threshold would be required.

主站蜘蛛池模板: 通州区| 邵阳县| 大兴区| 扶绥县| 保德县| 墨脱县| 花垣县| 曲靖市| 辽阳市| 视频| 肥城市| 临夏市| 当阳市| 渝中区| 读书| 中西区| 德化县| 白玉县| 当涂县| 苗栗县| 佛教| 灯塔市| 大埔区| 洱源县| 陕西省| 阳春市| 炉霍县| 西安市| 汶上县| 汽车| 镇雄县| 壤塘县| 罗定市| 松阳县| 德昌县| 石台县| 龙胜| 龙游县| 庄浪县| 青海省| 治县。|