官术网_书友最值得收藏!

Spark MLlib

MLlib ALS algorithm takes the training data of the RDD type, that is, Distributed Datasets [Rating] and trains a model, which is a MatrixFactorizationModel object.

RDD  is a special data type supported by Spark. The RDD format is immutable, and they run on clusters and can operate in Parallel. One can perform on an RDD class.

The technique we are using here is known as . Let's assume User A likes Product A, Product B, and Product C and rated them with a score. Then, let's assume User B likes Product B, Product C, and Product D and gave a similar rating to the score User A gave for Product B and Product C. Now, using Collaborative Filtering, one can find out what User A would rate for Product D or what User B would rate for Product A as we have some commonality between User A and User B--they both rated Product B and Product C similarly.

主站蜘蛛池模板: 桂东县| 昔阳县| 司法| 会昌县| 晋中市| 汶川县| 柘城县| 新巴尔虎左旗| 沭阳县| 台北县| 漳浦县| 维西| 武城县| 海口市| 三门县| 墨竹工卡县| 白水县| 沐川县| 蓝田县| 岢岚县| 确山县| 凤冈县| 抚顺县| 莒南县| 南阳市| 延庆县| 七台河市| 吉安县| 泸州市| 东兰县| 康马县| 印江| 石狮市| 随州市| 万宁市| 贵溪市| 沁水县| 兴和县| 砀山县| 米泉市| 祁连县|