官术网_书友最值得收藏!

Data gathering

We need to obtain data and organize it appropriately for the current problem (in our example, this could mean building a dataset linking users to songs they've listened to in the past). Depending on the size of the data, we might pick different technologies for storing the data. For example, it might be fine to train on a local machine using scikit-learn if we're working through a few million records. However, if the data doesn't fit on a single computer, then we must consider AWS solutions such as S3 for storage and Apache Spark, or SageMaker's built-in algorithms for model building.

主站蜘蛛池模板: 深圳市| 新乐市| 龙门县| 英德市| 耒阳市| 岫岩| 宜春市| 呈贡县| 南部县| 乾安县| 尼玛县| 信阳市| 边坝县| 黄骅市| 兴义市| 台东市| 光泽县| 正宁县| 宜章县| 延川县| 锦州市| 顺平县| 墨脱县| 枣庄市| 江安县| 阿克| 老河口市| 峨眉山市| 苗栗县| 西平县| 介休市| 比如县| 黑山县| 隆尧县| 广东省| 三门峡市| 大丰市| 于田县| 枣强县| 余庆县| 岑溪市|