官术网_书友最值得收藏!

Algorithm selection

We need to iterate on the complex problem of the creating the algorithm. This entails exploring the data to gain a deep understanding of the underlying variables. Once we have an idea of the kind of algorithm we want to apply, we'll need to further prepare the data, possibly combining it with other data sources (for example, census data). In our example, this could mean creating a song similarity matrix. Once we have the data, we can train a model so that it is capable of making predictions, and test that model against holdout data to see how it performs. There are many considerations in this process that make it complex:

  • How the data is encoded (for example, how the song matrix is constructed)
  • What algorithm is used (example, collaborative filtering or content-based filtering)
  • What parameter values your model takes (for example, values for smoothing constants or prior distributions)

Our goal in this book is to make this step easier for you by presenting iterations a data scientist would undergo in the task of creating a successful model using real-world applications as examples.

主站蜘蛛池模板: 沈阳市| 古交市| 通江县| 东方市| 高安市| 冀州市| 宁德市| 罗甸县| 文化| 哈尔滨市| 萝北县| 宁国市| 朝阳县| 北票市| 偏关县| 简阳市| 正安县| 扶绥县| 峨眉山市| 寿宁县| 邢台县| 石棉县| 泗阳县| 邛崃市| 东方市| 凉山| 彭泽县| 万全县| 蓬溪县| 南木林县| 广西| 东山县| 梁河县| 项城市| 秀山| SHOW| 陇南市| 莎车县| 晋中市| 鄂托克前旗| 临桂县|