官术网_书友最值得收藏!

Features

When we are talking about features in the context of ML , what we mean is some characteristic property of the object or phenomenon we are investigating.

Other names for the same concept you'll see in some publications are explanatory variable, independent variable, and predictor.

Features are used to distinguish objects from each other and to measure the similarity between them.

For instance:

  • If the objects of our interest are books, features could be a title, page count, author's name, a year of publication, genre, and so on
  • If the objects of interest are images, features could be intensities of each pixel
  • If the objects are blog posts, features could be language, length, or presence of some terms
It's useful to imagine your data as a spreadsheet table. In this case, each sample (data point) would be a row, and each feature would be a column. For example, Table 1.1 shows a tiny dataset of books consisting of four samples where each has eight features.

Table 1.1: an example of a ML dataset (dummy books):

主站蜘蛛池模板: 固阳县| 榆中县| 屯昌县| 九江市| 碌曲县| 长泰县| 兴海县| 淮安市| 德庆县| 扶绥县| 札达县| 沈阳市| 封丘县| 察雅县| 玛纳斯县| 德阳市| 大竹县| 赫章县| 武乡县| 乌拉特前旗| 从江县| 荣昌县| 盐源县| 马龙县| 屏东市| 金平| 昌黎县| 林口县| 平顺县| 兴仁县| 古田县| 连城县| 边坝县| 大宁县| 红原县| 双鸭山市| 康马县| 姜堰市| 黔南| 外汇| 平江县|