官术网_书友最值得收藏!

Summary

Feature engineering is a massive task to be undertaken by data scientists and machine learning engineers. It is a task that is imperative to having successful and production-ready machine learning pipelines. In the coming seven chapters, we are going to explore six major aspects of feature engineering:

  • Feature understanding: learning how to identify data based on its qualities and quantitative state
  • Feature improvement: cleaning and imputing missing data values in order to maximize the dataset's value
  • Feature selection -statistically selecting and subsetting feature sets in order to reduce the noise in our data
  • Feature construction - building new features with the intention of exploiting feature interactions
  • Feature transformation - extracting latent (hidden) structure within datasets in order to mathematically transform our datasets into something new (and usually better)
  • Feature learning - harnessing the power of deep learning to view data in a whole new light that will open up new problems to be solved.

In this book, we will be exploring feature engineering as it relates to our machine learning endeavors. By breaking down this large topic into our subtopics and ping deep into each one in separate chapters, we will be able to get a much broader and more useful understanding of how these procedures work and how to apply each one in Python.

In our next chapter, we will pe straight into our first subsection, Feature understanding. We will finally be getting our hands on some real data, so let's begin!

主站蜘蛛池模板: 金川县| 伊宁市| 黎川县| 手机| 安庆市| 惠来县| 晋江市| 济阳县| 平阳县| 凤阳县| 卫辉市| 中山市| 定南县| 化隆| 出国| 漾濞| 伊金霍洛旗| 方城县| 修水县| 乌拉特中旗| 温泉县| 巨野县| 锡林浩特市| 达州市| 淮北市| 富源县| 大安市| 东乡| 建瓯市| 江安县| 稷山县| 静海县| 孝感市| 成都市| 汾西县| 巴南区| 玉树县| 呼伦贝尔市| 平顶山市| 遵义县| 华池县|