官术网_书友最值得收藏!

  • Hands-On Data Science with R
  • Vitor Bianchi Lanzetta Nataraj Dasgupta Ricardo Anjoleto Farias
  • 170字
  • 2021-06-10 19:12:25

Domain knowledge

More often than data scientists would like to admit, machine learning models produce results that are obvious and intuitive. For instance, we once conducted an elaborate analysis of physicians, prescribing behavior to find out the strongest predictor of how many prescriptions a physician would write in the next quarter. We used a broad set of input variables such as the physicians locations, their specialties, hospital affiliations, prescribing history, and other data. In the end, the best performing model produced a result that we all knew very well. The strongest predictor of how many prescriptions a physician would write in the next quarter was the number of prescriptions the physician had written in the previous quarter! To filter out the truly meaningful variables and build a more robust model, we eventually had to engage someone who had extensive experience of working in the pharma industry. Machine learning models work best when produced in a hybrid approach—one that combines domain expertise along with the sophistication of models developed.

主站蜘蛛池模板: 巴青县| 施秉县| 兴海县| 江津市| 永丰县| 仙桃市| 钟山县| 海城市| 明溪县| 旺苍县| 石河子市| 页游| 东港市| 浪卡子县| 石景山区| 余姚市| 望江县| 邵武市| 天台县| 横峰县| 来安县| 营山县| 滦平县| 原阳县| 内丘县| 全南县| 台北市| 饶阳县| 东乌珠穆沁旗| 江口县| 湖口县| 芷江| 蒲江县| 松滋市| 通河县| 邢台县| 雷山县| 九台市| 读书| 日照市| 湾仔区|