官术网_书友最值得收藏!

Domain knowledge

More often than data scientists would like to admit, machine learning models produce results that are obvious and intuitive. For instance, we once conducted an elaborate analysis of physicians, prescribing behavior to find out the strongest predictor of how many prescriptions a physician would write in the next quarter. We used a broad set of input variables such as the physicians locations, their specialties, hospital affiliations, prescribing history, and other data. In the end, the best performing model produced a result that we all knew very well. The strongest predictor of how many prescriptions a physician would write in the next quarter was the number of prescriptions the physician had written in the previous quarter! To filter out the truly meaningful variables and build a more robust model, we eventually had to engage someone who had extensive experience of working in the pharma industry. Machine learning models work best when produced in a hybrid approach—one that combines domain expertise along with the sophistication of models developed.

主站蜘蛛池模板: 清水河县| 沿河| 迁安市| 中西区| 乾安县| 扎赉特旗| 大埔县| 龙里县| 廊坊市| 嵩明县| 涪陵区| 丹江口市| 资兴市| 巴彦淖尔市| 宁德市| 武清区| 乌苏市| 新田县| 昌图县| 昌江| 蓝山县| 乌鲁木齐市| 盐津县| 南康市| 河间市| 剑河县| 文成县| 达州市| 莆田市| 阿巴嘎旗| 北海市| 论坛| 新民市| 鹤山市| 西林县| 玛沁县| 新平| 鄂托克旗| 东乌珠穆沁旗| 分宜县| 大田县|