官术网_书友最值得收藏!

Coding

Coding or statistical coding is again a process that a data scientist will use to prepare data for analysis. In this process, both quantitative data values (such as income or years of education) and qualitative data (such as race or gender) are categorized or coded in a consistent way.

Coding is performed by a data scientist for various reasons such as follows:

  • More effective for running statistical models
  • Computers understand the variables
  • Accountability--so the data scientist can run models blind, or without knowing what variables stand for, to reduce programming/author bias
You can imagine the process of coding as the means to transform data into a form required for a system or application.
主站蜘蛛池模板: 玉门市| 威海市| 五莲县| 天长市| 保山市| 廊坊市| 正阳县| 兰考县| 扎赉特旗| 芦山县| 天峨县| 桂林市| 太仓市| 辛集市| 福贡县| 石屏县| 巫溪县| 呼伦贝尔市| 延寿县| 曲水县| 高雄市| 沛县| 凤阳县| 六盘水市| 张家口市| 孝感市| 宁乡县| 乌恰县| 五峰| 花莲市| 榆中县| 广西| 三原县| 九龙县| 遵化市| 陵水| 博客| 安塞县| 白银市| 郎溪县| 威海市|