官术网_书友最值得收藏!

Statistical inference

What developer at some point in his or her career, had to create a sample or test data? For example, I've often created a simple script to generate a random number (based upon the number of possible options or choices) and then used that number as the selected option (in my test recordset). This might work well for data development, but with statistics and data science, this is not sufficient.

To create sample data (or a sample population), the data scientist will use a process called statistical inference, which is the process of deducing options of an underlying distribution through analysis of the data you have or are trying to generate for. The process is sometimes called inferential statistical analysis and includes testing various hypotheses and deriving estimates.

When the data scientist determines that a recordset (or population) should be larger than it actually is, it is assumed that the recordset is a sample from a larger population, and the data scientist will then utilize statistical inference to make up the difference.

The data or recordset in use is referred to by the data scientist as the observed data. Inferential statistics can be contrasted with descriptive statistics, which is only concerned with the properties of the observed data and does not assume that the recordset came from a larger population.
主站蜘蛛池模板: 阿拉善左旗| 长兴县| 周至县| 泗水县| 讷河市| 广德县| 高阳县| 水城县| 吉首市| 贵南县| 弋阳县| 米林县| 榆社县| 章丘市| 两当县| 永吉县| 平原县| 高平市| 介休市| 青河县| 鲁甸县| 城口县| 天峨县| 延吉市| 昭苏县| 苍梧县| 深州市| 武胜县| 湘阴县| 车险| 灯塔市| 涟源市| 芒康县| 丁青县| 平塘县| 巩留县| 洱源县| 苗栗市| 绥中县| 镇沅| 尼木县|