官术网_书友最值得收藏!

Statistical inference

What developer at some point in his or her career, had to create a sample or test data? For example, I've often created a simple script to generate a random number (based upon the number of possible options or choices) and then used that number as the selected option (in my test recordset). This might work well for data development, but with statistics and data science, this is not sufficient.

To create sample data (or a sample population), the data scientist will use a process called statistical inference, which is the process of deducing options of an underlying distribution through analysis of the data you have or are trying to generate for. The process is sometimes called inferential statistical analysis and includes testing various hypotheses and deriving estimates.

When the data scientist determines that a recordset (or population) should be larger than it actually is, it is assumed that the recordset is a sample from a larger population, and the data scientist will then utilize statistical inference to make up the difference.

The data or recordset in use is referred to by the data scientist as the observed data. Inferential statistics can be contrasted with descriptive statistics, which is only concerned with the properties of the observed data and does not assume that the recordset came from a larger population.
主站蜘蛛池模板: 申扎县| 武宁县| 通榆县| 鹤庆县| 鞍山市| 阿拉善左旗| 舟曲县| 苍溪县| 佳木斯市| 牙克石市| 郴州市| 永靖县| 独山县| 深圳市| 平邑县| 禹州市| 资源县| 遂平县| 濉溪县| 准格尔旗| 武邑县| 原阳县| 新绛县| 淮阳县| 嘉禾县| 奉节县| 红桥区| 含山县| 云梦县| 石台县| 无锡市| 井陉县| 徐汇区| 苏州市| 永嘉县| 抚远县| 清丰县| 灵山县| 晋宁县| 嵩明县| 新晃|