官术网_书友最值得收藏!

Description of the dataset

A dataset from the Allstate Insurance company will be used, which consists of more than 300,000 examples with masked and anonymous data and consisting of more than 100 categorical and numerical attributes, thus being compliant with confidentiality constraints, more than enough for building and evaluating a variety of ML techniques.

The dataset is downloaded from the Kaggle website at https://www.kaggle.com/c/allstate-claims-severity/data. Each row in the dataset represents an insurance claim. Now, the task is to predict the value for the loss column. Variables prefaced with cat are categorical, while those prefaced with cont are continuous.

It is to be noted that the Allstate Corporation is the second largest insurance company in the United States, founded in 1931. We are trying to make the whole thing automated, to predict the cost, and hence the severity, of accident and damage claims.

主站蜘蛛池模板: 平罗县| 迭部县| 图木舒克市| 眉山市| 汝阳县| 榆社县| 长白| 龙山县| 建昌县| 陆河县| 扎鲁特旗| 彰化县| 叶城县| 南通市| 印江| 濮阳县| 扎兰屯市| 乐陵市| 共和县| 唐山市| 武平县| 鄢陵县| 绥阳县| 临汾市| 襄樊市| 虞城县| 广州市| 共和县| 库尔勒市| 高邑县| 临西县| 长武县| 鄯善县| 盐源县| 五原县| 肥城市| 磐石市| 济源市| 五华县| 台山市| 辉县市|