官术网_书友最值得收藏!

Description of the dataset

A dataset from the Allstate Insurance company will be used, which consists of more than 300,000 examples with masked and anonymous data and consisting of more than 100 categorical and numerical attributes, thus being compliant with confidentiality constraints, more than enough for building and evaluating a variety of ML techniques.

The dataset is downloaded from the Kaggle website at https://www.kaggle.com/c/allstate-claims-severity/data. Each row in the dataset represents an insurance claim. Now, the task is to predict the value for the loss column. Variables prefaced with cat are categorical, while those prefaced with cont are continuous.

It is to be noted that the Allstate Corporation is the second largest insurance company in the United States, founded in 1931. We are trying to make the whole thing automated, to predict the cost, and hence the severity, of accident and damage claims.

主站蜘蛛池模板: 宣武区| 高平市| 江津市| 日照市| 金阳县| 宁波市| 杂多县| 会昌县| 苍梧县| 洱源县| 军事| 萝北县| 原阳县| 冷水江市| 酉阳| 台安县| 江华| 朝阳县| 启东市| 梁山县| 诸城市| 广水市| 许昌市| 万安县| 临海市| 界首市| 徐水县| 镇巴县| 江津市| 寿宁县| 阿拉善右旗| 获嘉县| 德清县| 呼伦贝尔市| 嘉荫县| 新乐市| 河北区| 宜宾市| 湖州市| 龙陵县| 金坛市|