官术网_书友最值得收藏!

Why open data?

Many books on machine learning use datasets that come with the language install (such as R or Hadoop) or point to public repositories that have considerable visibility in the data science community. The most common ones are Kaggle (especially the Titanic competition) and the UC Irvine's datasets. While these are great datasets and give a common denominator, this book will expose you to datasets that come from government entities. The notion of getting data from government and hacking for social good is typically called open data. I believe that open data will transform how the government interacts with its citizens and will make government entities more efficient and transparent. Therefore, we will use open datasets in this book and hopefully you will consider helping out with the open data movement.

主站蜘蛛池模板: 福州市| 乌兰县| 南部县| 金寨县| 玛沁县| 聂拉木县| 岚皋县| 礼泉县| 清镇市| 张家界市| 翁源县| 云和县| 图木舒克市| 九江市| 武陟县| 长岛县| 石渠县| 明星| 大石桥市| 马公市| 南乐县| 京山县| 弋阳县| 涟源市| 镇沅| 客服| 都安| 中西区| 伊宁县| 雅江县| 丘北县| 保靖县| 安徽省| 河津市| 桐梓县| 郎溪县| 元江| 金华市| 禄劝| 泸州市| 镇江市|