官术网_书友最值得收藏!

Why open data?

Many books on machine learning use datasets that come with the language install (such as R or Hadoop) or point to public repositories that have considerable visibility in the data science community. The most common ones are Kaggle (especially the Titanic competition) and the UC Irvine's datasets. While these are great datasets and give a common denominator, this book will expose you to datasets that come from government entities. The notion of getting data from government and hacking for social good is typically called open data. I believe that open data will transform how the government interacts with its citizens and will make government entities more efficient and transparent. Therefore, we will use open datasets in this book and hopefully you will consider helping out with the open data movement.

主站蜘蛛池模板: 福鼎市| 福贡县| 盐亭县| 禄劝| 西昌市| 徐汇区| 南投市| 长垣县| 昂仁县| 柳州市| 康保县| 固镇县| 马山县| 东乡| 沙洋县| 咸宁市| 菏泽市| 武山县| 永和县| 临潭县| 噶尔县| 道孚县| 台山市| 来安县| 乐昌市| 微山县| 项城市| 昌乐县| 南平市| 广灵县| 海兴县| 葫芦岛市| 兴安县| 绥江县| 阜南县| 澄城县| 孟州市| 南川市| 府谷县| 吉首市| 泰州市|