官术网_书友最值得收藏!

Chapter 4. Big Data – Advanced Analytics

In this chapter, we will deal with one of the biggest challenges of high-performance financial analytics and data management; that is, how to handle large datasets efficiently and flawlessly in R.

Our main objective is to give a practical introduction on how to access and manage large datasets in R. This chapter does not focus on any particular financial theorem, but it aims to give practical, hands-on examples to researchers and professionals on how to implement computationally - intensive analyses and models that leverage large datasets in the R environment.

In the first part of this chapter, we explained how to access data directly for multiple open sources. R offers various tools and options to load data into the R environment without any prior data-management requirements. This part of the chapter will guide you through practical examples on how to access data using the Quandl and qualtmod packages. The examples presented here will be a useful reference for the other chapters of this book. In the second part of this chapter, we will highlight the limitation of R to handle big data and show practical examples on how to load a large amount of data in R with the help of big memory and ff packages. We will also show how to perform essential statistical analyses, such as K-mean clustering and linear regression, using large datasets.

主站蜘蛛池模板: 三江| 雅江县| 罗城| 鲁山县| 富平县| 峨山| 唐河县| 瑞昌市| 宜章县| 广昌县| 淳安县| 鱼台县| 博罗县| 荣昌县| 资阳市| 南漳县| 金山区| 延边| 岳池县| 广南县| 巴林左旗| 增城市| 长汀县| 留坝县| 天峨县| 长岛县| 酒泉市| 乐清市| 枣阳市| 安康市| 喀喇沁旗| 田林县| 报价| 紫金县| 会泽县| 涟水县| 申扎县| 瓦房店市| 陕西省| 邢台市| 永春县|