官术网_书友最值得收藏!

Manipulating Data with the Pandas Library

In the next few portions of the book, we are going to get our hands dirty by building the various kinds of recommender systems that were introduced in chapter one. However, before we do so, it is important that we know how to handle, manipulate, and analyze data efficiently in Python.

The datasets we'll be working with will be several megabytes in size. Historically, Python has never been well-known for its speed of execution. Therefore, analyzing such huge amounts of data using vanilla Python and the built-in data structures it provides us is simply impossible.

In this chapter, we're going to get ourselves acquainted with the pandas library, which aims to overcome the aforementioned limitations, making data analysis in Python extremely efficient and user-friendly. We'll also introduce ourselves to the Movies Dataset that we're going to use to build our recommenders as well as use pandas to extract some interesting facts and narrate the history of movies using data.

Disclaimer:
If you are already familiar with the pandas library, you may skip this chapter and move on to the next, Building an IMDB Top 250 Clone with p andas.

主站蜘蛛池模板: 丁青县| 尤溪县| 锡林浩特市| 通州市| 麻栗坡县| 桃园县| 保德县| 阳城县| 顺义区| 汶川县| 周口市| 兴安盟| 疏勒县| 兴仁县| 翁源县| 桐柏县| 赤城县| 泾阳县| 广安市| 林甸县| 夹江县| 金平| 常宁市| 通河县| 阜康市| 桃江县| 阿拉善盟| 贞丰县| 黔南| 湖州市| 和政县| 炉霍县| 崇文区| 卢湾区| 樟树市| 全州县| 花垣县| 南木林县| 盱眙县| 巨野县| 湛江市|