官术网_书友最值得收藏!

Manipulating Data with the Pandas Library

In the next few portions of the book, we are going to get our hands dirty by building the various kinds of recommender systems that were introduced in chapter one. However, before we do so, it is important that we know how to handle, manipulate, and analyze data efficiently in Python.

The datasets we'll be working with will be several megabytes in size. Historically, Python has never been well-known for its speed of execution. Therefore, analyzing such huge amounts of data using vanilla Python and the built-in data structures it provides us is simply impossible.

In this chapter, we're going to get ourselves acquainted with the pandas library, which aims to overcome the aforementioned limitations, making data analysis in Python extremely efficient and user-friendly. We'll also introduce ourselves to the Movies Dataset that we're going to use to build our recommenders as well as use pandas to extract some interesting facts and narrate the history of movies using data.

Disclaimer:
If you are already familiar with the pandas library, you may skip this chapter and move on to the next, Building an IMDB Top 250 Clone with p andas.

主站蜘蛛池模板: 凤山市| 虎林市| 黔西县| 社旗县| 麦盖提县| 九江市| 仲巴县| 无锡市| 普安县| 门源| 缙云县| 昂仁县| 安康市| 民勤县| 新绛县| 茌平县| 石景山区| 依安县| 赣榆县| 青浦区| 大石桥市| 洞头县| 巨野县| 克山县| 阿克苏市| 阳曲县| 历史| 苗栗县| 历史| 大理市| 德保县| 江门市| 闽清县| 石景山区| 福贡县| 兰溪市| 彰化县| 龙口市| 台北市| 宁远县| 历史|