官术网_书友最值得收藏!

Manipulating Data with the Pandas Library

In the next few portions of the book, we are going to get our hands dirty by building the various kinds of recommender systems that were introduced in chapter one. However, before we do so, it is important that we know how to handle, manipulate, and analyze data efficiently in Python.

The datasets we'll be working with will be several megabytes in size. Historically, Python has never been well-known for its speed of execution. Therefore, analyzing such huge amounts of data using vanilla Python and the built-in data structures it provides us is simply impossible.

In this chapter, we're going to get ourselves acquainted with the pandas library, which aims to overcome the aforementioned limitations, making data analysis in Python extremely efficient and user-friendly. We'll also introduce ourselves to the Movies Dataset that we're going to use to build our recommenders as well as use pandas to extract some interesting facts and narrate the history of movies using data.

Disclaimer:
If you are already familiar with the pandas library, you may skip this chapter and move on to the next, Building an IMDB Top 250 Clone with p andas.

主站蜘蛛池模板: 宜阳县| 百色市| 临西县| 江孜县| 江永县| 宁河县| 渭南市| 铜陵市| 永年县| 阳新县| 达州市| 兖州市| 全州县| 公主岭市| 高安市| 玛纳斯县| 博客| 河东区| 金平| 微山县| 连平县| 襄樊市| 哈巴河县| 武强县| 乐清市| 贺州市| 工布江达县| 旬邑县| 西乌珠穆沁旗| 深水埗区| 依兰县| 绩溪县| 米脂县| 正宁县| 扎囊县| 四会市| 广东省| 霍山县| 夏河县| 永吉县| 营口市|