- Matplotlib for Python Developers
- Aldrin Yim Claire Chung Allen Yu
- 240字
- 2021-08-27 18:48:19
pandas DataFrame
You may often see df appearing on Python-based data science resources and literature. It is a conventional way to denote the pandas DataFrame structure. pandas lets us perform the otherwise tedious operations on tables (data frames) with simple commands, such as dropna(), merge(), pivot(), and set_index().
pandas is designed to streamline handling processes of common data types, such as time series. While NumPy is more specialized in mathematical calculations, pandas has built-in string manipulation functions and also allows custom functions to be applied to each cell via apply().
Before use, we import the module with the conventional shorthand as:
pd.DataFrame(my_list_or_array)
To read data from existing files, just use the following:
pd.read_csv()
For tab-delimited files, just add '\t' as the separator:
pd.read_csv(sep='\t')
pandas supports data import from a wide range of common file structures for data handling and processing, from pd.read_xlsx() for Excel and pd.read_sql_query() for SQL databases to the more recently popular JSON, HDF5, and Google BigQuery.
pandas provides a collection of handy operations for data manipulation and is considered a must-have in a Python data scientist's or developer's toolbox.
To fully understand and utilize the functionalities, you may want to read more from the official documentation:
- Building a RESTful Web Service with Spring
- VSTO開發(fā)入門教程
- Python數(shù)據(jù)分析從0到1
- 響應(yīng)式架構(gòu):消息模式Actor實現(xiàn)與Scala、Akka應(yīng)用集成
- UNIX Linux程序設(shè)計教程
- JavaScript腳本特效編程給力起飛
- Python 快速入門(第3版)
- PHP Microservices
- Implementing Domain:Specific Languages with Xtext and Xtend
- C語言從入門到精通(第5版)
- Learning Azure DocumentDB
- Performance Testing with JMeter 3(Third Edition)
- MongoDB進(jìn)階與實戰(zhàn):微服務(wù)整合、性能優(yōu)化、架構(gòu)管理
- Mastering Android NDK
- F# for Machine Learning Essentials