舉報(bào)

會(huì)員
Mastering Hadoop
最新章節(jié):
Index
DoyouwanttobroadenyourHadoopskillsetandtakeyourknowledgetothenextlevel?DoyouwishtoenhanceyourknowledgeofHadooptosolvechallengingdataprocessingproblems?AreyourHadoopjobs,Pigscripts,orHivequeriesnotworkingasfastasyouintend?AreyoulookingtounderstandthebenefitsofupgradingHadoop?Iftheanswerisyestoanyofthese,thisbookisforyou.Itassumesnovice-levelfamiliaritywithHadoop.
目錄(105章)
倒序
- coverpage
- Mastering Hadoop
- Credits
- About the Author
- Acknowledgments
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- What this book covers
- What you need for this book?
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Hadoop 2.X
- The inception of Hadoop
- The evolution of Hadoop
- Hadoop 2.X
- Hadoop distributions
- Summary
- Chapter 2. Advanced MapReduce
- MapReduce input
- The RecordReader class
- Hadoop's "small files" problem
- Filtering inputs
- The Map task
- The Reduce task
- MapReduce output
- MapReduce job counters
- Handling data joins
- Summary
- Chapter 3. Advanced Pig
- Pig versus SQL
- Different modes of execution
- Complex data types in Pig
- Compiling Pig scripts
- Development and debugging aids
- The advanced Pig operators
- User-defined functions
- Pig performance optimizations
- Best practices
- Summary
- Chapter 4. Advanced Hive
- The Hive architecture
- Data types
- File formats
- The data model
- Hive query optimizers
- Advanced DML
- UDF UDAF and UDTF
- Summary
- Chapter 5. Serialization and Hadoop I/O
- Data serialization in Hadoop
- Avro serialization
- File formats
- Compression
- Summary
- Chapter 6. YARN – Bringing Other Paradigms to Hadoop
- The YARN architecture
- Developing YARN applications
- Monitoring YARN
- Job scheduling in YARN
- YARN commands
- Summary
- Chapter 7. Storm on YARN – Low Latency Processing in Hadoop
- Batch processing versus streaming
- Apache Storm
- Storm on YARN
- Summary
- Chapter 8. Hadoop on the Cloud
- Cloud computing characteristics
- Hadoop on the cloud
- Amazon Elastic MapReduce (EMR)
- Summary
- Chapter 9. HDFS Replacements
- HDFS – advantages and drawbacks
- Amazon AWS S3
- Implementing a filesystem in Hadoop
- Implementing an S3 native filesystem in Hadoop
- Summary
- Chapter 10. HDFS Federation
- Limitations of the older HDFS architecture
- Architecture of HDFS Federation
- HDFS high availability
- HDFS block placement
- Summary
- Chapter 11. Hadoop Security
- The security pillars
- Authentication in Hadoop
- Authorization in Hadoop
- Data confidentiality in Hadoop
- Audit logging in Hadoop
- Summary
- Chapter 12. Analytics Using Hadoop
- Data analytics workflow
- Machine learning
- Apache Mahout
- Document analysis using Hadoop and Mahout
- RHadoop
- Summary
- Appendix A. Hadoop for Microsoft Windows
- Deploying Hadoop on Microsoft Windows
- Summary
- Index 更新時(shí)間:2021-08-06 19:53:18
推薦閱讀
- Big Data Analytics with Hadoop 3
- Instant Raspberry Pi Gaming
- Seven NoSQL Databases in a Week
- Machine Learning for Cybersecurity Cookbook
- R Machine Learning By Example
- 工業(yè)機(jī)器人現(xiàn)場(chǎng)編程(FANUC)
- 大數(shù)據(jù)技術(shù)與應(yīng)用
- Excel 2007技巧大全
- 網(wǎng)中之我:何明升網(wǎng)絡(luò)社會(huì)論稿
- 突破,Objective-C開發(fā)速學(xué)手冊(cè)
- 空間站多臂機(jī)器人運(yùn)動(dòng)控制研究
- Citrix? XenDesktop? 7 Cookbook
- SQL Server數(shù)據(jù)庫(kù)應(yīng)用基礎(chǔ)(第2版)
- 會(huì)聲會(huì)影X4中文版從入門到精通
- Spark大數(shù)據(jù)商業(yè)實(shí)戰(zhàn)三部曲:內(nèi)核解密|商業(yè)案例|性能調(diào)優(yōu)
- MPC5554/5553微處理器揭秘
- Python文本分析
- Visual Basic項(xiàng)目開發(fā)案例精粹
- 手把手教你學(xué)Photoshop CS3
- PostgreSQL High Performance Cookbook
- 數(shù)據(jù)庫(kù)技術(shù):Access 2003計(jì)算機(jī)網(wǎng)絡(luò)技術(shù)
- 傳感器技術(shù)及實(shí)訓(xùn)(第2版)
- Hands-On Neural Networks with TensorFlow 2.0
- 從零開始學(xué)Visual C++
- 工業(yè)自動(dòng)化儀器儀表與裝置修理工
- Mastering BeagleBone Robotics
- OpenStack Bootcamp
- 排爆機(jī)器人的研究與開發(fā)
- Photoshop CS3中文版圖像處理與創(chuàng)意設(shè)計(jì)
- Extending SaltStack