舉報

會員
Mastering Hadoop
最新章節(jié):
Index
DoyouwanttobroadenyourHadoopskillsetandtakeyourknowledgetothenextlevel?DoyouwishtoenhanceyourknowledgeofHadooptosolvechallengingdataprocessingproblems?AreyourHadoopjobs,Pigscripts,orHivequeriesnotworkingasfastasyouintend?AreyoulookingtounderstandthebenefitsofupgradingHadoop?Iftheanswerisyestoanyofthese,thisbookisforyou.Itassumesnovice-levelfamiliaritywithHadoop.
目錄(105章)
倒序
- coverpage
- Mastering Hadoop
- Credits
- About the Author
- Acknowledgments
- About the Reviewers
- www.PacktPub.com
- Support files eBooks discount offers and more
- Preface
- What this book covers
- What you need for this book?
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Hadoop 2.X
- The inception of Hadoop
- The evolution of Hadoop
- Hadoop 2.X
- Hadoop distributions
- Summary
- Chapter 2. Advanced MapReduce
- MapReduce input
- The RecordReader class
- Hadoop's "small files" problem
- Filtering inputs
- The Map task
- The Reduce task
- MapReduce output
- MapReduce job counters
- Handling data joins
- Summary
- Chapter 3. Advanced Pig
- Pig versus SQL
- Different modes of execution
- Complex data types in Pig
- Compiling Pig scripts
- Development and debugging aids
- The advanced Pig operators
- User-defined functions
- Pig performance optimizations
- Best practices
- Summary
- Chapter 4. Advanced Hive
- The Hive architecture
- Data types
- File formats
- The data model
- Hive query optimizers
- Advanced DML
- UDF UDAF and UDTF
- Summary
- Chapter 5. Serialization and Hadoop I/O
- Data serialization in Hadoop
- Avro serialization
- File formats
- Compression
- Summary
- Chapter 6. YARN – Bringing Other Paradigms to Hadoop
- The YARN architecture
- Developing YARN applications
- Monitoring YARN
- Job scheduling in YARN
- YARN commands
- Summary
- Chapter 7. Storm on YARN – Low Latency Processing in Hadoop
- Batch processing versus streaming
- Apache Storm
- Storm on YARN
- Summary
- Chapter 8. Hadoop on the Cloud
- Cloud computing characteristics
- Hadoop on the cloud
- Amazon Elastic MapReduce (EMR)
- Summary
- Chapter 9. HDFS Replacements
- HDFS – advantages and drawbacks
- Amazon AWS S3
- Implementing a filesystem in Hadoop
- Implementing an S3 native filesystem in Hadoop
- Summary
- Chapter 10. HDFS Federation
- Limitations of the older HDFS architecture
- Architecture of HDFS Federation
- HDFS high availability
- HDFS block placement
- Summary
- Chapter 11. Hadoop Security
- The security pillars
- Authentication in Hadoop
- Authorization in Hadoop
- Data confidentiality in Hadoop
- Audit logging in Hadoop
- Summary
- Chapter 12. Analytics Using Hadoop
- Data analytics workflow
- Machine learning
- Apache Mahout
- Document analysis using Hadoop and Mahout
- RHadoop
- Summary
- Appendix A. Hadoop for Microsoft Windows
- Deploying Hadoop on Microsoft Windows
- Summary
- Index 更新時間:2021-08-06 19:53:18
推薦閱讀
- Microsoft Dynamics CRM Customization Essentials
- Oracle SOA Governance 11g Implementation
- PowerShell 3.0 Advanced Administration Handbook
- 教父母學(xué)會上網(wǎng)
- 機(jī)艙監(jiān)測與主機(jī)遙控
- 城市道路交通主動控制技術(shù)
- VB語言程序設(shè)計
- CompTIA Network+ Certification Guide
- Implementing Splunk 7(Third Edition)
- 網(wǎng)絡(luò)安全與防護(hù)
- 貫通Java Web開發(fā)三劍客
- 突破,Objective-C開發(fā)速學(xué)手冊
- Machine Learning with the Elastic Stack
- 悟透AutoCAD 2009案例自學(xué)手冊
- 教育機(jī)器人的風(fēng)口:全球發(fā)展現(xiàn)狀及趨勢
- Excel 2007終極技巧金典
- Learning Linux Shell Scripting
- 算法設(shè)計與分析
- 人工智能云平臺:原理、設(shè)計與應(yīng)用
- Hands-On Generative Adversarial Networks with Keras
- 實(shí)戰(zhàn)突擊
- 模式:工程化實(shí)現(xiàn)及擴(kuò)展(設(shè)計模式Java 版)
- 案例解說Visual C++典型控制應(yīng)用
- 人工智能初探1
- Windows XP操作系統(tǒng)考前12小時
- 機(jī)器學(xué)習(xí)技術(shù)及應(yīng)用
- 巧學(xué)活用打印機(jī)維護(hù)
- UG NX 8.0中文版從入門到精通
- 劍指Offer
- Flash動畫設(shè)計