舉報(bào)

會(huì)員
Hadoop 2.x Administration Cookbook
最新章節(jié):
Index
IfyouareasystemadministratorwithabasicunderstandingofHadoopandyouwanttogetintoHadoopadministration,thisbookisforyou.It’salsoidealifyouareaHadoopadministratorwhowantsaquickreferenceguidetoalltheHadoopadministration-relatedtasksandsolutionstocommonlyoccurringproblems
目錄(155章)
倒序
- coverpage
- Hadoop 2.x Administration Cookbook
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- eBooks discount offers and more
- Customer Feedback
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Sections
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Hadoop Architecture and Deployment
- Introduction
- Building and compiling Hadoop
- Installation methods
- Setting up host resolution
- Installing a single-node cluster - HDFS components
- Installing a single-node cluster - YARN components
- Installing a multi-node cluster
- Configuring the Hadoop Gateway node
- Decommissioning nodes
- Adding nodes to the cluster
- Chapter 2. Maintaining Hadoop Cluster HDFS
- Introduction
- Configuring HDFS block size
- Setting up Namenode metadata location
- Loading data in HDFS
- Configuring HDFS replication
- HDFS balancer
- Quota configuration
- HDFS health and FSCK
- Configuring rack awareness
- Recycle or trash bin configuration
- Distcp usage
- Control block report storm
- Configuring Datanode heartbeat
- Chapter 3. Maintaining Hadoop Cluster – YARN and MapReduce
- Introduction
- Running a simple MapReduce program
- Hadoop streaming
- Configuring YARN history server
- Job history web interface and metrics
- Configuring ResourceManager components
- YARN containers and resource allocations
- ResourceManager Web UI and JMX metrics
- Preserving ResourceManager states
- Chapter 4. High Availability
- Introduction
- Namenode HA using shared storage
- ZooKeeper configuration
- Namenode HA using Journal node
- Resourcemanager HA using ZooKeeper
- Rolling upgrade with HA
- Configure shared cache manager
- Configure HDFS cache
- HDFS snapshots
- Configuring storage based policies
- Configuring HA for Edge nodes
- Chapter 5. Schedulers
- Introduction
- Configuring users and groups
- Fair Scheduler configuration
- Fair Scheduler pools
- Configuring job queues
- Job queue ACLs
- Configuring Capacity Scheduler
- Queuing mappings in Capacity Scheduler
- YARN and Mapred commands
- YARN label-based scheduling
- YARN SLS
- Chapter 6. Backup and Recovery
- Introduction
- Initiating Namenode saveNamespace
- Using HDFS Image Viewer
- Fetching parameters which are in-effect
- Configuring HDFS and YARN logs
- Backing up and recovering Namenode
- Configuring Secondary Namenode
- Promoting Secondary Namenode to Primary
- Namenode recovery
- Namenode roll edits – online mode
- Namenode roll edits – offline mode
- Datanode recovery – disk full
- Configuring NFS gateway to serve HDFS
- Recovering deleted files
- Chapter 7. Data Ingestion and Workflow
- Introduction
- Hive server modes and setup
- Using MySQL for Hive metastore
- Operating Hive with ZooKeeper
- Loading data into Hive
- Partitioning and Bucketing in Hive
- Hive metastore database
- Designing Hive with credential store
- Configuring Flume
- Configure Oozie and workflows
- Chapter 8. Performance Tuning
- Tuning the operating system
- Tuning the disk
- Tuning the network
- Tuning HDFS
- Tuning Namenode
- Tuning Datanode
- Configuring YARN for performance
- Configuring MapReduce for performance
- Hive performance tuning
- Benchmarking Hadoop cluster
- Chapter 9. HBase Administration
- Introduction
- Setting up single node HBase cluster
- Setting up multi-node HBase cluster
- Inserting data into HBase
- Integration with Hive
- HBase administration commands
- HBase backup and restore
- Tuning HBase
- HBase upgrade
- Migrating data from MySQL to HBase using Sqoop
- Chapter 10. Cluster Planning
- Introduction
- Disk space calculations
- Nodes needed in the cluster
- Memory requirements
- Sizing the cluster as per SLA
- Network design
- Estimating the cost of the Hadoop cluster
- Hardware and software options
- Chapter 11. Troubleshooting Diagnostics and Best Practices
- Introduction
- Namenode troubleshooting
- Datanode troubleshooting
- Resourcemanager troubleshooting
- Diagnose communication issues
- Parse logs for errors
- Hive troubleshooting
- HBase troubleshooting
- Hadoop best practices
- Chapter 12. Security
- Introduction
- Encrypting disk using LUKS
- Configuring Hadoop users
- HDFS encryption at Rest
- Configuring SSL in Hadoop
- In-transit encryption
- Enabling service level authorization
- Securing ZooKeeper
- Configuring auditing
- Configuring Kerberos server
- Configuring and enabling Kerberos for Hadoop
- Index 更新時(shí)間:2021-07-09 20:11:08
推薦閱讀
- 大學(xué)計(jì)算機(jī)信息技術(shù)導(dǎo)論
- 構(gòu)建高質(zhì)量的C#代碼
- 2018西門子工業(yè)專家會(huì)議論文集(上)
- IoT Penetration Testing Cookbook
- 21天學(xué)通C++
- Java Web整合開發(fā)全程指南
- Prometheus監(jiān)控實(shí)戰(zhàn)
- 教育機(jī)器人的風(fēng)口:全球發(fā)展現(xiàn)狀及趨勢(shì)
- Unity Multiplayer Games
- 所羅門的密碼
- 電氣控制與PLC原理及應(yīng)用(歐姆龍機(jī)型)
- Flink原理與實(shí)踐
- Mastering Text Mining with R
- 人工智能:智能人機(jī)交互
- 電腦故障排除與維護(hù)終極技巧金典
- 網(wǎng)絡(luò)信息安全項(xiàng)目教程
- ROS Robotics By Example(Second Edition)
- ARM嵌入式系統(tǒng)開發(fā)完全入門與主流實(shí)踐
- 工程地質(zhì)地學(xué)信息遙感自動(dòng)提取技術(shù)
- 工業(yè)控制系統(tǒng)安全
- 人工智能基礎(chǔ)教程:Python篇(青少版)
- SQL Server 2017 Machine Learning Services with R
- 隨機(jī)分布控制系統(tǒng)的故障診斷與容錯(cuò)控制
- 筆記本電腦維修實(shí)用教程
- Mastering Docker Enterprise
- WordPress for Education
- Hands-On Cybersecurity for Architects
- 機(jī)器學(xué)習(xí)技術(shù)及應(yīng)用
- Apache Tomcat 7 Essentials
- Flex 3開發(fā)實(shí)踐