舉報

會員
Hadoop 2.x Administration Cookbook
Gurmukh Singh 著
更新時間:2021-07-09 20:11:08
開會員,本書免費讀 >
最新章節(jié):
Index
IfyouareasystemadministratorwithabasicunderstandingofHadoopandyouwanttogetintoHadoopadministration,thisbookisforyou.It’salsoidealifyouareaHadoopadministratorwhowantsaquickreferenceguidetoalltheHadoopadministration-relatedtasksandsolutionstocommonlyoccurringproblems
最新章節(jié)
- Index
- Configuring and enabling Kerberos for Hadoop
- Configuring Kerberos server
- Configuring auditing
- Securing ZooKeeper
- Enabling service level authorization
品牌:中圖公司
上架時間:2021-07-09 18:23:37
出版社:Packt Publishing
本書數(shù)字版權由中圖公司提供,并由其授權上海閱文信息技術有限公司制作發(fā)行
- Index 更新時間:2021-07-09 20:11:08
- Configuring and enabling Kerberos for Hadoop
- Configuring Kerberos server
- Configuring auditing
- Securing ZooKeeper
- Enabling service level authorization
- In-transit encryption
- Configuring SSL in Hadoop
- HDFS encryption at Rest
- Configuring Hadoop users
- Encrypting disk using LUKS
- Introduction
- Chapter 12. Security
- Hadoop best practices
- HBase troubleshooting
- Hive troubleshooting
- Parse logs for errors
- Diagnose communication issues
- Resourcemanager troubleshooting
- Datanode troubleshooting
- Namenode troubleshooting
- Introduction
- Chapter 11. Troubleshooting Diagnostics and Best Practices
- Hardware and software options
- Estimating the cost of the Hadoop cluster
- Network design
- Sizing the cluster as per SLA
- Memory requirements
- Nodes needed in the cluster
- Disk space calculations
- Introduction
- Chapter 10. Cluster Planning
- Migrating data from MySQL to HBase using Sqoop
- HBase upgrade
- Tuning HBase
- HBase backup and restore
- HBase administration commands
- Integration with Hive
- Inserting data into HBase
- Setting up multi-node HBase cluster
- Setting up single node HBase cluster
- Introduction
- Chapter 9. HBase Administration
- Benchmarking Hadoop cluster
- Hive performance tuning
- Configuring MapReduce for performance
- Configuring YARN for performance
- Tuning Datanode
- Tuning Namenode
- Tuning HDFS
- Tuning the network
- Tuning the disk
- Tuning the operating system
- Chapter 8. Performance Tuning
- Configure Oozie and workflows
- Configuring Flume
- Designing Hive with credential store
- Hive metastore database
- Partitioning and Bucketing in Hive
- Loading data into Hive
- Operating Hive with ZooKeeper
- Using MySQL for Hive metastore
- Hive server modes and setup
- Introduction
- Chapter 7. Data Ingestion and Workflow
- Recovering deleted files
- Configuring NFS gateway to serve HDFS
- Datanode recovery – disk full
- Namenode roll edits – offline mode
- Namenode roll edits – online mode
- Namenode recovery
- Promoting Secondary Namenode to Primary
- Configuring Secondary Namenode
- Backing up and recovering Namenode
- Configuring HDFS and YARN logs
- Fetching parameters which are in-effect
- Using HDFS Image Viewer
- Initiating Namenode saveNamespace
- Introduction
- Chapter 6. Backup and Recovery
- YARN SLS
- YARN label-based scheduling
- YARN and Mapred commands
- Queuing mappings in Capacity Scheduler
- Configuring Capacity Scheduler
- Job queue ACLs
- Configuring job queues
- Fair Scheduler pools
- Fair Scheduler configuration
- Configuring users and groups
- Introduction
- Chapter 5. Schedulers
- Configuring HA for Edge nodes
- Configuring storage based policies
- HDFS snapshots
- Configure HDFS cache
- Configure shared cache manager
- Rolling upgrade with HA
- Resourcemanager HA using ZooKeeper
- Namenode HA using Journal node
- ZooKeeper configuration
- Namenode HA using shared storage
- Introduction
- Chapter 4. High Availability
- Preserving ResourceManager states
- ResourceManager Web UI and JMX metrics
- YARN containers and resource allocations
- Configuring ResourceManager components
- Job history web interface and metrics
- Configuring YARN history server
- Hadoop streaming
- Running a simple MapReduce program
- Introduction
- Chapter 3. Maintaining Hadoop Cluster – YARN and MapReduce
- Configuring Datanode heartbeat
- Control block report storm
- Distcp usage
- Recycle or trash bin configuration
- Configuring rack awareness
- HDFS health and FSCK
- Quota configuration
- HDFS balancer
- Configuring HDFS replication
- Loading data in HDFS
- Setting up Namenode metadata location
- Configuring HDFS block size
- Introduction
- Chapter 2. Maintaining Hadoop Cluster HDFS
- Adding nodes to the cluster
- Decommissioning nodes
- Configuring the Hadoop Gateway node
- Installing a multi-node cluster
- Installing a single-node cluster - YARN components
- Installing a single-node cluster - HDFS components
- Setting up host resolution
- Installation methods
- Building and compiling Hadoop
- Introduction
- Chapter 1. Hadoop Architecture and Deployment
- Customer support
- Reader feedback
- Conventions
- Sections
- Who this book is for
- What you need for this book
- What this book covers
- Preface
- Customer Feedback
- eBooks discount offers and more
- www.PacktPub.com
- About the Reviewers
- About the Author
- Credits
- Hadoop 2.x Administration Cookbook
- coverpage
- coverpage
- Hadoop 2.x Administration Cookbook
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- eBooks discount offers and more
- Customer Feedback
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Sections
- Conventions
- Reader feedback
- Customer support
- Chapter 1. Hadoop Architecture and Deployment
- Introduction
- Building and compiling Hadoop
- Installation methods
- Setting up host resolution
- Installing a single-node cluster - HDFS components
- Installing a single-node cluster - YARN components
- Installing a multi-node cluster
- Configuring the Hadoop Gateway node
- Decommissioning nodes
- Adding nodes to the cluster
- Chapter 2. Maintaining Hadoop Cluster HDFS
- Introduction
- Configuring HDFS block size
- Setting up Namenode metadata location
- Loading data in HDFS
- Configuring HDFS replication
- HDFS balancer
- Quota configuration
- HDFS health and FSCK
- Configuring rack awareness
- Recycle or trash bin configuration
- Distcp usage
- Control block report storm
- Configuring Datanode heartbeat
- Chapter 3. Maintaining Hadoop Cluster – YARN and MapReduce
- Introduction
- Running a simple MapReduce program
- Hadoop streaming
- Configuring YARN history server
- Job history web interface and metrics
- Configuring ResourceManager components
- YARN containers and resource allocations
- ResourceManager Web UI and JMX metrics
- Preserving ResourceManager states
- Chapter 4. High Availability
- Introduction
- Namenode HA using shared storage
- ZooKeeper configuration
- Namenode HA using Journal node
- Resourcemanager HA using ZooKeeper
- Rolling upgrade with HA
- Configure shared cache manager
- Configure HDFS cache
- HDFS snapshots
- Configuring storage based policies
- Configuring HA for Edge nodes
- Chapter 5. Schedulers
- Introduction
- Configuring users and groups
- Fair Scheduler configuration
- Fair Scheduler pools
- Configuring job queues
- Job queue ACLs
- Configuring Capacity Scheduler
- Queuing mappings in Capacity Scheduler
- YARN and Mapred commands
- YARN label-based scheduling
- YARN SLS
- Chapter 6. Backup and Recovery
- Introduction
- Initiating Namenode saveNamespace
- Using HDFS Image Viewer
- Fetching parameters which are in-effect
- Configuring HDFS and YARN logs
- Backing up and recovering Namenode
- Configuring Secondary Namenode
- Promoting Secondary Namenode to Primary
- Namenode recovery
- Namenode roll edits – online mode
- Namenode roll edits – offline mode
- Datanode recovery – disk full
- Configuring NFS gateway to serve HDFS
- Recovering deleted files
- Chapter 7. Data Ingestion and Workflow
- Introduction
- Hive server modes and setup
- Using MySQL for Hive metastore
- Operating Hive with ZooKeeper
- Loading data into Hive
- Partitioning and Bucketing in Hive
- Hive metastore database
- Designing Hive with credential store
- Configuring Flume
- Configure Oozie and workflows
- Chapter 8. Performance Tuning
- Tuning the operating system
- Tuning the disk
- Tuning the network
- Tuning HDFS
- Tuning Namenode
- Tuning Datanode
- Configuring YARN for performance
- Configuring MapReduce for performance
- Hive performance tuning
- Benchmarking Hadoop cluster
- Chapter 9. HBase Administration
- Introduction
- Setting up single node HBase cluster
- Setting up multi-node HBase cluster
- Inserting data into HBase
- Integration with Hive
- HBase administration commands
- HBase backup and restore
- Tuning HBase
- HBase upgrade
- Migrating data from MySQL to HBase using Sqoop
- Chapter 10. Cluster Planning
- Introduction
- Disk space calculations
- Nodes needed in the cluster
- Memory requirements
- Sizing the cluster as per SLA
- Network design
- Estimating the cost of the Hadoop cluster
- Hardware and software options
- Chapter 11. Troubleshooting Diagnostics and Best Practices
- Introduction
- Namenode troubleshooting
- Datanode troubleshooting
- Resourcemanager troubleshooting
- Diagnose communication issues
- Parse logs for errors
- Hive troubleshooting
- HBase troubleshooting
- Hadoop best practices
- Chapter 12. Security
- Introduction
- Encrypting disk using LUKS
- Configuring Hadoop users
- HDFS encryption at Rest
- Configuring SSL in Hadoop
- In-transit encryption
- Enabling service level authorization
- Securing ZooKeeper
- Configuring auditing
- Configuring Kerberos server
- Configuring and enabling Kerberos for Hadoop
- Index 更新時間:2021-07-09 20:11:08