目錄(206章)
倒序
- Coverpage
- Title Page
- Copyright
- Building Data Streaming Applications with Apache Kafka
- Credits
- About the Authors
- About the Reviewer
- www.PacktPub.com
- Why subscribe?
- Customer Feedback
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Downloading the example code
- Downloading the color images of this book
- Errata
- Piracy
- Questions
- Introduction to Messaging Systems
- Understanding the principles of messaging systems
- Understanding messaging systems
- Peeking into a point-to-point messaging system
- Publish-subscribe messaging system
- Advance Queuing Messaging Protocol
- Using messaging systems in big data streaming applications
- Summary
- Introducing Kafka the Distributed Messaging Platform
- Kafka origins
- Kafka's architecture
- Message topics
- Message partitions
- Replication and replicated logs
- Message producers
- Message consumers
- Role of Zookeeper
- Summary
- Deep Dive into Kafka Producers
- Kafka producer internals
- Kafka Producer APIs
- Producer object and ProducerRecord object
- Custom partition
- Additional producer configuration
- Java Kafka producer example
- Common messaging publishing patterns
- Best practices
- Summary
- Deep Dive into Kafka Consumers
- Kafka consumer internals
- Understanding the responsibilities of Kafka consumers
- Kafka consumer APIs
- Consumer configuration
- Subscription and polling
- Committing and polling
- Additional configuration
- Java Kafka consumer
- Scala Kafka consumer
- Rebalance listeners
- Common message consuming patterns
- Best practices
- Summary
- Building Spark Streaming Applications with Kafka
- Introduction to Spark
- Spark architecture
- Pillars of Spark
- The Spark ecosystem
- Spark Streaming
- Receiver-based integration
- Disadvantages of receiver-based approach
- Java example for receiver-based integration
- Scala example for receiver-based integration
- Direct approach
- Java example for direct approach
- Scala example for direct approach
- Use case log processing - fraud IP detection
- Maven
- Producer
- Property reader
- Producer code
- Fraud IP lookup
- Expose hive table
- Streaming code
- Summary
- Building Storm Applications with Kafka
- Introduction to Apache Storm
- Storm cluster architecture
- The concept of a Storm application
- Introduction to Apache Heron
- Heron architecture
- Heron topology architecture
- Integrating Apache Kafka with Apache Storm - Java
- Example
- Integrating Apache Kafka with Apache Storm - Scala
- Use case – log processing in Storm Kafka Hive
- Producer
- Producer code
- Fraud IP lookup
- Storm application
- Running the project
- Summary
- Using Kafka with Confluent Platform
- Introduction to Confluent Platform
- Deep driving into Confluent architecture
- Understanding Kafka Connect and Kafka Stream
- Kafka Streams
- Playing with Avro using Schema Registry
- Moving Kafka data to HDFS
- Camus
- Running Camus
- Gobblin
- Gobblin architecture
- Kafka Connect
- Flume
- Summary
- Building ETL Pipelines Using Kafka
- Considerations for using Kafka in ETL pipelines
- Introducing Kafka Connect
- Deep dive into Kafka Connect
- Introductory examples of using Kafka Connect
- Kafka Connect common use cases
- Summary
- Building Streaming Applications Using Kafka Streams
- Introduction to Kafka Streams
- Using Kafka in Stream processing
- Kafka Stream - lightweight Stream processing library
- Kafka Stream architecture
- Integrated framework advantages
- Understanding tables and Streams together
- Maven dependency
- Kafka Stream word count
- KTable
- Use case example of Kafka Streams
- Maven dependency of Kafka Streams
- Property reader
- IP record producer
- IP lookup service
- Fraud detection application
- Summary
- Kafka Cluster Deployment
- Kafka cluster internals
- Role of Zookeeper
- Replication
- Metadata request processing
- Producer request processing
- Consumer request processing
- Capacity planning
- Capacity planning goals
- Replication factor
- Memory
- Hard drives
- Network
- CPU
- Single cluster deployment
- Multicluster deployment
- Decommissioning brokers
- Data migration
- Summary
- Using Kafka in Big Data Applications
- Managing high volumes in Kafka
- Appropriate hardware choices
- Producer read and consumer write choices
- Kafka message delivery semantics
- At least once delivery
- At most once delivery
- Exactly once delivery
- Big data and Kafka common usage patterns
- Kafka and data governance
- Alerting and monitoring
- Useful Kafka matrices
- Producer matrices
- Broker matrices
- Consumer metrics
- Summary
- Securing Kafka
- An overview of securing Kafka
- Wire encryption using SSL
- Steps to enable SSL in Kafka
- Configuring SSL for Kafka Broker
- Configuring SSL for Kafka clients
- Kerberos SASL for authentication
- Steps to enable SASL/GSSAPI - in Kafka
- Configuring SASL for Kafka broker
- Configuring SASL for Kafka client - producer and consumer
- Understanding ACL and authorization
- Common ACL operations
- List ACLs
- Understanding Zookeeper authentication
- Apache Ranger for authorization
- Adding Kafka Service to Ranger
- Adding policies
- Best practices
- Summary
- Streaming Application Design Considerations
- Latency and throughput
- Data and state persistence
- Data sources
- External data lookups
- Data formats
- Data serialization
- Level of parallelism
- Out-of-order events
- Message processing semantics
- Summary 更新時間:2022-07-12 10:38:49
推薦閱讀
- 精通Nginx(第2版)
- Building Modern Web Applications Using Angular
- Asynchronous Android Programming(Second Edition)
- Creating Stunning Dashboards with QlikView
- Web性能實戰
- Learning Hadoop 2
- 從零開始:UI圖標設計與制作(第3版)
- Clojure for Java Developers
- 分布式架構原理與實踐
- Getting Started with Electronic Projects
- 算法訓練營:海量圖解+競賽刷題(入門篇)
- 編寫高質量代碼之Java(套裝共2冊)
- Java EE框架開發技術與案例教程
- Python程序設計教程
- Scratch少兒編程思維訓練:培養孩子的邏輯思維和計算思維能力
- AVR單片機C語言非常入門與視頻演練
- 來吧,一起創客
- 常用工具軟件(第4版)
- PHP程序設計
- Elasticsearch技術解析與實戰
- Visualforce Development Cookbook
- Heroku Cookbook
- Python 3 程序設計
- Team Foundation Server 2015 Customization
- Excel VBA程序開發自學寶典
- Python全棧開發:高階編程
- 深入理解Java虛擬機:JVM高級特性與最佳實踐(第2版)
- Python數據分析從入門到精通
- Building a Web Application with PHP and MariaDB:A Reference Guide
- Mastering Machine Learning with scikit-learn(Second Edition)