Mastering Hadoop 3
ApacheHadoopisoneofthemostpopularbigdatasolutionsfordistributedstorageandforprocessinglargechunksofdata.WithHadoop3,Apachepromisestoprovideahigh-performance,morefault-tolerant,andhighlyefficientbigdataprocessingplatform,withafocusonimprovedscalabilityandincreasedefficiency.Withthisguide,you’llunderstandadvancedconceptsoftheHadoopecosystemtool.You’lllearnhowHadoopworksinternally,studyadvancedconceptsofdifferentecosystemtools,discoversolutionstoreal-worldusecases,andunderstandhowtosecureyourcluster.ItwillthenwalkyouthroughHDFS,YARN,MapReduce,andHadoop3concepts.You’llbeabletoaddresscommonchallengeslikeusingKafkaefficiently,designinglowlatency,reliablemessagedeliveryKafkasystems,andhandlinghighdatavolumes.Asyouadvance,you’lldiscoverhowtoaddressmajorchallengeswhenbuildinganenterprise-grademessagingsystem,andhowtousedifferentstreamprocessingsystemsalongwithKafkatofulfilyourenterprisegoals.Bytheendofthisbook,you’llhaveacompleteunderstandingofhowcomponentsintheHadoopecosystemareeffectivelyintegratedtoimplementafastandreliabledatapipeline,andyou’llbeequippedtotacklearangeofreal-worldproblemsindatapipelines.
·12.1萬字