Apache Spark 2:Data Processing and Real-Time Analytics
ApacheSparkisanin-memory,cluster-baseddataprocessingsystemthatprovidesawiderangeoffunctionalitiessuchasbigdataprocessing,analytics,machinelearning,andmore.WiththisLearningPath,youcantakeyourknowledgeofApacheSparktothenextlevelbylearninghowtoexpandSpark'sfunctionalityandbuildingyourowndataflowandmachinelearningprogramsonthisplatform.YouwillworkwiththedifferentmodulesinApacheSpark,suchasinteractivequeryingwithSparkSQL,usingDataFramesanddatasets,implementingstreaminganalyticswithSparkStreaming,andapplyingmachinelearninganddeeplearningtechniquesonSparkusingMLlibandvariousexternaltools.BytheendofthiselaboratelydesignedLearningPath,youwillhavealltheknowledgeyouneedtomasterApacheSpark,andbuildyourownbigdataprocessingandanalyticspipelinequicklyandwithoutanyhassle.ThisLearningPathincludescontentfromthefollowingPacktproducts:MasteringApacheSpark2.xbyRomeoKienzler.ScalaandSparkforBigDataAnalyticsbyMd.RezaulKarim,SridharAlla.ApacheSpark2.xMachineLearningCookbookbySiamakAmirghodsi,MeenakshiRajendran,BroderickHall,ShuenMeiCookbook.
·10.4萬字