Mastering Text Mining with R
Mastertext-tamingtechniquesandbuildeffectivetext-processingapplicationswithRAboutThisBook?Developalltherelevantskillsforbuildingtext-miningappswithRwiththiseasy-to-followguide?Gainin-depthunderstandingofthetextminingprocesswithlucidimplementationintheRlanguage?Example-richguidethatletsyougainhigh-qualityinformationfromtextdataWhoThisBookIsForIfyouareanRprogrammer,analyst,ordatascientistwhowantstogainexperienceinperformingtextdataminingandanalyticswithR,thenthisbookisforyou.Exposuretoworkingwithstatisticalmethodsandlanguageprocessingwouldbehelpful.WhatYouWillLearn?GetacquaintedwithsomeofthehighlyefficientRpackagessuchasOpenNLPandRWekatoperformvariousstepsinthetextminingprocess?AccessandmanipulatedatafromdifferentsourcessuchasJSONandHTTP?Processtextusingregularexpressions?Gettoknowthedifferentapproachesoftaggingtexts,suchasPOStagging,togetstartedwithtextanalysis?Exploredifferentdimensionalityreductiontechniques,suchasPrincipalComponentAnalysis(PCA),andunderstanditsimplementationinR?Discovertheunderlyingthemesortopicsthatarepresentinanunstructuredcollectionofdocuments,usingcommontopicmodelssuchasLatentDirichletAllocation(LDA)?Buildabaselinesentencecompletingapplication?PerformentityextractionandnamedentityrecognitionusingRInDetailTextMining(ortextdataminingortextanalytics)istheprocessofextractingusefulandhigh-qualityinformationfromtextbydevisingpatternsandtrends.Rprovidesanextensiveecosystemtominetextthroughitsmanyframeworksandpackages.Startingwithbasicinformationaboutthestatisticsconceptsusedintextmining,thisbookwillteachyouhowtoaccess,cleanse,andprocesstextusingtheRlanguageandwillequipyouwiththetoolsandtheassociatedknowledgeaboutdifferenttagging,chunking,andentailmentapproachesandtheirusageinnaturallanguageprocessing.Movingon,thisbookwillteachyoudifferentdimensionalityreductiontechniquesandtheirimplementationinR.Next,wewillcoverpatternrecognitionintextdatautilizingclassificationmechanisms,performentityrecognition,anddevelopanontologylearningframework.Bytheendofthebook,youwilldevelopapracticalapplicationfromtheconceptslearned,andwillunderstandhowtextminingcanbeleveragedtoanalyzethemassivelyavailabledataonsocialmedia.StyleandapproachThisbooktakesahands-on,example-drivenapproachtothetextminingprocesswithlucidimplementationinR.
·4.1萬字