R Web Scraping Quick Start Guide
Webscrapingisatechniquetoextractdatafromwebsites.Itsimulatesthebehaviorofawebsiteusertoturnthewebsiteitselfintoawebservicetoretrieveorintroducenewdata.ThisbookgivesyouallyouneedtogetstartedwithscrapingwebpagesusingRprogramming.YouwilllearnabouttherulesofRegExandXpath,keycomponentsforscrapingwebsitedata.Wewillshowyouwebscrapingtechniques,methodologies,andframeworks.Withthisbook'sguidance,youwillbecomecomfortablewiththetoolstowriteandtestRegExandXPathrules.Wewillfocusonexamplesofdynamicwebsitesforscrapingdataandhowtoimplementthetechniqueslearned.YouwilllearnhowtocollectURLsandthencreateXPathrulesforyourfirstwebscrapingscriptusingrvestlibrary.Fromthedatayoucollect,youwillbeabletocalculatethestatisticsandcreateRplotstovisualizethem.Finally,youwilldiscoverhowtouseSeleniumdriverswithRformoresophisticatedscraping.YouwillcreateAWSinstancesanduseRtoconnectaPostgreSQLdatabasehostedonAWS.Bytheendofthebook,youwillbesufficientlyconfidenttocreateend-to-endwebscrapingsystemsusingR.
·1.4萬字