Event box

R case study: web scraping

Building on knowledge from earlier Rfun workshops, useRs will be introduced to web crawling and HTML parsing.  In this introductory web scraping workshop, attendees will use the rvest package to deconstruct a target site into structured data by combining limited knowledge of HTML specifications with a very limited appreciation of the HTTP protocol along with basic Tidyverse-style iteration. 

Prerequisites

  • Introductory familiarity with R and the Tidyverse (e.g. quickStart with R, part 1)
  • Install R and RStudio on your computer
  • tidyverese and rvest packages installed in your R environment

install.packages(c("tidyverse", "rvest"))

This event is offered virtually in accordance with Duke's Coronavirus events policies. A zoom link will be sent via email to registered participants to join the workshop. 

The content of the workshop may be recorded. If you are uncomfortable with a recording being published, please contact the instructor at anytime prior to the conclusion of the workshop.

Data Science

Date:
Thursday, March 4, 2021
Time:
1:30pm - 3:00pm
Campus:
n/a
Categories:
Data and Visualization  
Registration has closed.

Event Organizer

Profile photo of John Little
John Little
Profile photo of Center for Data and Visualization Sciences
Center for Data and Visualization Sciences