Event box
R case study: web scraping
Building on knowledge from earlier Rfun workshops, useRs will be introduced to web crawling and HTML parsing. In this introductory web scraping workshop, attendees will use the rvest package to deconstruct a target site into structured data by combining limited knowledge of HTML specifications with a very limited appreciation of the HTTP protocol along with basic Tidyverse-style iteration.
Prerequisites:
- Introductory familiarity with R and the Tidyverse (e.g. quickStart with R, part 1)
- Install R and RStudio on your computer
- tidyverese and rvest packages installed in your R environment
install.packages(c("tidyverse", "rvest"))
This event is offered virtually in accordance with Duke's Coronavirus events policies. A zoom link will be sent via email to registered participants to join the workshop.
The content of the workshop may be recorded. If you are uncomfortable with a recording being published, please contact the instructor at anytime prior to the conclusion of the workshop.
Data Science
- Date:
- Thursday, March 4, 2021
- Time:
- 1:30pm - 3:00pm
- Campus:
- n/a
- Categories:
- Data and Visualization