Course

Web scraping 211: Rvest and RCrawler for R

Ended Nov 17, 2022
2 credits

Spots remaining: 13

Enrollment is closed
Add yourself to the wait list and you'll be auto enrolled when a spot opens

Add to Wait List

Full course description

Term: Fall 2022

Date: November 17th, 2022

Time: 2:30pm - 4:00p

Location: Torgerson 3310 & Online through Zoom

Instructor: Nathaniel Porter

Presented By: University Libraries (LIB)

 

Description:

Programmatic tools like R make web scraping simpler and more transparent, and allow scaling to multiple pages using crawlers. Learn how to automate collection and extraction of data from multiple related webpages with R libraries Rvest and RCrawler and work with the extracted data. Participants need a computer with R and RStudio installed (directions will be sent after registration) as well as basic experience working with data in R and familiarity with XPath (equivalent to Web scraping 101) to fully participate. The workshop is adapted from episode 4 of the Library Carpentry Web Scraping curriculum.