BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Springshare//LibCal//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-TIMEZONE:America/New_York
X-PUBLISHED-TTL:PT15M
BEGIN:VEVENT
DTSTART:20181009T140000Z
DTEND:20181009T160000Z
DTSTAMP:20181009T000000Z
SUMMARY:Text/Data (RCR Days): Acquiring and Preparing a Corpus of Texts
DESCRIPTION:This workshop focuses on the technical dimensions of corpus 
 development.  Using an array of printed matter -- from digital facsimiles 
 of incunabula to modern letterpress/offset books -- we will explore the 
 risks and benefits of optical character recognition (OCR)\; file formatting 
 and naming issues\; organization strategies for large corpora\; and 
 problems of data cleaning and preparation.  We will also look at some 
 common sources for textual research data\, such as Project Gutenberg\, the 
 Internet Archive\, and Google Books.  While this session will not examine 
 legal issues in detail\, we will discuss some common legal concerns around 
 the use of textual corpora.\n
LOCATION:Bostock 121 (Murthy Digital Studio)\, West Campus
ORGANIZER;CN="Will Shaw":MAILTO:william.shaw@duke.edu
CATEGORIES:Digital Scholarship
CONTACT;CN="Will Shaw":MAILTO:william.shaw@duke.edu
STATUS:CONFIRMED
UID:LibCal-4491985
URL:https://duke.libcal.com/event/4491985
X-MICROSOFT-CDO-BUSYSTATUS:BUSY
BEGIN:VALARM
TRIGGER:-PT15M
ACTION:DISPLAY
DESCRIPTION:Reminder
END:VALARM
END:VEVENT

END:VCALENDAR