Event box

(Online; RCR) Digital Humanities Research: Topic Modeling and Document Classification

This workshop will equip students with a general understanding of topic modeling and document classification techniques for research. To facilitate a hands-on approach with a focus on process, we'll use the Orange data analysis platform and the open-source MALLET toolkit to explore ways of characterizing or sorting large corpora. Participants will learn what topic modeling can (and can't) reveal about a body of texts, how to interpret the results of topic modeling or document classification, and how to apply such techniques to their own research. 

Registered participants will receive a Zoom link the day before the workshop; this event is offered for RCR credit as GS717.09.

Tuesday, April 4, 2023
9:30am - 11:30am
Digital Humanities   Digital Scholarship   ScholarWorks  
Registration has closed.

Event Organizer

Profile photo of Will Shaw
Will Shaw

Digital Humanities Consultant, Duke University Libraries