(Online; RCR) Digital Humanities Research: Topic Modeling and Document Classification
This workshop will equip students with a general understanding of topic modeling and document classification techniques for research. To facilitate a hands-on approach with a focus on process, we'll use the Orange data analysis platform and the open-source MALLET toolkit to explore ways of characterizing or sorting large corpora. Participants will learn what topic modeling can (and can't) reveal about a body of texts, how to interpret the results of topic modeling or document classification, and how to apply such techniques to their own research.
Registered participants will receive a Zoom link the day before the workshop; this event is offered for RCR credit as GS717.09.
- Tuesday, April 4, 2023
- 9:30am - 11:30am
- Digital Humanities Digital Scholarship ScholarWorks