Event box

Text/Data (RCR Days): Topic Modeling and Document Classification with MALLET

Participants in this session will acquire a general understanding of topic modeling, the automated analysis technique often referred to as "text mining."  Topic modeling can refer to a number of different algorithms, which are computationally intensive and mathematically complex. To facilitate a hands-on approach with a focus on process, this workshop uses the open-source MALLET toolkit as a platform for exploring topic modeling with LDA (Latent Dirichlet Allocation) and will not offer a comparison of algorithms. In addition to topic modeling, this session introduces the concepts of sequence labeling and automated document classification, both of which are also possible with MALLET.   

Date:
Tuesday, October 9, 2018
Time:
1:00pm - 3:00pm
Location:
Bostock 121 (Murthy Digital Studio)
Campus:
West Campus
Categories:
Digital Scholarship  
Registration has closed.

Event Organizer

Profile photo of Will Shaw
Will Shaw

Digital Humanities Consultant, Duke University Libraries