Text/Data (RCR Days): Topic Modeling and Document Classification with MALLET
Participants in this session will acquire a general understanding of topic modeling, the automated analysis technique often referred to as "text mining." Topic modeling can refer to a number of different algorithms, which are computationally intensive and mathematically complex. To facilitate a hands-on approach with a focus on process, this workshop uses the open-source MALLET toolkit as a platform for exploring topic modeling with LDA (Latent Dirichlet Allocation) and will not offer a comparison of algorithms. In addition to topic modeling, this session introduces the concepts of sequence labeling and automated document classification, both of which are also possible with MALLET.
- Tuesday, October 9, 2018
- 1:00pm - 3:00pm
- Bostock 121 (Murthy Digital Studio)
- West Campus
- Digital Scholarship