ArchiText: Interactive Hierarchical Topic Modeling

Hannah Kim, Barry Drake, Alex Endert, Haesun Park

View presentation:2020-10-28T15:00:00ZGMT-0600Change your timezone on the schedule page
2020-10-28T15:00:00Z
Exemplar figure, but none was provided by the authors
ArchiText: Interactive Hierarchical Topic Modeling
Fast forward

Direct link to video on YouTube: https://youtu.be/-r9sox7jk7w

Keywords

Text analytics, Topic modeling, Nonnegative matrix factorization, Hierarchical topics, Visual analytics

Abstract

Human-in-the-loop topic modeling allows users to explore and steer the process to produce better quality topics that align with their needs. When integrated into visual analytic systems, many existing automated topic modeling algorithms are given interactive parameters to allow users to tune or adjust them. However, this has limitations when the algorithms cannot be easily adapted to changes, and it is difficult to realize interactivity closely supported by underlying algorithms. Instead, we emphasize the concept of tight integration, which advocates for the need to co-develop interactive algorithms and interactive visual analytic systems in parallel to allow flexibility and scalability. In this paper, we describe design goals for efficiently and effectively executing the concept of tight integration among computation, visualization, and interaction for hierarchical topic modeling of text data. We propose computational base operations for interactive tasks to achieve the design goals. To instantiate our concept, we present ArchiText, a prototype system for interactive hierarchical topic modeling, which offers fast, flexible, and algorithmically valid analysis via tight integration. Utilizing interactive hierarchical topic modeling, our technique lets users generate, explore, and flexibly steer hierarchical topics to discover more informed topics and their document memberships.