Lex Alexander, Gehlenborg Nils, Strobelt Hendrik, Vuillemot Romain, Pfister Hanspeter
IEEE Trans Vis Comput Graph. 2014 Dec;20(12):1983-92. doi: 10.1109/TVCG.2014.2346248.
Understanding relationships between sets is an important analysis task that has received widespread attention in the visualization community. The major challenge in this context is the combinatorial explosion of the number of set intersections if the number of sets exceeds a trivial threshold. In this paper we introduce UpSet, a novel visualization technique for the quantitative analysis of sets, their intersections, and aggregates of intersections. UpSet is focused on creating task-driven aggregates, communicating the size and properties of aggregates and intersections, and a duality between the visualization of the elements in a dataset and their set membership. UpSet visualizes set intersections in a matrix layout and introduces aggregates based on groupings and queries. The matrix layout enables the effective representation of associated data, such as the number of elements in the aggregates and intersections, as well as additional summary statistics derived from subset or element attributes. Sorting according to various measures enables a task-driven analysis of relevant intersections and aggregates. The elements represented in the sets and their associated attributes are visualized in a separate view. Queries based on containment in specific intersections, aggregates or driven by attribute filters are propagated between both views. We also introduce several advanced visual encodings and interaction methods to overcome the problems of varying scales and to address scalability. UpSet is web-based and open source. We demonstrate its general utility in multiple use cases from various domains.
理解集合之间的关系是一项重要的分析任务,在可视化社区中受到了广泛关注。在这种情况下的主要挑战是,如果集合数量超过一个微不足道的阈值,集合交集数量会出现组合爆炸。在本文中,我们介绍了UpSet,这是一种用于对集合、它们的交集以及交集聚合进行定量分析的新颖可视化技术。UpSet专注于创建任务驱动的聚合,传达聚合和交集的大小及属性,以及数据集中元素可视化与其集合成员关系之间的对偶性。UpSet以矩阵布局可视化集合交集,并基于分组和查询引入聚合。矩阵布局能够有效地表示相关数据,例如聚合和交集中的元素数量,以及从子集或元素属性派生的其他汇总统计信息。根据各种度量进行排序能够对相关交集和聚合进行任务驱动的分析。集合中表示的元素及其相关属性在一个单独的视图中可视化。基于特定交集、聚合中的包含关系或由属性过滤器驱动的查询在两个视图之间传播。我们还引入了几种先进的视觉编码和交互方法,以克服不同尺度的问题并解决可扩展性。UpSet是基于网络的且开源的。我们在来自各个领域的多个用例中展示了它的通用实用性。