Traveler: Navigating Task Parallel Traces for Performance Analysis
Author
Sakin, Sayef AzadBigelow, Alex
Tohid, R.
Scully-Allison, Connor
Scheidegger, Carlos
Brandt, Steven R.
Taylor, Christopher
Huck, Kevin A.
Kaiser, Hartmut
Isaacs, Katherine E.
Affiliation
University of ArizonaIssue Date
2022Keywords
CodesData visualization
event sequence visualization
High performance computing
Measurement
Navigation
parallel computing
performance analysis
Performance analysis
Software visualization
Task analysis
traces
Metadata
Show full item recordCitation
Sakin, S. A., Bigelow, A., Tohid, R., Scully-Allison, C., Scheidegger, C., Brandt, S. R., Taylor, C., Huck, K. A., Kaiser, H., & Isaacs, K. E. (2022). Traveler: Navigating Task Parallel Traces for Performance Analysis. IEEE Transactions on Visualization and Computer Graphics, 1–10.Rights
© 2022 IEEE.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Understanding the behavior of software in execution is a key step in identifying and fixing performance issues. This is especially important in high performance computing contexts where even minor performance tweaks can translate into large savings in terms of computational resource use. To aid performance analysis, developers may collect an <italic>execution trace</italic>—a chronological log of program activity during execution. As traces represent the full history, developers can discover a wide array of possibly previously unknown performance issues, making them an important artifact for exploratory performance analysis. However, interactive trace visualization is difficult due to issues of data size and complexity of meaning. Traces represent nanosecond-level events across many parallel processes, meaning the collected data is often large and difficult to explore. The rise of asynchronous task parallel programming paradigms complicates the relation between events and their probable cause. To address these challenges, we conduct a continuing design study in collaboration with high performance computing researchers. We develop diverse and hierarchical ways to navigate and represent execution trace data in support of their trace analysis tasks. Through an iterative design process, we developed <italic>Traveler</italic>, an integrated visualization platform for task parallel traces. Traveler provides multiple linked interfaces to help navigate trace data from multiple contexts. We evaluate the utility of Traveler through feedback from users and a case study, finding that integrating multiple modes of navigation in our design supported performance analysis tasks and led to the discovery of previously unknown behavior in a distributed array library.Note
Immediate accessISSN
1077-2626EISSN
1941-05062160-9306
Version
Final accepted manuscriptae974a485f413a2113503eed53cd6c53
10.1109/tvcg.2022.3209375