HPCA 2026
Sat 31 January - Wed 4 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026
Tue 3 Feb 2026 16:50 - 17:10 at Collaroy - Accelerator Design and Modeling Chair(s): Leeor Peled

Chiplet-based architectures have emerged as a promising approach to overcome the physical and manufacturing constraints faced by monolithic chips, enabling scalable integration of compute resources to meet the growing demands of AI workloads. However, efficient inter-chiplet communication remains some significant bottleneck, especially under the fine-grained and bursty DMA request patterns generated by processing elements in modern AI tasks. Existing communication models and simulators fail to capture these characteristics, which limits the accuracy of performance analysis and the effectiveness of optimization strategies. These limitations hinder the DMA-communication inefficiencies in chiplet-based AI systems and pose challenges for designing HPC architectures.

To address these challenges, we present the first comprehensive chiplet communication model that explicitly incorporates fine-grained DMA traffic observed in realistic AI workloads. Building on this model, we propose COMET, a novel framework that intelligently searches for optimal DMA request aggregation and memory address mapping strategies tailored to chiplet environments. COMET dynamically consolidates small DMA transfers to improve bandwidth utilization and reduce communication latency, while also adapting on-chip memory mapping to align with workload-specific dataflows. This mitigates synchronization overhead across diverse AI tasks. Compared with inference on conventional chiplet communication schemes, COMET achieves $1.7\times$–$2.5\times$ speedup and $1.5\times$–$4.4\times$ higher bandwidth utilization across different DNN and LLM workloads.

Tue 3 Feb

Displayed time zone: Hobart change

15:50 - 17:10
Accelerator Design and ModelingMain Conference at Collaroy
Chair(s): Leeor Peled Huawei
15:50
20m
Talk
NPUWattch: ML-based Power, Area, and Timing Modeling for Neural Accelerators
Main Conference
Sehyeon Kim Yonsei University, Minkwan Kim Yonsei University, Chanho Park Yonsei University, Hanmok Park Kyungpook National University, Seonghoon Kim Kyungpook National University, Taigon Song Kyungpook National University, William Song Yonsei University
16:10
20m
Talk
Area Bloating and the Future of Specialization
Main Conference
Qixuan Yu Princeton University, David Wentzlaff Princeton University
16:30
20m
Talk
Advancing Full-stack Acceleration for Schrödinger-Style Quantum Simulation
Main Conference
Shuang Liang Imperial College London, Yuncheng Lu Imperial College London, Ce Guo Imperial College London, Paul H J Kelly Imperial College London, Wayne Luk Imperial College London, Hongxiang Fan Imperial College London
16:50
20m
Talk
COMET: Communication and Memory Co-Design for Fine-Grained AI Inference in MCM Accelerators
Main Conference
Taishu Sheng College of Computer Science and Technology, National University of Defense Technology, Guangyu Sun Peking University, Dezun Dong NUDT