Scaling Graph Neural Network Training via Geometric Optimization (HPCA 2026 - Main Conference)

Who

Fangzhou Ye, Lingxiang Yin, Hao Zheng

Track

HPCA 2026 Main Conference

Time Zone

The program is currently displayed in (GMT+11:00) Hobart.

Use conference time zone: (GMT+11:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 4 Feb 2026 10:30 - 10:50 at Coogee - Graph Neural Networks and Retrieval Systems Chair(s): Amir Yazdanbakhsh

Abstract

Wafer-scale computing has emerged as an alternative solution to sustain performance scaling in the post-Moore era, driven by recent technology advancements such as chiplet integration. This enables considerable computing and storage capabilities on a single chip, making it capable of accommodating large machine learning models and datasets. Recent efforts have heralded the promise of wafer-scale architectures for deep learning inference and training. However, scaling the training of Graph Neural Networks in wafer-scale architecture remains a challenge and is relatively unexplored due to irregularities in gradient propagation as well as physical constraints from flat on-chip topologies.

In this paper, we propose Aster, a topology-aware framework designed to efficiently support GNN training on arbitrary wafer-scale architectures. The proposed framework, as opposed to the current application or topology-specific heuristics, can be generalized to support any network topology and irregular GNN datasets. Specifically, we mathematically formulate commonly-seen network topologies in their geometric representation and prioritize communication efficiency during GNN workload partitioning and mapping. Based on the geometric representation, we propose a quadratic assignment problem solver to efficiently map irregular dataflows to a flat topology with reduced communication distance. The simulation results show that Aster can achieve performance speedup by $2.91 \times$, $1.50 \times$, $1.84 \times$, and $1.58 \times$ in Mesh and speedup by $3.84 \times$, $1.56 \times$, $2.05 \times$, and $1.49 \times$ in Torus on average compared to Mini-cut, ScalaGraph, Chunk-V, and Chunk-E, respectively.

Fangzhou Ye

University of Central Florida

Lingxiang Yin

University of Central Florida

Hao Zheng

University of Central Florida

Time Zone

The program is currently displayed in (GMT+11:00) Hobart.

Use conference time zone: (GMT+11:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 4 Feb
Displayed time zone: Hobart change

09:50 - 11:10	Graph Neural Networks and Retrieval SystemsMain Conference at Coogee Chair(s): Amir Yazdanbakhsh Google Research, Brain Team

09:50 20m Talk		VeloxGNN: Accelerating Out-of-Core based GNN Training with Low Data Migration and High Accuracy via Delayed Gradient Propagation Main Conference Yi Li University of Texas at Dallas, Tsun-Yu Yang Center for Computational Evolutionary Intelligence, Electrical & Computer Engineering, Duke University, Zhaoyan Shen Shandong University, Ming-Chang Yang The Chinese University of Hong Kong (CUHK), Bingzhe Li University of Texas at Dallas
10:10 20m Talk		AutoGNN: End-to-End Hardware-Driven Graph Preprocessing for Enhanced GNN Performance Main Conference Seungkwan Kang KAIST, Seungjun Lee KAIST, Donghyun Gouk Panmnesia, Miryeong Kwon Panmnesia, Hyunkyu Choi Panmnesia, Junhyeok Jang Panmnesia, Sangwon Lee Panmnesia, Huiwon Choi KAIST, Jie Zhang Peking University, Wonil Choi Hanyang University, Mahmut Taylan Kandemir Pennsylvania State University, Myoungsoo Jung KAIST
10:30 20m Talk		Scaling Graph Neural Network Training via Geometric Optimization Main Conference Fangzhou Ye University of Central Florida, Lingxiang Yin University of Central Florida, Hao Zheng University of Central Florida
10:50 20m Talk		VectorLiteRAG: Latency-Aware and Fine-Grained Resource Partitioning for Efficient RAG Main Conference Junkyum Kim Georgia Institute of Technology, Divya Mahajan Georgia Institute of Technology