Conduit: Programmer-Transparent Near-Data Processing Using Multiple Compute-Capable Resources in SSDs
Near-data processing (NDP) mitigates the data movement bottleneck in modern computing systems by performing computation closer to where the data resides. Solid-state drives (SSDs) are well-suited for NDP for four key reasons: (1) they store large application datasets that exceed main memory capacity; (2) they inherently support three near-data processing paradigms: processing using controller cores, processing using DRAM in the SSD, and in-flash processing; (3) they offer high computation throughput via massive internal parallelism; (4) they ease the burden on the memory hierarchy by supporting in-place computations.
Leveraging NDP techniques in SSDs presents three key challenges: (1) limited applicability across diverse workloads, (2) underutilization of SSD computation resources, and (3) significant programmer effort to analyze the applications and system state to select and integrate a suitable NDP technique in the storage I/O stack. While no prior approach leverages multiple SSD computation resources in a general-purpose and application-agnostic manner, several studies partition computation between the host and near-memory processing units. However, adapting these techniques to SSDs offers limited benefits because they (1) ignore the heterogeneity of the SSD computation resources, and (2) rely on limited factors such as bandwidth utilization or data movement cost, ignoring key factors such as computation resource utilization.
We address these challenges with Conduit, a general-purpose programmer-transparent NDP framework for SSDs that dynamically offloads computations across multiple SSD resources. Conduit performs: (1) a compile-time preprocessing step that vectorizes application code segments for NDP, and (2) a runtime instruction-granularity offloading step that dynamically determines the best-fit SSD computation resource based on resource utilization, data dependencies and data movement costs. Conduit accelerates a broad range of workloads without programmer intervention. We evaluate Conduit and two prior NDP offloading techniques using an in-house event-driven SSD simulator on six data-intensive applications. Conduit outperforms the best-performing prior offloading policy by 80% and reduces energy consumption by 46%, with minimal storage overhead, and no additional hardware costs.
Mon 2 FebDisplayed time zone: Hobart change
11:30 - 12:50 | Near-Data Processing and StorageMain Conference at Coogee Chair(s): Jisung Park POSTECH (Pohang University of Science and Technology) | ||
11:30 20mTalk | PIMphony: Overcoming Bandwidth and Capacity Inefficiency in PIM-based Long-Context LLM Inference System Main Conference hyucksung kwon Hanyang University, Kyungmo Koo Hanyang University, Janghyeon Kim Hanyang University, Woongkyu Lee Hanyang University, Minjae Lee Hanyang University, Gyeonggeun Jung KAIST, Hyungdeok Lee Solution Advanced Technology, SK hynix, Yousub Jung Solution Advanced Technology, SK hynix, Jaehan Park Solution Advanced Technology, SK hynix, Yosub Song Solution Advanced Technology, SK hynix, Byeongsu Yang Solution Advanced Technology, SK hynix, Haerang Choi Solution Advanced Technology, SK hynix, Guhyun Kim Solution Advanced Technology, SK hynix, Jongsoon Won Solution Advanced Technology, SK hynix, Woojae Shin Solution Advanced Technology, SK hynix, Changhyun Kim Solution Advanced Technology, SK hynix, Shin Gyeongcheol Solution Advanced Technology, SK hynix, Yongkee Kwon Tenstorrent, Ilkon Kim Solution Advanced Technology, SK hynix, Euicheol Lim SK hynix, John Kim KAIST, Jungwook Choi Hanyang University | ||
11:50 20mTalk | Adaptive Draft Sequence Length: Enhancing Speculative Decoding Throughput on PIM-Enabled Systems Main Conference Runze Wang Huazhong University of Science and Technology, Qinggang Wang Huazhong University of Science and Technology, Haifeng Liu Huazhong University of Science and Technology, Long Zheng Huazhong University of Science and Technology, XIAOFEI LIAO Huazhong University of Science and Technology, Hai Jin Huazhong University of Science and Technology, Jingling Xue UNSW Sydney | ||
12:10 20mTalk | Conduit: Programmer-Transparent Near-Data Processing Using Multiple Compute-Capable Resources in SSDs Main Conference Rakesh Nadig ETH Zurich, Vamanan Arulchelvan ETH Zurich, Mayank Kabra ETH Zurich, Harshita Gupta ETH Zurich, Rahul Bera ETH Zurich, Nika Mansouri Ghiasi ETH Zurich, Nanditha Rao ETH Zurich, Qingcai Jiang ETH Zurich, Andreas Kosmas Kakolyris ETH Zurich, Yu Liang ETH Zurich, Mohammad Sadrosadati ETH Zürich, Onur Mutlu ETH Zurich | ||
12:30 20mTalk | N-DIPPER: A Distributed Inter-die Peak Power Management Network for NAND Systems Main Conference | ||