Main Program

ASPLOS 2024 PROCEEDINGS

Volume 1: https://dl.acm.org/doi/proceedings/10.1145/3617232

Volume 2: https://dl.acm.org/doi/proceedings/10.1145/3620665

Volume 3: https://dl.acm.org/doi/proceedings/10.1145/3620666

Sunday, 18:00 PDT – 21:00 PDT: Welcome Reception and Poster Session (Location: Grande C)

Floor Plan

Day 1: Monday, April 29

7:30 PDT – 8:30 PDT: Breakfast

8:30 PDT – 9:00 PDT: Opening Remarks (Location: Grande)

9:00 PDT – 10:00 PDT: Keynote 1 by Amin Vahdat (Google)

Societal infrastructure in the age of Artificial General Intelligence
Abstract
Today, we are at an inflection point in computing where emerging Generative AI services are placing unprecedented demand for compute while the existing architectural patterns for improving efficiency have stalled. In this talk, we will discuss the likely needs of the next generation of computing infrastructure and use recent examples at Google from networks to accelerators to servers to illustrate the challenges and opportunities ahead. Taken together, we chart a course where computing must be increasingly specialized and co-optimized with algorithms and software, all while fundamentally focusing on security and sustainability.
Bio
Amin Vahdat (Vice President Google — ML, Systems, and Cloud AI) is a Fellow and vice president of Engineering at Google, where his team is responsible for delivering industry-leading Machine Learning software and hardware that serves Alphabet, Google and the world, and Artificial Intelligence technologies that empower ML developers and solve customers’ most pressing business challenges. In the past, he was General Manager for Google’s compute, storage, and network hardware and software infrastructure. Until 2019, he was the Technical Lead for the Networking organization at Google. 

Before joining Google, Amin was the Science Applications International Corporation (SAIC) Professor of Computer Science and Engineering at UC San Diego (UCSD). He received his doctorate from the University of California Berkeley in computer science. He is a member of the National Academy of Engineering (NAE) and an ACM Fellow.

Amin has been recognized with a number of awards, including the National Science Foundation (NSF) CAREER award, the UC Berkeley Distinguished EECS Alumni Award, the Alfred P. Sloan Fellowship, the ACM SIGCOMM Networking Systems Award, and the Duke University David and Janet Vaughn Teaching Award. Most recently, Amin was awarded the SIGCOMM lifetime achievement award for his contributions to data center and wide area networks. Lastly, he was inducted into the National Academy of Engineering in September 2023 for his contributions to the design and implementation of datacenter and planet-scale networks that power cloud computer systems.

10:00 PDT – 10:30 PDT: Break

10:30 PDT – 11:45 PDT: Lightning Talk Session

Lightning A

(Location: Grande A/B)
Session Chair: Soroush Ghodrati (University of California, San Diego)
Papers from all A sessions.
Lightning B

(Location: Grande C)
Session Chair: Moumita Dey (AMD Research and Advanced Development)
Papers from all B sessions.
Lightning C

(Location: Grande: D/E)
Session Chair: Nader Sehatbakhsh (University of California, Los Angeles)
Papers from all C sessions.
Lightning D

(Location: Scripps I/II)
Session Chair: Kazem Taram (Purdue University)
Papers from all D sessions.

11:45 PDT – 12:00 PDT: Break

12:00 PDT – 13:00 PDT: Session 1

1A: Synthesis for Architectures

(Location: Grande A/B)
Session Chair: Adrian Sampson (Cornell University)
Explainable Port Mapping Inference with Sparse Performance Counters for AMD’s Zen Architectures

Fabian Ritter and Sebastian Hack (Saarland University)

Paper . Abstract . Lightning Talk
Longnail: High-Level Synthesis of Portable Custom Instruction Set Extensions for RISC-V Processors from Descriptions in the Open-Source CoreDSL Language

Julian Oppermann, Brindusa Mihaela Damian-Kosterhon, Florian Meisel, and Tammo Mürmann (Technical University of Darmstadt);Eyck Jentzsch (MINRES Technologies GmbH);Andreas Koch (Technical University of Darmstadt)

Paper . Abstract . Lightning Talk
SEER: Super-Optimization Explorer for High-Level Synthesis using E-graph Rewriting

Jianyi Cheng (University of Cambridge and Intel); Samuel Coward (Imperial College London and Intel); Lorenzo Chelini, Rafael Barbalho, and Theo Drane (Intel)

Paper . Abstract . Lightning Talk
HIDA: A Hierarchical Dataflow Compiler for High-Level Synthesis

Hanchen Ye, Hyegang Jun, and Deming Chen (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk
1B: Optimizing ML Communication

(Location: Grande C)
Session Chair: Roshan Dathathri (Microsoft Research)
TCCL: Discovering Better Communication Paths for PCIe GPU Clusters

Heehoon Kim, Junyeol Ryu, and Jaejin Lee (Seoul National University)

Paper . Abstract . Lightning Talk
[Best Paper] Centauri: Enabling Efficient Scheduling for Communication-Computation Overlap in Large Model Training via Communication Partitioning

Chang Chen, Xiuhong Li, and Qianchao Zhu (Peking University); Jiangfei Duan (Chinese University of Hong Kong); Peng Sun and Xingcheng Zhang (Shanghai AI Lab); Chao Yang (Peking University)

Paper . Abstract . Lightning Talk
T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives

Suchita Pati (University of Wisconsin-Madison and AMD); Shaizeen Aga, Mahzabeen Islam, and Nuwan Jayasena (AMD); Matthew D. Sinclair (University of Wisconsin-Madison and AMD)

Paper . Abstract . Lightning Talk
Two-Face: Combining Collective and One-Sided Communication for Efficient Distributed SpMM

Charles Block, Gerasimos Gerogiannis, and Charith Mendis (University of Illinois at Urbana-Champaign); Ariful Azad (Indiana University); Josep Torrellas (University of Illinois at Urbana-Champaign)

Paper . Abstract . Lightning Talk
1C: Case Studies and Experience

(Location: Grande D/E)
Session Chair: Akanksha Jain (Google)
A Quantitative Analysis and Guidelines of Data Streaming Accelerator in Modern Intel Xeon Scalable Processors

Reese Kuper (University of Illinois at Urbana Champaign); Ipoom Jeong (University of Illinois Urbana-Champaign); Yifan Yuan, Ren Wang, Narayan Ranganathan, and Nikhil Rao (Intel Labs); Jiayu Hu (Tencent); Sanjay Kumar and Philip Lantz (Intel Labs); Nam Sung Kim (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk
Thesios: Synthesizing Accurate Counterfactual I/O Traces from I/O Samples

Phitchaya Mangpo Phothilimthana (Google DeepMind);Saurabh Kadekodi (Google);Soroush Ghodrati (University of California San Diego);Selene Moon (Google);Martin Maas (Google DeepMind)

Paper . Abstract . Lightning Talk
A Journey of a 1,000 Kernels Begins with a Single Step: A Retrospective of Deep Learning on GPUs

Michael Davies, Ian McDougall, Selvaraj Anandaraj, Deep Machchhar, Rithik Jain, and Karthikeyan Sankaralingam (University of Wisconsin-Madison)

Paper . Abstract . Lightning Talk
Expanding Datacenter Capacity with DVFS Boosting: A safe and scalable deployment experience

Leonardo Piga, Iyswarya Narayanan, Aditya Sundarrajan, Matt Skach, and Qingyuan Deng (Meta); Biswadip Maity (University of California Irvine); Manoj Chakkaravarthy, Alison Huang, Abhishek Dhanotia, and Parth Malani (Meta)

Paper . Abstract . Lightning Talk
1D: Attacks and Mitigations

(Location: Scripps I/II)
Session Chair: Moumita Dey (AMD Research and Advanced Development)
Rubix: Reducing the Overhead of Secure Rowhammer Mitigations via Randomized Line-to-Row Mapping

Anish Saxena, Saurav Mathur, and Moinuddin Qureshi (Georgia Tech)

Paper . Abstract . Lightning Talk
TAROT: A CXL SmartNIC-Based Defense Against Multi-bit Errors by Row-Hammer Attacks

Chihun Song (University of Illinois Urbana-Champaign); Michael Jaemin Kim (Seoul National University); Tianchen Wang, Houxiang Ji, Jinghan Huang, and Ipoom Jeong (University of Illinois Urbana-Champaign); Jaehyun Park, Hwayong Nam, Minbok Wi, and Jung Ho Ahn (Seoul National University); Nam Sung Kim (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk
Pythia: Compiler-Guided Defense Against Non-Control Data Attacks

Sharjeel Khan, Bodhisatwa Chatterjee, and Santosh Pande (Georgia Tech)

Paper . Abstract . Lightning Talk
Everywhere All at Once: Co-Location Attacks on Public Cloud FaaS

Zirui Neil Zhao (University of Illinois Urbana-Champaign); Adam Morrison (Tel Aviv University); Christopher W. Fletcher and Josep Torrellas (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk

13:00 PDT – 14:30 PDT: Lunch

14:30 PDT – 15:30 PDT: Session 2

2A: Binary Analysis

(Location: Grande A/B)
Session Chair: Tapti Palit (Purdue University)
Plankton: Reconciling Binary Code and Debug Information

Anshunkang Zhou and Chengfeng Ye (The Hong Kong University of Science and Technology); Heqing Huang (City University of Hong Kong); Yuandao Cai and Charles Zhang (The Hong Kong University of Science and Technology)

Paper . Abstract . Lightning Talk
What You Trace is What You Get: Dynamic Stack-Layout Recovery for Binary Recompilation

Fabian Parzefall, Chinmay Deshpande, Felicitas Hetzelt, and Michael Franz (University of California Irvine)

Paper . Abstract . Lightning Talk
Accurate Disassembly of Complex Binaries Without Use of Compiler Metadata

Soumyakant Priyadarshan, Huan Nguyen, and R. Sekar (Stony Brook University)

Paper . Abstract . Lightning Talk
FITS: Inferring Intermediate Taint Sources for Effective Vulnerability Analysis of IoT Device Firmware

Puzhuo Liu (Chinese Academy of Sciences); Yaowen Zheng (Nanyang Technological University); Chengnian Sun (University of Waterloo); Chuan Qin, Dongliang Fang, Mingdong Liu, and Limin Sun (Chinese Academy of Sciences)

Paper . Abstract . Lightning Talk
2B: Side Channels

(Location: Grande C)
Session Chair: Sadullah Canakci (Advanced Micro Devices)
Avoiding Instruction-Centric Microarchitectural Timing Channels Via Binary-Code Transformations

Michael Flanders, Reshabh K Sharma, Alexandra E. Michael, Dan Grossman, and David Kohlbrenner (University of Washington)

Paper . Abstract . Lightning Talk
Last-Level Cache Side-Channel Attacks Are Feasible in the Modern Public Cloud

Zirui Neil Zhao (University of Illinois Urbana-Champaign); Adam Morrison (Tel Aviv University); Christopher W. Fletcher and Josep Torrellas (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk
Pathfinder: High-Resolution Control-Flow Attacks Exploiting the Conditional Branch Predictor

Hosein Yavarzadeh and Archit Agarwal (University of California San Diego); Max Christman (University of North Carolina at Chapel Hill); Christina Garman (Purdue University); Daniel Genkin (Georgia Tech); Andrew Kwong (University of North Carolina at Chapel Hill); Daniel Moghimi (Google); Deian Stefan (University of California San Diego); Kazem Taram (Purdue University); Dean Tullsen (University of California San Diego)

Paper . Abstract . Lightning Talk
Pentimento: Data Remanence in Cloud FPGAs

Colin Drewes (Stanford University); Olivia Weng and Andres Meza (University of California San Diego); Alric Althoff (ARM); David Kohlbrenner (University of Washington); Ryan Kastner (University of California San Diego); Dustin Richmond (University of California Santa Cruz)

Paper . Abstract . Lightning Talk
2C: Memory Optimizations

(Location: Grande D/E)
Session Chair: Christian Pinto (IBM Research Europe)
Kimbap: A Node-Property Map System for Distributed Graph Analytics

Hochan Lee (University of Texas at Austin); Roshan Dathathri (Microsoft Research); Keshav Pingali (University of Texas at Austin)

Paper . Abstract . Lightning Talk
TrackFM: Far-out Compiler Support for a Far Memory World

Brian R. Tauro (Illinois Institute of Technology); Brian Suchy, Simone Campanoni, and Peter Dinda (Northwestern University); Kyle C. Hale (Illinois Institute of Technology)

Paper . Abstract . Lightning Talk
Scaling Up Memory Disaggregated Applications with SMART (Recorded Talk)

Feng Ren, Mingxing Zhang, and Kang Chen (Tsinghua University); Huaxia Xia (Meituan); Zuoning Chen (Chinese Academy of Engineering); Yongwei Wu (Tsinghua University)

Paper . Abstract . Lightning Talk
CC-NIC: a Cache-Coherent Interface to the NIC

Henry N. Schuh and Arvind Krishnamurthy (Google and University of Washington); David Culler (Google); Henry M. Levy (Google and University of Washington); Luigi Rizzo (Google); Samira Khan (Google and University of Virginia); Brent E. Stephens (Google and University of Utah)

Paper . Abstract . Lightning Talk
2D: ML Inference Systems

(Location: Scripps I/II)
Session Chair: Charith Mendis (University of Illinois at Urbana-Champaign)
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification

Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, and Zeyu Wang (Carnegie Mellon University); Zhengxin Zhang (Tsinghua University); Rae Ying Yee Wong (Stanford University); Alan Zhu and Lijie Yang (Carnegie Mellon University); Xiaoxiang Shi (Shanghai Jiao Tong University); Chunan Shi (Peking University); Zhuoming Chen and Daiyaan Arfeen (Carnegie Mellon University); Reyna Abhyankar (University of California San Diego); Zhihao Jia (Carnegie Mellon University)

Paper . Abstract . Lightning Talk
ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference

Hyungjun Oh, Kihong Kim, Jaemin Kim, Sungkyun Kim, and Junyeol Lee (Hanyang University); Du-seong Chang (KT Corporation); Jiwon Seo (Hanyang University)

Paper . Abstract . Lightning Talk
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling

Sohaib Ahmad and Hui Guan (University of Massachusetts Amherst); Brian D. Friedman and Thomas Williams (Nokia Bell Labs); Ramesh K. Sitaraman (University of Massachusetts Amherst); Thomas Woo (Nokia Bell Labs)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] SpotServe: Serving Generative Large Language Models on Preemptible Instances

Xupeng Miao (Carnegie Mellon University); Chunan Shi (Peking University); Jiangfei Duan (The Chinese University of Hong Kong); Xiaoli Xi (Carnegie Mellon University); Dahua Lin (Chinese University of Hong Kong and Sensetime Research); Bin Cui (Peking University);Zhihao Jia (Carnegie Mellon University)

Paper . Abstract . Lightning Talk

15:30 PDT – 16:00 PDT: Break

16:00 PDT – 17:00 PDT: Session 3

3A: Dynamic Analysis and Instrumentation

(Location: Grande A/B)
Session Chair: Sangeeta Chowdhary (AMD Research)
Flexible Non-intrusive Dynamic Instrumentation for WebAssembly

Ben L. Titzer, Elizabeth Gilbert, Bradley Wei Jie Teo, Yash Anand, Kazuyuki Takayama, and Heather Miller (Carnegie Mellon University)

Paper . Abstract . Lightning Talk
ShapleyIQ: Influence Quantification by Shapley Values for Performance Debugging of Microservices

Ye Li, Jian Tan, Bin Wu, Xiao He, and Feifei Li (Alibaba)

Paper . Abstract . Lightning Talk
Loupe: Driving the Development of OS Compatibility Layers

Hugo Lefeuvre (University of Manchester); Gaulthier Gain (University of Liege); Vlad-Andrei Bădoiu and Daniel Dinca (University Politehnica of Bucharest); Vlad-Radu Schiller (University of Manchester); Costin Raiciu (University Politehnica of Bucharest); Felipe Huici (Unikraft.io); Pierre Olivier (University of Manchester)

Paper . Abstract . Lightning Talk
Amanda: Unified Instrumentation Framework for Deep Neural Networks

Yue Guan, Yuxian Qiu, and Jingwen Leng (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute);Fan Yang (Microsoft Research);Shuo Yu (Shanghai Jiao Tong University);Yunxin Liu (Tsinghua University);Yu Feng and Yuhao Zhu (University of Rochester);Lidong Zhou (Microsoft Research);Yun Liang (Peking University);Chen Zhang, Chao Li, and Minyi Guo (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
3B: Security

(Location: Grande C)
Session Chair: Tal Garfinkel (UC San Deigo)
[Best Paper] GIANTSAN: Efficient Memory Sanitization with Segment Folding

Hao Ling (The Hong Kong University of Science and Technology); Heqing Huang (City University of Hong Kong); Chengpeng Wang, Yuandao Cai, and Charles Zhang (The Hong Kong University of Science and Technology)

Paper . Abstract . Lightning Talk
Enforcing C/C++ Type and Scope at Runtime for Control-Flow and Data-Flow Integrity

Mohannad Ismail and Christopher Jelesnianski (Virginia Tech);Yeongjin Jang (Samsung Research America);Changwoo Min (Igalia);Wenjie Xiong (Virginia Tech)

Paper . Abstract . Lightning Talk
Lightweight Fault Isolation: Practical, Efficient, and Secure Software Sandboxing

Zachary Yedidia (Stanford University)

Paper . Abstract . Lightning Talk
FreePart: Hardening Data Processing Software via Framework-based Partitioning and Isolation

Ali Ahad (University of Virginia);Gang Wang (University of Illinois at Urbana-Champaign);Chung Hwan Kim (University of Texas at Dallas);Suman Jana (Columbia University);Zhiqiang Lin (Ohio State University);Yonghwi Kwon (University of Virginia)

Paper . Abstract . Lightning Talk
3C: ML Cluster Scheduling

(Location: Grande D/E)
Session Chair: Jingweng Leng (Shanghai Jiao Tong University)
DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

Seah Kim (University of California Berkeley); Hyoukjun Kwon (University of California Irvine); Jinook Song, Jihyuck Jo, Yu-Hsin Chen, Liangzhen Lai, and Vikas Chandra (Meta)

Paper . Abstract . Lightning Talk
SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers

Daliang Xu (Peking University); Mengwei Xu (State Key Laboratory of Networking and Switching Technology); Chiheng Lou (Peking University); Li Zhang (State Key Laboratory of Networking and Switching Technology); Gang Huang, Xin Jin, and Xuanzhe Liu (Peking University)

Paper . Abstract . Lightning Talk
Training Job Placement in Clusters with Statistical In-Network Aggregation

Bohan Zhao and Wei Xu (Tsinghua University); Shuo Liu, Yang Tian, and Qiaoling Wang (Huawei); Wenfei Wu (Peking University)

Paper . Abstract . Lightning Talk
RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing

Zheng Wang (University of California San Diego); Yuke Wang and Jiaqi Deng (University of California Santa Barbara); Da Zheng (Amazon); Ang Li (Pacific Northwest National Laboratory); Yufei Ding (University of California San Diego)

Paper . Abstract . Lightning Talk
3D: ML Quantization and Memory Optimizations

(Location: Scripps I/II)
Session Chair: Kiwan Maeng (Pennsylvania State University)
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN

Renze Chen (Peking University); Zijian Ding (University of California Los Angeles); Size Zheng and Chengrui Zhang (Peking University); Jingwen Leng (Shanghai Jiao Tong University); Xuanzhe Liu and Yun Liang (Peking University)

Paper . Abstract . Lightning Talk
8-bit Transformer Inference and Fine-tuning for Edge Accelerators

Jeffrey Yu, Kartik Prabhu, Yonatan Urman, Robert M. Radway, Eric Han, and Priyanka Raina (Stanford University)

Paper . Abstract . Lightning Talk
Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization

Zhanhong Tan, Zijian Zhu, and Kaisheng Ma (Tsinghua University)

Paper . Abstract . Lightning Talk
Atalanta: A Bit is Worth a “Thousand” Tensor Values

Alberto Delmas Lascorz and Mostafa Mahmoud (University of Toronto); Ali Hadi Zadeh (University of Toronto and 1QBit); Milos Nikolic, Kareem Ibrahim, and Christina Giannoula (University of Toronto); Ameer Abdelhadi (McMaster University); Andreas Moshovos (University of Toronto and Vector Institute)

Paper . Abstract . Lightning Talk

17:00 PDT – 18:00 PDT: Break

18:00 PDT – 18:50 PDT: Debate (Location: Grande)

Topic: Should everyone work on machine learning/AI?

AI has made incredible strides of late, thanks to foundational advances in both ML algorithms and hardware. As AI continues to revolutionize more of our lives, should everyone in the ASPLOS community switch to working on topics in AI? Why should we look at other problems when we could help AI improve so it can solve them instead?

SpeakersTeam
Amir Yazdanbaksh (Google DeepMind), Vijay Janapa Reddi (Harvard) and Charith Mendis (UIUC)Yes
Tamar Eilam (IBM), Moin Qureshi (Georgia Tech) and Josep Torrellas (UIUC)No

18:50 PDT – 19:00 PDT: Break

19:00 PDT – 19:50 PDT: WACI (Location: Grande)

TitleSpeaker(s)
Computing Foundations: Knitting Together a WACI PurlZachary Tatlock (University of Washington)
BoltBleed – Security from an Aerial PerspectiveGuy Wilks (UC Santa Barbara)
Apparate?: Evading Memory Hierarchy with GodSpeed Wireless on ChipNitesh Narayana Gondlyala Sathya and Abhijit Das (Universitat Politècnica de Catalunya Barcelona)
Bureaucracy in SystemsMichael Roitzsch (Barkhausen Institut)
BRAINSTORM: Supercharging Innovation with AI-Driven IdeationDeniz Altınbüken, Martin Maas, Phitchaya Mangpo Phothilimthana (Google DeepMind)

19:50 PDT – 20:00 PDT: Break

20:00 PDT – 21:00 PDT: Business Meeting (Location: Grande)

  • ASPLOS’24 Budget and Stats (Nael Abu-Ghazaleh & Rajiv Gupta)
  • ASPLOS’24 Program Recap and Stats (Madan Musuvathi & Dan Tsafrir)
  • CARES (Shan Lu)
  • Revival of TOCS (Shan Lu)
  • ASPLOS’25 Plans
  •      Plans for the program (PC Chairs: Martha Kim, Chris Rossbach, Adrian Sampson)
  •      Plans for the venue (GC Chair: Lieven Eeckhout)
  • ASPLOS’26 Looking for bids (Shan Lu)
  • ISCA’26 Looking for bids (Natalie Enright Jerger & Daniel Jiménez)

Day 2: Tuesday, April 30

7:30 PDT – 8:30 PDT: Breakfast

8:30 PDT – 9:30 PDT: Keynote 2 by Emmett Witchel (University of Texas at Austin) (Location: Grande)

Challenges and Opportunities for Systems Using CXL Memory
Abstract
We are at the start of the technology cycle for compute express link (CXL) memory, which is a significant opportunity and challenge for architecture, operating systems, and programming languages.  The 3.0 CXL specification allows multiple, physically attached hosts to dynamically share memory.  We call such a configuration a CXL pod. Pods provide an intermediate hardware configuration between a network of machines, each with their private memory, and a shared memory multiprocessor with a unified memory, accessible to all machines.

This talk will discuss system support for single-node applications to attain scalable performance and high availability across a CXL pod as well as pointing out likely technical challenges for future systems.  Along with the technical content, the talk categorizes computer systems research using the model of storytelling with a beginning, a middle and an end.  We also examine the fascination popular culture has with personal aha moments and weigh their importance for a group working on an impending submission deadline.
Bio
Emmett Witchel is Professor of Computer Science at the University of Texas at Austin, where he has been on the faculty since 2004, after receiving his PhD at MIT.  His thesis won honorable mention for the ACM doctoral dissertation award.  Witchel’s research interests include operating systems, security, architecture, and concurrency. His recent work has been on system support for CXL memory, serverless computing, persistent memory, and trusted execution environments.  He co-chaired ASPLOS in 2019. His publishing recognition includes best paper awards at both SOSP and OSDI, as well as IEEE Micro top picks and research highlights in Communications of the ACM (CACM).  He is a fellow of the ACM.

9:30 PDT – 10:00 PDT: Break

10:00 PDT – 11:15 PDT: Session 4

4A: Accelerators

(Location: Grande C)
Session Chair: Minsoo Rhu (KAIST)
Harp: Leveraging Quasi-Sequential Characteristics to Accelerate Sequence-to-Graph Mapping of Long Reads

Yichi Zhang, Dibei Chen, Gang Zeng, and Jianfeng Zhu (Tsinghua University); Zhaoshi Li (MetaX Integrated Circuits); Longlong Chen, Shaojun Wei, and Leibo Liu (Tsinghua University)

Paper . Abstract . Lightning Talk
GSCore: Efficient Radiance Field Rendering via Architectural Support for 3D Gaussian Splatting

Junseo Lee, Seokwon Lee, Jungi Lee, Junyong Park, and Jaewoong Sim (Seoul National University)

Paper . Abstract . Lightning Talk
BeeZip: Towards An Organized and Scalable Architecture for Data Compression

Ruihao Gao, Zhichun Li, Guangming Tan, and Xueqi Li (Chinese Academy of Sciences)

Paper . Abstract . Lightning Talk
ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache Optimizations

Xiaoyang Lu (Illinois Institute of Technology); Boyu Long, Xiaoming Chen, and Yinhe Han (Chinese Academy of Sciences); Xian-He Sun (Illinois Institute of Technology)

Paper . Abstract . Lightning Talk
Explainable-DSE: An Agile and Explainable Exploration of Efficient HW/SW Codesigns of Deep Learning Accelerators Using Bottleneck Analysis

Shail Dave (Arizona State University); Tony Nowatzki (University of California Los Angeles); Aviral Shrivastava (Arizona State University)

Paper . Abstract . Lightning Talk
4B: Serverless Computing 1

(Location: Grande D/E)
Session Chair: Chris Rossbach (University of Texas at Austin and Katana Graph)
λFS:: A Scalable and Elastic Distributed File System Metadata Service using Serverless Functions

Benjamin Carver (George Mason University); Runzhou Han (Iowa State University); Jingyuan Zhang (George Mason University); Mai Zheng (Iowa State University); Yue Cheng (University of Virginia)

Paper . Abstract . Lightning Talk
FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture

Chuhao Xu, Yiyu Liu, Zijun Li, Quan Chen, and Han Zhao (Shanghai Jiao Tong University); Deze Zeng (China University of Geosciences); Qian Peng, Xueqi Wu, Haifeng Zhao, and Senbo Fu (Huawei);Minyi Guo (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
CodeCrunch: Improving Serverless Performance via Function Compression and Cost-Aware Warmup Location Optimization

Rohan Basu Roy (Northeastern University); Tirthak Patel (Rice University); Rohan Garg (Nutanix); Devesh Tiwari (Northeastern University)

Paper . Abstract . Lightning Talk
RainbowCake: Mitigating Cold-starts in Serverless with Layer-wise Container Caching and Sharing

Hanfei Yu (Louisiana State University); Rohan Basu Roy (Northeastern University); Christian Fontenot (Louisiana State University); Devesh Tiwari (Northeastern University); Jian Li (Stony Brook University); Hong Zhang (University of Waterloo); Hao Wang (Louisiana State University); Seung-Jong Park (Missouri University of Science and Technology)

Paper . Abstract . Lightning Talk
Flame: A Centralized Cache Controller for Serverless Computing (Recorded Talk)

Yanan Yang, Laiping Zhao, Yiming Li, and Shihao Wu (Tianjin University); Yuechan Hao and Yuchi Ma (Huawei); Keqiu Li (Tianjin University)

Paper . Abstract . Lightning Talk
4C: Power and Energy

(Location: Scripps I)
Session Chair: Christina Delimitrou (MIT)
Characterizing Power Management Opportunities for LLMs in the Cloud

Pratyush Patel (Microsoft Azure and University of Washington); Esha Choukse, Chaojie Zhang, Íñigo Goiri, Brijesh Warrier, Nithish Mahalingam, and Ricardo Bianchini (Microsoft Azure)

Paper . Abstract . Lightning Talk
Going Green for Less Green: Optimizing the Cost of Reducing Cloud Carbon Emissions

Walid A. Hanafy and Qianlin Liang (University of Massachusetts Amherst); Noman Bashir (MIT); Abel Souza, David Irwin, and Prashant Shenoy (University of Massachusetts Amherst)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] FOCAL: A First-Order Carbon Model to Assess Processor Sustainability (Recorded Talk)

Lieven Eeckhout (Ghent University)

Paper . Abstract . Lightning Talk
Predict; Don’t React for Enabling Efficient Fine-Grain DVFS in GPUs

Srikant Bharadwaj (Microsoft); Shomit Das (Qualcomm); Kaushik Mazumdar and Bradford M. Beckmann (AMD); Stephen Kosonocky (Uhnder)

Paper . Abstract . Lightning Talk
SUIT: Secure Undervolting with Instruction Traps

Jonas Juffinger (Graz University of Technology); Stepan Kalinin (North Carolina State University); Daniel Gruss (Graz University of Technology); Frank Mueller (North Carolina State University)

Paper . Abstract . Lightning Talk
4D: Static Analysis and Verification

(Location: Scripps II)
Session Chair: Shan Lu (Microsoft Research)
Kaleidoscope: Precise Invariant-Guided Pointer Analysis

Tapti Palit and Pedro Fonseca (Purdue University)

Paper . Abstract . Lightning Talk
Lifting Micro-Update Models from RTL for Formal Security Analysis

Adwait Godbole and Kevin Cheang (University of California Berkeley); Yatin A. Manerkar (University of Michigan); Sanjit A. Seshia (University of California Berkeley)

Paper . Abstract . Lightning Talk
Formal Mechanised Semantics of CHERI C: Capabilities, Undefined Behaviour, and Provenance

Vadim Zaliva and Kayvan Memarian (University of Cambridge); Ricardo Almeida (University of Edinburgh); Jessica Clarke (University of Cambridge); Brooks Davis (SRI International); Alexander Richardson (University of Cambridge); David Chisnall (Microsoft); Brian Campbell and Ian Stark (University of Edinburgh); Robert N. M. Watson and Peter Sewell (University of Cambridge)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] Lightweight, Modular Verification for WebAssembly-to-Native Instruction Selection

Alexa VanHattum (Wellesley College); Monica Pardeshi (Carnegie Mellon University); Chris Fallin (Fastly); Adrian Sampson (Cornell University); Fraser Brown (Carnegie Mellon University)

Paper . Abstract . Lightning Talk
Verifying Rust Implementation of Page Tables in a Software Enclave Hypervisor

Zhenyang Dai (Tsinghua University and Ant Group); Shuang Liu (Ant Group); Vilhelm Sjoberg (CertiK); Xupeng Li (Columbia University); Yu Chen (Tsinghua University); Wenhao Wang (Chinese Academy of Sciences); Yuekai Jia (Tsinghua University and Ant Group); Sean Noble Anderson (Portland State University); Laila Elbeheiry (Max Planck Institute for Software Systems); Shubham Sondhi (CertiK); Yu Zhang (Yale University); Zhaozhong Ni (CertiK); Shoumeng Yan (Ant Group); Ronghui Gu (Columbia University); Zhengyu He (Ant Group)

Paper . Abstract . Lightning Talk

11:15 PDT – 11:45 PDT: Break

11:45 PDT – 13:00 PDT: Session 5

5A: Compiler and Optimization Techniques

(Location: Grande C)
Session Chair: Gilbert Bernstein (University of Washington)
C4CAM: A Compiler for CAM-based In-memory Accelerators

Hamid Farzaneh and Joao Paulo Cardoso De Lima (TU Dresden); Mengyuan Li (University of Notre Dame); Asif Ali Khan (TU Dresden); Xiaobo Sharon Hu (University of Notre Dame); Jeronimo Castrillon (TU Dresden)

Paper . Abstract . Lightning Talk
BaCO: A Fast and Portable Bayesian Compiler Optimization Framework

Erik Orm Hellsten (Lund University);Artur Souza (Federal University of Minas Gerais); Johannes Lenfers (University of Münster); Rubens Lacouture and Olivia Hsu (Stanford University);Adel Ejjeh (University of Illinois at Urbana-Champaign); Fredrik Kjolstad (Stanford University); Michel Steuwer (University of Edinburgh); Kunle Olukotun (Stanford University); Luigi Nardi (Lund University and Stanford University)

Paper . Abstract . Lightning Talk
Merlin: Multi-tier Optimization of eBPF Code for Performance and Compactness

Jinsong Mao (University of Massachusetts Amherst); Hailun Ding (Rutgers University); Juan Zhai and Shiqing Ma (University of Massachusetts Amherst)

Paper . Abstract . Lightning Talk
[Best Paper] Automatic Generation of Vectorizing Compilers for Customizable Digital Signal Processors

Samuel Thomas and James Bornholt (University of Texas at Austin)

Paper . Abstract . Lightning Talk
Fast Instruction Selection for Fast Digital Signal Processing

Alexander J Root (Stanford University); Maaz Bin Safeer Ahmad (Adobe); Dillon Sharlet (Independent Researcher); Andrew Adams and Shoaib Kamil (Adobe); Jonathan Ragan-Kelley (MIT CSAIL)

Paper . Abstract . Lightning Talk
5B: Emerging and Non-Traditional Technologies

(Location: Grande D/E)
Session Chair: Sara Achour (Stanford University)
An Encoding Scheme to Enlarge Practical DNA Storage Capacity by Reducing Primer-Payload Collisions

Yixun Wei (University of Minnesota Twin Cities); Bingzhe Li (University of Texas at Dallas); David H.C. Du (University of Minnesota Twin Cities)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] Design of Novel Analog Compute Paradigms with Ark

Yu-Neng Wang (Stanford University); Glenn Cowan (Concordia University); Ulrich Rührmair (TU Berlin and University of Connecticut); Sara Achour (Stanford University)

Paper . Abstract . Lightning Talk
LightRidge: An End-to-end Agile Design Framework for Diffractive Optical Neural Networks

Yingjie Li (University of Utah and University of Maryland); Ruiyang Chen, Minhan Lou, Berardi Sensale-Rodriguez, and Weilu Gao (University of Utah); Cunxi Yu (University of Utah and University of Maryland)

Paper . Abstract . Lightning Talk
EagleEye: Nanosatellite constellation design for high-coverage, high-resolution sensing

Zhuo Cheng, Bradley Denby, Kyle McCleary, and Brandon Lucia (Carnegie Mellon University)

Paper . Abstract . Lightning Talk
Energy Efficient Convolutions with Temporal Arithmetic

Rhys Gretsch and Peiyang Song (University of California Santa Barbara); Advait Madhavan (University of Maryland College Park); Jeremy Lau and Timothy Sherwood (University of California Santa Barbara)

Paper . Abstract . Lightning Talk
5C: Memory: Allocation and Management

(Location: Scripps I)
Session Chair: Martin Maas (Google)
Getting a Handle on Unmanaged Memory

Nick Wanninger, Tommy McMichen, Simone Campanoni, and Peter Dinda (Northwestern University)

Paper . Abstract . Lightning Talk
Cornucopia Reloaded: Load Barriers for CHERI Heap Temporal Safety

Nathaniel Wesley Filardo (University of Cambridge and Microsoft); Brett F. Gutstein, Jonathan Woodruff, Jessica Clarke, and Peter Rugg (University of Cambridge);Brooks Davis (SRI International);Mark Johnston (University of Cambridge);Robert Norton (Microsoft);David Chisnall (SCI Semiconductor); Simon W. Moore (University of Cambridge); Peter G. Neumann (SRI International); Robert N. M. Watson (University of Cambridge)

Paper . Abstract . Lightning Talk
Characterizing a Memory Allocator at Warehouse Scale

Zhuangzhuang Zhou (Cornell University); Vaibhav Gogte, Nilay Vaish, Chris Kennelly, Patrick Xia, Svilen Kanev, and Tipp Moseley (Google); Christina Delimitrou (MIT); Parthasarathy Ranganathan (Google)

Paper . Abstract . Lightning Talk
More Apps, Faster Hot-Launch on Mobile Devices via Fore/Background-aware GC-Swap Co-design

Jiacheng Huang (City University of Hong Kong and Wuhan University); Yunmo Zhang and Junqiao Qiu (City University of Hong Kong); Yu Liang (ETH Zürich); Rachata Ausavarungnirun (King Mongkut’s University of Technology North Bangkok); Qingan Li (Wuhan University); Chun Jason Xue (Mohamed bin Zayed University of Artificial Intelligence)

Paper . Abstract . Lightning Talk
MiniMalloc: A Lightweight Memory Allocator for Hardware-Accelerated Machine Learning

Michael D. Moffitt (Google)

Paper . Abstract . Lightning Talk
5D: Quantum Architecture

(Location: Scripps II)
Session Chair: Gokul Ravi (University of Michigan, Ann Arbor)
Codesign of quantum error-correcting codes and modular chiplets in the presence of defects

Sophia Fuhui Lin and Joshua Viszlai (University of Chicago); Kaitlin N. Smith (Infleqtion); Gokul Subramanian Ravi (University of Chicago); Charles Yuan (MIT CSAIL); Frederic T. Chong (University of Chicago); Benjamin J. Brown (IBM T. J. Watson Research Center)

Paper . Abstract . Lightning Talk
MECH: Multi-Entry Communication Highway for Superconducting Quantum Chiplets

Hezi Zhang and Keyi Yin (University of California San Diego); Anbang Wu (University of California Santa Barbara); Hassan Shapourian and Alireza Shabani (Cisco Quantum Lab); Yufei Ding (University of California San Diego)

Paper . Abstract . Lightning Talk
QuFEM: Fast and Accurate Quantum Readout Calibration Using the Finite Element Method

Siwei Tan, Liqiang Lu, Hanyu Zhang, Jia Yu, Congliang Lang, Yongheng Shang, Xinkui Zhao, and Mingshuai Chen (Zhejiang University); Yun Liang (Peking University); Jianwei Yin (Zhejiang University)

Paper . Abstract . Lightning Talk
A Fault-Tolerant Million Qubit-Scale Distributed Quantum Computer

Junpyo Kim, Dongmoon Min, Jungmin Cho, Hyeonseong Jeong, Ilkwon Byun, Junhyuk Choi, Juwon Hong, and Jangwoo Kim (Seoul National University)

Paper . Abstract . Lightning Talk
Promatch: Extending the Reach of Real-Time Quantum Error Correction with Adaptive Predecoding

Narges Alavisamani, Suhas Vittal, and Ramin Ayanzadeh (Georgia Tech); Poulami Das (University of Texas at Austin); Moinuddin Qureshi (Georgia Tech)

Paper . Abstract . Lightning Talk

13:00 PDT – 14:30 PDT: Lunch

14:30 PDT – 15:30 PDT: Session 6

6A: Bug Finding and Testing

(Location: Grande C)
Session Chair: Sangeeta Chowdhary (AMD Research)
Greybox Fuzzing for Concurrency Testing

Dylan Wolff, Shi Zheng, Gregory J. Duck, Umang Mathur, and Abhik Roychoudhury (National University of Singapore)

Paper . Abstract . Lightning Talk
Multi-Dimensional and Message-Guided Fuzzing for Robotic Programs in Robot Operating System (Speaker Zhenyang Dai)

Jia-Ju Bai (Beihang University); Hao-Xuan Song and Shi-Min Hu (Tsinghua University)

Paper . Abstract . Lightning Talk
[Best Paper] CSSTs: A Dynamic Data Structure for Partial Orders in Concurrent Execution Analysis


Hünkar Can Tunç (Aarhus University);Ameya Prashant Deshmukh (Indian Institute of Technology Bombay); Berk Cirisci (Amazon Web Services); Constantin Enea (Ecole Polytechnique); Andreas Pavlogiannis (Aarhus University)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] UBFuzz: Finding Bugs in Sanitizer Implementations

Shaohua Li and Zhendong Su (ETH Zurich)

Paper . Abstract . Lightning Talk
6B: Processing-In-Memory (PIM) for ML

(Location: Grande D/E)
Session Chair: Christina Giannoula (University of Toronto)
AttAcc! Unleashing the Power of PIM for Batched Transformer-based Generative Model Inference

Jaehyun Park, Jaewan Choi, Kwanhee Kyung, Michael Jaemin Kim, and Yongsuk Kwon (Seoul National University); Nam Sung Kim (University of Illinois Urbana-Champaign); Jung Ho Ahn (Seoul National University)

Paper . Abstract . Lightning Talk
SpecPIM: Accelerating Speculative Inference on PIM-Enabled System via Architecture-Dataflow Co-Exploration

Cong Li, Zhe Zhou, Size Zheng, Jiaxi Zhang, Yun Liang, and Guangyu Sun (Peking University)

Paper . Abstract . Lightning Talk
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization

Cong Li and Zhe Zhou (Peking University); Yang Wang (Microsoft Research Asia); Fan Yang (Nankai University); Ting Cao and Mao Yang (Microsoft Research); Yun Liang and Guangyu Sun (Peking University)

Paper . Abstract . Lightning Talk
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing

Guseul Heo, Sangyeop Lee, Jaehong Cho, Hyunmin Choi, and Sanghyeon Lee (KAIST); Hyungkyu Ham and Gwangsun Kim (POSTECH); Divya Mahajan (Georgia Tech); Jongse Park (KAIST)

Paper . Abstract . Lightning Talk
6C: Optimization of Tensor Programs

(Location: Scripps I)
Session Chair: Mangpo Phothilimthana (Google DeepMind)
Felix: Optimizing Tensor Programs with Gradient Descent

Yifan Zhao, Hashim Sharif, Vikram Adve, and Sasa Misailovic (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk
Optimizing Deep Learning Inference via Global Analysis and Tensor Expressions

Chunwei Xia, Jiacheng Zhao, and Qianqi Sun (Chinese Academy of Sciences); Zheng Wang (University of Leeds); Yuan Wen (University of Aberdeen); Teng Yu (Thewake Systems); Xiaobing Feng and Huimin Cui (Chinese Academy of Sciences)

Paper . Abstract . Lightning Talk
PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation

Jason Ansel, Edward Yang, and Horace He (Meta); Natalia Gimelshein (OpenAI); Animesh Jain, Michael Voznesensky, Bin Bao, David Berard, Geeta Chauhan, Anjali Chourdia, Will Constable, Alban Desmaison, Zachary DeVito, Elias Ellison, and Will Feng (Meta); Jiong Gong (Intel); Michael Gschwind, Brian Hirsh, Sherlock Huang, Laurent Kirsch, Michael Lazos, Yanbo Liang, Jason Liang, Yinghai Lu, CK Luk, and Bert Maher (Meta); Yunjie Pan (University of Michigan); Christian Puhrsch, Matthias Reso, Mark Saroufim, Helen Suk, and Michael Suo (Meta); Phil Tillet (OpenAI); Eikan Wang (Intel); Xiaodong Wang, William Wen, Shunting Zhang, and Xu Zhao (Meta); Keren Zhou (OpenAI); Richard Zou, Ajit Mathews, Gregory Chanan, Peng Wu, and Soumith Chintala (Meta)

Paper . Abstract . Lightning Talk
Optimal Kernel Orchestration for Tensor Programs with Korch

Muyan Hu (University of Illinois at Urbana-Champaign); Ashwin Venkatram (AMD); Shreyashri Biswas (Carnegie Mellon University); Balamurugan Marimuthu (Sambanova Systems); Bohan Hou and Gabriele Oliaro (Carnegie Mellon University); Haojie Wang and Liyan Zheng (Tsinghua University); Xupeng Miao (Carnegie Mellon University); Jidong Zhai (Tsinghua University); Zhihao Jia (Carnegie Mellon University)

Paper . Abstract . Lightning Talk
6D: Variational Quantum Computing

(Location: Scripps II)
Session Chair: Gushu Li (University of Pennsylvania)
Elivagar: Efficient Quantum Circuit Search for Classification

Sashwat Anagolum (Pennsylvania State University); Narges Alavisamani (Georgia Tech); Poulami Das (The University of Texas at Austin); Moinuddin Qureshi (Georgia Tech); Yunong Shi (Amazon Quantum Technologies)

Paper . Abstract . Lightning Talk
Red-QAOA: Efficient Variational Optimization through Circuit Reduction

Meng Wang (University of British Columbia and Pacific Northwest National Laboratory); Bo Fang and Ang Li (Pacific Northwest National Laboratory); Prashant J. Nair (University of British Columbia)

Paper . Abstract . Lightning Talk
VarSaw: Application-tailored Measurement Error Mitigation for Variational Quantum Algorithms

Siddharth Dangwal (University of Chicago); Gokul Subramanian Ravi (University of Michigan); Poulami Das (University of Texas at Austin); Kaitlin N. Smith (Infleqtion); Jonathan Mark Baker (University of Texas at Austin); Frederic T. Chong (University of Chicago)

Paper . Abstract . Lightning Talk
ProxiML: Building Machine Learning Classifiers for Photonic Quantum Computing

Aditya Ranjan (Northeastern University); Tirthak Patel (Rice University); Daniel Silver, Harshitta Gandhi, and Devesh Tiwari (Northeastern University)

Paper . Abstract . Lightning Talk

15:30 PDT – 16:00 PDT: Break

16:00 PDT – 17:00 PDT: Session 7

7A: Architecture Support for ML

(Location: Grande C)
Session Chair: Hyoukjun Kwon (University of California, Irvine)
FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning

Kai Zhong and Zhenhua Zhu (Tsinghua University); Guohao Dai (Shanghai Jiao Tong University and Infinigence-AI); Hongyi Wang, Xinhao Yang, and Haoyu Zhang (Tsinghua University); Jin Si (Beijing University of Posts and Telecommunications);Qiuli Mao (Tsinghua University); Shulin Zeng (Tsinghua University and Infinigence-AI); Ke Hong (Tsinghua University); Genghan Zhang (Stanford University); Huazhong Yang and Yu Wang (Tsinghua University)

Paper . Abstract . Lightning Talk
CMC: Video Transformer Acceleration via CODEC Assisted Matrix Condensing (Speaker Qingyuan Liu)

Zhuoran Song, Chunyu Qi, Fangxin Liu, Naifeng Jing, and Xiaoyao Liang (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
Tandem Processor: Grappling with Emerging Operators in Neural Networks

Soroush Ghodrati, Sean Kinzer, Hanyang Xu, and Rohan Mahapatra (University of California San Diego); Yoonsung Kim (KAIST); Byung Hoon Ahn (University of California San Diego); Dong Kai Wang (University of Illinois Urbana-Champaign); Lavanya Karthikeyan (University of California San Diego); Amir Yazdanbakhsh (Google DeepMind); Jongse Park (KAIST); Nam Sung Kim (University of Illinois Urbana-Champaign); Hadi Esmaeilzadeh (University of California San Diego)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] Carat: Unlocking Value-Level Parallelism for Multiplier-Free GEMMs

Zhewen Pan and Joshua San Miguel (University of Wisconsin-Madison);Di Wu (University of Central Florida)

Paper . Abstract . Lightning Talk
7B: Program and Configuration Synthesis

(Location: Grande D/E)
Session Chair: Tamara Lehman (University of Colorado, Boulder)
NetRen: Service Migration-Driven Network Renascence with Synthesizing Updated Configuration (Recorded Talk)

Rongxin Han, Jingyu Wang, Qi Qi, Haifeng Sun, Chaowei Xu, Zhaoyang Wan, and Zirui Zhuang (Beijing University of Posts and Telecommunications);Yichuan Yu (Huawei); Jianxin Liao (Beijing University of Posts and Telecommunications)

Paper . Abstract . Lightning Talk
Hydride: A Retargetable and Extensible Synthesis-based Compiler for Modern Hardware Architectures

Akash Kothari, Abdul Rafae Noor, Muchen Xu, Hassam Uddin, Dhruv Baronia, Stefanos Baziotis, Vikram Adve, and Charith Mendis (University of Illinois at Urbana-Champaign);Sudipta Sengupta (Amazon Web Services)

Paper . Abstract . Lightning Talk
RTL-Repair: Fast Symbolic Repair of Hardware Design Code

Kevin Laeufer, Brandon Fajardo, Abhik Ahuja, Vighnesh Iyer, Borivoje Nikolić, and Koushik Sen (University of California Berkeley)

Paper . Abstract . Lightning Talk
SIRO: Empowering Version Compatibility in Intermediate Representations via Program Synthesis

Bowen Zhang and Wei Chen (The Hong Kong University of Science and Technology); Peisen Yao (Zhejiang University); Chengpeng Wang, Wensheng Tang, and Charles Zhang (The Hong Kong University of Science and Technology)

Paper . Abstract . Lightning Talk
7C: Storage Optimizations in Software

(Location: Scripps I)
Session Chair: Joseph Devietti (University of Pennsylvania)
MemSnap μCheckpoints: A Data Single Level Store for Fearless Persistence

Emil Tsalapatis, Ryan Hancock, Rakeeb Hossain, and Ali José Mashtizadeh (University of Waterloo)

Paper . Abstract . Lightning Talk
Grafu: Unleashing the Full Potential of Future Value Computation for Out-of-core Synchronous Graph Processing

Tsun-Yu Yang (The Chinese University of Hong Kong); Cale England (Oklahoma State University); Yi Li and Bingzhe Li (University of Texas at Dallas); Ming-Chang Yang (The Chinese University of Hong Kong)

Paper . Abstract . Lightning Talk
CrossPrefetch: Accelerating I/O Prefetching for Modern Storage

Shaleen Garg and Jian Zhang (Rutgers University);Rekha Pitchumani (Samsung);Manish Parashar (University of Utah);Bing Xie (Microsoft);Sudarsun Kannan (Rutgers University)

Paper . Abstract . Lightning Talk
Palantir: Hierarchical Similarity Detection for Post-Deduplication Delta Compression

Hongming Huang (City University of Hong Kong and Huawei); Peng Wang (Huawei); Qiang Su (City University of Hong Kong); Hong Xu (The Chinese University of Hong Kong); Chun Jason Xue (City University of Hong Kong and Mohamed bin Zayed University of Artificial Intelligence); André Brinkmann (Johannes Gutenberg University Mainz)

Paper . Abstract . Lightning Talk
7D: Graph Neural Networks

(Location: Scripps II)
Session Chair: Xulong Tang (University of Pittsburgh)
TGLite: A Lightweight Programming Framework for Continuous-Time Temporal Graph Neural Networks

Yufeng Wang and Charith Mendis (University of Illinois at Urbana-Champaign)

Paper . Abstract . Lightning Talk
MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training

Hongwu Peng and Xi Xie (University of Connecticut); Kaustubh Shivdikar (Northeastern University); Md Amit Hasan, Jiahui Zhao, Shaoyi Huang, and Omer Khan (University of Connecticut); David Kaeli (Northeastern University); Caiwen Ding (University of Connecticut)

Paper . Abstract . Lightning Talk
Hector: An Efficient Programming and Compilation Framework for Implementing Relational Graph Neural Networks in GPU Architectures

Kun Wu (University of Illinois at Urbana-Champaign); Mert Hidayetoğlu (Stanford University); Xiang Song (AWS AI); Sitao Huang (University of California Irvine); Da Zheng and Israt Nisa (AWS AI); Wen-Mei Hwu (Nvidia and University of Illinois at Urbana-Champaign)

Paper . Abstract . Lightning Talk
Sleuth: A Trace-Based Root Cause Analysis System for Large-Scale Microservices with Graph Neural Networks

Yu Gan, Guiyang Liu, Xin Zhang, Qi Zhou, Jiesheng Wu, and Jiangwei Jiang (Alibaba)

Paper . Abstract . Lightning Talk

17:00 PDT – 22:00 PDT: Excursion and Banquet at USS Midway Museum

The buses to the USS Midway Museum start departing from the entrance of the hotel at 5:15 pm and return from the banquet at 9:15 pm. The temperature will be 55oF, please dress accordingly.

Day 3: Wednesday, May 1

7:30 PDT – 8:30 PDT: Breakfast

8:30 PDT – 9:30 PDT: Keynote 3 by Tamar Eilam (IBM T. J. Watson Research Center) (Location: Grande)

Harnessing the Power of Specialization for Sustainable Computing
Abstract 
Computing is critical to address some of the most pressing needs of humanity today, including climate change mitigation and adaptation. However, it is also the source of a significant and steadily increasing carbon toll, attributed in part to the exponential growth in energy-demanding workloads, such as artificial intelligence (AI). Due to the demise of Dennard scaling, we can no longer count on exponentially-improve energy efficiency of general-purpose processors. Therefore, today’s operational efficiency gains rely on specialized hardware.
In this talk I will discuss the promise and the perils of harnessing specialization on the road to sustainable computing.
Bio
Dr. Tamar Eilam is an IBM Fellow and Chief Scientist for Sustainable Computing in the IBM T. J. Watson Research Center, New York.  Tamar is leading research aiming at drastically reducing the carbon footprint associated with computing across infrastructure, systems, and software, data and AI. Tamar complete a Ph.D. degree in Computer Science in the Technion, Israel, in 2000. She joined the IBM T.J. Watson Research Center in New York as a Research Staff Member that same year. She was recognized as an IBM Fellow in 2014.

9:30 PDT – 10:00 PDT: Break

10:00 PDT – 11:15 PDT: Session 8

8A: Caching and Prefetching

(Location: Grande C)
Session Chair: Akanksha Jain (Google)
[Best Paper] PDIP: Priority Directed Instruction Prefetching

Bhargav Reddy Godala (Princeton University);Sankara Prasad Ramesh (University of California San Diego); Gilles A. Pokam, Jared Stark, and Andre Seznec (Intel); Dean Tullsen (University of California San Diego); David I. August (Princeton University)

Paper . Abstract . Lightning Talk
Limoncello: Prefetchers for Scale

Akanksha Jain (Google); Hannah Lin (Google and University of Washington); Carlos Villavieja (Google);Baris Kasikci (Google and University of Washington); Chris Kennelly, Milad Hashemi, and Parthasarathy Ranganathan (Google)

Paper . Abstract . Lightning Talk
PATHFINDER: Practical Real-Time Learning for Data Prefetching

Lin Jia, James Patrick Mcmahon, Sumanth Gudaparthi, Shreyas Singh, and Rajeev Balasubramonian (University of Utah)

Paper . Abstract . Lightning Talk
Skip It: Take Control of Your Cache!

Shashank Anand and Michal Friedman (ETH Zurich);Michael Giardino (Huawei);Gustavo Alonso (ETH Zurich)

Paper . Abstract . Lightning Talk
RPG^2: Robust Profile-Guided Runtime Prefetch Generation

Yuxuan Zhang, Nathan Sobotka, and Soyoon Park (University of Pennsylvania);Saba Jamilan (University of California Santa Cruz);Tanvir Ahmed Khan (Columbia University);Baris Kasikci (University of Washington and Google);Gilles A Pokam (Intel);Heiner Litz (University of California Santa Cruz);Joseph Devietti (University of Pennsylvania)

Paper . Abstract . Lightning Talk
8B: Memory: Address Translation and Tiering

(Location: Grande D/E)
Session Chair: Jayneel Gandhi (Meta)
METAL: Caching Multi-level Indexes in Domain-Specific Architectures

Anagha Molakalmur Anil Kumar and Aditya Prasanna (Simon Fraser University); Jonathan Balkind (University of California Santa Barbara); Arrvindh Shriraman (Simon Fraser University)

Paper . Abstract . Lightning Talk
GMT: GPU Orchestrated Memory Tiering for the Big Data Era

Chia-Hao Chang, Jihoon Han, and Anand Sivasubramaniam (Pennsylvania State University);Vikram Sharma Mailthody, Zaid Qureshi, and Wen-Mei Hwu (NVIDIA Research)

Paper . Abstract . Lightning Talk
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching

Cong Guo (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute); Rui Zhang (Ant Group); Jiale Xu, Jingwen Leng, Zihan Liu, Ziyu Huang, and Minyi Guo (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute); Hao Wu, Shouren Zhao, Junping Zhao, and Ke Zhang (Ant Group)

Paper . Abstract . Lightning Talk
Direct Memory Translation for Virtualized Clouds

Jiyuan Zhang (University of Illinois Urbana-Champaign);Weiwei Jia (University of Rhode Island);Siyuan Chai, Peizhe Liu, Jongyul Kim, and Tianyin Xu (University of Illinois Urbana-Champaign)

Paper . Abstract . Lightning Talk
WASP: Workload-Aware Self-Replicating Page-Tables for NUMA Servers

Hongliang Qu and Zhibin Yu (Chinese Academy of Sciences)

Paper . Abstract . Lightning Talk
8C: High Performance Systems

(Location: Scripps I/II)
Session Chair: Dongyoon Lee (Stony Brook University)
Supporting Descendants in SIMD-Accelerated JSONPath

Mateusz Gienieczko and Filip Murlak (University of Warsaw); Charles Paperman (CRIStAL, Université de Lille, INRIA)

Paper . Abstract . Lightning Talk
Boost Linear Algebra Computation Performance via Efficient VNNI Utilization (Recorded Talk)

Hao Zhou and Qiukun Han (Enflame Tech); Heng Shi (Enflame Tech and Shanghai Jiao Tong University); Yalin Zhang (Enflame Tech Inc.); Jianguo Yao (Enflame Tech and Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
A shared compilation stack for distributed-memory parallelism in stencil DSLs

George Bisbas (Imperial College London); Anton Lydike, Emilien Bauer, Nick Brown, and Mathieu Fehr (University of Edinburgh); Lawrence Mitchell (Unaffiliated); Gabriel Rodriguez-Canal and Maurice Jamieson (University of Edinburgh); Paul H J Kelly (Imperial College London); Michel Steuwer (Technische Universität Berlin); Tobias Grosser (University of Cambridge)

Paper . Abstract . Lightning Talk
SlimSLAM: An Adaptive Runtime for Visual-Inertial Simultaneous Localization and Mapping

Armand Behroozi and Yuxiang Chen (University of Michigan);Vlad Fruchter, Lavanya Subramanian, and Sriseshan Srikanth (Meta);Scott Mahlke (University of Michigan)

Paper . Abstract . Lightning Talk
Compiling Loop-Based Nested Parallelism for Irregular Workloads

Yian Su (Northwestern University); Mike Rainey (Carnegie Mellon University); Nick Wanninger, Nadharm Dhiantravan, and Jasper Liang (Northwestern University); Umut A. Acar (Carnegie Mellon University); Peter Dinda and Simone Campanoni (Northwestern University)

Paper . Abstract . Lightning Talk
8D: IoT and Embedded

(Location: Fairway I/IV)
Session Chair: Don Porter (University of North Carolina at Chapel Hill)
TinyForge: A Design Space Exploration to Advance Energy and Silicon Area Trade-offs in tinyML Compute Architectures with Custom Latch Arrays

Massimo Giordano, Rohan Doshi, and Qianyun Lu (Stanford University);Boris Murmann (University of Hawaii)

Paper . Abstract . Lightning Talk
MulBERRY: Enabling Bit-Error Robustness for Energy-Efficient Multi-Agent Autonomous Systems

Zishen Wan (Georgia Tech);Nandhini Chandramoorthy, Karthik Swaminathan, and Pin-Yu Chen (IBM Research);Kshitij Bhardwaj (Lawrence Livermore National Lab);Vijay Janapa Reddi (Harvard University);Arijit Raychowdhury (Georgia Tech)

Paper . Abstract . Lightning Talk
Exploiting Human Color Discrimination for Memory- and Energy-Efficient Image Encoding in Virtual Reality

Nisarg Ujjainkar and Ethan Shahan (University of Rochester);Kenneth Chen, Budmonde Duinkharjav, and Qi Sun (New York University);Yuhao Zhu (University of Rochester)

Paper . Abstract . Lightning Talk
MicroVSA: An Ultra-Lightweight Vector Symbolic Architecture-based Classifier Library for Always-On Inference on Tiny Microcontrollers

Nuntipat Narkthong and Shijin Duan (Northeastern University);Shaolei Ren (University of California Riverside);Xiaolin Xu (Northeastern University)

Paper . Abstract . Lightning Talk
Energy-Adaptive Buffering for Efficient, Responsive, and Persistent Batteryless Systems

Harrison Williams and Matthew Hicks (Virginia Tech)

Paper . Abstract . Lightning Talk

11:15 PDT – 11:45 PDT: Break

11:45 PDT – 13:00 PDT: Session 9

9A: Accelerated Applications

(Location: Grande C)
Session Chair: Arrvindh Shriraman (Simon-Fraser University)
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping

Zihan Liu (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute); Wentao Ni (Shanghai Jiao Tong University); Jingwen Leng (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute); Yu Feng (University of Rochester); Cong Guo (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute); Quan Chen (Shanghai Jiao Tong University); Chao Li and Minyi Guo (Shanghai Jiao Tong University and Shanghai Qi Zhi Institute); Yuhao Zhu (University of Rochester)

Paper . Abstract . Lightning Talk
[Best Paper] ngAP: Non-blocking Large-scale Automata Processing on GPUs

Tianao Ge (Hong Kong University of Science and Technology Guangzhou); Tong Zhang (Samsung); Hongyuan Liu (Hong Kong University of Science and Technology Guangzhou)

Paper . Abstract . Lightning Talk
Marple: Scalable Spike Sorting for Untethered Brain-Machine Interfacing

Eugene Sha, Andy Liu, Kareem Ibrahim, Mostafa Mahmoud, and Christina Giannoula (University of Toronto); Ameer Abdelhadi (McMaster University); Andreas Moshovos (University of Toronto and Vector Institute)

Paper . Abstract . Lightning Talk
Manticore: Hardware-Accelerated RTL Simulation with Static Bulk-Synchronous Parallelism

Mahyar Emami and Sahand Kashani (EPFL); Keisuke Kamahori (University of Tokyo); Mohammad Sepehr Pourghannad (Sharif University); Ritik Raj (Indian Institute of Technology Roorkee); James R. Larus (EPFL)

Paper . Abstract . Lightning Talk
ORIANNA: An Accelerator Generation Framework for Optimization-based Robotic Applications

Yuhui Hao (Tianjin University); Yiming Gan (Chinese Academy of Sciences); Bo Yu (Shenzhen Institute of Artificial Intelligence and Robotics for Society); Qiang Liu (Tianjin University); Yinhe Han (Chinese Academy of Sciences); Zishen Wan (Georgia Tech); Shaoshan Liu (Shenzhen Institute of Artificial Intelligence and Robotics for Society)

Paper . Abstract . Lightning Talk
9B: SSDs

(Location: Grande D/E)
Session Chair: Anand Sivasubramaniam (Penn State)
AERO: Adaptive Erase Operation for Improving Lifetime and Performance of Modern NAND Flash-Based SSDs

Sungjun Cho (POSTECH);Beomjun Kim (Kyungpook National University);Hyunuk Cho (POSTECH);Gyeongseob Seo (Kyungpook National University);Onur Mutlu (ETH Zürich);Myungsuk Kim (Kyungpook National University);Jisung Park (POSTECH)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] BypassD: Enabling fast userspace access to shared SSDs

Sujay Yadalam (University of Wisconsin-Madison);Chloe Alverti (National Technical University of Athens);Vasileios Karakostas (University of Athens);Jayneel Gandhi (Meta);Michael Swift (University of Wisconsin-Madison)

Paper . Abstract . Lightning Talk
Eliminating Storage Management Overhead of Deduplication over SSD Arrays Through a Hardware/Software Co-Design

Yuhong Wen, Xiaogang Zhao, and You Zhou (Huazhong University of Science and Technology); Tong Zhang (Rensselaer Polytechnic Institute and ScaleFlux); Shangjun Yang, Changsheng Xie, and Fei Wu (Huazhong University of Science and Technology)

Paper . Abstract . Lightning Talk
LazyBarrier: Reconstructing Android IO Stack for Barrier-Enabled Flash Storage (Recorded Talk)

Yuanyi Zhang, Heng Zhang, Wenbin Cao, Xing He, Daejun Park, Jinyoung Choi, and SungJun Park (Samsung Electronics)

Paper . Abstract . Lightning Talk
Achieving Near-Zero Read Retry for 3D NAND Flash Memory

Min Ye (City University of Hong Kong);Qiao Li (Xiamen University); Yina Lv (City University of Hong Kong);Jie Zhang (Peking University);Tianyu Ren (City University of Hong Kong);Daniel Wen (YEESTOR Microelectronics);Tei-Wei Kuo (National Taiwan University);Chun Jason Xue (City University of Hong Kong and Mohamed bin Zayed University of Artificial Intelligence)

Paper . Abstract . Lightning Talk
9C: ML Systems and Optimizations

(Location: Scripps I/II)
Session Chair: Sangeeta Chowdhary (AMD Research)
SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile

Wei Niu, Md Musfiqur Rahman Sanim, and Zhihao Shu (University of Georgia);Jiexiong Guan (William & Mary);Xipeng Shen (North Carolina State University);Miao Yin (University of Texas at Arlington);Gagan Agrawal (University of Georgia);Bin Ren (William & Mary)

Paper . Abstract . Lightning Talk
Dr. DNA: Combating Silent Data Corruptions in Deep Learning using Distribution of Neuron Activations

Dongning Ma (Villanova University);Fred Lin, Alban Desmaison, Joel Coburn, Daniel Moore, and Sriram Sankar (Meta); Xun Jiao (Villanova University and Meta)

Paper . Abstract . Lightning Talk
GPU-based Private Information Retrieval for On-Device Machine Learning Inference

Maximilian Lam (Harvard University); Jeff Johnson (Meta); Wenjie Xiong (Virginia Tech); Kiwan Maeng (Pennsylvania State University); Udit Gupta (Harvard University); Yang Li, Liangzhen Lai, and Ilias Leontiadis (Meta); Minsoo Rhu (KAIST and Meta); Hsien-Hsin S. Lee (Intel); Vijay Janapa Reddi, Gu-Yeon Wei, and David Brooks (Harvard University); Edward Suh (Meta and Cornell University)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] RECom: A Compiler Approach to Accelerate Recommendation Model Inference with Massive Embedding Columns

Zaifeng Pan (Renmin University of China);Zhen Zheng (Alibaba);Feng Zhang and Ruofan Wu (Renmin University of China);Hao Liang (Alibaba);Dalin Wang (Renmin University of China);Xiafei Qiu, Junjie Bai, and Wei Lin (Alibaba);Xiaoyong Du (Renmin University of China)

Paper . Abstract . Lightning Talk
NDPipe: Exploiting Near-data Processing for Scalable Inference and Continuous Training in Photo Storage

Jungwoo Kim and Seonggyun Oh (DGIST);Jaeha Kung (Korea University);Yeseong Kim and Sungjin Lee (DGIST)

Paper . Abstract . Lightning Talk
9D: Quantum Software

(Location: Fairway I/IV)
Session Chair: Yufei Ding (University of California San Diego)
Exploiting the Regular Structure of Modern Quantum Architectures for Compiling and Optimizing Programs with Permutable Operators

Yuwei Jin, Fei Hua, Yanhao Chen, and Ari Hayes (Rutgers University);Chi Zhang (University of Pittsburgh);Eddy Z. Zhang (Rutgers University)

Paper . Abstract . Lightning Talk
One Gate Scheme to Rule Them All: Introducing a Complex Yet Reduced Instruction Set for Quantum Computing

Jianxin Chen and Dawei Ding (DAMO Academy);Weiyuan Gong (Harvard University);Cupjin Huang (DAMO Academy);Qi Ye (DAMO Academy and Tsinghua University)

Paper . Abstract . Lightning Talk
MorphQPV: Exploiting Isomorphism in Quantum Programs to Facilitate Confident Verification

Siwei Tan, Debin Xiang, and Liqiang Lu (Zhejiang University); Junlin Lu (Peking University); Qiuping Jiang (Ningbo University); Mingshuai Chen and Jianwei Yin (Zhejiang University)

Paper . Abstract . Lightning Talk
Fermihedral: On the Optimal Compilation for Fermion-to-Qubit Encoding

Yuhao Liu, Shize Che, and Junyu Zhou (University of Pennsylvania); Yunong Shi (AWS Quantum Technologies); Gushu Li (University of Pennsylvania)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] OnePerc: A Randomness-aware Compiler for Photonic Quantum Computing

Hezi Zhang and Jixuan Ruan (University of California San Diego); Hassan Shapourian and Ramana Rao Kompella (Cisco Systems); Yufei Ding (University of California San Diego)

Paper . Abstract . Lightning Talk

13:00 PDT – 14:30 PDT: Lunch

14:30 PDT – 15:30 PDT: Keynote 4 by Nafea Bshara (Amazon) (Location: Grande)

AWS Trainium: The Journey for Designing and Optimization Full Stack ML Hardware
Abstract
Machine learning accelerators present a unique set of design challenges across chip architecture, instruction set, server design, compiler, and both inter- and intra-chip connectivity. With AWS Trainium, we’ve utilized AWS’s end-to-end ownership from chip to server, network, compilers, and runtime tools to collaboratively design and optimize across all layers, emphasizing simplicity and ease of use. This talk will illustrate the design principles, tradeoffs, and lessons learned during the development of three generations of AWS ML products, from conceptualization to placing systems in the hands of AWS customers.
Bio
Machine learning accelerators present a unique set of design challenges across chip architecture, instruction set, server design, compiler, and both inter- and intra-chip connectivity. With AWS Trainium, we’ve utilized AWS’s end-to-end ownership from chip to Nafea Bshara, Vice President and Distinguished Engineer at Amazon Web Services (AWS), leads the strategy and architecture for AWS custom hardware, including Nitro, Nitro SSD, SRD, Graviton, Inferentia, Trainium, and Neuron AI SDK. Nafea began his career at Galileo Technology, where he held various leading roles in software and chip design for network, storage, and compute infrastructure, culminating in his position as Chief Architect. Following Galileo’s acquisition by Marvell Semiconductor in 2001, Nafea joined Marvell and served in several product definition roles. In 2011, he co-founded Annapurna Labs, a startup focused on designing cloud-optimized infrastructure chips and associated software. Amazon acquired Annapurna Labs in February 2015, after which Nafea and his team have led AWS’s custom silicon and hardware efforts. Nafea holds an M.Sc. degree in Electrical and Computer Engineering from the Technion – Israel Institute of Technology and has been granted over 350 US patents.

15:30 PDT – 16:00 PDT: Break

16:00 PDT – 17:00 PDT: Session 10

10A: FPGAs and Reconfigurable Hardware

(Location: Grande C)
Session Chair: Jonathan Balkind (UC Santa Barbara)
FPGA Technology Mapping Using Sketch-Guided Program Synthesis

Gus Henry Smith, Benjamin Kushigian, Vishal Canumalla, and Andrew Cheung (University of Washington); Steven Lyubomirsky (OctoAI); Sorawee Porncharoenwase, René Just, Gilbert Louis Bernstein, and Zachary Tatlock (University of Washington)

Paper . Abstract . Lightning Talk
TAPA-CS: Enabling Scalable Accelerator Design on Distributed HBM-FPGAs

Neha Prakriya, Yuze Chi, Suhail Basalama, Linghao Song, and Jason Cong (University of California Los Angeles)

Paper . Abstract . Lightning Talk
Zoomie: A Software-like Debugging Tool for FPGAs

Tianrui Wei and Kevin Laeufer (University of California Berkeley);Katie Lim (University of Washington);Jerry Zhao and Koushik Sen (UC Berkeley);Jonathan Balkind (University of California Santa Barbara);Krste Asanovic (University of California Berkeley)

Paper . Abstract . Lightning Talk
HIR: An MLIR-based Intermediate Representation for Hardware Accelerator Description

Kingshuk Majumder and Uday Bondhugula (Indian Institute of Science)

Paper . Abstract . Lightning Talk
10B: Serverless Computing 2

(Location: Grande D/E)
Session Chair: Jian Huang (University of Illinois Urbana-Champaign)
FaaSGraph: Enabling Scalable, Efficient, and Cost-Effective Graph Processing with Serverless Computing

Yushi Liu, Shixuan Sun, Zijun Li, and Quan Chen (Shanghai Jiao Tong University);Sen Gao and Bingsheng He (National University of Singapore);Chao Li and Minyi Guo (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
In-Storage Domain-Specific Acceleration for Serverless Computing

Rohan Mahapatra, Soroush Ghodrati, Byung Hoon Ahn, Sean Kinzer, Shu-Ting Wang, Hanyang Xu, and Lavanya Karthikeyan (University of California San Diego);Hardik Sharma (Google);Amir Yazdanbakhsh (Google DeepMind);Mohammad Alian (University of Kansas);Hadi Esmaeilzadeh (University of California San Diego)

Paper . Abstract . Lightning Talk
FUYAO: DPU-enabled Direct Data Transfer for Serverless Computing (Recorded Talk)

Guowei Liu, Laiping Zhao, Yiming Li, Zhaolin Duan, Sheng Chen, and Yitao Hu (Tianjin University);Zhiyuan Su (Inspur Electronic Information Industry);Wenyu Qu (Tianjin University)

Paper . Abstract . Lightning Talk
DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration

Zijun Li, Chuhao Xu, Quan Chen, Jieru Zhao, Chen Chen, and Minyi Guo (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
10C: ML Sparsity and Dynamic Shapes

(Location: Scripps I/II)
Session Chair: Roshan Dathathri (Microsoft Research)
Fractal: Joint Multi-Level Sparse Pattern Tuning of Accuracy and Performance for DNN Pruning

Yue Guan, Changming Yu, Yangjie Zhou, Jingwen Leng, Chao Li, and Minyi Guo (Shanghai Jiao Tong University and Shanghai Qizhi Institute)

Paper . Abstract . Lightning Talk
Optimizing Dynamic-Shape Neural Networks on Accelerators via On-the-Fly Micro-Kernel Polymerization (Recorded Talk)

Feng Yu, Guangli Li, Jiacheng Zhao, Huimin Cui, and Xiaobing Feng (Chinese Academy of Sciences);Jingling Xue (University of New South Wales)

Paper . Abstract . Lightning Talk
DTC-SpMM: Bridging the Gap in Accelerating General Sparse Matrix Multiplication with Tensor Cores (Recorded Talk)

Ruibo Fan, Wei Wang, and Xiaowen Chu (Hong Kong University of Science and Technology)

Paper . Abstract . Lightning Talk
SoD2: Statically Optimizing Dynamic Deep Neural Network Execution

Wei Niu and Gagan Agrawal (University of Georgia);Bin Ren (William & Mary)

Paper . Abstract . Lightning Talk
10D: Trusted Computing

(Location: Fairway I/IV)
Session Chair: Dan Williams (Virginia Tech)
SEVeriFast: Minimizing the root of trust for fast startup of SEV microVMs

Benjamin Holmes (MIT and Vassar College); Jason Waterman (Vassar College); Dan Williams (Virginia Tech)

Paper . Abstract . Lightning Talk
sIOPMP: Scalable and Efficient I/O Protection for TEEs

Erhu Feng (Shanghai Jiao Tong University);Dahu Feng (Tsinghua University);Dong Du and Yubin Xia (Shanghai Jiao Tong University);Wenbin Zheng and Siqi Zhao (Alibaba DAMO Academy);Haibo Chen (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
A Midsummer Night’s Tree: Efficient and High Performance Secure SCM

Samuel Thomas (Brown University); Kidus Workneh (University of Colorado Boulder); Jac McCarty (Bryn Mawr College); Joseph Izraelevitz and Tamara Lehman (University of Colorado Boulder); R. Iris Bahar (Colorado School of Mines)

Paper . Abstract . Lightning Talk
Veil: A Protected Services Framework for Confidential Virtual Machines

Adil Ahmad (Arizona State University); Botong Ou and Congyu Liu (Purdue University); Xiaokuan Zhang (George Mason University); Pedro Fonseca (Purdue University)

Paper . Abstract . Lightning Talk

17:00 PDT – 17:30 PDT: Break

17:30 PDT – 18:30 PDT: Session 11

11A: Cryptography and Privacy

(Location: Grande C; Ends 18:45 PDT)
Session Chair: Moumita Dey (AMD Research and Advanced Development)
Accelerating Multi-Scalar Multiplication for Efficient Zero Knowledge Proofs with Multi-GPU Systems

Zhuoran Ji and Zhiyuan Zhang (Shandong University);Jiming Xu (Ant Group);Lei Ju (Shandong University)

Paper . Abstract . Lightning Talk
LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models

Juntaek Lim, Youngeun Kwon, and Ranggi Hwang (KAIST);Kiwan Maeng (Pennsylvania State University);Edward Suh (FAIR at Meta and Cornell University);Minsoo Rhu (KAIST)

Paper . Abstract . Lightning Talk
BitPacker: Enabling High Arithmetic Efficiency in Fully Homomorphic Encryption Accelerators

Nikola Samardzic and Daniel Sanchez (MIT)

Paper . Abstract . Lightning Talk
ZENO: A Type-based Optimization Framework for Zero Knowledge Neural Network Inference

Boyuan Feng, Zheng Wang, Yuke Wang, Shu Yang, and Yufei Ding (University of California Santa Barbara)

Paper . Abstract . Lightning Talk
Performance-aware Scale Analysis with Reserve for Homomorphic Encryption

Yongwoo Lee, Seonyoung Cheon, and Dongkwan Kim (Yonsei University); Dongyoon Lee (Stony Brook University); Hanjun Kim (Yonsei University)

Paper . Abstract . Lightning Talk
11B: Scheduling

(Location: Grande D/E)
Session Chair: Martin Maas (Google)
Heet: Accelerating Elastic Training in Heterogeneous Deep Learning Clusters

Zizhao Mo, Huanle Xu, and Chengzhong Xu (University of Macau)

Paper . Abstract . Lightning Talk
Efficient Microsecond-scale Blind Scheduling with Tiny Quanta

Zhihong Luo, Sam Son, and Dev Bali (University of California Berkeley);Emmanuel Amaro (VMware Research);Amy Ousterhout (University of California San Diego);Sylvia Ratnasamy (University of California Berkeley);Scott Shenker (ICSI and University of California Berkeley)

Paper . Abstract . Lightning Talk
AUDIBLE: A Convolution-Based Resource Allocator for Oversubscribing Burstable Virtual Machines (Recorded Talk)

Seyedali Jokar Jandaghi and Kaveh Mahdaviani (University of Toronto); Amirhossein Mirhosseini (University of Michigan); Sameh Elnikety (Microsoft Research); Cristiana Amza and Bianca Schroeder (University of Toronto)

Paper . Abstract . Lightning Talk
CPS: A Cooperative Para-virtualized Scheduling Framework for Manycore Machines (Recorded Talk)

Yuxuan Liu, Tianqiang Xu, Zeyu Mi, Zhichao Hua, Binyu Zang, and Haibo Chen (Shanghai Jiao Tong University)

Paper . Abstract . Lightning Talk
11C: ML Training Optimizations

(Location: Scripps I/II)
Session Chair: Roshan Dathathri (Microsoft Research)
PrimePar: Efficient Spatial-temporal Tensor Partitioning for Large Transformer Model Training (Speaker Shixin Zhao)

Haoran Wang, Lei Wang, Haobo Xu, Ying Wang, Yuming Li, and Yinhe Han (Chinese Academy of Sciences)

Paper . Abstract . Lightning Talk
AdaPipe: Optimizing Pipeline Parallelism with Adaptive Recomputation and Partitioning

Zhenbo Sun, Huanqi Cao, Yuanwei Wang, Guanyu Feng, Shengqi Chen, Haojie Wang, and Wenguang Chen (Tsinghua University)

Paper . Abstract . Lightning Talk
[Distinguished Artifact Evaluation Award] EVT: Accelerating Deep Learning Training with Epilogue Visitor Tree

Zhaodong Chen (University of California Santa Barbara);Andrew Kerr, Richard Cai, Jack Kosaian, and Haicheng Wu (NVIDIA); Yufei Ding (University of California San Diego); Yuan Xie (The Hong Kong University of Science and Technology)

Paper . Abstract . Lightning Talk
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training

Hongzheng Chen (Cornell University); Cody Hao Yu and Shuai Zheng (Boson AI); Zhen Zhang (Amazon Web Services); Zhiru Zhang (Cornell University); Yida Wang (Amazon Web Services)

Paper . Abstract . Lightning Talk
11D: More Processing-In-Memory

(Location: Fairway I/IV)
Session Chair: Sara Achour (Stanford)
BVAP: Energy and Memory Efficient Automata Processing for Regular Expressions with Bounded Repetitions

Ziyuan Wen, Lingkun Kong, Alexis Le Glaunec, Konstantinos Mamouras, and Kaiyuan Yang (Rice University)

Paper . Abstract . Lightning Talk
IANUS: Integrated Accelerator based on NPU-PIM Unified Memory System

Minseok Seo and Xuan Truong Nguyen (Seoul National University and Inter-university Semiconductor Research Center); Seok Joong Hwang (SAPEON); Yongkee Kwon, Guhyun Kim, Chanwook Park, Ilkon Kim, Jaehan Park, Jeongbin Kim, Woojae Shin, Jongsoon Won, Haerang Choi, Kyuyoung Kim, Daehan Kwon, and Chunseok Jeong (SK hynix);Sangheon Lee, Yongseok Choi, Wooseok Byun, and Seungcheol Baek (SAPEON);Hyuk-Jae Lee (Seoul National University and Inter-university Semiconductor Research Center);John Kim (KAIST)

Paper . Abstract . Lightning Talk
PIM-STM: Software Transactional Memory for Processing-In-Memory Systems

André Lopes, Daniel Castro, and Paolo Romano (IST/INESC-ID)

Paper . Abstract . Lightning Talk
CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators

Songyun Qu and Shixin Zhao (Chinese Academy of Sciences); Bing Li (Capital Normal University); Yintao He, Xuyi Cai, Lei Zhang, and Ying Wang (Chinese Academy of Sciences)

Paper . Abstract . Lightning Talk