Accepted Papers

#TitleAuthorsAbstractDOI
#8PMFuzz: Test Case Generation for Persistent Memory ProgramsSihang Liu (University of Virginia); Suyash Mahar (University of California San Diego/Indian Institute of Technology Roorkee); Baishakhi Ray (Columbia University); Samira Khan (University of Virginia)PDF🔗DOI
#28Vegen: A Vectorizer Generator for SIMD and BeyondYishen Chen, Charith Mendis, Michael Carbin, Saman Amarasinghe (MIT CSAIL)PDF🔗DOI
#43Sinan: ML-Based & QoS-Aware Resource Management for Cloud MicroservicesYanqi Zhang, Weizhe Hua, Zhuangzhuang Zhou, Edward Suh, Christina Delimitrou (Cornell University)PDF🔗DOI
#44HIPPOCRATES: Healing Persistent Memory Bugs Without Doing Any HarmIan Neal, Andrew Quinn, Baris Kasikci (University of Michigan)PDF🔗DOI
#58Who's Debugging the Debuggers? Exposing Debug Information Bugs in Optimized BinariesGiuseppe Antonio Di Luna (Sapienza, University of Rome); Davide Italiano (Apple); Luca Massarelli (Sapienza University of Rome - Department of Computer, Control, and Management Engineering "Antonio Ruberti"); Sebastian Österlund, Cristiano Giuffrida (Vrije Universiteit Amsterdam); Leonardo Querzoni (Sapienza Università di Roma)PDF🔗DOI
#59Dagger: Efficient and Fast RPCs in Cloud Microservices with Near-Memory Reconfigurable NICsNikita Lazarev, Shaojie Xiang, Neil Adit (Cornell University); Zhiru Zhang (Cornell Univeristy); Christina Delimitrou (Cornell University)PDF🔗DOI
#62Statistical Robustness of Markov Chain Monte Carlo AcceleratorsXiangyu Zhang, Ramin Bashizade, Yicheng Wang, Sayan Mukherjee, Alvin R. Lebeck (Duke University)PDF🔗DOI
#75PMEM-Spec: Persistent Memory Speculation (Strict Persistency Can Trump Relaxed Persistency)Jungi Jeong, Changhee Jung (Purdue University)PDF🔗DOI
#79When Application-Specific ISA Meets FPGAs: A Multi-Layer Virtualization Framework for Heterogeneous Cloud FPGAsYue Zha, Jing Li (University of Pennsylvania)PDF🔗DOI
#85Sage: Practical & Scalable ML-Driven Performance Debugging in MicroservicesYu Gan, Mingyu Liang (Cornell University); Sundar Dev, David Lo (Google); Christina Delimitrou (Cornell University)PDF🔗DOI
#89Nightcore: Efficient and Scalable Serverless Computing for Latency-Sensitive, Interactive MicroservicesZhipeng Jia, Emmett Witchel (University of Texas at Austin)PDF🔗DOI
#95Exploiting Gustavson's Algorithm to Accelerate Sparse Matrix MultiplicationGuowei Zhang, Nithya Attaluri (MIT); Joel Emer (MIT & NVIDIA); Daniel Sanchez (MIT)PDF🔗DOI
#101DiAG: A Dataflow-inspired Architecture for General-purpose ProcessorsDong Kai Wang, Nam Sung Kim (UIUC)PDF🔗DOI
#111PTEMagnet: Fine-grained Physical Memory Reservation for Faster Page Walks in Public CloudsArtemiy Margaritov, Dmitrii Ustiugov, Amna Shahab, Boris Grot (University of Edinburgh)PDF🔗DOI
#113Scalable FSM Parallelization via Path Fusion and Higher-Order SpeculationJunqiao Qiu (Michigan Technological University); Xiaofan Sun, Amir Hossein Nodehi Sabet, Zhijia Zhao (University of California, Riverside)PDF🔗DOI
#129Time-Optimal Qubit MappingChi Zhang (University of Pittsburgh); Ari Hayes, Longfei Qiu, Yuwei Jin, Yanhao Chen, Eddy Z. Zhang (Rutgers University)PDF🔗DOI
#142Vectorization for Digital Signal Processors via Equality SaturationAlexa VanHattum, Rachit Nigam (Cornell University); Vincent T. Lee (Facebook Reality Labs); James Bornholt (University of Texas at Austin); Adrian Sampson (Cornell University)PDF🔗DOI
#148Speculative Interference Attacks: Breaking Invisible Speculation SchemesMohammad Behnia (University of Illinois at Urbana-Champaign); Prateek Sahu (University of Texas at Austin); Riccardo Paccagnella, Jiyong Yu, Zirui (Neil) Zhao (University of Illinois at Urbana-Champaign); Xiang (Chris) Zou (Intel Corporation); Thomas Unterluggauer (Intel); Josep Torrellas (University of Illinois at Urbana-Champaign); Carlos Rozas (Intel); Adam Morrison (Tel Aviv University); Frank Mckeen (Intel); Fangfei Liu (Intel Corporation); Ron Gabor (Toga Networks); Christopher W. Fletcher (University of Illinois--Urbana Champaign); Abhishek Basak (Intel Corporation); Alaa Alameldeen (Intel)PDF🔗DOI
#158Fast Local Page-Tables for Virtualized NUMA Servers with vMitosisAshish Panwar (Indian Institute of Science-Bangalore); Reto Achermann (ETH Zurich and University of British Columbia); Arkaprava Basu (Indian Institute of Science-Bangalore); Abhishek Bhattacharjee (Yale University); K. Gopinath (Indian Institute of Science-Bangalore); Jayneel Gandhi (VMware Research)PDF🔗DOI
#171Corundum: Statically-Enforced Persistent Memory SafetyMorteza Hoseinzadeh, Steven Swanson (UC San Diego)PDF🔗DOI
#172C11Tester: A Fuzzer for C/C++ AtomicsWeiyu Luo, Brian Demsky (University of California, Irvine)PDF🔗DOI
#206A Compiler Infrastructure for Accelerator GeneratorsSamuel Thomas, Rachit Nigam, Zhijing Li, Adrian Sampson (Cornell University)PDF🔗DOI
#210Rethinking Software Runtimes for Disaggregated MemoryIrina Calciu (VMware Research); M. Talha Imran (The Pennsylvania State University); Ivan Puddu (ETH Zurich); Sanidhya Kashyap (Georgia Institute of Technology); Hasan Al Maruf (University of Michigan); Onur Mutlu (ETH Zurich); Aasheesh Kolli (The Pennsylvania State University and Google)PDF🔗DOI
#212Benchmarking, Analysis, and Optimization of Serverless Function SnapshotsDmitrii Ustiugov, Plamen Petrov (University of Edinburgh); Marios Kogias, Edouard Bugnion (EPFL); Boris Grot (University of Edinburgh)PDF🔗DOI
#228Q-VR: System-Level Design for Mobile Collaborative Virtual Reality RenderingChenhao Xie (Pacific Northwest National Laboratory); Xie Li (University of Sydney); Yang Hu (The University of Texas at Dallas); Huwan Peng, Michael Taylor (University of Washington Seattle); Shuaiwen Leon Song (University of Sydney)PDF🔗DOI
#255Jamais Vu: Thwarting Microarchitectural Replay AttacksDimitrios Skarlatos, Zirui Neil Zhao, Riccardo Paccagnella, Christopher Fletcher, Josep Torrellas (University of Illinois at Urbana-Champaign)PDF🔗DOI
#265MERCI: Efficient Embedding Reduction on Commodity Hardware via Sub-Query MemoizationYejin Lee, Seong Hoon Seo, Hyunji Choi, Hyoung Uk Sul, Soosung Kim, Jae W. Lee, Tae Jun Ham (Seoul National University)PDF🔗DOI
#276Orchestrated Trios: Compiling for Efficient Communication in Quantum Programs with 3-Qubit GatesCasey Duckering, Jonathan Baker, Andrew Litteken, Fred Chong (University of Chicago)PDF🔗DOI
#278Reducing Solid-state Drive Read Latency by Optimizing Read-retryJisung Park (ETH Zürich); Myungsuk Kim, Myoungjun Chun (Seoul National University); Lois Orosa (ETH Zurich); Jihong Kim (Seoul National University); Onur Mutlu (ETH Zürich)PDF🔗DOI
#315Effective simulation and debugging for a high-level hardware language using software compilersClément Pit-Claudel, Thomas Bourgeat, Stella Lau, Arvind, Adam Chlipala (MIT CSAIL)PDF🔗DOI
#321Quantifying the Design-Space Tradeoffs in Autonomous DronesRamyad Hadidi, Bahar Asgari, Sam Jijina, Adriana Amyette, Nima Shoghi, Hyesoon Kim (Georgia Institute of Technology)PDF🔗DOI
#379BCD Deduplication: Effective Memory Compression Using Partial Cache-Line DeduplicationSungbo Park (Cornell University); Ingab Kang, Yaebin Moon, Jung Ho Ahn (Seoul National University); Edward Suh (Cornell University)PDF🔗DOI
#388Warehouse-scale Video Acceleration: Co-design and Deployment in the WildParthasarathy Ranganathan, Daniel Stodolsky, Jeff Calow, Jeremy Dorfman, Marisabel Guevara, Clinton Wills Smullen IV, Raghu Balasubramanian, Sandeep Bhatia, Prakash Chauhan, Anna Cheung, In Suk Chong, Niranjani Dasharathi, Jia Feng, Brian Fosco, Samuel Foss, Ben Gelb, Sara J. Gwin, Yoshiaki Hase, Da-ke He, C. Richard Ho, Roy W. Huffman Jr., Elisha Indupalli, Indira Jayaram, Poonacha Kongetira, Aki Kuusela, Cho Mon Kyaw, Aaron Laursen, Yuan Li, Fong Lou, Kyle A. Lucke, JP Maaninen, Ramon Macias, Maire Mahony, David Alexander Munday, Srikanth Muroor, Narayana Penukonda, Eric Perkins-Argueta, Devin Persaud, Alex Ramirez, Ville-Mikko Rautio, Yolanda Ripley, Amir Salek, Sathish Sekar, Sergey N. Sokolov, Rob Springer, Don Stark, Mercedes Tan, Mark S. Wachsler, Andrew C. Walton, David A. Wickeraad, Alvin Wijaya, Hon Kwan Wu (Google Inc.)PDF🔗DOI
#393PacketMill: Toward per-core 100-Gbps NetworkingAlireza Farshin, Tom Barbette (KTH Royal Institute of Technology); Amir Roozbeh (KTH Royal Institute of Technology / Ericsson Research); Gerald Q. Maguire Jr., Dejan Kostić (KTH Royal Institute of Technology)PDF🔗DOI
#402LifeStream: A High-performance Stream Processing Engine for Waveform DataAnand Jayarajan, Kimberly Hau (University of Toronto); Andrew Goodwin (The Hospital for Sick Children, University of Sydney); Gennady Pekhimenko (University of Toronto)PDF🔗DOI
#415NOREBA: A Compiler-Informed Non-Speculative Out-of-Order Commit ProcessorAli Hajiabadi (National University of Singapore); Andreas Diavastos (Universitat Politècnica de Catalunya); Trevor E. Carlson (National University of Singapore)PDF🔗DOI
#442KARD: Lightweight Data Race Detection with Per-Thread Memory ProtectionAdil Ahmad (Purdue University); Sangho Lee (Microsoft Research Redmond); Pedro Fonseca (Purdue University); Byoungyoung Lee (Seoul National University)PDF🔗DOI
#459Enclosure: language-based restriction of untrusted librariesAdrien Ghosn, Marios Kogias, Mathias Payer, James R. Larus, Edouard Bugnion (EPFL)PDF🔗DOI
#460Computing with Time: Microarchitectural Weird MachinesThomas S. Benjamin, Jeffrey Eitel, Jesse Elwell (Perspecta Labs); Dmitry Evtyushkin (William & Mary); Abhrajit Ghosh, Angelo Sapello (Perspecta Labs)🔗DOI
#476Clobber-NVM: Log Less, Re-execute MoreYi Xu (UC San Diego); Joseph Izraelevitz (University of Colorado, Boulder); Steven Swanson (UC San Diego)PDF🔗DOI
#486CubicleOS: A Library OS with Software Componentisation for Practical IsolationVasily A. Sartakov, Lluís Vilanova, Peter Pietzuch (Imperial College London)PDF🔗DOI
#499Autonomous NIC OffloadsBoris Pismenny, Haggai Eran (Technion); Aviad Yehezkel, Liran Liss (Mellanox); Adam Morrison (Tel Aviv University); Dan Tsafrir (Technion & VMware Research)PDF🔗DOI
#500Prolonging 3D NAND SSD Lifetime via Read Latency RelaxationChun-Yi Liu, Yunju Lee (The Pennsylvania State University); Myoungsoo Jung (KAIST); Mahmut Taylan Kandemir, Wonil Choi (The Pennsylvania State University)PDF🔗DOI
#515Judging a Type by its Pointer: Optimizing Virtual Function Calls on GPUsMengchi Zhang, Ahmad Alawneh, Timothy G. Rogers (Purdue UniversityPDF🔗DOI
#544Jaaru: Efficiently Model Checking Persistent Memory ProgramsHamed Gorjiara (University of California, Irvine); Guoqing Harry Xu (UCLA); Brian Demsky (University of California, Irvine)PDF🔗DOI
#548HerQules: Securing Programs via Hardware-Enforced Message QueuesDaming Chen (Carnegie Mellon University); Wen Shih Lim (unaffiliated); Mohammad Bakhshalipour (Carnegie Mellon University (CMU)); Phillip Gibbons, James C. Hoe, Bryan Parno (Carnegie Mellon University)PDF🔗DOI
#588Rhythmic Pixel Regions: Visual sensing architecture for flexible spatiotemporal resolution towards high-precision visual computing at low powerVenkatesh Kodukula, Alexander Shearer, Van Nguyen, Srinivas Lingutla, Yifei Liu, Robert LiKamWa (Arizona State University)PDF🔗DOI
#593NeuroEngine: A Hardware-based Event-driven Simulation System for Advanced Brain-inspired ComputingHunjun Lee, Chanmyeong Kim, Yujin Chung, Jangwoo Kim (Seoul National University)PDF🔗DOI
#648BayesPerf: Minimizing Performance Monitoring Errors Using Bayesian StatisticsSubho S. Banerjee, Saurabh Jha, Zbigniew Kalbarczyk, Ravishankar K. Iyer (University of Illinois at Urbana-Champaign)PDF🔗DOI
#701PIBE: Practical Kernel Control-flow Hardening with Profile-guided Indirect Branch EliminationVictor Duta, Erik van der Kouwe, Herbert Bos, Cristiano Giuffrida (Vrije Universiteit Amsterdam)PDF🔗DOI
#731Field-Configurable Multi-resolution Inference: Rethinking QuantizationSai Qian Zhang (Harvard University); Bradley McDanel (Franklin & Marshall College); HT Kung, Xin Dong (Harvard University)PDF🔗DOI
#744Streamline: A Fast, Flushless Cache Covert-Channel Attack by Enabling Asynchronous CollusionGururaj Saileshwar (Georgia Institute of Technology); Christopher Fletcher (University of Illinois-Urbana Champaign); Moinuddin Qureshi (Georgia Tech)PDF🔗DOI
#772QRAFT: Reverse Your Quantum Circuit and Know the Correct Program OutputTirthak Patel, Devesh Tiwari (Northeastern University)PDF🔗DOI
#827Fast, Flexible and Comprehensive Bug Detection for Persistent Memory ProgramsBang Di (Hunan University); Jiawen Liu (University of California, Merced); Hao Chen (Hunan University); Dong Li (University of California, Merced)PDF🔗DOI
#844KLOCs: Kernel-Level Object Contexts for Heterogeneous Memory SystemSudarsun Kannan, Yujie Ren (Rutgers University); Abhishek Bhattacharjee (Yale University)PDF🔗DOI
#857Robomorphic Computing: A Design Methodology for Domain-Specific Accelerators Parameterized by Robot MorphologySabrina M. Neuman (MIT); Brian Plancher (Harvard); Thomas Bourgeat (MIT); Thierry Tambe (Harvard); Srinivas Devadas (MIT); Vijay Janapa Reddi (Harvard/UT Austin/Google)PDF🔗DOI
#898Defensive Approximation: Securing CNNs using Approximate ComputingAmira Guesmi (ENIS, University of Sfax, Tunisia); Ihsen Alouani (IEMN Lab, Electronics Department, Polytechnic University Hauts-de-France, France); Khaled N. Khasawneh (George Mason University); Mouna Baklouti, Tarek Frikha, Mohamed Abid (ENIS, University of Sfax, Tunisia); Nael Abu-Ghazaleh (University of California, Riverside)PDF🔗DOI
#922VSync: Push-Button Verification and Optimization for Synchronization Primitives on Weak Memory ModelsJonas Oberhauser, Rafael Lourenco de Lima Chehab, Diogo Behrens, Ming Fu, Antonio Paolillo, Lilith Oberhauser, Koustubha Bhat (Huawei Dresden Research Center & Huawei OS Kernel Lab); Yuzhong Wen (Huawei OS Kernel Lab); Haibo Chen (Huawei OS Kernel Lab & Shanghai Jiao Tong University); Jaeho Kim (Huawei Dresden Research Center & Huawei OS Kernel Lab); Viktor Vafeiadis (Max Planck Institute for Software Systems)PDF🔗DOI
#926SIMDRAM: A Framework for Bit-Serial SIMD Processing using DRAMNastaran Hajinazar (Simon Fraser University); Geraldo Francisco de Oliveira Junior (ETH Zurich); Sven Gregorio, João Ferreira (ETH Zürich); Nika Mansouri Ghiasi (ETH Zurich); Minesh Patel (ETH Zürich); Mohammed Alser (ETH Zurich); Saugata Ghose (University of Illinois at Urbana-Champaign / Carnegie Mellon University); Juan Gómez Luna, Onur Mutlu (ETH Zurich)PDF🔗DOI
#936Noisy Variational Quantum Algorithm Simulation via Knowledge Compilation for Repeated InferenceYipeng Huang (Rutgers University); Steven Holtzen, Todd Millstein, Guy Van den Broeck (UCLA); Margaret R. Martonosi (Princeton University)PDF🔗DOI
#949Language-Parametric Compiler Validation with Application to LLVMTheodoros Kasampalis (University of Illinois at Urbana-Champaign); Daejun Park (Runtime Verification, Inc.); Zhengyao Lin (University of Illinois at Urbana Champaign); Vikram Adve (University of Illinois at Urbana-Champaign); Grigore Rosu (University of Illinois at Urbana–Champaign)PDF🔗DOI
#963A Hierarchical Neural Model of Data PrefetchingZhan Shi, Akanksha Jain (University of Texas at Austin); Kevin Swersky, Milad Hashemi, Parthasarathy Ranganathan (Google); Calvin Lin (University of Texas at Austin)PDF🔗DOI
#969Neural Architecture Search as Program Transformation ExplorationJack Turner, Elliot J Crowley, Michael F.P. O'Boyle (University of Edinburgh)PDF🔗DOI
#974SherLock: Unsupervised Synchronization-Operation InferenceGuangpu Li (University of Chicago); Dongjie Chen (Nanjing University); Shan Lu (University of Chicago); Madanlal Musuvathi, Suman Nath (Microsoft Research)PDF🔗DOI
#990Compiler-Driven FPGA Virtualization with SYNERGYJoshua Landgraf, Tiffany Yang, Will Lin (UT Austin); Christopher J. Rossbach (UT Austin and VMware Research and Katana Graph); Eric Schkufza (Amazon)PDF🔗DOI
#1170Automatically Detecting and Fixing Concurrency Bugs in Go Software SystemsZiheng Liu, Shuofei Zhu (Pennsylvania State University); Boqin Qin (BUPT); Hao Chen (University of California, Davis); Linhai Song (Pennsylvania State University)PDF🔗DOI
#1225FaasCache: Keeping Serverless Computing Alive With Greedy-Dual CachingAlexander Fuerst, Prateek Sharma (Indiana University)PDF🔗DOI
#1241CutQC: Using Small Quantum Computers for Large Quantum Circuit EvaluationsWei Tang, Teague Tomesh (Princeton University); Martin Suchara, Jeffrey Larson (Argonne National Laboratory); Margaret Martonosi (Princeton University)PDF🔗DOI
#1367Analytical characterization and design space exploration for optimization of CNNsRui Li, Yufan Xu (University of Utah); Aravind Sukumaran-Rajam (Washington State University); Atanas Rountev (Ohio State University); P. Sadayappan (University of Utah)PDF🔗DOI
#1375Switches for HIRE: Resource Scheduling for Data Center In-Network ComputingMarcel Blöcher (Technische Universität Darmstadt); Lin Wang (VU Amsterdam & TU Darmstadt); Patrick Eugster (Università della Svizzera italiana, Switzerland; TU Darmstadt, Germany; Purdue University, USA); Max Schmidt (Technische Universität Darmstadt)PDF🔗DOI
#1439In-Fat Pointer: Hardware-Assisted Tagged-Pointer Spatial Memory Safety Defense with Subobject Bound Granularity ProtectionShengjie Xu, Wei Huang, David Lie (University of Toronto)PDF🔗DOI
#1450Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space SearchKartik Hegde (UIUC); Po-An Tsai (NVIDIA); Sitao Huang (University of Illinois at Urbana–Champaign); Vikas Chandra (Facebook); Angshuman Parashar (NVIDIA); Christopher Fletcher (University of Illinois--Urbana Champaign)PDF🔗DOI
#1455RecSSD: Near Data Processing for Solid State Drive Based Recommendation InferenceMark Wilkening, Udit Gupta, Samuel Hsia (Harvard University); Caroline Trippel (Facebook); Carole-Jean Wu (Facebook/ASU); David Brooks, Gu-Yeon Wei (Harvard University)PDF🔗DOI
#1594Probabilistic Profiling of Stateful Data Planes for Adversarial TestingQiao Kang, Jiarong Xing, Yiming Qiu, Ang Chen (Rice University)PDF🔗DOI
#1687Incremental CFG Patching for Binary RewritingXiaozhu Meng (Rice University); Weijie Liu (Indiana University Bloomington)PDF🔗DOI