Volume 33, Number 9, September 2022
CoFilter: High-Performance Switch-Accelerated Stateful Packet Filter for Bare-Metal Servers.

Jiamin Cao Ying Liu Yu Zhou Lin He Chen Sun Yangyang Wang Mingwei Xu

Flexible Performant GEMM Kernels on GPUs.

Thomas Faingnaert Tim Besard Bjorn De Sutter

NetSHa: In-Network Acceleration of LSH-Based Distributed Search.

Penghao Zhang Heng Pan Zhenyu Li Penglai Cui Ru Jia Peng He Zhibin Zhang Gareth Tyson Gaogang Xie

Cost-Efficient Server Configuration and Placement for Mobile Edge Computing.

Zhenli He Kenli Li Keqin Li

Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud.

Huifang Li Danjing Wang MengChu Zhou Yushun Fan Yuanqing Xia

UFC2: User-Friendly Collaborative Cloud.

Minghao Zhao Zhenhua Li Wei Liu Jian Chen Xingyao Li

Performant, Multi-Objective Scheduling of Highly Interleaved Task Graphs on Heterogeneous System on Chip Devices.

Joshua Mack Samet E. Arda Ümit Y. Ogras Ali Akoglu

Exploiting Concurrency in Sharded Parallel State Machine Replication.

Aldenio Burgos Eduardo Alchieri Fernando Luís Dotti Fernando Pedone

DePo: Dynamically Offload Expensive Event Processing to the Edge of Cyber-Physical Systems.

Meng Ma Jingbin Zhang Ping Wang

Addressing the Read-Performance Impact of Reconfigurations in Replicated Key-Value Stores.

Antonis Papaioannou Kostas Magoutis

Cooperative Edge Caching Based on Temporal Convolutional Networks.

Xu Zhang Zhengnan Qi Geyong Min Wang Miao Qilin Fan Zhan Ma

Cost-Efficient Workflow Scheduling Algorithm for Applications With Deadline Constraint on Heterogeneous Clouds.

Xiaoyong Tang Wenbiao Cao Huiya Tang Tan Deng Jing Mei Yi Liu Cheng Shi Meng Xia Zeng Zeng

PPOAccel: A High-Throughput Acceleration Framework for Proximal Policy Optimization.

Yuan Meng Sanmukh R. Kuppannagari Rajgopal Kannan Viktor K. Prasanna

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From the University of Texas at Austin.

Brock Davis Juan Paez Jack Gaither Joe A. Garcia

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Nanyang Technological University.

Shenggui Li Bu-Sung Lee

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Clemson University.

Griffin Dube Cavender Holt John Hollowell Sarah Placke Sansriti Ranjan Nikolas Heitzig Jon Calhoun

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Tsinghua University.

Runxin Zhong Jiajie Chen Chen Zhang Mingshu Zhai Zeyu Song Yutian Wang Wentao Han Lin Gan Jidong Zhai

Reproducibility: Performance Evaluation of MemXCT on Azure CycleCloud Platform.

Yuchen Liu Yixuan Meng Kaiyuan Xu Zijun Xu Tianyuan Wu Yiwei Yang Shu Yin

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From University of California San Diego.

Xiaochen Li Maximilian Apodaca Arunav Gupta Zihao Kong Hongyi Pan Hongyu Zhou Mary Thomas Martin Kandes Zhaoyi Li Mahidhar Tatineni Lewis Carroll

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From ETH Zürich.

Jan Kleine Rahul Steiger Simon Wachter Emir Isman Simon Jacob Dario Romaniello

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Georgia Tech.

Nicole Prindle Ali Kazmi Aman Jain Albert Chen Marissa Sorkin Sudhanshu Agarwal Richard W. Vuduc Vijay Thakkar

Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Peking University.

Zejia Fan Yuchen Gu Zhewen Hao Yueyang Pan Pengcheng Xu Yuxuan Yan Fangyuan Yang Zhenxin Fu Yun Liang

MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging.

Mert Hidayetoglu Tekin Biçer Simon Garcia de Gonzalo Bin Ren Doga Gürsoy Rajkumar Kettimuthu Ian T. Foster Wen-Mei W. Hwu

Advancing Adoption of Reproducibility in HPC: A Preface to the Special Section.

Stephen Lien Harrell Scott Michael Carlos Maltzahn

EiC Editorial - Advancing Reproducibility in Parallel and Distributed Systems Research.

Manish Parashar


Volume 33, Number 8, August 2022
AUCTION: Automated and Quality-Aware Client Selection Framework for Efficient Federated Learning.

Yongheng Deng Feng Lyu Ju Ren Huaqing Wu Yuezhi Zhou Yaoxue Zhang Xuemin Shen

Cost-Effective Web Application Replication and Deployment in Multi-Cloud Environment.

Tao Shi Hui Ma Gang Chen Sven Hartmann

TensorOpt: Exploring the Tradeoffs in Distributed DNN Training With Auto-Parallelism.

Zhenkun Cai Xiao Yan Kaihao Ma Yidi Wu Yuzhen Huang James Cheng Teng Su Fan Yu

TridentKV: A Read-Optimized LSM-Tree Based KV Store via Adaptive Indexing and Space-Efficient Partitioning.

Kai Lu Nannan Zhao Jiguang Wan Changhong Fei Wei Zhao Tongliang Deng

Completely Independent Spanning Trees on BCCC Data Center Networks With an Application to Fault-Tolerant Routing.

Xiao-Yan Li Wanling Lin Ximeng Liu Cheng-Kuan Lin Kung-Jui Pai Jou-Ming Chang

A GPU-Oriented Application Programming Interface for Digital Audio Workstations.

Daniele Bianchi Federico Avanzini Adriano Baratè Luca A. Ludovico Giorgio Presti

Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning.

Zheyi Chen Jia Hu Geyong Min Chunbo Luo Tarek A. El-Ghazawi

Construction of Dual-CISTs on an Infinite Class of Networks.

Xiao-Wen Qin Rong-Xia Hao Jie Wu

Scaling Poisson Solvers on Many Cores via MMEwald.

Mingchuan Wu Yangjun Wu Honghui Shang Ying Liu Huimin Cui Fang Li Xiaohui Duan Yunquan Zhang Xiaobing Feng

CSEdge: Enabling Collaborative Edge Storage for Multi-Access Edge Computing Based on Blockchain.

Liang Yuan Qiang He Feifei Chen Jun Zhang Lianyong Qi Xiaolong Xu Yang Xiang Yun Yang

Evaluating Data Redistribution in PaRSEC.

Qinglei Cao George Bosilca Nuria Losada Wei Wu Dong Zhong Jack J. Dongarra

Online Learning for Distributed Computation Offloading in Wireless Powered Mobile Edge Computing Networks.

Xiaojie Wang Zhaolong Ning Lei Guo Song Guo Xinbo Gao Guoyin Wang

Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum.

Kaihua Fu Wei Zhang Quan Chen Deze Zeng Minyi Guo

Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment.

Bingting Jiang Zhuo Tang Xiong Xiao Jing Yao Ronghui Cao Kenli Li

LoomIO: Object-Level Coordination in Distributed File Systems.

Yusheng Hua Xuanhua Shi Kang He Hai Jin Wei Xie Ligang He Yong Chen

A Bifactor Approximation Algorithm for Cloudlet Placement in Edge Computing.

Dixit Bhatta Lena Mashayekhy

FedGraph: Federated Graph Learning With Intelligent Sampling.

Fahao Chen Peng Li Toshiaki Miyazaki Celimuge Wu

iBalancer: Load-Aware in-Server Flow Scheduling for Sub-Millisecond Tail Latency.

Qi Zhang Yi Liu Tao Liu


Volume 33, Number 7, July 2022
Hamiltonian Paths of $k$k-ary $n$n-cubes Avoiding Faulty Links and Passing Through Prescribed Linear Forests.

Yuxing Yang Lingling Zhang

FFNLFD: Fault Diagnosis of Multiprocessor Systems at Local Node With Fault-Free Neighbors Under PMC Model and MM* Model.

Limei Lin Yanze Huang Yuhang Lin Sun-Yuan Hsieh Li Xu

FARNN: FPGA-GPU Hybrid Acceleration Platform for Recurrent Neural Networks.

Hyungmin Cho Jeesoo Lee Jaejin Lee

A Highly-Available Move Operation for Replicated Trees.

Martin Kleppmann Dominic P. Mulligan Victor B. F. Gomes Alastair R. Beresford

Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments.

Muhammed Tawfiqul Islam Shanika Karunasekera Rajkumar Buyya

TherMa-MiCs: Thermal-Aware Scheduling for Fault-Tolerant Mixed-Criticality Systems.

Sepideh Safari Heba Khdr Pourya Gohari-Nazari Mohsen Ansari Shaahin Hessabi Jörg Henkel

Design and Simulation of Content-Aware Hybrid DRAM-PCM Memory System.

Yinjin Fu Yutong Lu Zhiguang Chen Yang Wu Nong Xiao

TODG: Distributed Task Offloading With Delay Guarantees for Edge Computing.

Sheng Yue Ju Ren Nan Qiao Yongmin Zhang Hongbo Jiang Yaoxue Zhang Yuanyuan Yang

Pistis: Issuing Trusted and Authorized Certificates With Distributed Ledger and TEE.

Zecheng Li Haotian Wu Ricky Lap-Hou Lao Songtao Guo Yuanyuan Yang Bin Xiao

Wukong+G: Fast and Concurrent RDF Query Processing Using RDMA-Assisted GPU Graph Exploration.

Zihang Yao Rong Chen Binyu Zang Haibo Chen

Cooperative Scheduling Schemes for Explainable DNN Acceleration in Satellite Image Analysis and Retraining.

Woojoong Kim Chan-Hyun Youn

A Fast $f(r, k+1)/k$f(r, k+1)/k-Diagnosis for Interconnection Networks Under MM* Model.

Yanze Huang Limei Lin Sun-Yuan Hsieh

LOCUS: User-Perceived Delay-Aware Service Placement and User Allocation in MEC Environment.

Yu Chen Sheng Zhang Yibo Jin Zhuzhong Qian Mingjun Xiao Jidong Ge Sanglu Lu

SaPus: Self-Adaptive Parameter Update Strategy for DNN Training on Multi-GPU Clusters.

Zhaorui Zhang Cho-Li Wang

Exploring Data Analytics Without Decompression on Embedded GPU Systems.

Zaifeng Pan Feng Zhang Yanliang Zhou Jidong Zhai Xipeng Shen Onur Mutlu Xiaoyong Du

Topology-Aware Neural Model for Highly Accurate QoS Prediction.

Jiahui Li Hao Wu Jiapei Chen Qiang He Ching-Hsien Hsu

Necessary Feasibility Analysis for Mixed-Criticality Real-Time Embedded Systems.

Hoon Sung Chwa Hyeongboo Baek Jinkyu Lee


Volume 33, Number 6, June 2022
A Potential Game Theoretic Approach to Computation Offloading Strategy Optimization in End-Edge-Cloud Computing.

Yan Ding Kenli Li Chubo Liu Keqin Li

Deep Reinforcement Learning for Load-Balancing Aware Network Control in IoT Edge Systems.

Qingzhi Liu Tiancong Xia Long Cheng Merijn van Eijk Tanir Ozcelebi Ying Mao

Coarse Grained FPGA Overlay for Rapid Just-In-Time Accelerator Compilation.

Abhishek Kumar Jain Douglas L. Maskell Suhaib A. Fahmy

EnosLib: A Library for Experiment-Driven Research in Distributed Computing.

Ronan-Alexandre Cherrueau Marie Delavergne Alexandre van Kempen Adrien Lebre Dimitri Pertin Javier Rojas Balderrama Anthony Simonet Matthieu Simonin

A Survey of GPU Multitasking Methods Supported by Hardware Architecture.

Chen Zhao Wu Gao Feiping Nie Huiyang Zhou

Modeling Speedup in Multi-OS Environments.

Brian R. Tauro Conghao Liu Kyle C. Hale

HSA-Net: Hidden-State-Aware Networks for High-Precision QoS Prediction.

Ziliang Wang Xiaohong Zhang Meng Yan Ling Xu Dan Yang

Network Cost-Aware Geo-Distributed Data Analytics System.

Kwangsung Oh Minmin Zhang Abhishek Chandra Jon B. Weissman

VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems.

Haotian Wu Zhe Peng Songtao Guo Yuanyuan Yang Bin Xiao

Customer Adaptive Resource Provisioning for Long-Term Cloud Profit Maximization under Constrained Budget.

Peijin Cong Zhixing Zhang Junlong Zhou Xin Liu Yao Liu Tongquan Wei

Benchmarking 50-Photon Gaussian Boson Sampling on the Sunway TaihuLight.

Yuxuan Li Lin Gan Mingcheng Chen Yaojian Chen Haitian Lu Chao-Yang Lu Jian-Wei Pan Haohuan Fu Guangwen Yang

On Mixing Eventual and Strong Consistency: Acute Cloud Types.

Maciej Kokocinski Tadeusz Kobus Pawel T. Wojciechowski

Distributed Graph Realizations.

John Augustine Keerti Choudhary Avi Cohen David Peleg Sumathi Sivasubramaniam Suman Sourav

Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System.

Tsung-Wei Huang Dian-Lun Lin Chun-Xun Lin Yibo Lin

DS-ADMM++: A Novel Distributed Quantized ADMM to Speed up Differentially Private Matrix Factorization.

Feng Zhang Erkang Xue Ruixin Guo Guangzhi Qu Gansen Zhao Albert Y. Zomaya

Power Log'n'Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols.

Kiril Dichev Daniele De Sensi Dimitrios S. Nikolopoulos Kirk W. Cameron Ivor T. A. Spence


Volume 33, Number 5, May 2022
DH-SVRF: A Reconfigurable Unicast/Multicast Forwarding for High-Performance Packet Forwarding Engines.

Zhu Jin Wen-Kang Jia

A Practical Framework for Secure Document Retrieval in Encrypted Cloud File Systems.

Junsong Fu Na Wang Baojiang Cui Bharat K. Bhargava

Workload Balancing via Graph Reordering on Multicore Systems.

YuAng Chen Yeh-Ching Chung

Mapping-Aware Kernel Partitioning Method for CGRAs Assisted by Deep Learning.

Takuya Kojima Ayaka Ohwada Hideharu Amano

Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing.

Jing Li Weifa Liang Wenzheng Xu Zichuan Xu Xiaohua Jia Wanlei Zhou Jin Zhao

Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing.

Zhi Ma Sheng Zhang Zhiqi Chen Tao Han Zhuzhong Qian Mingjun Xiao Ning Chen Jie Wu Sanglu Lu

DIESEL+: Accelerating Distributed Deep Learning Tasks on Image Datasets.

Lipeng Wang Qiong Luo Shengen Yan

Online Reconfiguration of IoT Applications in the Fog: The Information-Coordination Trade-Off.

Bruno Donassolo Arnaud Legrand Panayotis Mertikopoulos Ilhem Fajjari

Data, User and Power Allocations for Caching in Multi-Access Edge Computing.

Xiaoyu Xia Feifei Chen Qiang He Guangming Cui John C. Grundy Mohamed Almorsy Abdelrazek Xiaolong Xu Hai Jin

Elastic Parameter Server: Accelerating ML Training With Scalable Resource Scheduling.

Shaoqi Wang Aidi Pi Xiaobo Zhou

Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics.

Jiaqi Liu Shiyue Huang Deng Li Sheng Wen Hui Liu

Parallel and Asynchronous Smart Contract Execution.

Jian Liu Peilun Li Raymond Cheng N. Asokan Dawn Song

Parallel and Distributed Structured SVM Training.

Jiantong Jiang Zeyi Wen Ze-ke Wang Bingsheng He Jian Chen

Efficient and Accurate Flow Record Collection With HashFlow.

Zongyi Zhao Xingang Shi Zhiliang Wang Qing Li Han Zhang Xia Yin

Exploring the Galaxyfly Family to Build Flexible-Scale Interconnection Networks.

Fei Lei Dezun Dong Xiangke Liao

A Low-Power Transprecision Floating-Point Cluster for Efficient Near-Sensor Data Analytics.

Fabio Montagna Stefan Mach Simone Benatti Angelo Garofalo Gianmarco Ottavi Luca Benini Davide Rossi Giuseppe Tagliavini


Volume 33, Number 4, April 2022
Accelerating Restarted GMRES With Mixed Precision Arithmetic.

Neil Lindquist Piotr Luszczek Jack J. Dongarra

gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs.

Anil Gaihre Xiaoye Sherry Li Hang Liu

Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication.

Gordon Euhyun Moon Hyoukjun Kwon Geonhwa Jeong Prasanth Chatarasi Sivasankaran Rajamanickam Tushar Krishna

Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems.

Ariful Azad Oguz Selvitopi Md Taufique Hussain John R. Gilbert Aydin Buluç

libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations.

Stephen Hudson Jeffrey Larson John-Luke Navarro Stefan M. Wild

Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC.

Sameh Abdulah Qinglei Cao Yu Pei George Bosilca Jack J. Dongarra Marc G. Genton David E. Keyes Hatem Ltaief Ying Sun

VPIC 2.0: Next Generation Particle-in-Cell Simulations.

Robert F. Bird Nigel Tan Scott V. Luedtke Stephen Lien Harrell Michela Taufer Brian J. Albright

TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer.

Xinbiao Gan Yiming Zhang Ruibo Wang Tiejun Li Tiaojie Xiao Ruigeng Zeng Jie Liu Kai Lu

A Parallel Algorithm Template for Updating Single-Source Shortest Paths in Large-Scale Dynamic Networks.

Arindam Khanda Sriram Srinivasan Sanjukta Bhowmick Boyana Norris Sajal K. Das

Characterizing Performance of Graph Neighborhood Communication Patterns.

Sayan Ghosh Nathan R. Tallent Mahantesh Halappanavar

Accelerating HDF5 I/O for Exascale Using DAOS.

Jérome Soumagne Jordan Henderson Mohamad Chaarawi Neil Fortner M. Scot Breitenfeld Songyu Lu Dana Robinson Elena Pourmal Johann Lombardi

Transparent Asynchronous Parallel I/O Using Background Threads.

Houjun Tang Quincey Koziol John Ravi Suren Byna

Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization.

Lipeng Wan Axel Huebl Junmin Gu Franz Poeschel Ana Gainaru Ruonan Wang Jieyang Chen Xin Liang Dmitry Ganyushin Todd S. Munson Ian T. Foster Jean-Luc Vay Norbert Podhorszki Kesheng Wu Scott Klasky

Enabling Scalable and Extensible Memory-Mapped Datastores in Userspace.

Ivy Bo Peng Maya B. Gokhale Karim Youssef Keita Iwabuchi Roger Pearce

An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications.

Keren Zhou Xiaozhu Meng Ryuichi Sai Dejan Grubisic John M. Mellor-Crummey

The PetscSF Scalable Communication Layer.

Junchao Zhang Jed Brown Satish Balay Jacob Faibussowitsch Matthew G. Knepley Oana Marin Richard Tran Mills Todd S. Munson Barry F. Smith Stefano Zampini

LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications.

Jonas H. Müller Korndörfer Ahmed Eleliemy Ali Mohammed Florina M. Ciorba

Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms.

André Merzky Matteo Turilli Mikhail Titov Aymen Al-Saadi Shantenu Jha

Kokkos 3: Programming Model Extensions for the Exascale Era.

Christian R. Trott Damien Lebrun-Grandié Daniel Arndt Jan Ciesko Vinh Q. Dang Nathan D. Ellingwood Rahulkumar Gayatri Evan Harvey Daisy S. Hollman Dan Ibanez Nevin Liber Jonathan R. Madsen Jeff Miles David Poliakoff Amy Powell Sivasankaran Rajamanickam Mikael Simberg Dan Sunderland Bruno Turcksin Jeremiah Wilke

EXA2PRO: A Framework for High Development Productivity on Heterogeneous Computing Systems.

Lazaros Papadopoulos Dimitrios Soudris Christoph W. Kessler August Ernstsson Johan Ahlqvist Nikos Vasilas Athanasios I. Papadopoulos Panos Seferlis Charles Prouveur Matthieu Haefele Samuel Thibault Athanasios Salamanis Theodoros Ioakimidis Dionysios D. Kehagias

Compiler-Assisted Compaction/Restoration of SIMD Instructions.

Juan M. Cebrian Thibaud Balem Adrián Barredo Marc Casas Miquel Moretó Alberto Ros Alexandra Jimborean

Near-Zero Downtime Recovery From Transient-Error-Induced Crashes.

Chao Chen Greg Eisenhauer Santosh Pande

Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach.

Yiming Wang Weizhe Zhang Meng Hao Zheng Wang

Anomaly Detection and Anticipation in High Performance Computing Systems.

Andrea Borghesi Martin Molan Michela Milano Andrea Bartolini

IEEE Special Issue on Innovative R&D Toward the Exascale Era.

Sadaf R. Alam Lois Curfman McInnes Kengo Nakajima


Volume 33, Number 3, March 2022
ViTrack: Efficient Tracking on the Edge for Commodity Video Surveillance Systems.

Linsong Cheng Jiliang Wang Yinghui Li

Efficient, Dynamic Multi-Task Execution on FPGA-Based Computing Systems.

Umar Ibrahim Minhas Roger F. Woods Dimitrios S. Nikolopoulos Georgios Karakonstantis

Timed Loops for Distributed Storage in Wireless Networks.

Anandarup Mukherjee Pallav Kumar Deb Sudip Misra

Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments.

Xing Chen Jianshan Zhang Bing Lin Zheyi Chen Katinka Wolter Geyong Min

Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems.

Tayebeh Bahreini Hossein Badri Daniel Grosu

On the Analysis of Cache Invalidation With LRU Replacement.

Quan Zheng Tao Yang Yuanzhi Kan Xiaobin Tan Jian Yang Xiaofeng Jiang

Propagation Pattern for Moment Representation of the Lattice Boltzmann Method.

John Gounley Madhurima Vardhan Erik W. Draeger Pedro Valero-Lara Shirley V. Moore Amanda Randles

Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing.

Jed Mills Jia Hu Geyong Min

Harnessing the Potential of Function-Reuse in Multimedia Cloud Systems.

Chavit Denninnart Mohsen Amini Salehi

Fast and Portable Concurrent FIFO Queues With Deterministic Memory Reclamation.

Oliver Giersch Jörg Nolte

Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs.

Yuan Wang Hideaki Ishii François Bonnet Xavier Défago

Online Pricing and Trading of Private Data in Correlated Queries.

Hui Cai Fan Ye Yuanyuan Yang Yanmin Zhu Jie Li Fu Xiao

cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope.

Yiwen Gao Jia Xu Hongbing Wang

Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning.

Wei Yang Bryan Lim Jer Shyuan Ng Zehui Xiong Jiangming Jin Yang Zhang Dusit Niyato Cyril Leung Chunyan Miao

Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration.

Marcin Copik Tobias Grosser Torsten Hoefler Paolo Bientinesi Benjamin Berkels

Optimal Checkpointing Strategies for Iterative Applications.

Yishu Du Loris Marchal Guillaume Pallez Yves Robert

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training.

Shixiong Zhao Fanxin Li Xusheng Chen Xiuxian Guan Jianyu Jiang Dong Huang Yuhao Qing Sen Wang Peng Wang Gong Zhang Cheng Li Ping Luo Heming Cui


Volume 33, Number 2, February 2022
A Practical and Efficient Bidirectional Access Control Scheme for Cloud-Edge Data Sharing.

Jie Cui Bei Li Hong Zhong Geyong Min Yan Xu Lu Liu

POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression.

Feng Zhang Jidong Zhai Xipeng Shen Onur Mutlu Xiaoyong Du

A Block-Based Triangle Counting Algorithm on Heterogeneous Environments.

Abdurrahman Yasar Sivasankaran Rajamanickam Jonathan W. Berry Ümit V. Çatalyürek

Tensorox: Accelerating GPU Applications via Neural Approximation on Unused Tensor Cores.

Nhut-Minh Ho Weng-Fai Wong

A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity.

Limei Lin Yanze Huang Li Xu Sun-Yuan Hsieh

Optimizing Network Transfers for Data Analytic Jobs Across Geo-Distributed Datacenters.

Li Chen Shuhao Liu Baochun Li

Repurposing GPU Microarchitectures with Light-Weight Out-Of-Order Execution.

Konstantinos Iliakis Sotirios Xydis Dimitrios Soudris

PostMan: Rapidly Mitigating Bursty Traffic via On-Demand Offloading of Packet Processing.

Yipei Niu Panpan Jin Jian Guo Yikai Xiao Rong Shi Fangming Liu Chen Qian Yang Wang

Optimization of Reactive Force Field Simulation: Refactor, Parallelization, and Vectorization for Interactions.

Ping Gao Xiaohui Duan Bertil Schmidt Wusheng Zhang Lin Gan Haohuan Fu Wei Xue Weiguo Liu Guangwen Yang

EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds.

Shutong Chen Lei Jiao Fangming Liu Lin Wang


Volume 33, Number 1, January 2022
Taming System Dynamics on Resource Optimization for Data Processing Workflows: A Probabilistic Approach.

Amelie Chi Zhou Weilin Xue Yao Xiao Bingsheng He Shadi Ibrahim Reynold Cheng

PLVER: Joint Stable Allocation and Content Replication for Edge-Assisted Live Video Delivery.

Huan Wang Guoming Tang Kui Wu Jianping Wang

Energy-Efficient Cache-Aware Scheduling on Heterogeneous Multicore Systems.

Saad Zia Sheikh Muhammad Adeel Pasha

Communication-Efficient Federated Learning With Compensated Overlap-FedAvg.

Yuhao Zhou Qing Ye Jiancheng Lv

Scalable, Confidential and Survivable Software Updates.

Federico Magnanini Luca Ferretti Michele Colajanni

A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures.

Zhen Xie Guangming Tan Weifeng Liu Ninghui Sun

Elastic Deep Learning in Multi-Tenant GPU Clusters.

Yidi Wu Kaihao Ma Xiao Yan Zhi Liu Zhenkun Cai Yuzhen Huang James Cheng Han Yuan Fan Yu

Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs.

Tongfeng Weng Xu Zhou Kenli Li Peng Peng Keqin Li

FlitZip: Effective Packet Compression for NoC in MultiProcessor System-on-Chip.

Dipika Deb Rohith M. K. John Jose

COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments.

Shreshth Tuli Shivananda R. Poojara Satish Narayana Srirama Giuliano Casale Nicholas R. Jennings

Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems.

Gingfung Yeung Damian Borowiec Renyu Yang Adrian Friday Richard Harper Peter Garraghan

Optimizing Depthwise Separable Convolution Operations on GPUs.

Gangzhao Lu Weizhe Zhang Zheng Wang

Optimal Repair-Scaling Trade-off in Locally Repairable Codes: Analysis and Evaluation.

Si Wu Zhirong Shen Patrick P. C. Lee Yinlong Xu

$run$ runData: Re-Distributing Data via Piggybacking for Geo-Distributed Data Analytics Over Edges.

Yibo Jin Zhuzhong Qian Song Guo Sheng Zhang Lei Jiao Sanglu Lu

Capelin: Data-Driven Compute Capacity Procurement for Cloud Datacenters Using Portfolios of Scenarios.

Georgios Andreadis Fabian Mastenbroek Vincent van Beek Alexandru Iosup

Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment.

Haozhao Wang Song Guo Zhihao Qu Ruixuan Li Ziming Liu

DeTraS: Delaying Stores for Friendly-Fire Mitigation in Hardware Transactional Memory.

J. Rubén Titos Gil Ricardo Fernández Pascual Alberto Ros Manuel E. Acacio