TPDS - volume 33 - 2022 论文列表 |
点击这里查看 IEEE Transactions on Parallel and Distributed Systems 的JCR分区、影响因子等信息 |
Jiamin Cao Ying Liu Yu Zhou Lin He Chen Sun Yangyang Wang Mingwei Xu
Flexible Performant GEMM Kernels on GPUs.Thomas Faingnaert Tim Besard Bjorn De Sutter
NetSHa: In-Network Acceleration of LSH-Based Distributed Search.Penghao Zhang Heng Pan Zhenyu Li Penglai Cui Ru Jia Peng He Zhibin Zhang Gareth Tyson Gaogang Xie
Cost-Efficient Server Configuration and Placement for Mobile Edge Computing. Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud.Huifang Li Danjing Wang MengChu Zhou Yushun Fan Yuanqing Xia
UFC2: User-Friendly Collaborative Cloud.Minghao Zhao Zhenhua Li Wei Liu Jian Chen Xingyao Li
Performant, Multi-Objective Scheduling of Highly Interleaved Task Graphs on Heterogeneous System on Chip Devices.Joshua Mack Samet E. Arda Ümit Y. Ogras Ali Akoglu
Exploiting Concurrency in Sharded Parallel State Machine Replication.Aldenio Burgos Eduardo Alchieri Fernando Luís Dotti Fernando Pedone
DePo: Dynamically Offload Expensive Event Processing to the Edge of Cyber-Physical Systems.Meng Ma Jingbin Zhang Ping Wang
Addressing the Read-Performance Impact of Reconfigurations in Replicated Key-Value Stores.Antonis Papaioannou Kostas Magoutis
Cooperative Edge Caching Based on Temporal Convolutional Networks.Xu Zhang Zhengnan Qi Geyong Min Wang Miao Qilin Fan Zhan Ma
Cost-Efficient Workflow Scheduling Algorithm for Applications With Deadline Constraint on Heterogeneous Clouds.Xiaoyong Tang Wenbiao Cao Huiya Tang Tan Deng Jing Mei Yi Liu Cheng Shi Meng Xia Zeng Zeng
PPOAccel: A High-Throughput Acceleration Framework for Proximal Policy Optimization.Yuan Meng Sanmukh R. Kuppannagari Rajgopal Kannan Viktor K. Prasanna
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From the University of Texas at Austin.Brock Davis Juan Paez Jack Gaither Joe A. Garcia
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Nanyang Technological University. Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Clemson University.Griffin Dube Cavender Holt John Hollowell Sarah Placke Sansriti Ranjan Nikolas Heitzig Jon Calhoun
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Tsinghua University.Runxin Zhong Jiajie Chen Chen Zhang Mingshu Zhai Zeyu Song Yutian Wang Wentao Han Lin Gan Jidong Zhai
Reproducibility: Performance Evaluation of MemXCT on Azure CycleCloud Platform.Yuchen Liu Yixuan Meng Kaiyuan Xu Zijun Xu Tianyuan Wu Yiwei Yang Shu Yin
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From University of California San Diego.Xiaochen Li Maximilian Apodaca Arunav Gupta Zihao Kong Hongyi Pan Hongyu Zhou Mary Thomas Martin Kandes Zhaoyi Li Mahidhar Tatineni Lewis Carroll
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From ETH Zürich.Jan Kleine Rahul Steiger Simon Wachter Emir Isman Simon Jacob Dario Romaniello
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Georgia Tech.Nicole Prindle Ali Kazmi Aman Jain Albert Chen Marissa Sorkin Sudhanshu Agarwal Richard W. Vuduc Vijay Thakkar
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Peking University.Zejia Fan Yuchen Gu Zhewen Hao Yueyang Pan Pengcheng Xu Yuxuan Yan Fangyuan Yang Zhenxin Fu Yun Liang
MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging.Mert Hidayetoglu Tekin Biçer Simon Garcia de Gonzalo Bin Ren Doga Gürsoy Rajkumar Kettimuthu Ian T. Foster Wen-Mei W. Hwu
Advancing Adoption of Reproducibility in HPC: A Preface to the Special Section. EiC Editorial - Advancing Reproducibility in Parallel and Distributed Systems Research.Yongheng Deng Feng Lyu Ju Ren Huaqing Wu Yuezhi Zhou Yaoxue Zhang Xuemin Shen
Cost-Effective Web Application Replication and Deployment in Multi-Cloud Environment.Tao Shi Hui Ma Gang Chen Sven Hartmann
TensorOpt: Exploring the Tradeoffs in Distributed DNN Training With Auto-Parallelism.Zhenkun Cai Xiao Yan Kaihao Ma Yidi Wu Yuzhen Huang James Cheng Teng Su Fan Yu
TridentKV: A Read-Optimized LSM-Tree Based KV Store via Adaptive Indexing and Space-Efficient Partitioning.Kai Lu Nannan Zhao Jiguang Wan Changhong Fei Wei Zhao Tongliang Deng
Completely Independent Spanning Trees on BCCC Data Center Networks With an Application to Fault-Tolerant Routing.Xiao-Yan Li Wanling Lin Ximeng Liu Cheng-Kuan Lin Kung-Jui Pai Jou-Ming Chang
A GPU-Oriented Application Programming Interface for Digital Audio Workstations.Daniele Bianchi Federico Avanzini Adriano Baratè Luca A. Ludovico Giorgio Presti
Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning.Zheyi Chen Jia Hu Geyong Min Chunbo Luo Tarek A. El-Ghazawi
Construction of Dual-CISTs on an Infinite Class of Networks.Xiao-Wen Qin Rong-Xia Hao Jie Wu
Scaling Poisson Solvers on Many Cores via MMEwald.Mingchuan Wu Yangjun Wu Honghui Shang Ying Liu Huimin Cui Fang Li Xiaohui Duan Yunquan Zhang Xiaobing Feng
CSEdge: Enabling Collaborative Edge Storage for Multi-Access Edge Computing Based on Blockchain.Liang Yuan Qiang He Feifei Chen Jun Zhang Lianyong Qi Xiaolong Xu Yang Xiang Yun Yang
Evaluating Data Redistribution in PaRSEC.Qinglei Cao George Bosilca Nuria Losada Wei Wu Dong Zhong Jack J. Dongarra
Online Learning for Distributed Computation Offloading in Wireless Powered Mobile Edge Computing Networks.Xiaojie Wang Zhaolong Ning Lei Guo Song Guo Xinbo Gao Guoyin Wang
Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum.Kaihua Fu Wei Zhang Quan Chen Deze Zeng Minyi Guo
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment.Bingting Jiang Zhuo Tang Xiong Xiao Jing Yao Ronghui Cao Kenli Li
LoomIO: Object-Level Coordination in Distributed File Systems.Yusheng Hua Xuanhua Shi Kang He Hai Jin Wei Xie Ligang He Yong Chen
A Bifactor Approximation Algorithm for Cloudlet Placement in Edge Computing. FedGraph: Federated Graph Learning With Intelligent Sampling. iBalancer: Load-Aware in-Server Flow Scheduling for Sub-Millisecond Tail Latency.Limei Lin Yanze Huang Yuhang Lin Sun-Yuan Hsieh Li Xu
FARNN: FPGA-GPU Hybrid Acceleration Platform for Recurrent Neural Networks.Hyungmin Cho Jeesoo Lee Jaejin Lee
A Highly-Available Move Operation for Replicated Trees.Martin Kleppmann Dominic P. Mulligan Victor B. F. Gomes Alastair R. Beresford
Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments.Muhammed Tawfiqul Islam Shanika Karunasekera Rajkumar Buyya
TherMa-MiCs: Thermal-Aware Scheduling for Fault-Tolerant Mixed-Criticality Systems.Sepideh Safari Heba Khdr Pourya Gohari-Nazari Mohsen Ansari Shaahin Hessabi Jörg Henkel
Design and Simulation of Content-Aware Hybrid DRAM-PCM Memory System.Yinjin Fu Yutong Lu Zhiguang Chen Yang Wu Nong Xiao
TODG: Distributed Task Offloading With Delay Guarantees for Edge Computing.Sheng Yue Ju Ren Nan Qiao Yongmin Zhang Hongbo Jiang Yaoxue Zhang Yuanyuan Yang
Pistis: Issuing Trusted and Authorized Certificates With Distributed Ledger and TEE.Zecheng Li Haotian Wu Ricky Lap-Hou Lao Songtao Guo Yuanyuan Yang Bin Xiao
Wukong+G: Fast and Concurrent RDF Query Processing Using RDMA-Assisted GPU Graph Exploration.Zihang Yao Rong Chen Binyu Zang Haibo Chen
Cooperative Scheduling Schemes for Explainable DNN Acceleration in Satellite Image Analysis and Retraining. A Fast $f(r, k+1)/k$f(r, k+1)/k-Diagnosis for Interconnection Networks Under MM* Model.Yanze Huang Limei Lin Sun-Yuan Hsieh
LOCUS: User-Perceived Delay-Aware Service Placement and User Allocation in MEC Environment.Yu Chen Sheng Zhang Yibo Jin Zhuzhong Qian Mingjun Xiao Jidong Ge Sanglu Lu
SaPus: Self-Adaptive Parameter Update Strategy for DNN Training on Multi-GPU Clusters. Exploring Data Analytics Without Decompression on Embedded GPU Systems.Zaifeng Pan Feng Zhang Yanliang Zhou Jidong Zhai Xipeng Shen Onur Mutlu Xiaoyong Du
Topology-Aware Neural Model for Highly Accurate QoS Prediction. Necessary Feasibility Analysis for Mixed-Criticality Real-Time Embedded Systems.Yan Ding Kenli Li Chubo Liu Keqin Li
Deep Reinforcement Learning for Load-Balancing Aware Network Control in IoT Edge Systems.Qingzhi Liu Tiancong Xia Long Cheng Merijn van Eijk Tanir Ozcelebi Ying Mao
Coarse Grained FPGA Overlay for Rapid Just-In-Time Accelerator Compilation.Abhishek Kumar Jain Douglas L. Maskell Suhaib A. Fahmy
EnosLib: A Library for Experiment-Driven Research in Distributed Computing.Ronan-Alexandre Cherrueau Marie Delavergne Alexandre van Kempen Adrien Lebre Dimitri Pertin Javier Rojas Balderrama Anthony Simonet Matthieu Simonin
A Survey of GPU Multitasking Methods Supported by Hardware Architecture.Chen Zhao Wu Gao Feiping Nie Huiyang Zhou
Modeling Speedup in Multi-OS Environments.Brian R. Tauro Conghao Liu Kyle C. Hale
HSA-Net: Hidden-State-Aware Networks for High-Precision QoS Prediction.Ziliang Wang Xiaohong Zhang Meng Yan Ling Xu Dan Yang
Network Cost-Aware Geo-Distributed Data Analytics System.Kwangsung Oh Minmin Zhang Abhishek Chandra Jon B. Weissman
VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems.Haotian Wu Zhe Peng Songtao Guo Yuanyuan Yang Bin Xiao
Customer Adaptive Resource Provisioning for Long-Term Cloud Profit Maximization under Constrained Budget.Peijin Cong Zhixing Zhang Junlong Zhou Xin Liu Yao Liu Tongquan Wei
Benchmarking 50-Photon Gaussian Boson Sampling on the Sunway TaihuLight.Yuxuan Li Lin Gan Mingcheng Chen Yaojian Chen Haitian Lu Chao-Yang Lu Jian-Wei Pan Haohuan Fu Guangwen Yang
On Mixing Eventual and Strong Consistency: Acute Cloud Types.Maciej Kokocinski Tadeusz Kobus Pawel T. Wojciechowski
Distributed Graph Realizations.John Augustine Keerti Choudhary Avi Cohen David Peleg Sumathi Sivasubramaniam Suman Sourav
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System.Tsung-Wei Huang Dian-Lun Lin Chun-Xun Lin Yibo Lin
DS-ADMM++: A Novel Distributed Quantized ADMM to Speed up Differentially Private Matrix Factorization.Feng Zhang Erkang Xue Ruixin Guo Guangzhi Qu Gansen Zhao Albert Y. Zomaya
Power Log'n'Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols.Kiril Dichev Daniele De Sensi Dimitrios S. Nikolopoulos Kirk W. Cameron Ivor T. A. Spence
Junsong Fu Na Wang Baojiang Cui Bharat K. Bhargava
Workload Balancing via Graph Reordering on Multicore Systems. Mapping-Aware Kernel Partitioning Method for CGRAs Assisted by Deep Learning.Takuya Kojima Ayaka Ohwada Hideharu Amano
Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing.Jing Li Weifa Liang Wenzheng Xu Zichuan Xu Xiaohua Jia Wanlei Zhou Jin Zhao
Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing.Zhi Ma Sheng Zhang Zhiqi Chen Tao Han Zhuzhong Qian Mingjun Xiao Ning Chen Jie Wu Sanglu Lu
DIESEL+: Accelerating Distributed Deep Learning Tasks on Image Datasets.Lipeng Wang Qiong Luo Shengen Yan
Online Reconfiguration of IoT Applications in the Fog: The Information-Coordination Trade-Off.Bruno Donassolo Arnaud Legrand Panayotis Mertikopoulos Ilhem Fajjari
Data, User and Power Allocations for Caching in Multi-Access Edge Computing.Xiaoyu Xia Feifei Chen Qiang He Guangming Cui John C. Grundy Mohamed Almorsy Abdelrazek Xiaolong Xu Hai Jin
Elastic Parameter Server: Accelerating ML Training With Scalable Resource Scheduling.Shaoqi Wang Aidi Pi Xiaobo Zhou
Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics.Jiaqi Liu Shiyue Huang Deng Li Sheng Wen Hui Liu
Parallel and Asynchronous Smart Contract Execution.Jian Liu Peilun Li Raymond Cheng N. Asokan Dawn Song
Parallel and Distributed Structured SVM Training.Jiantong Jiang Zeyi Wen Ze-ke Wang Bingsheng He Jian Chen
Efficient and Accurate Flow Record Collection With HashFlow.Zongyi Zhao Xingang Shi Zhiliang Wang Qing Li Han Zhang Xia Yin
Exploring the Galaxyfly Family to Build Flexible-Scale Interconnection Networks.Fei Lei Dezun Dong Xiangke Liao
A Low-Power Transprecision Floating-Point Cluster for Efficient Near-Sensor Data Analytics.Fabio Montagna Stefan Mach Simone Benatti Angelo Garofalo Gianmarco Ottavi Luca Benini Davide Rossi Giuseppe Tagliavini
Neil Lindquist Piotr Luszczek Jack J. Dongarra
gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs.Anil Gaihre Xiaoye Sherry Li Hang Liu
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication.Gordon Euhyun Moon Hyoukjun Kwon Geonhwa Jeong Prasanth Chatarasi Sivasankaran Rajamanickam Tushar Krishna
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems.Ariful Azad Oguz Selvitopi Md Taufique Hussain John R. Gilbert Aydin Buluç
libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations.Stephen Hudson Jeffrey Larson John-Luke Navarro Stefan M. Wild
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC.Sameh Abdulah Qinglei Cao Yu Pei George Bosilca Jack J. Dongarra Marc G. Genton David E. Keyes Hatem Ltaief Ying Sun
VPIC 2.0: Next Generation Particle-in-Cell Simulations.Robert F. Bird Nigel Tan Scott V. Luedtke Stephen Lien Harrell Michela Taufer Brian J. Albright
TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer.Xinbiao Gan Yiming Zhang Ruibo Wang Tiejun Li Tiaojie Xiao Ruigeng Zeng Jie Liu Kai Lu
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in Large-Scale Dynamic Networks.Arindam Khanda Sriram Srinivasan Sanjukta Bhowmick Boyana Norris Sajal K. Das
Characterizing Performance of Graph Neighborhood Communication Patterns.Sayan Ghosh Nathan R. Tallent Mahantesh Halappanavar
Accelerating HDF5 I/O for Exascale Using DAOS.Jérome Soumagne Jordan Henderson Mohamad Chaarawi Neil Fortner M. Scot Breitenfeld Songyu Lu Dana Robinson Elena Pourmal Johann Lombardi
Transparent Asynchronous Parallel I/O Using Background Threads.Houjun Tang Quincey Koziol John Ravi Suren Byna
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization.Lipeng Wan Axel Huebl Junmin Gu Franz Poeschel Ana Gainaru Ruonan Wang Jieyang Chen Xin Liang Dmitry Ganyushin Todd S. Munson Ian T. Foster Jean-Luc Vay Norbert Podhorszki Kesheng Wu Scott Klasky
Enabling Scalable and Extensible Memory-Mapped Datastores in Userspace.Ivy Bo Peng Maya B. Gokhale Karim Youssef Keita Iwabuchi Roger Pearce
An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications.Keren Zhou Xiaozhu Meng Ryuichi Sai Dejan Grubisic John M. Mellor-Crummey
The PetscSF Scalable Communication Layer.Junchao Zhang Jed Brown Satish Balay Jacob Faibussowitsch Matthew G. Knepley Oana Marin Richard Tran Mills Todd S. Munson Barry F. Smith Stefano Zampini
LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications.Jonas H. Müller Korndörfer Ahmed Eleliemy Ali Mohammed Florina M. Ciorba
Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms.André Merzky Matteo Turilli Mikhail Titov Aymen Al-Saadi Shantenu Jha
Kokkos 3: Programming Model Extensions for the Exascale Era.Christian R. Trott Damien Lebrun-Grandié Daniel Arndt Jan Ciesko Vinh Q. Dang Nathan D. Ellingwood Rahulkumar Gayatri Evan Harvey Daisy S. Hollman Dan Ibanez Nevin Liber Jonathan R. Madsen Jeff Miles David Poliakoff Amy Powell Sivasankaran Rajamanickam Mikael Simberg Dan Sunderland Bruno Turcksin Jeremiah Wilke
EXA2PRO: A Framework for High Development Productivity on Heterogeneous Computing Systems.Lazaros Papadopoulos Dimitrios Soudris Christoph W. Kessler August Ernstsson Johan Ahlqvist Nikos Vasilas Athanasios I. Papadopoulos Panos Seferlis Charles Prouveur Matthieu Haefele Samuel Thibault Athanasios Salamanis Theodoros Ioakimidis Dionysios D. Kehagias
Compiler-Assisted Compaction/Restoration of SIMD Instructions.Juan M. Cebrian Thibaud Balem Adrián Barredo Marc Casas Miquel Moretó Alberto Ros Alexandra Jimborean
Near-Zero Downtime Recovery From Transient-Error-Induced Crashes.Chao Chen Greg Eisenhauer Santosh Pande
Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach.Yiming Wang Weizhe Zhang Meng Hao Zheng Wang
Anomaly Detection and Anticipation in High Performance Computing Systems.Andrea Borghesi Martin Molan Michela Milano Andrea Bartolini
IEEE Special Issue on Innovative R&D Toward the Exascale Era.Linsong Cheng Jiliang Wang Yinghui Li
Efficient, Dynamic Multi-Task Execution on FPGA-Based Computing Systems.Umar Ibrahim Minhas Roger F. Woods Dimitrios S. Nikolopoulos Georgios Karakonstantis
Timed Loops for Distributed Storage in Wireless Networks.Anandarup Mukherjee Pallav Kumar Deb Sudip Misra
Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments.Xing Chen Jianshan Zhang Bing Lin Zheyi Chen Katinka Wolter Geyong Min
Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems.Tayebeh Bahreini Hossein Badri Daniel Grosu
On the Analysis of Cache Invalidation With LRU Replacement.Quan Zheng Tao Yang Yuanzhi Kan Xiaobin Tan Jian Yang Xiaofeng Jiang
Propagation Pattern for Moment Representation of the Lattice Boltzmann Method.John Gounley Madhurima Vardhan Erik W. Draeger Pedro Valero-Lara Shirley V. Moore Amanda Randles
Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing. Harnessing the Potential of Function-Reuse in Multimedia Cloud Systems.Chavit Denninnart Mohsen Amini Salehi
Fast and Portable Concurrent FIFO Queues With Deterministic Memory Reclamation. Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs.Yuan Wang Hideaki Ishii François Bonnet Xavier Défago
Online Pricing and Trading of Private Data in Correlated Queries.Hui Cai Fan Ye Yuanyuan Yang Yanmin Zhu Jie Li Fu Xiao
cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope.Yiwen Gao Jia Xu Hongbing Wang
Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning.Wei Yang Bryan Lim Jer Shyuan Ng Zehui Xiong Jiangming Jin Yang Zhang Dusit Niyato Cyril Leung Chunyan Miao
Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration.Marcin Copik Tobias Grosser Torsten Hoefler Paolo Bientinesi Benjamin Berkels
Optimal Checkpointing Strategies for Iterative Applications.Yishu Du Loris Marchal Guillaume Pallez Yves Robert
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training.Shixiong Zhao Fanxin Li Xusheng Chen Xiuxian Guan Jianyu Jiang Dong Huang Yuhao Qing Sen Wang Peng Wang Gong Zhang Cheng Li Ping Luo Heming Cui
Jie Cui Bei Li Hong Zhong Geyong Min Yan Xu Lu Liu
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression.Feng Zhang Jidong Zhai Xipeng Shen Onur Mutlu Xiaoyong Du
A Block-Based Triangle Counting Algorithm on Heterogeneous Environments.Abdurrahman Yasar Sivasankaran Rajamanickam Jonathan W. Berry Ümit V. Çatalyürek
Tensorox: Accelerating GPU Applications via Neural Approximation on Unused Tensor Cores. A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity.Limei Lin Yanze Huang Li Xu Sun-Yuan Hsieh
Optimizing Network Transfers for Data Analytic Jobs Across Geo-Distributed Datacenters. Repurposing GPU Microarchitectures with Light-Weight Out-Of-Order Execution.Konstantinos Iliakis Sotirios Xydis Dimitrios Soudris
PostMan: Rapidly Mitigating Bursty Traffic via On-Demand Offloading of Packet Processing.Yipei Niu Panpan Jin Jian Guo Yikai Xiao Rong Shi Fangming Liu Chen Qian Yang Wang
Optimization of Reactive Force Field Simulation: Refactor, Parallelization, and Vectorization for Interactions.Ping Gao Xiaohui Duan Bertil Schmidt Wusheng Zhang Lin Gan Haohuan Fu Wei Xue Weiguo Liu Guangwen Yang
EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds.Amelie Chi Zhou Weilin Xue Yao Xiao Bingsheng He Shadi Ibrahim Reynold Cheng
PLVER: Joint Stable Allocation and Content Replication for Edge-Assisted Live Video Delivery.Huan Wang Guoming Tang Kui Wu Jianping Wang
Energy-Efficient Cache-Aware Scheduling on Heterogeneous Multicore Systems.Saad Zia Sheikh Muhammad Adeel Pasha
Communication-Efficient Federated Learning With Compensated Overlap-FedAvg.Yuhao Zhou Qing Ye Jiancheng Lv
Scalable, Confidential and Survivable Software Updates.Federico Magnanini Luca Ferretti Michele Colajanni
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures.Zhen Xie Guangming Tan Weifeng Liu Ninghui Sun
Elastic Deep Learning in Multi-Tenant GPU Clusters.Yidi Wu Kaihao Ma Xiao Yan Zhi Liu Zhenkun Cai Yuzhen Huang James Cheng Han Yuan Fan Yu
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs.Tongfeng Weng Xu Zhou Kenli Li Peng Peng Keqin Li
FlitZip: Effective Packet Compression for NoC in MultiProcessor System-on-Chip.Dipika Deb Rohith M. K. John Jose
COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments.Shreshth Tuli Shivananda R. Poojara Satish Narayana Srirama Giuliano Casale Nicholas R. Jennings
Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems.Gingfung Yeung Damian Borowiec Renyu Yang Adrian Friday Richard Harper Peter Garraghan
Optimizing Depthwise Separable Convolution Operations on GPUs.Gangzhao Lu Weizhe Zhang Zheng Wang
Optimal Repair-Scaling Trade-off in Locally Repairable Codes: Analysis and Evaluation.Si Wu Zhirong Shen Patrick P. C. Lee Yinlong Xu
$run$ runData: Re-Distributing Data via Piggybacking for Geo-Distributed Data Analytics Over Edges.Yibo Jin Zhuzhong Qian Song Guo Sheng Zhang Lei Jiao Sanglu Lu
Capelin: Data-Driven Compute Capacity Procurement for Cloud Datacenters Using Portfolios of Scenarios.Georgios Andreadis Fabian Mastenbroek Vincent van Beek Alexandru Iosup
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment.Haozhao Wang Song Guo Zhihao Qu Ruixuan Li Ziming Liu
DeTraS: Delaying Stores for Friendly-Fire Mitigation in Hardware Transactional Memory.J. Rubén Titos Gil Ricardo Fernández Pascual Alberto Ros Manuel E. Acacio