DeepComp: towards a balanced system design for high performance computer systems
作者:Mingfa Zhu, Limin Xiao, Li Ruan, Qinfen Hao
摘要
Today, cluster-based computing is the mainstream architecture for high end computer systems. Balanced system design is critical for large scale cluster systems to achieve high efficiency. This paper addresses the practice on DeepComp high end computer systems toward a balanced system design. Methodologies of designing balanced large scale cluster systems are given. A method for balancing central processing unit (CPU) and memory hierarchy is addressed. For balancing computing nodes and I/O systems, two approaches are given: maximum bandwidth criterion and maximum number of computing nodes which can concurrently access I/O systems. Experiences of Lenovo high end cluster systems show that above methods are effective. Lenovo strategies toward a balanced system design for both peta and 10 peta scale high productivity computing systems (HPCSs).
论文关键词:high performance computer systems (HPCs), high productivity computing systems (HPCSs), cluster, balanced system design
论文评审过程:
论文官网地址:https://doi.org/10.1007/s11704-010-0150-z