Publication result detail

Use of Multiple GPUs on Shared Memory Multiprocessors for Ultrasound Propagation Simulations

JAROŠ, J.; TREEBY, B.; RENDELL, A.

Original Title

Use of Multiple GPUs on Shared Memory Multiprocessors for Ultrasound Propagation Simulations

English Title

Use of Multiple GPUs on Shared Memory Multiprocessors for Ultrasound Propagation Simulations

Type

Paper in proceedings (conference paper)

Original Abstract

This paper outlines our effort to migrate a compute intensive application of ultrasound propagation being developed in Matlab to a cluster computer where each node has seven GPUs. Our goal is to perform realistic simulations in hours and minutes instead of weeks and days. In order to reach this goal we investigate architecture characteristics of the target system focusing on the PCI-Express subsystem and new features proposed in CUDA version 4.0, especially simultaneous host to device, device to host and peer-to-peer transfers that the application is going to highly benefit from. We also present the results from a CPU based implementation and discuss future directions to exploit multiple GPUs.

English abstract

This paper outlines our effort to migrate a compute intensive application of ultrasound propagation being developed in Matlab to a cluster computer where each node has seven GPUs. Our goal is to perform realistic simulations in hours and minutes instead of weeks and days. In order to reach this goal we investigate architecture characteristics of the target system focusing on the PCI-Express subsystem and new features proposed in CUDA version 4.0, especially simultaneous host to device, device to host and peer-to-peer transfers that the application is going to highly benefit from. We also present the results from a CPU based implementation and discuss future directions to exploit multiple GPUs.

Keywords

Ultrasound simulation, 7-GPU system, CUDA, Matlab, FFT, PCI-Express, bandwidth, multi-core

Key words in English

Ultrasound simulation, 7-GPU system, CUDA, Matlab, FFT, PCI-Express, bandwidth, multi-core

Authors

JAROŠ, J.; TREEBY, B.; RENDELL, A.

Released

01.02.2012

Location

Melbourne

ISBN

978-1-921770-08-1

Book

Australasian Symposium on Parallel and Distributed Computing (AusPDC 2012)

Pages from

43

Pages to

52

Pages count

10

URL

BibTex

@inproceedings{BUT91284,
  author="Jiří {Jaroš} and Bradley {Treeby} and Alistair {Rendell}",
  title="Use of Multiple GPUs on Shared Memory Multiprocessors for Ultrasound Propagation Simulations",
  booktitle="Australasian Symposium on Parallel and Distributed Computing (AusPDC 2012)",
  year="2012",
  pages="43--52",
  address="Melbourne",
  isbn="978-1-921770-08-1",
  url="http://crpit.com/confpapers/CRPITV127Jaros.pdf"
}

Documents