Skip to content

Benchmark Migrate performance

Sam Reeve edited this page Oct 16, 2023 · 3 revisions

The plots below show performance of particle communication on the ORNL Frontier supercomputer. Both CPU and GPU performance are compared as a function of total particle count per rank (with 8 MPI ranks). These are the only core library benchmarks intended to run with multiple MPI ranks.

Create refers to building the communication steering vectors (construction of the Distributor) and migrate refers to executing the communication (sending particles to new ranks). Each point represents a single fraction of particles communicated.

Frontier

Device-Device

Device-Host

Host-Host

Implementation

Default parameters with the commandline "large" setting were used for these results.

Clone this wiki locally