Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | |||
technical:whitepaper:darwin_ucx_openmpi [2021-02-12 17:09] – [Chunked Broadcast] frey | technical:whitepaper:darwin_ucx_openmpi [2021-02-12 17:15] (current) – frey | ||
---|---|---|---|
Line 244: | Line 244: | ||
Using this module, the previous code is transformed to: | Using this module, the previous code is transformed to: | ||
<code fortran> | <code fortran> | ||
+ | Use mpi_utils | ||
+ | : | ||
+ | Allocate(Rmat(2, | ||
+ | : | ||
+ | Call BroadcastR(Rmat, | ||
+ | </ | ||
+ | The data is now broadcast as a 268435456-element chunk followed by a 20084484-element chunk, both of which are well below the 1.0 GiB limit associated with the OFI MTL. The efficiency of MTL should be well in excess of the overhead involved in the " |