cuda graph utilization #4356

indra098124 · 2025-02-26T21:28:40Z

indra098124
Feb 26, 2025

May I ask how cuda graphs are used in AMReX?

Mar 11, 2025

In our old communication functions, there were a lot of smaller kernels. So we used cudaGraph to reduce the kernel launch overhead. But later, we found that manually fusing the small kernels was faster than cudaGraph for our cases. So we no longer use cudaGraph in communication unless one forces it by setting cudaGraph region.

View full answer

WeiqunZhang · 2025-03-11T00:35:44Z

WeiqunZhang
Mar 11, 2025
Maintainer

In our old communication functions, there were a lot of smaller kernels. So we used cudaGraph to reduce the kernel launch overhead. But later, we found that manually fusing the small kernels was faster than cudaGraph for our cases. So we no longer use cudaGraph in communication unless one forces it by setting cudaGraph region.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda graph utilization #4356

{{title}}

Replies: 1 comment

{{title}}

Select a reply

cuda graph utilization #4356

indra098124 Feb 26, 2025

Replies: 1 comment

WeiqunZhang Mar 11, 2025 Maintainer

indra098124
Feb 26, 2025

WeiqunZhang
Mar 11, 2025
Maintainer