Dear WRF user,
I am currently facing an issue related to the task distribution of the WRF model grid in my simulation based on WRF v4.3.3. I intend to run WRF in high-resolution (3x3 and 1.5x1.5) for a larger domain and to achieve faster simulation times, I want to increase the number of processors. The supercomputer I am using is NPAD-UFRN BRAZIL, which has 40 nodes with 60 tasks per node. To determine the maximum number of nodes and processors I can utilize, I employed the Python script (number_of_procs.py) recommended by kwerner.
However, I encountered a problem when attempting to increase the number of nodes and tasks beyond a certain limit. Currently, I am able to use only 3 nodes with a total of 169 tasks. Whenever I try to increase the number, such as using 4 nodes and 225 tasks, the WRF distribution appears to proceed normally, but the model eventually hangs or crashes without displaying any error message. I have exhaustively attempted various combinations, following the square rule to equally divide the domain.
As a newcomer to this field, I am uncertain whether this issue stems from a problem with the model itself or if it is a computational limitation. I have attached the WRF namelist.input and the rsl.out.0000 files for reference.
Thanks!
Nícolas
I am currently facing an issue related to the task distribution of the WRF model grid in my simulation based on WRF v4.3.3. I intend to run WRF in high-resolution (3x3 and 1.5x1.5) for a larger domain and to achieve faster simulation times, I want to increase the number of processors. The supercomputer I am using is NPAD-UFRN BRAZIL, which has 40 nodes with 60 tasks per node. To determine the maximum number of nodes and processors I can utilize, I employed the Python script (number_of_procs.py) recommended by kwerner.
However, I encountered a problem when attempting to increase the number of nodes and tasks beyond a certain limit. Currently, I am able to use only 3 nodes with a total of 169 tasks. Whenever I try to increase the number, such as using 4 nodes and 225 tasks, the WRF distribution appears to proceed normally, but the model eventually hangs or crashes without displaying any error message. I have exhaustively attempted various combinations, following the square rule to equally divide the domain.
As a newcomer to this field, I am uncertain whether this issue stems from a problem with the model itself or if it is a computational limitation. I have attached the WRF namelist.input and the rsl.out.0000 files for reference.
Thanks!
Nícolas