I am now conducting the real.exe with the WRF ARW V4.0, and the details can be seem in the attached file namelist.input. It is at 15.0km grids space with 345X310 grids.
Now I have to conducet a longterm simulation (about 30 years), and I need to processe a laege number of time steps in a single lateral boundary file (the issue of limited 10000-maximum number of time periods have been adress, see my former post thread. Thanks WRF&MPAS-Aforum and kwerner again).
When I run the ./real.exe (mpirun -np 24 ./real.exe &), I found the output speed is much lower than I expected. It need almost 3.0 seconds to deal with each loop:
...
d01 1993-03-23_06:00:00 Timing for processing 0 s.
d01 1993-03-23_06:00:00 Timing for output 0 s.
d01 1993-03-23_06:00:00 Timing for loop # 4710 = 3 s.
...
I have tried to use mpi and with more nodes, more tasks and more cpus (> 120) to run the real.exe, but it seems that it still use 3-seconds for each loop.
I also tried to modify the &namelist_quilt options here:
&namelist_quilt
nio_tasks_per_group = 1,
nio_groups = 6,
but it didn't work.
So I hope you could give some advice to improve the speed of running real.exe except adding more cpus ?
Thanks a lot.
Now I have to conducet a longterm simulation (about 30 years), and I need to processe a laege number of time steps in a single lateral boundary file (the issue of limited 10000-maximum number of time periods have been adress, see my former post thread. Thanks WRF&MPAS-Aforum and kwerner again).
When I run the ./real.exe (mpirun -np 24 ./real.exe &), I found the output speed is much lower than I expected. It need almost 3.0 seconds to deal with each loop:
...
d01 1993-03-23_06:00:00 Timing for processing 0 s.
d01 1993-03-23_06:00:00 Timing for output 0 s.
d01 1993-03-23_06:00:00 Timing for loop # 4710 = 3 s.
...
I have tried to use mpi and with more nodes, more tasks and more cpus (> 120) to run the real.exe, but it seems that it still use 3-seconds for each loop.
I also tried to modify the &namelist_quilt options here:
&namelist_quilt
nio_tasks_per_group = 1,
nio_groups = 6,
but it didn't work.
So I hope you could give some advice to improve the speed of running real.exe except adding more cpus ?
Thanks a lot.