Dear WRF users,
I have installed WRF v4.2 with dmpar in a cluster which has three nodes, 8 CPUs each.
I test run this cluster with CAMx model (air modeling), the speed is 3 times higher than the case using only 1 processor (i.e. no useing mpiexe)
However, when I test with WRF (set -np 9), the speed is almost the same with the case of 1 processor.
Please advise me how can I speed up the wrf simulation?
Let me explain my case:
When I set run command as:
The running shows:
Then 9 rsl* files were generated (rsl.out.0000, rsl.out.0001, ..., rsl.out.0008)
Reading these 9 files, I found that that only time for reading inputs are divided between 9 cores, while "time for main" were only in 1 processor. So I guest this is the reason why the speed of simulation was not increase. How can we divide the task of "timing for main"
This is the endling lines of rsl.out.0000 file showing "timing for main" running in this core
And this is the rsl.out.0001, all other rsl* file have the same, only working for inputs, then waiting to complete. "timing for main" were not run in these cores
I attached here the namelist.input, rsl.out.0000 and rsl.out.0001.
Please help!
Thanks
Ha Chi
I have installed WRF v4.2 with dmpar in a cluster which has three nodes, 8 CPUs each.
I test run this cluster with CAMx model (air modeling), the speed is 3 times higher than the case using only 1 processor (i.e. no useing mpiexe)
However, when I test with WRF (set -np 9), the speed is almost the same with the case of 1 processor.
Please advise me how can I speed up the wrf simulation?
Let me explain my case:
When I set run command as:
Code:
mpiexe --machinefile machinefile.txt -np 9 ./wrf.exe
Code:
starting wrf task 0 of 9
starting wrf task 1 of 9
starting wrf task 2 of 9
starting wrf task 3 of 9
starting wrf task 4 of 9
starting wrf task 5 of 9
starting wrf task 6 of 9
starting wrf task 7 of 9
starting wrf task 8 of 9
Reading these 9 files, I found that that only time for reading inputs are divided between 9 cores, while "time for main" were only in 1 processor. So I guest this is the reason why the speed of simulation was not increase. How can we divide the task of "timing for main"
This is the endling lines of rsl.out.0000 file showing "timing for main" running in this core
Code:
Timing for main: time 2017-12-25_05:59:00 on domain 2: 9.60069 elapsed seconds
Timing for main: time 2017-12-25_05:59:10 on domain 3: 1.89823 elapsed seconds
Timing for main: time 2017-12-25_05:59:20 on domain 3: 1.86424 elapsed seconds
Timing for main: time 2017-12-25_05:59:30 on domain 3: 1.76740 elapsed seconds
Timing for main: time 2017-12-25_05:59:30 on domain 2: 9.53581 elapsed seconds
Timing for main: time 2017-12-25_05:59:40 on domain 3: 1.91286 elapsed seconds
Timing for main: time 2017-12-25_05:59:50 on domain 3: 1.82460 elapsed seconds
Timing for main: time 2017-12-25_06:00:00 on domain 3: 1.78574 elapsed seconds
Timing for Writing wrfout_d03_2017-12-25_06:00:00 for domain 3: 2.84209 elapsed seconds
Timing for main: time 2017-12-25_06:00:00 on domain 2: 12.28063 elapsed seconds
Timing for Writing wrfout_d02_2017-12-25_06:00:00 for domain 2: 1.90340 elapsed seconds
Timing for main: time 2017-12-25_06:00:00 on domain 1: 35.79001 elapsed seconds
Timing for Writing wrfout_d01_2017-12-25_06:00:00 for domain 1: 0.96697 elapsed seconds
d01 2017-12-25_06:00:00 wrf: SUCCESS COMPLETE WRF
And this is the rsl.out.0001, all other rsl* file have the same, only working for inputs, then waiting to complete. "timing for main" were not run in these cores
Code:
INPUT LandUse = "MODIFIED_IGBP_MODIS_NOAH"
LANDUSE TYPE = "MODIFIED_IGBP_MODIS_NOAH" FOUND 33 CATEGORIES 2 SEASONS WATER CATEGORY = 17 SNOW CATEGORY = 15
INITIALIZE THREE Noah LSM RELATED TABLES
INPUT LandUse = "MODIFIED_IGBP_MODIS_NOAH"
LANDUSE TYPE = "MODIFIED_IGBP_MODIS_NOAH" FOUND 33 CATEGORIES 2 SEASONS WATER CATEGORY = 17 SNOW CATEGORY = 15
INITIALIZE THREE Noah LSM RELATED TABLES
INPUT LandUse = "MODIFIED_IGBP_MODIS_NOAH"
LANDUSE TYPE = "MODIFIED_IGBP_MODIS_NOAH" FOUND 33 CATEGORIES 2 SEASONS WATER CATEGORY = 17 SNOW CATEGORY = 15
INITIALIZE THREE Noah LSM RELATED TABLES
Tile Strategy is not specified. Assuming 1D-Y
WRF TILE 1 IS 28 IE 54 JS 1 JE 21
WRF NUMBER OF TILES = 1
Tile Strategy is not specified. Assuming 1D-Y
WRF TILE 1 IS 34 IE 66 JS 1 JE 24
WRF NUMBER OF TILES = 1
d01 2017-12-25_06:00:00 wrf: SUCCESS COMPLETE WRF
I attached here the namelist.input, rsl.out.0000 and rsl.out.0001.
Please help!
Thanks
Ha Chi