Chris Thomas
New member
I am running an ensemble with various choices of physics options. I find that for a lot of combinations wrf.exe hangs after some time. I am using 504 processors, and when the hang occurs a traceback using padb shows that 503 processors are waiting either in rsl_lite_exch_x_ or rsl_lite_exch_y_ while one processor is in module_sf_sfclayrev_mp_zolri2_. This is true every time the hang occurs. The problem is reproducible and for a fixed set of physics combinations always occurs at the same time step. In one particular case I have produced a restart file at a time step shortly before the issue occurs, and I can reproduce the issue from a restart at this time step. This was so that I could track the problem down with a slow executable compiled with -d option, however it does not occur in this case!! This probably says something about the nature of the problem. I have attached namelist.input so that you know what physics options were used in this case, but the problem occurs for a number of different combinations. I could supply you with the wrfrst etc files if you wish. The restart files (d01, d02) are about 1.3G each.