Andrea-ARPAPUGLIA
Member
Hi all,
when i ran the WRF model on 3 domains, respectively 15, 5 and 1km “wrf.exe” exits with error:
[hpc-16-01:18706] *** Process received signal ***
[hpc-16-01:18706] Signal: Segmentation fault (11)
[hpc-16-01:18706] Signal code: Address not mapped (1)
[hpc-16-01:18706] Failing at address: 0xfffffffe03a5bac0
[hpc-16-01:18706] [ 0] /lib64/libpthread.so.0(+0xf630)[0x7f8ffbf08630]
[hpc-16-01:18706] [ 1] ./wrf.exe[0x155f1c8]
[hpc-16-01:18706] [ 2] ./wrf.exe[0x1568da0]
[hpc-16-01:18706] [ 3] ./wrf.exe[0x156a46a]
[hpc-16-01:18706] [ 4] ./wrf.exe[0x156d1ed]
[hpc-16-01:18706] [ 5] ./wrf.exe[0x1116827]
[hpc-16-01:18706] [ 6] ./wrf.exe[0x1216267]
[hpc-16-01:18706] [ 7] ./wrf.exe[0xcb7375]
[hpc-16-01:18706] [ 8] ./wrf.exe[0xbc17a6]
[hpc-16-01:18706] [ 9] ./wrf.exe[0x463d23]
[hpc-16-01:18706] [10] ./wrf.exe[0x46416b]
[hpc-16-01:18706] [11] ./wrf.exe[0x46416b]
[hpc-16-01:18706] [12] ./wrf.exe[0x4056e4]
[hpc-16-01:18706] [13] ./wrf.exe[0x404f3c]
[hpc-16-01:18706] [14] ./wrf.exe[0x244818a]
[hpc-16-01:18706] [15] /lib64/libc.so.6(__libc_start_main+0xf5)[0x7f8ffbb4d555]
[hpc-16-01:18706] [16] ./wrf.exe[0x404e39]
[hpc-16-01:18706] *** End of error message ***
and the wrfoutput files have only first hour.
When I ran the same setup on only 2 domine I haven’t any problem.
In both cases I used 40 CPU.
Looking in the forum, errors of this type occur when the “time_step” parameter does not respect the rule “6 x dX”. In my settings it is respected!
I attach the namelist.input and one of the 40 rsl.error files (for the second I changed the name).
I thank in advance anyone who wants to take an interest in my problem.
Andrea
when i ran the WRF model on 3 domains, respectively 15, 5 and 1km “wrf.exe” exits with error:
[hpc-16-01:18706] *** Process received signal ***
[hpc-16-01:18706] Signal: Segmentation fault (11)
[hpc-16-01:18706] Signal code: Address not mapped (1)
[hpc-16-01:18706] Failing at address: 0xfffffffe03a5bac0
[hpc-16-01:18706] [ 0] /lib64/libpthread.so.0(+0xf630)[0x7f8ffbf08630]
[hpc-16-01:18706] [ 1] ./wrf.exe[0x155f1c8]
[hpc-16-01:18706] [ 2] ./wrf.exe[0x1568da0]
[hpc-16-01:18706] [ 3] ./wrf.exe[0x156a46a]
[hpc-16-01:18706] [ 4] ./wrf.exe[0x156d1ed]
[hpc-16-01:18706] [ 5] ./wrf.exe[0x1116827]
[hpc-16-01:18706] [ 6] ./wrf.exe[0x1216267]
[hpc-16-01:18706] [ 7] ./wrf.exe[0xcb7375]
[hpc-16-01:18706] [ 8] ./wrf.exe[0xbc17a6]
[hpc-16-01:18706] [ 9] ./wrf.exe[0x463d23]
[hpc-16-01:18706] [10] ./wrf.exe[0x46416b]
[hpc-16-01:18706] [11] ./wrf.exe[0x46416b]
[hpc-16-01:18706] [12] ./wrf.exe[0x4056e4]
[hpc-16-01:18706] [13] ./wrf.exe[0x404f3c]
[hpc-16-01:18706] [14] ./wrf.exe[0x244818a]
[hpc-16-01:18706] [15] /lib64/libc.so.6(__libc_start_main+0xf5)[0x7f8ffbb4d555]
[hpc-16-01:18706] [16] ./wrf.exe[0x404e39]
[hpc-16-01:18706] *** End of error message ***
and the wrfoutput files have only first hour.
When I ran the same setup on only 2 domine I haven’t any problem.
In both cases I used 40 CPU.
Looking in the forum, errors of this type occur when the “time_step” parameter does not respect the rule “6 x dX”. In my settings it is respected!
I attach the namelist.input and one of the 40 rsl.error files (for the second I changed the name).
I thank in advance anyone who wants to take an interest in my problem.
Andrea