Scheduled Downtime
On Friday 21 April 2023 @ 5pm MT, this website will be down for maintenance and expected to return online the morning of 24 April 2023 at the latest

After running wrf.exe for half an hour, the process was forcibly interrupted

ghuan

New member
I have confirmed that the issue is not related to memory limitations. I will attach my namelist.input file and would greatly appreciate your help in resolving this matter. Thank you very much for your assistance.
1. wrf returned the following:
(base) [ghuan@localhost em_real]$ mpirun -n 8 ./wrf.exe
starting wrf task 2 of 8
starting wrf task 5 of 8
starting wrf task 3 of 8
starting wrf task 4 of 8
starting wrf task 7 of 8
starting wrf task 6 of 8
starting wrf task 1 of 8
starting wrf task 0 of 8

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 1 PID 6564 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 2 PID 6565 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 3 PID 6566 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 4 PID 6567 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 5 PID 6568 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 6 PID 6569 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================

===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 7 PID 6570 RUNNING AT localhost.localdomain
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================
2.rsl.error.0000 returned the following:
Timing for main: time 2020-07-07_04:40:48 on domain 2: 3.18213 elapsed seconds
Timing for main: time 2020-07-07_04:40:48 on domain 1: 10.09962 elapsed seconds
d01 2020-07-07_04:40:48 13 points exceeded v_cfl = 2 in domain d01 at time 2020-07-07_04:40:48 hours
d01 2020-07-07_04:40:48 Max W: 9 9 26 W: 12.96 w-cfl: 3.39 dETA: 0.02
d01 2020-07-07_04:40:48 10 points exceeded v_cfl = 2 in domain d01 at time 2020-07-07_04:40:48 hours
d01 2020-07-07_04:40:48 Max W: 9 9 23 W: 3.82 w-cfl: 3.94 dETA: 0.03
d01 2020-07-07_04:40:48 13 points exceeded v_cfl = 2 in domain d01 at time 2020-07-07_04:40:48 hours
d01 2020-07-07_04:40:48 Max W: 9 9 24 W: 16.24 w-cfl: 3.75 dETA: 0.03
forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image PC Routine Line Source
wrf.exe 000000000316D96A for__signal_handl Unknown Unknown
libpthread-2.17.s 00002AB9558B1630 Unknown Unknown Unknown
wrf.exe 0000000001FB0A4F Unknown Unknown Unknown
wrf.exe 0000000001F9E2BE Unknown Unknown Unknown
wrf.exe 0000000001CF1D27 Unknown Unknown Unknown
wrf.exe 00000000016BF2B7 Unknown Unknown Unknown
wrf.exe 00000000014F3F38 Unknown Unknown Unknown
wrf.exe 00000000005B86CB Unknown Unknown Unknown
wrf.exe 00000000004172D1 Unknown Unknown Unknown
wrf.exe 000000000041728F Unknown Unknown Unknown
wrf.exe 0000000000417222 Unknown Unknown Unknown
libc-2.17.so 00002AB955DE2555 __libc_start_main Unknown Unknown
wrf.exe 0000000000417129 Unknown Unknown Unknown
 

Attachments

  • namelist.input
    4 KB · Views: 1
The messages in your rsl files show CFL errors:

Code:
d01 2020-07-07_04:40:48 13 points exceeded v_cfl = 2 in domain d01 at time 2020-07-07_04:40:48 hours
d01 2020-07-07_04:40:48 Max W: 9 9 26 W: 12.96 w-cfl: 3.39 dETA: 0.02
d01 2020-07-07_04:40:48 10 points exceeded v_cfl = 2 in domain d01 at time 2020-07-07_04:40:48 hours
d01 2020-07-07_04:40:48 Max W: 9 9 23 W: 3.82 w-cfl: 3.94 dETA: 0.03
d01 2020-07-07_04:40:48 13 points exceeded v_cfl = 2 in domain d01 at time 2020-07-07_04:40:48 hours
d01 2020-07-07_04:40:48 Max W: 9 9 24 W: 16.24 w-cfl: 3.75 dETA: 0.03

This indicates the model has become unstable. See Segmentation Faults and CFL Errors for suggestions.
 
Top