Hello,
I have been running some projected years in WRF, everything was working fine until the simulation stopped at a specific month and I am getting the following error:
zlev_tq and zlev_u received from GCM are not equal
zlev_tq and zlev_u received from GCM are not equal
srun: Job step aborted: Waiting up to 62 seconds for job step to finish.
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
Stack trace terminated abnormally.
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
libifcoremt.so.5 000014E64EE01555 for__signal_handl Unknown Unknown
I already checked the metfiles and apparently they are fine. When comparing the metfiles of the month with the previous months that have been working fine, there is no difference.
Also, I have this error in the Script_Output:
========================================================================
EXECUTION of : /usr/bin/time srun --multi-prog ./run_file
slurmstepd: error: *** JOB 611622 ON r3i6n8 CANCELLED AT 2023-10-24T20:26:59 DUE TO TIME LIMIT ***
Could you please help me? I don't know how to fix this error and continue the simulation. Something important is that my variables are not called "tq" and "u" so I have not been able to find where this error comes from.
Thank you very much for your help,
Jhoana
I have been running some projected years in WRF, everything was working fine until the simulation stopped at a specific month and I am getting the following error:
zlev_tq and zlev_u received from GCM are not equal
zlev_tq and zlev_u received from GCM are not equal
srun: Job step aborted: Waiting up to 62 seconds for job step to finish.
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
Stack trace terminated abnormally.
forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
libifcoremt.so.5 000014E64EE01555 for__signal_handl Unknown Unknown
I already checked the metfiles and apparently they are fine. When comparing the metfiles of the month with the previous months that have been working fine, there is no difference.
Also, I have this error in the Script_Output:
========================================================================
EXECUTION of : /usr/bin/time srun --multi-prog ./run_file
slurmstepd: error: *** JOB 611622 ON r3i6n8 CANCELLED AT 2023-10-24T20:26:59 DUE TO TIME LIMIT ***
Could you please help me? I don't know how to fix this error and continue the simulation. Something important is that my variables are not called "tq" and "u" so I have not been able to find where this error comes from.
Thank you very much for your help,
Jhoana