junius.Wang
Member
I am encountering an issue where WRF crashes at a specific time. I would appreciate any assistance you can provide.
I have installed the latest WRF v4.6.0 model on my Linux server to utilize new functions for wind farms. After successfully running an earlier case, I read literature and planned sensitivity tests for Typhoon 'Infa', modifying some options and adjusting area ranges.
I created the initial and lateral data from fnl data (ds083.3) and ran WRF. However, it consistently crashes at a certain point without clear error messages. Upon investigation, I discovered NAN values in the final calculation stage, and I currently don’t know how to resolve this issue.
Here is the error part:
I changed the time step from 90 to 60.
Although the situation improved somewhat, NaN values appeared again after running the program for more than 2 hours, causing the program to interrupt.
Should I continue to narrow down the time step?
I have attached my namelist files, including those for both successful and failed runs, along with the rsl.error files.
Additionally, I have uploaded the shell script files for processing the WPS and WRF components, which may be helpful.
Thank you in advance.
I have installed the latest WRF v4.6.0 model on my Linux server to utilize new functions for wind farms. After successfully running an earlier case, I read literature and planned sensitivity tests for Typhoon 'Infa', modifying some options and adjusting area ranges.
I created the initial and lateral data from fnl data (ds083.3) and ran WRF. However, it consistently crashes at a certain point without clear error messages. Upon investigation, I discovered NAN values in the final calculation stage, and I currently don’t know how to resolve this issue.
Here is the error part:
Code:
Timing for main: time 2021-07-22_18:44:30 on domain 2: 0.73602 elapsed seconds
d02 Domain average of dpsdt, dmudt (mb/3h): 404.5000 8.397835 3.262286
d02 Max mu change time step: 110 2 3.2546468E-02
d02 Domain average of dardt, drcdt, drndt (mm/sec): 404.5000 1.7035587E-04 3.0214911E-05 1.4014094E-04
d02 Domain average of rt_sum, rc_sum, rnc_sum (mm): 404.5000 2.770452 0.6746788 2.095775
d02 Max Accum Resolved Precip, I,J (mm): 165.6131 211 186
d02 Max Accum Convective Precip, I,J (mm): 16.00648 200 202
d02 Domain average of sfcevp, hfx, lh: 404.5000 1.401203 13.21041 143.0962
Timing for main: time 2021-07-22_18:45:00 on domain 2: 0.73642 elapsed seconds
Timing for main: time 2021-07-22_18:45:00 on domain 1: 2.69181 elapsed seconds
d01 Domain average of dpsdt, dmudt (mb/3h): 405.0000 NaN NaN
d01 Max mu change time step: 143 104 7.5008301E-04
d01 Domain average of dardt, drcdt, drndt (mm/sec): 405.0000 1.1270608E-04 4.6179335E-05 6.6526765E-05
d01 Domain average of rt_sum, rc_sum, rnc_sum (mm): 405.0000 1.703290 0.6249699 1.078320
d01 Max Accum Resolved Precip, I,J (mm): 156.9319 146 98
d01 Max Accum Convective Precip, I,J (mm): 15.58068 224 74
d01 Domain average of sfcevp, hfx, lh: 405.0000 1.006149 10.36697 102.8982
d02 Domain average of dpsdt, dmudt (mb/3h): 405.0000 NaN NaN
d02 Max mu change time step: 217 118 5.8851804E-04
d02 Domain average of dardt, drcdt, drndt (mm/sec): 405.0000 1.6426136E-04 3.0271098E-05 1.3399025E-04
d02 Domain average of rt_sum, rc_sum, rnc_sum (mm): 405.0000 2.775380 0.6755869 2.099794
d02 Max Accum Resolved Precip, I,J (mm): 165.7204 211 186
d02 Max Accum Convective Precip, I,J (mm): 16.02392 200 202
d02 Domain average of sfcevp, hfx, lh: 405.0000 1.402918 13.21963 143.1080
Timing for main: time 2021-07-22_18:45:30 on domain 2: 1.07063 elapsed seconds
I changed the time step from 90 to 60.
Although the situation improved somewhat, NaN values appeared again after running the program for more than 2 hours, causing the program to interrupt.
Should I continue to narrow down the time step?
Code:
Timing for main: time 2021-07-24_08:30:00 on domain 2: 0.74259 elapsed seconds
Timing for main: time 2021-07-24_08:30:00 on domain 1: 2.70604 elapsed seconds
d01 Domain average of dpsdt, dmudt (mb/3h): 2670.000 NaN NaN
d01 Max mu change time step: 51 96 69.82448
d01 Domain average of dardt, drcdt, drndt (mm/sec): 2670.000 1.0824289E-04 4.7659465E-05 6.0583436E-05
d01 Domain average of rt_sum, rc_sum, rnc_sum (mm): 2670.000 15.55473 4.840905 10.71383
d01 Max Accum Resolved Precip, I,J (mm): 674.7068 141 109
d01 Max Accum Convective Precip, I,J (mm): 117.8609 254 64
d01 Domain average of sfcevp, hfx, lh: 2670.000 8.766058 27.33279 158.9171
d02 Domain average of dpsdt, dmudt (mb/3h): 2670.000 10.43634 2.705757
d02 Max mu change time step: 377 127 4.8635740E-04
d02 Domain average of dardt, drcdt, drndt (mm/sec): 2670.000 1.3764304E-04 2.0445495E-05 1.1719757E-04
d02 Domain average of rt_sum, rc_sum, rnc_sum (mm): 2670.000 24.52558 4.057384 20.46820
d02 Max Accum Resolved Precip, I,J (mm): 730.6641 204 217
d02 Max Accum Convective Precip, I,J (mm): 72.22220 194 212
d02 Domain average of sfcevp, hfx, lh: 2670.000 10.14799 14.81878 151.7243
Timing for main: time 2021-07-24_08:30:20 on domain 2: 3.16795 elapsed seconds
d02 Domain average of dpsdt, dmudt (mb/3h): 2670.333 10.42248 2.691759
d02 Max mu change time step: 377 127 4.6444396E-04
d02 Domain average of dardt, drcdt, drndt (mm/sec): 2670.333 1.3754175E-04 2.0391140E-05 1.1715063E-04
d02 Domain average of rt_sum, rc_sum, rnc_sum (mm): 2670.333 24.52834 4.057792 20.47054
d02 Max Accum Resolved Precip, I,J (mm): 730.6641 204 217
d02 Max Accum Convective Precip, I,J (mm): 72.22220 194 212
d02 Domain average of sfcevp, hfx, lh: 2670.333 10.14920 14.78852 151.6871
Timing for main: time 2021-07-24_08:30:40 on domain 2: 0.73734 elapsed seconds
d02 Domain average of dpsdt, dmudt (mb/3h): 2670.667 10.43617 2.691706
d02 Max mu change time step: 377 127 4.7271789E-04
d02 Domain average of dardt, drcdt, drndt (mm/sec): 2670.667 1.3748354E-04 2.0381407E-05 1.1710214E-04
d02 Domain average of rt_sum, rc_sum, rnc_sum (mm): 2670.667 24.53109 4.058199 20.47288
d02 Max Accum Resolved Precip, I,J (mm): 730.6641 204 217
d02 Max Accum Convective Precip, I,J (mm): 72.22220 194 212
d02 Domain average of sfcevp, hfx, lh: 2670.667 10.15042 14.79035 151.6866
Timing for main: time 2021-07-24_08:31:00 on domain 2: 0.73843 elapsed seconds
Timing for main: time 2021-07-24_08:31:00 on domain 1: 5.78849 elapsed seconds
I have attached my namelist files, including those for both successful and failed runs, along with the rsl.error files.
Additionally, I have uploaded the shell script files for processing the WPS and WRF components, which may be helpful.
Thank you in advance.
Attachments
-
namelist-without-crush.input6 KB · Views: 2
-
wps.sh.txt455 bytes · Views: 1
-
wrf-pbl2_mp6_cu11_fdda0_isftcflx2-namelist.input5.9 KB · Views: 1
-
wrf-pbl2_mp6_cu11_fdda0_isftcflx2-rsl.error.0000800.8 KB · Views: 1
-
wrf-pbl2_mp24_cu11_fdda0_isftcflx2.sh.txt613 bytes · Views: 2
-
wrf-pbl2_mp24_cu11_fdda0_isftcflx2-namelist.input5.9 KB · Views: 2
-
wrf-pbl2_mp24_cu11_fdda0_isftcflx2-rsl.error.0000821.3 KB · Views: 2