Scheduled Downtime
On Friday 21 April 2023 @ 5pm MT, this website will be down for maintenance and expected to return online the morning of 24 April 2023 at the latest

Segmentation Error

Hello All

I hope you're doing well

I am doing a simulation on WRF 4-3.2 but wrf.exe fails just i am running for 30 hours (6 hours spinup time) I get the following error:

Program received signal SIGSEGV: Segmentation fault - invalid memory reference.

Backtrace for this error:
#0 0x7F94BC9346F7
#1 0x7F94BC934D3E
#2 0x7F94BBE3926F
#3 0x2DCFE8D in __module_sf_sfclayrev_MOD_psim_stable
#4 0x2DD44DC in __module_sf_sfclayrev_MOD_sfclayrev1d
#5 0x2DD93BD in __module_sf_sfclayrev_MOD_sfclayrev
#6 0x23C586A in __module_surface_driver_MOD_sfclayrev_seaice_wrapper
#7 0x2407AA2 in __module_surface_driver_MOD_surface_driver
#8 0x1B044D0 in __module_first_rk_step_part1_MOD_first_rk_step_part1
#9 0x144B8C7 in solve_em_
#10 0x12C32C0 in solve_interface_
#11 0x480C73 in __module_integrate_MOD_integrate
#12 0x481254 in __module_integrate_MOD_integrate
#13 0x481254 in __module_integrate_MOD_integrate
#14 0x408323 in __module_wrf_top_MOD_wrf_run

My domains are 9, 3, and 1 km. I tried with different types of time steps like 90, and 50,45, it just ran upto 4 minutes

kindly help in this regard and suggest to me

Please have a look at the name list :

&time_control
run_days = 0,
run_hours = 30,
run_minutes = 0,
run_seconds = 0,
start_year = 2022, 2022, 2022,
start_month = 07, 07, 07,
start_day = 29, 29, 29,
start_hour = 21, 21, 21,
end_year = 2022, 2022, 2022,
end_month = 07, 07, 07,
end_day = 31, 31, 31,
end_hour = 03, 03, 03,
interval_seconds = 3600
input_from_file = .true.,.true.,.true.,
history_interval = 60, 60, 60,
frames_per_outfile = 1000, 1000, 1000,
restart = .false.,
restart_interval = 5000,
io_form_history = 2
io_form_restart = 2
io_form_input = 2
io_form_boundary = 2
debug_level = 0
auxinput4_inname = "wrflowinp_d<domain>"
auxinput4_interval = 360
io_form_auxinput4 = 2
/

&domains
time_step = 45,
time_step_fract_num = 0,
time_step_fract_den = 1,
max_dom = 3,
e_we = 325, 271, 325,
e_sn = 325, 271, 325,
e_vert = 45, 45, 45,
dzstretch_s = 1.1
p_top_requested = 5000,
num_metgrid_levels = 34,
num_metgrid_soil_levels = 4,
dx = 9000, 3000, 1000,
dy = 9000, 3000, 1000,
grid_id = 1, 2, 3,
parent_id = 0, 1, 2,
i_parent_start = 1, 117, 81,
j_parent_start = 1, 117, 81,
parent_grid_ratio = 1, 3, 3,
parent_time_step_ratio = 1, 3, 3,
feedback = 0,
smooth_cg_topo = .true.,
smooth_option = 2
eta_levels = 1.0000, 0.9946, 0.9875, 0.9789, 0.9685,
0.9562, 0.9413, 0.9238, 0.9037, 0.8813,
0.8514, 0.8210, 0.7906, 0.7602, 0.7298,
0.6812, 0.6290, 0.5796, 0.5333, 0.4901,
0.4493, 0.4109, 0.3746, 0.3412, 0.3098,
0.2802, 0.2524, 0.2267, 0.2028, 0.1803,
0.1593, 0.1398, 0.1219, 0.1054, 0.0904,
0.0766, 0.0645, 0.0534, 0.0433, 0.0341,
0.0259, 0.0185, 0.0118, 0.0056, 0.0000
/

&physics
physics_suite = 'CONUS'
mp_physics = 6, 6, 6,
cu_physics = 1, 1, 0,
ra_lw_physics = 1, 1, 1,
ra_sw_physics = 1, 1, 1,
bl_pbl_physics = 8, 8, 8,
sf_sfclay_physics = 1, 1, 1,
sf_surface_physics = 2, 2, 2,
radt = 30, 30, 30,
bldt = 0, 0, 0,
cudt = 5, 5, 0,
icloud = 1,
num_land_cat = 21,
sf_urban_physics = 1, 1, 1,
fractional_seaice = 1,
/

&fdda
/

&dynamics
hybrid_opt = 2,
w_damping = 1,
diff_opt = 2, 2, 2,
km_opt = 4, 4, 4,
diff_6th_opt = 0, 0, 0,
diff_6th_factor = 0.12, 0.12, 0.12,
base_temp = 290.
damp_opt = 3,
zdamp = 5000., 5000., 5000.,
dampcoef = 0.2, 0.2, 0.2,
khdif = 0, 0, 0,
kvdif = 0, 0, 0,
non_hydrostatic = .true., .true., .true.,
moist_adv_opt = 1, 1, 1,
scalar_adv_opt = 1, 1, 1,
gwd_opt = 1, 0, 0,
damp_opt = 0,
/

&bdy_control
spec_bdy_width = 5,
spec_zone = 1,
relax_zone = 4,
specified = .true., .false.,.false.,
nested = .false., .true., .true.,
/

&grib2
/

&namelist_quilt
nio_tasks_per_group = 0,
nio_groups = 1,
/

Kindly waiting for your reply

Thanks and Regards
Nagaraju Gaddam
 
Hi,
Can you please package all of our error files (e.g., rsl.error.*) into a single .TAR file and attach that so we can take a look? Thanks!
 
Thank you so much for the reply

kindly have a look at the attached TAR file and namelist.input. waiting for your solution

Thanks and Regards
Nagaraju Gaddam
 

Attachments

  • rsl_errors.tar.gz
    10.4 KB · Views: 6
When I search your rsl files, I'm seeing some CFL errors:
Code:
rsl.error.0000:d03 2022-07-29_21:06:05            2  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:05 hours
rsl.error.0000:d03 2022-07-29_21:06:05 Max   W:     85     12      3 W:  -78.42  w-cfl:    2.25  dETA:    0.01
rsl.error.0000:d03 2022-07-29_21:06:10           10  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:10 hours
rsl.error.0000:d03 2022-07-29_21:06:10 Max   W:     84     12      3 W:   46.42  w-cfl:    3.59  dETA:    0.01
rsl.error.0000:d03 2022-07-29_21:06:10            6  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:10 hours
rsl.error.0000:d03 2022-07-29_21:06:10 Max   W:     85     12      3 W: -217.80  w-cfl:    4.90  dETA:    0.01
rsl.error.0000:d03 2022-07-29_21:06:10            8  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:10 hours
rsl.error.0000:d03 2022-07-29_21:06:10 Max   W:     86     12      3 W:  -21.18  w-cfl:    3.80  dETA:    0.01
rsl.error.0000:d03 2022-07-29_21:06:15            8  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:15 hours
rsl.error.0000:d03 2022-07-29_21:06:15 Max   W:     85     12      3 W:    5.34  w-cfl:    3.58  dETA:    0.01
rsl.error.0000:d03 2022-07-29_21:06:15           36  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:15 hours
rsl.error.0000:d03 2022-07-29_21:06:15 Max   W:     85     12      3 W: -268.22  w-cfl:   12.24  dETA:    0.01
rsl.error.0000:d03 2022-07-29_21:06:15          167  points exceeded v_cfl = 2 in domain d03 at time 2022-07-29_21:06:15 hours
rsl.error.0000:d03 2022-07-29_21:06:15 Max   W:     84     13     20 W: *******  w-cfl: 8929.85  dETA:    0.04

This indicates that the model is becoming unstable - typically due to complex terrain. I see that you're already using a lower time_step than is required (6xdx), but can you try to decrease it a bit more and see if that gets you past this point. Try setting it to 30.

If it fails again, do a search in your rsl files to see if the CFL errors are still there.
Code:
grep cfl rsl*
If they are, then take a look at this FAQ that has some suggestions for getting past the errors.
 
Top