Scheduled Downtime
On Friday 21 April 2023 @ 5pm MT, this website will be down for maintenance and expected to return online the morning of 24 April 2023 at the latest

Segmentation fault

papayasalad

New member
Hello, during the simulation tests in the wrf/test/em_real folder, the simulation runs well for the first few hours but then suddenly interrupts, and I can't find the source of the problem. The simulation area uses a 4-level nesting, with the outermost layer having a resolution of 9 km. I have tried setting the time step to 50, 30, and 20, but the simulations all get interrupted.

The error in the task file is as follows:
Primary job terminated normally, but 1 process returned a non-zero exit code. Per user-direction, the job has been aborted.
mpirun noticed that process rank 18 with PID 890711 on node sd01n01 exited on signal 11 (Segmentation fault).

Where could the problem lie, and how can I resolve this issue? For your reference, I have attached my `namelist.input`, `rsl.error.0000`, and job submission error files.

Thank you for your help!

Best regards,
 

Attachments

  • namelist.input
    4.6 KB · Views: 2
  • rsl.error(1).0000
    634.5 KB · Views: 1
  • rsl.out (1).0000
    633.4 KB · Views: 1
  • QQ20240926-195250.png
    QQ20240926-195250.png
    44.5 KB · Views: 5
In your run folder where all the rsl files are try these commands:

Bash:
grep -i FATAL rsl.*


grep -i error rsl.*


grep -i SIGSEGV rsl.*


grep -i cfl rsl.*

they will show you which rsl file has the error
 
There are a few issues in your namelist.input,

(1) please set
dx = 12000, 4000, 1333
dy = 12000, 4000, 1333

(2) with spectral nudging, please set
xwavenum = 4
ywavenum = 4

Can you modify these options and try again?

Also, if you turn off spectral nudging, can your case be done successfully?
 
In your run folder where all the rsl files are try these commands:

Bash:
grep -i FATAL rsl.*


grep -i error rsl.*


grep -i SIGSEGV rsl.*


grep -i cfl rsl.*

they will show you which rsl file has the error
Thank you for your reply. I tried these commands according to your suggestion and found something wrong in rsl.0000, but I still couldn't understand the specific problem. I hope you can help me.

Best regards,
 
Can you upload your rsl.error.0018 and rsl.out.0018 for me to take a look? Thanks.
Thank you for your reply. I attached the file below.

Looking forward to your reply.

I do not know why the file is not displayed after uploading, I can only take a screenshot to show you.
 

Attachments

  • QQ20240927-122018.png
    QQ20240927-122018.png
    40.4 KB · Views: 4
Last edited:
您的 namelist.input 中存在一些问题,

(1) 请设置
dx = 12000、4000、1333
dy = 12000、4000、1333

(2) 带光谱微移,请设置
xwavenum = 4
ywavenum = 4

您能否修改这些选项并重试?

此外,如果您关闭频谱微移,您的案例能否成功完成?
I will get back to you after I try and get the result, thank you for your help!
 
Thank you for your reply. I tried these commands according to your suggestion and found something wrong in rsl.0000, but I still couldn't understand the specific problem. I hope you can help me.

Best regards,
I do not know why the file is not displayed after uploading, I can only take a screenshot to show you
 

Attachments

  • QQ20240927-122018.png
    QQ20240927-122018.png
    40.4 KB · Views: 4
Please upload your rsl file that contains the error message for me to take a look. The screen shot doesn't really tell me anything.
 
Top