Scheduled Downtime
On Friday 21 April 2023 @ 5pm MT, this website will be down for maintenance and expected to return online the morning of 24 April 2023 at the latest

wrf.exe failure on cluster

This post was from a previous version of the WRF&MPAS-A Support Forum. New replies have been disabled and if you have follow up questions related to this post, then please start a new thread from the forum home page.

sekluzia

Member
Dear Colleges,

My run is terminated on the cluster. Please find attached my rsl.error file.

Kind regards,
Artur
 

Attachments

  • rsl.error.txt
    4.3 KB · Views: 35
This case failed immediately, which often indicates either a data issue or a machine issue.
The rsl.error file shows that "Failed RDMA write request (status 12 : transport retry counter exceeded). Connection broken!", which looks more like a machine issue. Please talk to your computer manager to make sure you have permission ad enough space to write the data, and the communication between processors must work fine.
 
Top