Hi,
I am running a WRF model with horizontal resolution of ~9 km, model grid is 859*859. I want to activate the asynchronized I/O option in my model. Now I am using 256 cores for computing, my nproc_x = 16, proc_y = 16, what is an appropriate option for my nio_groups and nio_tasks_per_group?
I have tried some combinations, for example nio_groups = 1, nio_tasks_per_group = 4, however I got the following message; I also tried nio_groups = 2, nio_tasks_per_group = 16, and got the same message. Is this caused by not enough memory?
===================================================================================
659 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
660 = RANK 256 PID 457264 RUNNING AT m3ca0705
661 = KILLED BY SIGNAL: 9 (Killed)
662 ===================================================================================
663
664 ===================================================================================
665 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
666 = RANK 257 PID 457265 RUNNING AT m3ca0705
667 = KILLED BY SIGNAL: 11 (Segmentation fault)
668 ===================================================================================
669
670 ===================================================================================
671 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
672 = RANK 258 PID 457266 RUNNING AT m3ca0705
673 = KILLED BY SIGNAL: 9 (Killed)
674 ===================================================================================
675
676 ===================================================================================
677 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
678 = RANK 259 PID 457267 RUNNING AT m3ca0705
679 = KILLED BY SIGNAL: 11 (Segmentation fault)
680 ===================================================================================
681
682 ===================================================================================
683 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
684 = RANK 260 PID 457268 RUNNING AT m3ca0705
685 = KILLED BY SIGNAL: 11 (Segmentation fault)
686 ===================================================================================
687
688 ===================================================================================
689 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
690 = RANK 261 PID 457269 RUNNING AT m3ca0705
691 = KILLED BY SIGNAL: 9 (Killed)
692 ===================================================================================
I am running a WRF model with horizontal resolution of ~9 km, model grid is 859*859. I want to activate the asynchronized I/O option in my model. Now I am using 256 cores for computing, my nproc_x = 16, proc_y = 16, what is an appropriate option for my nio_groups and nio_tasks_per_group?
I have tried some combinations, for example nio_groups = 1, nio_tasks_per_group = 4, however I got the following message; I also tried nio_groups = 2, nio_tasks_per_group = 16, and got the same message. Is this caused by not enough memory?
===================================================================================
659 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
660 = RANK 256 PID 457264 RUNNING AT m3ca0705
661 = KILLED BY SIGNAL: 9 (Killed)
662 ===================================================================================
663
664 ===================================================================================
665 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
666 = RANK 257 PID 457265 RUNNING AT m3ca0705
667 = KILLED BY SIGNAL: 11 (Segmentation fault)
668 ===================================================================================
669
670 ===================================================================================
671 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
672 = RANK 258 PID 457266 RUNNING AT m3ca0705
673 = KILLED BY SIGNAL: 9 (Killed)
674 ===================================================================================
675
676 ===================================================================================
677 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
678 = RANK 259 PID 457267 RUNNING AT m3ca0705
679 = KILLED BY SIGNAL: 11 (Segmentation fault)
680 ===================================================================================
681
682 ===================================================================================
683 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
684 = RANK 260 PID 457268 RUNNING AT m3ca0705
685 = KILLED BY SIGNAL: 11 (Segmentation fault)
686 ===================================================================================
687
688 ===================================================================================
689 = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
690 = RANK 261 PID 457269 RUNNING AT m3ca0705
691 = KILLED BY SIGNAL: 9 (Killed)
692 ===================================================================================