robson_passos
New member
Dear, I am preparing a simulation whose domains have the following numbers of grid points:
I have two computational nodes available with 64 cpus each, I wonder if the most correct is to use 128 cpus, which decomposes into 8 NTASKS_X and 16 NTASKS_Y or 120 cpus that decomposes into 10 NTASKS_X and 12 NTASKS_Y. The last option more closely approximates a square. On the other hand the first uses more cpus. I wonder if there is a performance difference given this difference in decomposition.
I did some tests and both start the simulations without returning errors. But I didn't have the opportunity to run the two options to compare the speed, as it is necessary to save on resources.
Thank you very much in advance,
Robson
e_we = 211, 280, 532,
e_sn = 198, 226, 358,
I have two computational nodes available with 64 cpus each, I wonder if the most correct is to use 128 cpus, which decomposes into 8 NTASKS_X and 16 NTASKS_Y or 120 cpus that decomposes into 10 NTASKS_X and 12 NTASKS_Y. The last option more closely approximates a square. On the other hand the first uses more cpus. I wonder if there is a performance difference given this difference in decomposition.
I did some tests and both start the simulations without returning errors. But I didn't have the opportunity to run the two options to compare the speed, as it is necessary to save on resources.
Thank you very much in advance,
Robson