Hi everyone
I'm very noob in WRF and i want to make a question for get the best perfomance in my WRF runs. I compiled with DMPAR option and i want to run a real case with 46 x 46 cells (a little mesh), with 5 nest in similar sizes. i have a HPC with 96 cores (AMD) and i run the case with parallelization mpich using 16 cores (4X, 4Y), using mpirun -np 16 ./wrf.exe, obviously with that i get the best division (near 10 cells per processors), so with mpich i can't get a fastest runs.
Do you know how can i get a shorter time the runs, with the same size of domains?
PD1: The objective it's get a time series of wind in one point for a very long time, so the time of runs it's important.
PD2: i already tried use a WRF compilated DMPAR+SMPAR, to use more processors in the run with mpich more some preocessors with OPENMP (TILES), but i don't get a better performance.
I will be very greatfull for any informations
Greatings
I'm very noob in WRF and i want to make a question for get the best perfomance in my WRF runs. I compiled with DMPAR option and i want to run a real case with 46 x 46 cells (a little mesh), with 5 nest in similar sizes. i have a HPC with 96 cores (AMD) and i run the case with parallelization mpich using 16 cores (4X, 4Y), using mpirun -np 16 ./wrf.exe, obviously with that i get the best division (near 10 cells per processors), so with mpich i can't get a fastest runs.
Do you know how can i get a shorter time the runs, with the same size of domains?
PD1: The objective it's get a time series of wind in one point for a very long time, so the time of runs it's important.
PD2: i already tried use a WRF compilated DMPAR+SMPAR, to use more processors in the run with mpich more some preocessors with OPENMP (TILES), but i don't get a better performance.
I will be very greatfull for any informations
Greatings