next up previous
Next: Individual cluster performance Up: Test case: AGARD swept Previous: Test case: AGARD swept

Globus versus non-Globus performance

Shown in Table 4 is a comparison between non-Globus and Globus performance for the AGARD swept wing test case using the INRIA-nina cluster. Table 4 shows the Globus performance to be slightly better than the non-Globus performance. The non-Globus MPI uses the MPICH p4_ch device whereas the Globus MPI uses the globus2 device. The small differences in performance are due to slightly different configure options. Global inter-communication occurs, for example, when the maximum, minimum, or sum of the values of a variable are computed over all the processors. Local inter-communication occurs when messages are passed between two processors. The total computation time includes the inter/intra-communication times but not setup times (reading data, meshes, initialization, ...etc). The times shown in all tables are in seconds and the maximum values for all the processors. Times to save intermediate solutions are not taken into account. The Communication/Work ratio is the sum of the local and global communication time divided by the total computational time - communication time (Work)1. The minimum and average Communication/Work ratios are much smaller.

The non-Globus and Globus computational times are approximately the same on the nina cluster. Without testing each individual cluster, we hypothesize that the non-Globus and Globus times will be approximately the same on the other clusters as well.


Table 4: Globus versus non-Globus performance implicit performance
Run type non-Globus Globus
Name of cluster INRIA nina INRIA nina
Processor speed 2 GHz 2 GHz
LAN speed 1 Gbps 1 Gbps
cache 512K 512K
RAM/CPU 1/2 GB 1/2 GB
Executable size 236 MB 236 MB
Number of processors 8 8
Total computational time 103.6 96.7
Local inter-comm. time 15.5 13.2
Global inter-comm. time 16.0 8.7
Communication/Work 0.4 0.3




Table 5: Globus performances on the individual clusters
Run type Globus Globus Globus Globus
Name of cluster INRIA-nina IUSTI CEMEF INRIA-pf
Processor speed 2 GHz 2 GHz 1 GHz 1 GHz
LAN speed 1.00 Gbps 100 Mbps 100 Mbps 100 Mbps
cache 512K 512K 256K 256K
RAM/CPU 1/2 GB 1 GB 1/4 GB 1/4 GB
Executable size 236 MB 236 MB 236 MB 236 MB
Number of processors 8 8 8 8
Total computational time 87.6 148.0 200.2 264.1
Local inter-comm. time 11.3 48.8 49.4 56.6
Global inter-comm. time 1.5 3.3 7.9 8.0
Computation ratio 1.0 1.5 2.3 3.0
Communication/Work 0.4 0.5 0.4 0.3



next up previous
Next: Individual cluster performance Up: Test case: AGARD swept Previous: Test case: AGARD swept
Stephen Wornom 2004-09-10