size
| beff
| beff/size
| bandwidth per PE at Lmax
| PingPong latency
| PingPong bandwidth
| maximal message length Lmax
| #nodes * #PEs
| summary
| full protocol
|
| MByte/s
| MByte/s
| MByte/s
| microsec
| MByte/s
| MByte
|
|
|
|
explicitly allocated PEs, i.e. contiguous ranks on each node:
|
24 | 1805.675 | 75.236 | 400.133 | 11.728 | 954.936 | 8.000 | 3 * 8
| result_3.3_SR8000_1GB_003nodes_024PEs_c.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs_c.gz
|
18 | 1565.703 | 86.983 | 427.860 | 11.525 | 1202.586 | 8.000 | 3 * 6
| result_3.3_SR8000_1GB_003nodes_018PEs_c.shrt
| result_3.3_SR8000_1GB_003nodes_018PEs_c.gz
|
12 | 1257.728 | 104.811 | 489.445 | 11.475 | 1204.480 | 8.000 | 3 * 4
| result_3.3_SR8000_1GB_003nodes_012PEs_c.shrt
| result_3.3_SR8000_1GB_003nodes_012PEs_c.gz
|
6 | 758.508 | 126.418 | 477.788 | 11.437 | 1224.976 | 8.000 | 3 * 2
| result_3.3_SR8000_1GB_003nodes_006PEs_c.shrt
| result_3.3_SR8000_1GB_003nodes_006PEs_c.gz
|
3 | 396.829 | 132.276 | 447.107 | 23.307 | 791.866 | 8.000 | 3 * 1
| result_3.3_SR8000_1GB_003nodes_003PEs_c.shrt
| result_3.3_SR8000_1GB_003nodes_003PEs_c.gz
|
16 | 1530.664 | 95.667 | 411.060 | 11.811 | 969.781 | 8.000 | 2 * 8
| result_3.3_SR8000_1GB_002nodes_016PEs_c.shrt
| result_3.3_SR8000_1GB_002nodes_016PEs_c.gz
|
12 | 1287.352 | 107.279 | 439.721 | 11.527 | 1208.742 | 8.000 | 2 * 6
| result_3.3_SR8000_1GB_002nodes_012PEs_c.shrt
| result_3.3_SR8000_1GB_002nodes_012PEs_c.gz
|
8 | 989.464 | 123.683 | 504.567 | 11.521 | 1213.191 | 8.000 | 2 * 4
| result_3.3_SR8000_1GB_002nodes_008PEs_c.shrt
| result_3.3_SR8000_1GB_002nodes_008PEs_c.gz
|
6 | 766.605 | 127.768 | 499.667 | 11.555 | 1222.560 | 8.000 | 2 * 3
| result_3.3_SR8000_1GB_002nodes_006PEs_c.shrt
| result_3.3_SR8000_1GB_002nodes_006PEs_c.gz
|
4 | 574.523 | 143.631 | 519.596 | 11.484 | 1226.043 | 8.000 | 2 * 2
| result_3.3_SR8000_1GB_002nodes_004PEs_c.shrt
| result_3.3_SR8000_1GB_002nodes_004PEs_c.gz
|
2 | 306.570 | 153.285 | 521.074 | 22.923 | 799.677 | 8.000 | 2 * 1
| result_3.3_SR8000_1GB_002nodes_002PEs_c.shrt
| result_3.3_SR8000_1GB_002nodes_002PEs_c.gz
|
8 | 1218.994 | 152.374 | 455.575 | 11.570 | 916.839 | 8.000 | 1 * 8
| result_3.3_SR8000_1GB_001nodes_008PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_008PEs_c.gz
|
7 | 1118.625 | 159.804 | 488.660 | 11.528 | 1207.508 | 8.000 | 1 * 7
| result_3.3_SR8000_1GB_001nodes_007PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_007PEs_c.gz
|
6 | 974.033 | 162.339 | 506.776 | 11.361 | 1211.698 | 8.000 | 1 * 6
| result_3.3_SR8000_1GB_001nodes_006PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_006PEs_c.gz
|
5 | 848.999 | 169.800 | 515.719 | 11.506 | 1211.176 | 8.000 | 1 * 5
| result_3.3_SR8000_1GB_001nodes_005PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_005PEs_c.gz
|
4 | 714.477 | 178.619 | 527.187 | 11.321 | 1216.537 | 8.000 | 1 * 4
| result_3.3_SR8000_1GB_001nodes_004PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_004PEs_c.gz
|
3 | 541.446 | 180.482 | 537.551 | 11.390 | 1222.115 | 8.000 | 1 * 3
| result_3.3_SR8000_1GB_001nodes_003PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_003PEs_c.gz
|
2 | 410.553 | 205.276 | 552.597 | 11.462 | 1230.266 | 8.000 | 1 * 2
| result_3.3_SR8000_1GB_001nodes_002PEs_c.shrt
| result_3.3_SR8000_1GB_001nodes_002PEs_c.gz
|
default round-robin order,
i.e. ranks 0,3,6,... are on node 0, ranks 1,4,7,... on node 1,
ranks 2,5,8,... on node 2:
|
24 | 915.478 | 38.145 | 110.275 | 23.077 | 741.535 | 8.000 | 3 * 8
| result_3.3_SR8000_1GB_003nodes_024PEs.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs.gz
|
24 | 922.392 | 38.433 | 110.291 | 23.302 | 741.305 | 8.000 | 3 * 8
| result_3.3_SR8000_1GB_003nodes_024PEs_b.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs_b.gz
|
18 | 895.539 | 49.752 | 138.199 | 23.185 | 752.172 | 8.000 | 3 * 6
| result_3.3_SR8000_1GB_003nodes_018PEs.shrt
| result_3.3_SR8000_1GB_003nodes_018PEs.gz
|
12 | 819.624 | 68.302 | 221.940 | 23.075 | 773.075 | 8.000 | 3 * 4
| result_3.3_SR8000_1GB_003nodes_012PEs.shrt
| result_3.3_SR8000_1GB_003nodes_012PEs.gz
|
6 | 618.331 | 103.055 | 361.906 | 23.158 | 785.927 | 8.000 | 3 * 2
| result_3.3_SR8000_1GB_003nodes_006PEs.shrt
| result_3.3_SR8000_1GB_003nodes_006PEs.gz
|
3 | 429.108 | 143.036 | 464.218 | 22.883 | 797.131 | 8.000 | 3 * 1
| result_3.3_SR8000_1GB_003nodes_003PEs.shrt
| result_3.3_SR8000_1GB_003nodes_003PEs.gz
|
16 | 775.710 | 48.482 | 115.840 | 36.781 | 103.655 | 8.000 | 2 * 8
| result_3.3_SR8000_1GB_002nodes_016PEs.shrt
| result_3.3_SR8000_1GB_002nodes_016PEs.gz
|
16 | 768.112 | 48.007 | 115.286 | 29.816 | 128.435 | 8.000 | 2 * 8
| result_3.3_SR8000_1GB_002nodes_016PEs_b.shrt
| result_3.3_SR8000_1GB_002nodes_016PEs_b.gz
|
12 | 768.140 | 64.012 | 158.576 | 23.544 | 770.873 | 8.000 | 2 * 6
| result_3.3_SR8000_1GB_002nodes_012PEs.shrt
| result_3.3_SR8000_1GB_002nodes_012PEs.gz
|
8 | 659.282 | 82.410 | 230.119 | 23.220 | 784.569 | 8.000 | 2 * 4
| result_3.3_SR8000_1GB_002nodes_008PEs.shrt
| result_3.3_SR8000_1GB_002nodes_008PEs.gz
|
8 | 680.633 | 85.079 | 230.015 | 23.430 | 775.896 | 8.000 | 2 * 4
| result_3.3_SR8000_1GB_002nodes_008PEs_b.shrt
| result_3.3_SR8000_1GB_002nodes_008PEs_b.gz
|
6 | 583.527 | 97.254 | 278.203 | 23.182 | 789.365 | 8.000 | 2 * 3
| result_3.3_SR8000_1GB_002nodes_006PEs.shrt
| result_3.3_SR8000_1GB_002nodes_006PEs.gz
|
4 | 495.397 | 123.849 | 390.279 | 23.160 | 792.499 | 8.000 | 2 * 2
| result_3.3_SR8000_1GB_002nodes_004PEs.shrt
| result_3.3_SR8000_1GB_002nodes_004PEs.gz
|
2 | 306.623 | 153.311 | 522.781 | 23.004 | 799.908 | 8.000 | 2 * 1
| result_3.3_SR8000_1GB_002nodes_002PEs.shrt
| result_3.3_SR8000_1GB_002nodes_002PEs.gz
|
8 | 1245.136 | 155.642 | 470.941 | 11.650 | 970.791 | 8.000 | 1 * 8
| result_3.3_SR8000_1GB_001nodes_008PEs.shrt
| result_3.3_SR8000_1GB_001nodes_008PEs.gz
|
6 | 971.215 | 161.869 | 505.246 | 11.563 | 1213.715 | 8.000 | 1 * 6
| result_3.3_SR8000_1GB_001nodes_006PEs.shrt
| result_3.3_SR8000_1GB_001nodes_006PEs.gz
|
4 | 706.801 | 176.700 | 526.974 | 11.521 | 1205.780 | 8.000 | 1 * 4
| result_3.3_SR8000_1GB_001nodes_004PEs.shrt
| result_3.3_SR8000_1GB_001nodes_004PEs.gz
|
2 | 410.471 | 205.236 | 552.259 | 11.549 | 1226.577 | 8.000 | 1 * 2
| result_3.3_SR8000_1GB_001nodes_002PEs.shrt
| result_3.3_SR8000_1GB_001nodes_002PEs.gz
|
explicitly allocated PEs, but using special additional options:
| options
|
|
|
24 | 1805.675 | 75.236 | 400.133 | 11.728 | 954.936 | 8.000 | ---
| result_3.3_SR8000_1GB_003nodes_024PEs_c.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs_c.gz
|
24 | 1806.033 | 75.251 | 381.057 | 12.003 | 1014.339 | 8.000 | SS
| result_3.3_SR8000_1GB_003nodes_024PEs_with_SS.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs_with_SS.gz
|
24 | 1280.068 | 53.336 | 353.586 | 29.933 | 161.925 | 8.000 | SS, 64
| result_3.3_SR8000_1GB_003nodes_024PEs_with_SS_lp64.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs_with_SS_lp64.gz
|
24 | 1225.848 | 51.077 | 295.441 | 27.604 | 144.019 | 8.000 | 64
| result_3.3_SR8000_1GB_003nodes_024PEs_with_lp64.shrt
| result_3.3_SR8000_1GB_003nodes_024PEs_with_lp64.gz
|