4
4
5
5
** URL:** https://computing.llnl.gov/projects/co-design/amg2013
6
6
7
- ** Team:**
7
+ ** Team:** HPCNepal
8
8
9
9
## Compilation
10
10
@@ -20,23 +20,163 @@ Pull request for Spack recipe changes:
20
20
21
21
22
22
23
- #### Compiler 1
23
+ #### gcc 10.3.0
24
24
25
25
```
26
- spack install <app>%<compiler1>
26
+ spack install amg2013%[email protected]
27
27
```
28
28
29
29
```
30
- $ spack spec -Il <app>%<compiler1>
30
+ $ spack spec -Il amg2013%[email protected]
31
+
32
+ ```
33
+ ```
34
+ Input spec
35
+ --------------------------------
36
+
37
+
38
+ Concretized
39
+ --------------------------------
40
+ [+] twfpbbe amg2013@master%[email protected] ~assumedpartition~int64+openmp arch=linux-amzn2-graviton2
41
+ [+] zvamksn ^[email protected] %[email protected] ~atomics~cuda~cxx~cxx_exceptions+gpfs~internal-hwloc~java~legacylaunchers~lustre~memchecker+pmi~singularity~sqlite3+static~thread_multiple+vt+wrapper-rpath fabrics=ofi patches=60ce20bc14d98c572ef7883b9fcd254c3f232c2f3a13377480f96466169ac4c8 schedulers=slurm arch=linux-amzn2-graviton2
42
+ [+] cukmqbg ^[email protected] %[email protected] ~cairo~cuda~gl~libudev+libxml2~netloc~nvml+pci+shared arch=linux-amzn2-graviton2
43
+ [+] asgtk6a ^[email protected] %[email protected] arch=linux-amzn2-graviton2
44
+ [+] z2uysov ^[email protected] %[email protected] arch=linux-amzn2-graviton2
45
+ [+] ebhjpix ^[email protected] %[email protected] +sigsegv patches=3877ab548f88597ab2327a2230ee048d2d07ace1062efe81fc92e91b7f39cd00,fc9b61654a3ba1a8d6cd78ce087e7c96366c290bc8d2c299f09828d793b853c8 arch=linux-amzn2-graviton2
46
+ [+] ltbv6bk ^[email protected] %[email protected] arch=linux-amzn2-graviton2
47
+ [+] s4pw7zm ^[email protected] %[email protected] arch=linux-amzn2-graviton2
48
+ [+] 4xr3hhh ^[email protected] %[email protected] arch=linux-amzn2-graviton2
49
+ [+] iyhm3wi ^[email protected] %[email protected] ~python arch=linux-amzn2-graviton2
50
+ [+] y5ei3cm ^[email protected] %[email protected] arch=linux-amzn2-graviton2
51
+ [+] ye3kcvv ^[email protected] %[email protected] ~pic libs=shared,static arch=linux-amzn2-graviton2
52
+ [+] qepjcvj ^[email protected] %[email protected] +optimize+pic+shared arch=linux-amzn2-graviton2
53
+ [+] iwzirqc ^[email protected] %[email protected] ~symlinks+termlib abi=none arch=linux-amzn2-graviton2
54
+ [+] tadxrfp ^[email protected] %[email protected] +openssl arch=linux-amzn2-graviton2
55
+ [+] 5i3lgfb ^[email protected] %[email protected] ~docs+systemcerts arch=linux-amzn2-graviton2
56
+ [+] 4m7exgb ^[email protected] %[email protected] +cpanm+shared+threads arch=linux-amzn2-graviton2
57
+ [+] y42m6yr ^[email protected] %[email protected] +cxx~docs+stl patches=b231fcc4d5cff05e5c3a4814f6a5af0e9a966428dc2176540d2c05aff41de522 arch=linux-amzn2-graviton2
58
+ [+] rqrpmap ^[email protected] %[email protected] ~debug~pic+shared arch=linux-amzn2-graviton2
59
+ [+] 2w7bert ^[email protected] %[email protected] arch=linux-amzn2-graviton2
60
+ [+] wjwqncx ^[email protected] %[email protected] arch=linux-amzn2-graviton2
61
+ [+] 3zy7kxk ^[email protected] %[email protected] arch=linux-amzn2-graviton2
62
+ [+] 72f5gvk ^[email protected] %[email protected] ~debug~kdreg fabrics=sockets,tcp,udp arch=linux-amzn2-graviton2
63
+ [+] mhav5gn ^[email protected] %[email protected] patches=4e1d78cbbb85de625bad28705e748856033eaafab92a66dffd383a3d7e00cc94,62fc8a8bf7665a60e8f4c93ebbd535647cebf74198f7afafec4c085a8825c006 arch=linux-amzn2-graviton2
64
+ [+] jkuhz64 ^[email protected] %[email protected] arch=linux-amzn2-graviton2
65
+ [+] xb2w5nc ^[email protected] %[email protected] arch=linux-amzn2-graviton2
66
+ [+] wturp6c ^[email protected] %[email protected] arch=linux-amzn2-graviton2
67
+ [+] ivotdt7 ^[email protected] %[email protected] arch=linux-amzn2-graviton2
68
+ [+] wqpuvmh ^slurm@20-02-4-1%[email protected] ~gtk~hdf5~hwloc~mariadb~pmix+readline~restd sysconfdir=PREFIX/etc arch=linux-amzn2-graviton2
69
+
70
+ ```
71
+
72
+
73
+ #### arm 21.0.0.879
74
+
75
+ ```
76
+ spack install amg2013%[email protected]
77
+ ```
78
+
79
+ ```
80
+ $ spack spec -Il amg2013%[email protected]
81
+
82
+ ```
83
+
84
+ ```
85
+ Input spec
86
+ --------------------------------
87
+
88
+
89
+ Concretized
90
+ --------------------------------
91
+ ==> Warning: [email protected] cannot build optimized binaries for "graviton2". Using best target possible: "aarch64"
92
+ [+] q6x6lzj amg2013@master%[email protected] ~assumedpartition~int64+openmp arch=linux-amzn2-aarch64
93
+ [+] lmaoy5t ^[email protected] %[email protected] ~atomics~cuda~cxx~cxx_exceptions+gpfs~internal-hwloc~java~legacylaunchers~lustre~memchecker+pmi~singularity~sqlite3+static~thread_multiple+vt+wrapper-rpath fabrics=ofi patches=60ce20bc14d98c572ef7883b9fcd254c3f232c2f3a13377480f96466169ac4c8 schedulers=slurm arch=linux-amzn2-aarch64
94
+ [+] xl6anaa ^[email protected] %[email protected] ~cairo~cuda~gl~libudev+libxml2~netloc~nvml+pci+shared arch=linux-amzn2-aarch64
95
+ [+] jueqz7p ^[email protected] %[email protected] arch=linux-amzn2-aarch64
96
+ [+] e4ssqx6 ^[email protected] %[email protected] arch=linux-amzn2-aarch64
97
+ [+] i2jmeo4 ^[email protected] %[email protected] +sigsegv patches=3877ab548f88597ab2327a2230ee048d2d07ace1062efe81fc92e91b7f39cd00,fc9b61654a3ba1a8d6cd78ce087e7c96366c290bc8d2c299f09828d793b853c8 arch=linux-amzn2-aarch64
98
+ [+] 6jhzlul ^[email protected] %[email protected] arch=linux-amzn2-aarch64
99
+ [+] zpuzm23 ^[email protected] %[email protected] arch=linux-amzn2-aarch64
100
+ [+] uwcxkin ^[email protected] %[email protected] arch=linux-amzn2-aarch64
101
+ [+] dypqz2i ^[email protected] %[email protected] ~python arch=linux-amzn2-aarch64
102
+ [+] 7vnthzn ^[email protected] %[email protected] arch=linux-amzn2-aarch64
103
+ [+] zqsab4f ^[email protected] %[email protected] ~pic libs=shared,static arch=linux-amzn2-aarch64
104
+ [+] puuxvg2 ^[email protected] %[email protected] +optimize+pic+shared arch=linux-amzn2-aarch64
105
+ [+] uhtqtlb ^[email protected] %[email protected] ~symlinks+termlib abi=none arch=linux-amzn2-aarch64
106
+ [+] gonqskn ^[email protected] %[email protected] +openssl arch=linux-amzn2-aarch64
107
+ [+] vc3waha ^[email protected] %[email protected] ~docs+systemcerts arch=linux-amzn2-aarch64
108
+ [+] vv6txro ^[email protected] %[email protected] +cpanm+shared+threads arch=linux-amzn2-aarch64
109
+ [+] 33wiajj ^[email protected] %[email protected] +cxx~docs+stl patches=b231fcc4d5cff05e5c3a4814f6a5af0e9a966428dc2176540d2c05aff41de522 arch=linux-amzn2-aarch64
110
+ [+] z4ybgri ^[email protected] %[email protected] ~debug~pic+shared arch=linux-amzn2-aarch64
111
+ [+] adtc6yc ^[email protected] %[email protected] arch=linux-amzn2-aarch64
112
+ [+] 645q4qj ^[email protected] %[email protected] arch=linux-amzn2-aarch64
113
+ [+] 3haw5gt ^[email protected] %[email protected] arch=linux-amzn2-aarch64
114
+ [+] qdn27nh ^[email protected] %[email protected] ~debug~kdreg fabrics=sockets,tcp,udp arch=linux-amzn2-aarch64
115
+ [+] mv2g7r5 ^[email protected] %[email protected] patches=4e1d78cbbb85de625bad28705e748856033eaafab92a66dffd383a3d7e00cc94,62fc8a8bf7665a60e8f4c93ebbd535647cebf74198f7afafec4c085a8825c006 arch=linux-amzn2-aarch64
116
+ [+] dcs645r ^[email protected] %[email protected] arch=linux-amzn2-aarch64
117
+ [+] edezkz3 ^[email protected] %[email protected] arch=linux-amzn2-aarch64
118
+ [+] 6vvthuo ^[email protected] %[email protected] arch=linux-amzn2-aarch64
119
+ [+] xe4evc4 ^[email protected] %[email protected] arch=linux-amzn2-aarch64
120
+ [+] x5xehti ^slurm@20-02-4-1%[email protected] ~gtk~hdf5~hwloc~mariadb~pmix+readline~restd sysconfdir=PREFIX/etc arch=linux-amzn2-aarch64
121
+
122
+ ```
123
+
124
+
125
+
126
+
127
+ ```
128
+ spack install amg2013%[email protected]
129
+ ```
130
+
131
+ ```
132
+ $ spack spec -Il amg2013%[email protected]
133
+
134
+ ```
135
+ ```
136
+ Input spec
137
+ --------------------------------
138
+
139
+
140
+ Concretized
141
+ --------------------------------
142
+ [+] eiqymzo amg2013@master%[email protected] ~assumedpartition~int64+openmp arch=linux-amzn2-graviton2
143
+ [+] krxyvbc ^[email protected] %[email protected] ~atomics~cuda~cxx~cxx_exceptions+gpfs~internal-hwloc~java~legacylaunchers~lustre~memchecker+pmi~singularity~sqlite3+static~thread_multiple+vt+wrapper-rpath fabrics=ofi patches=60ce20bc14d98c572ef7883b9fcd254c3f232c2f3a13377480f96466169ac4c8,fba0d3a784a9723338722b48024a22bb32f6a951db841a4e9f08930a93f41d7a schedulers=slurm arch=linux-amzn2-graviton2
144
+ [+] jroqews ^[email protected] %[email protected] ~cairo~cuda~gl~libudev+libxml2~netloc~nvml+pci+shared arch=linux-amzn2-graviton2
145
+ [+] e4m4ued ^[email protected] %[email protected] patches=6e08dc445ece06e9e8b1344397f2d3f169005703ddc0f2ae24f366cde78c7377 arch=linux-amzn2-graviton2
146
+ [+] kk4ax3i ^[email protected] %[email protected] arch=linux-amzn2-graviton2
147
+ [+] 6c4kz5g ^[email protected] %[email protected] +sigsegv patches=3877ab548f88597ab2327a2230ee048d2d07ace1062efe81fc92e91b7f39cd00,5746cf51f45b405661c3edae7a78c33d41e54d83f635d16e2bf1f956dbfbf635,fc9b61654a3ba1a8d6cd78ce087e7c96366c290bc8d2c299f09828d793b853c8 arch=linux-amzn2-graviton2
148
+ [+] pa6wm5j ^[email protected] %[email protected] arch=linux-amzn2-graviton2
149
+ [+] vtiml6g ^[email protected] %[email protected] arch=linux-amzn2-graviton2
150
+ [+] 4imdwuy ^[email protected] %[email protected] arch=linux-amzn2-graviton2
151
+ [+] wo4l72s ^[email protected] %[email protected] ~python patches=05ff238cf435825ef835c7ae39376b52dc83d8caf19e962f0766c841386a305a,10a88ad47f9797cf7cf2d7d07241f665a3b6d1f31fa026728c8c2ae93e1664e9 arch=linux-amzn2-graviton2
152
+ [+] r7mmkdp ^[email protected] %[email protected] arch=linux-amzn2-graviton2
153
+ [+] br733tn ^[email protected] %[email protected] ~pic libs=shared,static arch=linux-amzn2-graviton2
154
+ [+] 4js6ect ^[email protected] %[email protected] +optimize+pic+shared arch=linux-amzn2-graviton2
155
+ [+] asgm7mt ^[email protected] %[email protected] ~symlinks+termlib abi=none arch=linux-amzn2-graviton2
156
+ [+] uttaumr ^[email protected] %[email protected] +openssl arch=linux-amzn2-graviton2
157
+ [+] j2qhi7h ^[email protected] %[email protected] ~docs+systemcerts arch=linux-amzn2-graviton2
158
+ [+] gn4fgp5 ^[email protected] %[email protected] +cpanm+shared+threads patches=21cf6a73cec16760f8de2e8895ace1299aff2d8e92dc581cd18f1d95a4503048 arch=linux-amzn2-graviton2
159
+ [+] 5uyf3k4 ^[email protected] %[email protected] +cxx~docs+stl patches=b231fcc4d5cff05e5c3a4814f6a5af0e9a966428dc2176540d2c05aff41de522 arch=linux-amzn2-graviton2
160
+ [+] wsi7g3j ^[email protected] %[email protected] ~debug~pic+shared arch=linux-amzn2-graviton2
161
+ [+] s4mb5no ^[email protected] %[email protected] patches=6e42dc243f17aab29fd167f060f5bc1f08813e03368eb301b43c95d4b1386681 arch=linux-amzn2-graviton2
162
+ [+] m2wdbeo ^[email protected] %[email protected] arch=linux-amzn2-graviton2
163
+ [+] zori3wf ^[email protected] %[email protected] arch=linux-amzn2-graviton2
164
+ [+] xl6zavq ^[email protected] %[email protected] ~debug~kdreg fabrics=sockets,tcp,udp arch=linux-amzn2-graviton2
165
+ [+] 5yq4tpw ^[email protected] %[email protected] patches=4e1d78cbbb85de625bad28705e748856033eaafab92a66dffd383a3d7e00cc94,62fc8a8bf7665a60e8f4c93ebbd535647cebf74198f7afafec4c085a8825c006 arch=linux-amzn2-graviton2
166
+ [+] fo57byt ^[email protected] %[email protected] arch=linux-amzn2-graviton2
167
+ [+] gmd4264 ^[email protected] %[email protected] arch=linux-amzn2-graviton2
168
+ [+] cl3ohqo ^[email protected] %[email protected] arch=linux-amzn2-graviton2
169
+ [+] yvqpq74 ^[email protected] %[email protected] arch=linux-amzn2-graviton2
170
+ [+] zehhooy ^slurm@20-02-4-1%[email protected] ~gtk~hdf5~hwloc~mariadb~pmix+readline~restd sysconfdir=PREFIX/etc arch=linux-amzn2-graviton2
31
171
32
172
```
33
173
34
- ## Test Case 1
174
+ ## Test Case
35
175
36
176
[ ReFrame Benchmark 1] ( # )
37
177
38
178
```
39
- ../bin/reframe -c benchmark .py -r --performance-report
179
+ ../bin/reframe -c amg2013_short .py -r --performance-report
40
180
```
41
181
42
182
### Validation
@@ -50,19 +190,128 @@ Details of the validation for `Test Case 1`.
50
190
==============================================================================
51
191
PERFORMANCE REPORT
52
192
------------------------------------------------------------------------------
53
- ****
193
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_1
194
+ - aws:c6gn
195
+ - builtin
196
+ * num_tasks: 1
197
+ * Total Time: 1848879.0 s
198
+ ------------------------------------------------------------------------------
199
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_2
200
+ - builtin
201
+ * num_tasks: 1
202
+ * Total Time: 2640431.0 s
203
+ ------------------------------------------------------------------------------
204
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_4
205
+ - builtin
206
+ * num_tasks: 1
207
+ * Total Time: 3499481.0 s
208
+ ------------------------------------------------------------------------------
209
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_8
210
+ - builtin
211
+ * num_tasks: 1
212
+ * Total Time: 3551646.0 s
213
+ ------------------------------------------------------------------------------
214
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_16
215
+ - builtin
216
+ * num_tasks: 1
217
+ * Total Time: 2584852.0 s
218
+ ------------------------------------------------------------------------------
219
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_32
220
+ - builtin
221
+ * num_tasks: 1
222
+ * Total Time: 1513061.0 s
223
+ ------------------------------------------------------------------------------
224
+ amg2013_amg2013_0_amg2013__gcc_10_3_0_N_1_MPI_1_OMP_64
225
+ - builtin
226
+ * num_tasks: 1
227
+ * Total Time: 830065.2 s
228
+ ------------------------------------------------------------------------------
229
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_1
230
+ - builtin
231
+ * num_tasks: 1
232
+ * Total Time: 1702493.0 s
233
+ ------------------------------------------------------------------------------
234
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_2
235
+ - builtin
236
+ * num_tasks: 1
237
+ * Total Time: 2134396.0 s
238
+ ------------------------------------------------------------------------------
239
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_4
240
+ - builtin
241
+ * num_tasks: 1
242
+ * Total Time: 2906109.0 s
243
+ ------------------------------------------------------------------------------
244
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_8
245
+ - builtin
246
+ * num_tasks: 1
247
+ * Total Time: 2793329.0 s
248
+ ------------------------------------------------------------------------------
249
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_16
250
+ - builtin
251
+ * num_tasks: 1
252
+ * Total Time: 2597610.0 s
253
+ ------------------------------------------------------------------------------
254
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_32
255
+ - builtin
256
+ * num_tasks: 1
257
+ * Total Time: 2092261.0 s
258
+ ------------------------------------------------------------------------------
259
+ amg2013_amg2013_0_amg2013__arm_21_0_0_879_N_1_MPI_1_OMP_64
260
+ - builtin
261
+ * num_tasks: 1
262
+ * Total Time: 1628985.0 s
263
+ ------------------------------------------------------------------------------
264
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_1
265
+ - builtin
266
+ * num_tasks: 1
267
+ * Total Time: 1765272.0 s
268
+ ------------------------------------------------------------------------------
269
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_2
270
+ - builtin
271
+ * num_tasks: 1
272
+ * Total Time: 2285842.0 s
273
+ ------------------------------------------------------------------------------
274
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_4
275
+ - builtin
276
+ * num_tasks: 1
277
+ * Total Time: 3051910.0 s
278
+ ------------------------------------------------------------------------------
279
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_8
280
+ - builtin
281
+ * num_tasks: 1
282
+ * Total Time: 3160689.0 s
283
+ ------------------------------------------------------------------------------
284
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_16
285
+ - builtin
286
+ * num_tasks: 1
287
+ * Total Time: 2496395.0 s
288
+ ------------------------------------------------------------------------------
289
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_32
290
+ - builtin
291
+ * num_tasks: 1
292
+ * Total Time: 1239280.0 s
293
+ ------------------------------------------------------------------------------
294
+ amg2013_amg2013_0_amg2013__nvhpc_21_2_N_1_MPI_1_OMP_64
295
+ - builtin
296
+ * num_tasks: 1
297
+ * Total Time: 487107.5 s
54
298
------------------------------------------------------------------------------
55
299
```
56
300
57
301
### On-node Compiler Comparison
58
302
59
303
Performance comparison of two compilers.
60
-
61
- | Cores | Compiler 1 | Compiler 2 |
62
- | -------| ------------| ------------|
63
- | | | |
64
-
65
-
304
+ ```
305
+ | Cores(OMP rank) | GCC | ARM | NVHPC|
306
+ |--------|------------|------------|
307
+ | 1 | 10.295 | 7.300 | 19.736 |
308
+ | 2 | 9.551 | 26.67 | 18.709 |
309
+ | 4 | 8.805 | 25.239 | 16.558 |
310
+ | 8 | 8.450 | 22.822 | 17.390 |
311
+ | 16 | 9.473 | 7.812 | 12.429 |
312
+ | 32 | 6.992 | 279.186| 276.317 |
313
+ | 64 | 281.773 | 9.072 | 12.124 |
314
+ ```
66
315
### Serial Hot-spot Profile
67
316
68
317
List of top-10 functions / code locations from a serial profile.
0 commit comments