Conversation
Member
Author
|
cscs-ci run |
Contributor
There was a problem hiding this comment.
AMDGPU.jl Benchmarks
Details
| Benchmark suite | Current: 4fee520 | Previous: 756602c | Ratio |
|---|---|---|---|
amdgpu/synchronization/context/device |
590 ns |
600 ns |
0.98 |
amdgpu/synchronization/stream/blocking |
240 ns |
240 ns |
1 |
amdgpu/synchronization/stream/nonblocking |
330 ns |
340 ns |
0.97 |
array/accumulate/Float32/1d |
86491 ns |
86251 ns |
1.00 |
array/accumulate/Float32/dims=1 |
400106 ns |
393845 ns |
1.02 |
array/accumulate/Float32/dims=1L |
135732 ns |
131681 ns |
1.03 |
array/accumulate/Float32/dims=2 |
130462 ns |
103022 ns |
1.27 |
array/accumulate/Float32/dims=2L |
2811009 ns |
2827930 ns |
0.99 |
array/accumulate/Int64/1d |
102171 ns |
96412 ns |
1.06 |
array/accumulate/Int64/dims=1 |
431536 ns |
285244 ns |
1.51 |
array/accumulate/Int64/dims=1L |
167673 ns |
160812 ns |
1.04 |
array/accumulate/Int64/dims=2 |
129382 ns |
120772 ns |
1.07 |
array/accumulate/Int64/dims=2L |
2984871 ns |
3014433 ns |
0.99 |
array/broadcast |
111712 ns |
128932 ns |
0.87 |
array/construct |
1680 ns |
1680 ns |
1 |
array/copy |
40031 ns |
39371 ns |
1.02 |
array/copyto!/cpu_to_gpu |
183943 ns |
114832 ns |
1.60 |
array/copyto!/gpu_to_cpu |
169653 ns |
152432 ns |
1.11 |
array/copyto!/gpu_to_gpu |
120802 ns |
88321 ns |
1.37 |
array/iteration/findall/bool |
182062 ns |
181912 ns |
1.00 |
array/iteration/findall/int |
197023 ns |
190933 ns |
1.03 |
array/iteration/findfirst/bool |
117111 ns |
114451 ns |
1.02 |
array/iteration/findfirst/int |
115101 ns |
116331 ns |
0.99 |
array/iteration/findmin/1d |
170102 ns |
166203 ns |
1.02 |
array/iteration/findmin/2d |
156552 ns |
156173 ns |
1.00 |
array/iteration/logical |
352414 ns |
346025 ns |
1.02 |
array/iteration/scalar |
288194 ns |
289864 ns |
0.99 |
array/permutedims/2d |
74701 ns |
64761 ns |
1.15 |
array/permutedims/3d |
74861 ns |
73791 ns |
1.01 |
array/permutedims/4d |
77321 ns |
76481 ns |
1.01 |
array/random/rand/Float32 |
52610 ns |
51540 ns |
1.02 |
array/random/rand/Int64 |
58371 ns |
56210 ns |
1.04 |
array/random/rand!/Float32 |
149162 ns |
142162 ns |
1.05 |
array/random/rand!/Int64 |
108062 ns |
141832 ns |
0.76 |
array/random/randn/Float32 |
100721 ns |
86921 ns |
1.16 |
array/random/randn!/Float32 |
102141 ns |
152202 ns |
0.67 |
array/reductions/mapreduce/Float32/1d |
133301 ns |
132902 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=1 |
95011 ns |
95052 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=1L |
773831 ns |
777081 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=2 |
97251 ns |
96731 ns |
1.01 |
array/reductions/mapreduce/Float32/dims=2L |
298504 ns |
299584 ns |
1.00 |
array/reductions/mapreduce/Int64/1d |
131302 ns |
133322 ns |
0.98 |
array/reductions/mapreduce/Int64/dims=1 |
95531 ns |
78081 ns |
1.22 |
array/reductions/mapreduce/Int64/dims=1L |
783190 ns |
783471 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=2 |
97622 ns |
96252 ns |
1.01 |
array/reductions/mapreduce/Int64/dims=2L |
298654 ns |
308254 ns |
0.97 |
array/reductions/reduce/Float32/1d |
133011 ns |
132802 ns |
1.00 |
array/reductions/reduce/Float32/dims=1 |
95412 ns |
94832 ns |
1.01 |
array/reductions/reduce/Float32/dims=1L |
774820 ns |
774621 ns |
1.00 |
array/reductions/reduce/Float32/dims=2 |
97531 ns |
96802 ns |
1.01 |
array/reductions/reduce/Float32/dims=2L |
292744 ns |
307245 ns |
0.95 |
array/reductions/reduce/Int64/1d |
133932 ns |
129672 ns |
1.03 |
array/reductions/reduce/Int64/dims=1 |
95362 ns |
78151 ns |
1.22 |
array/reductions/reduce/Int64/dims=1L |
779961 ns |
781931 ns |
1.00 |
array/reductions/reduce/Int64/dims=2 |
96801 ns |
96192 ns |
1.01 |
array/reductions/reduce/Int64/dims=2L |
299884 ns |
298414 ns |
1.00 |
array/reverse/1d |
43830 ns |
44380 ns |
0.99 |
array/reverse/1dL |
75591 ns |
74131 ns |
1.02 |
array/reverse/1dL_inplace |
115191 ns |
108282 ns |
1.06 |
array/reverse/1d_inplace |
76321 ns |
86471 ns |
0.88 |
array/reverse/2d |
52401 ns |
50661 ns |
1.03 |
array/reverse/2dL |
102471 ns |
100341 ns |
1.02 |
array/reverse/2dL_inplace |
127432 ns |
117622 ns |
1.08 |
array/reverse/2d_inplace |
86751 ns |
95391 ns |
0.91 |
array/sorting/1d |
394695 ns |
341945 ns |
1.15 |
integration/byval/reference |
39261 ns |
38830 ns |
1.01 |
integration/byval/slices=1 |
40301 ns |
40880 ns |
0.99 |
integration/byval/slices=2 |
149502 ns |
158462 ns |
0.94 |
integration/byval/slices=3 |
240744 ns |
238013 ns |
1.01 |
integration/volumerhs |
5038840 ns |
4942659 ns |
1.02 |
kernel/indexing |
60811 ns |
43630 ns |
1.39 |
kernel/indexing_checked |
122032 ns |
128022 ns |
0.95 |
kernel/launch |
1280 ns |
1290 ns |
0.99 |
kernel/rand |
123381 ns |
106671 ns |
1.16 |
latency/import |
1496379632 ns |
1501349912 ns |
1.00 |
latency/precompile |
11972833506 ns |
12041117438 ns |
0.99 |
latency/ttfp |
10899717433 ns |
10491950084 ns |
1.04 |
This comment was automatically generated by workflow using github-action-benchmark.
Co-authored-by: Simeon David Schaub <simeon@schaub.rocks>
Co-authored-by: Simeon David Schaub <simeon@schaub.rocks>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR adds an io stream to
AMDGPU.versioninfo()to be compatible with KA's interface and implementsKA.versioninfo.x-ref JuliaGPU/KernelAbstractions.jl#617 (comment) and JuliaGPU/KernelAbstractions.jl#712