Re-implement SYCL backend parallel_for
to improve bandwidth utilization
#451
Job | Run time |
---|---|
17s | |
20s | |
37s |
parallel_for
to improve bandwidth utilization
#451
Job | Run time |
---|---|
17s | |
20s | |
37s |