SYCL: Implement few same quantized type copy kernels #13739
qnixsynapse
marked this pull request as draft 1 year ago
qnixsynapse
marked this pull request as ready for review 1 year ago
qnixsynapse
marked this pull request as draft 1 year ago
qnixsynapse
force pushed
from
3cdc64b9
to
c8c22786
1 year ago
qnixsynapse
marked this pull request as ready for review 1 year ago
SYCL: Implement few same quantized type copy kernels
c26934dd
Use memcpy for copying contiguous tensors
608e8811
feat(sycl): add contiguous tensor copy support and device checks
faeb7f34
refactor: replace specific block copy functions with template
b36c550d
Exclude BF16 support for COPY tensors for now
b6db0056
perf: adjust SYCL copy kernel block sizes for efficiency
4aa261af
Rbiessy
approved these changes
on 2025-06-06
qnixsynapse
deleted the sycl/same_q_cpy branch 360 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub