Reduce amount of compiled CUDA device code #6118
Move CudaKernel definition into a separate header file.
df838a61
Replace include of cuda_common.h with cuda_kernel.h in header files c…
f5eeaf20
Fix build issues for onnxruntime_providers_cuda.
530690a1
Fix build.
e7cfa771
Include cuda_fwd.h instead of cuda_kernel.h in cuda_execution_provide…
3de23731
Add cuda_kernel.h to amd_hipify.py exclusion list.
239be5e2
Add rocm_kernel.h, updates.
bb2c9edc
Fix ROCM build.
cb33f6cb
Exclude fallback_cpu_capability.h/cc from minimal build.
8b5ab2d9
Fix typo.
274a7232
Remove redundant includes of op_kernel.h.
a1f6878e
Remove another redundant include.
ac73c1fa
HectorSVC
approved these changes
on 2020-12-14
edgchen1
merged
9810b9e0
into master 5 years ago
edgchen1
deleted the edgchen1/cuda_kernel_h branch 5 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub