onnxruntime
[JS/WebGPU] Add GatherBlockQuantized op support
#21734
Merged

[JS/WebGPU] Add GatherBlockQuantized op support #21734

satyajandhyala
satyajandhyala Added GatherBlockQuantized operator.
3e2e54c0
satyajandhyala Remove templatization
6c349321
satyajandhyala Added int4x2 and uint4x2
c9d465c8
satyajandhyala Revert "Remove templatization"
23219602
satyajandhyala Fixed script to look for ONNX_OPERATOR_TWO_TYPED_KERNEL_CLASS_NAME
4db3348f
satyajandhyala Updated the doc
5c3f33e2
satyajandhyala Added test cases.
cb63d14b
satyajandhyala Added more GatherBlockQunntized op functionality.
6c270028
satyajandhyala Calculate zero-point array index.
3677d60d
satyajandhyala Split signed and unsigned test cases, not group.
07ceae59
satyajandhyala Reapply "Remove templatization"
fe802128
satyajandhyala satyajandhyala added ep:WebGPU
github-advanced-security
github-advanced-security commented on 2024-08-14
satyajandhyala lint
b29d9b20
satyajandhyala satyajandhyala marked this pull request as ready for review 1 year ago
satyajandhyala Add missing semicolon
dd6f95a4
satyajandhyala trim error message
253d4098
satyajandhyala Inserted missing line space.
8da8bd93
satyajandhyala Test using indices input with dims > 1
1ea1f982
satyajandhyala
satyajandhyala commented on 2024-08-14
satyajandhyala updated tensor_helper.cc
1b6accd1
satyajandhyala Updated hint
f75dcbd2
satyajandhyala Merge branch 'main' of github.com:microsoft/onnxruntime into sajandhy…
a0355a86
satyajandhyala format
4f016ca1
satyajandhyala Replaced ternary operator with if-else
219e2b0f
satyajandhyala satyajandhyala requested a review from fs-eire fs-eire 1 year ago
satyajandhyala Use vec instead of array to unpack data and use built-in function unp…
56554da4
satyajandhyala Merge branch 'main' of github.com:microsoft/onnxruntime into sajandhy…
9e9d15ee
satyajandhyala Add code to verify that the indices input is valid.
31a62e2e
satyajandhyala Added (u)int4 in tensor-impl.ts
bf3abaad
satyajandhyala Commented out indices validation code.
963814a3
satyajandhyala Added (u)int4.
5fbb497d
satyajandhyala test related changes
d065cea1
satyajandhyala fixed unused variable.
6a57c346
satyajandhyala everted changes tensor_helper.cc
163866ac
satyajandhyala Updated expected output to match that of wasm.
a519237b
satyajandhyala Use indicesGet/indicesSet to access index out of indices
b93ca99d
satyajandhyala typo
60d6ba80
satyajandhyala renamed dequantize-linear_int4.jsonc as dequantize-linear-int4.jsonc
0a7387d5
satyajandhyala Indices should be normalized before indexing. Added a test case.
9b5eac43
satyajandhyala format JSONC
d62058a3
satyajandhyala Merge branch 'main' of github.com:microsoft/onnxruntime into sajandhy…
2162f92b
satyajandhyala Avoid producing presentKey/presentValue outputs if pastKey/pastValue …
18a39066
satyajandhyala Don't treat empty inputs as undefined in MHA. Let Attention deal with…
9e225640
satyajandhyala Feed pastKey/pastValue inputs down to the functions that generate sha…
c01f721c
satyajandhyala Added a test case with zero-size pastKey/pastValue inputs that requir…
c8b187ff
satyajandhyala Added back the assumption comment.
d0b0627b
satyajandhyala satyajandhyala requested a review from guschmue guschmue 1 year ago
satyajandhyala ShapeUtis.size should return 0 if the tensor dims is empty instead of 1.
d8aeee17
satyajandhyala Revert "ShapeUtis.size should return 0 if the tensor dims is empty in…
7520dc24
satyajandhyala Merge branch 'sajandhy/wepu_fix_attention_output' of github.com:micro…
5b706fe3
guschmue
guschmue requested changes on 2024-08-19
satyajandhyala Skip Indices shape from the cache key.
52e2821c
satyajandhyala format
62e26654
satyajandhyala satyajandhyala requested a review from guschmue guschmue 1 year ago
guschmue
guschmue approved these changes on 2024-08-26
satyajandhyala Merge branch 'main' of github.com:microsoft/onnxruntime into sajandhy…
c5620994
satyajandhyala satyajandhyala merged af18824f into main 1 year ago
satyajandhyala satyajandhyala deleted the sajandhy/webgpu_add_block_quantized_gather branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone