ZCode FastFormers changes #5827
Add FBGEMM submodule
6ea91f12
Add fbgemm based per-channel quantization
7461ea3c
Add missing logic for pre-layernorm transformer model fusion
641b2dbd
add support for structured pruning architecture -fastformers
33ab92e7
Fix windows build
81f22d1d
Add a default behavior when head_size is not present for the backward…
1f9234db
Remove FBGEMM and default to tensor-wise quantization, column-wise qu…
6accff94
Fixed some unit test errors
55eb2e18
Fix windows compile error and unit test errors
828d6213
Merge branch 'master' into youki/pr-revise
30686f8c
delete the option removed from the upstream
49a86943
ykim362
marked this pull request as ready for review 5 years ago
ykim362
requested a review
5 years ago
Addresses review comments and fixes a merge error
4cf7ee4b
Remove commented out code
38abb2eb
Merge branch 'master' into youki/pr-revise
77fa341a
add non-zero zp support
335e3514
Merge branch 'master' into yufeng/matmul_non_zp
bb515c93
support A and B scale with any dimensions
8b395219
fix build breaks
49f6edab
fix warning in MSVC
745c6371
Fix bug for not checking original float value names when treat it as …
264531b9
Merge commit 'a647da3e1acb5ecb750c19df6b1b85af4572237d' into youki/pr…
35c10d8a
Clean up head size
e3dc7f0d
Clean up python tools
7f8f57fd
Enable per column quantization
1d40fa8d
Merge branch 'yufeng/matmul_non_zp' into youki/percol-test
ae81e733
Merge branch 'zhalei/checking_origin_fpvalue_before_dequantize' into …
e8d11aa7
fix quant weight cleanup bug
bc9e8491
Merge branch 'yufeng/quant_tool_weight' into youki/percol-test
b624e617
A few code clean up
21c86619
Merge branch 'master' into youki/percol-test
6b85a9e3
Some code clean-up
e9003026
Merge branch 'master' into youki/fastformers-pr
71b44d46
Some code clean-up
c1f128c2
Merge branch 'master' into youki/fastformers-pr
03785f13
Change option name
950b9c3a
update default value
2895a273
Rename option and parameter names
d4494600
Missing argument name change
b29d03d5
Add tests for quantization options for attention and matmul
5232d33c
yufenglee
approved these changes
on 2021-05-17
yufenglee
merged
e9057d2e
into master 4 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub