Megatron-DeepSpeed
Checking we use fused kernels to compute scaled masked softmax on prefix lm
#213
Open

Checking we use fused kernels to compute scaled masked softmax on prefix lm #213

thomasw21 wants to merge 6 commits into main from thomas/improve_test_to_test_custom_kernel
thomasw21
thomasw21 WIP
46d5c334
thomasw21 Turns out there's no issue with the way we build prefix lm
e7a12e73
thomasw21 Lint
16ed6211
thomasw21 Revert "Lint"
071be146
thomasw21 First fix
2aec997f
thomasw21 Tests do not have independent env
e1955f3c

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone