vllm
0f94cbef - Fix FP8 padding: use torch.where instead of masked_fill_

Commit

39 days ago

Fix FP8 padding: use torch.where instead of masked_fill_ masked_fill_ and indexing assignment are not implemented for float8_e4m3fn. Use torch.where which supports FP8 tensors. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Tyler Michael Smith <tlrmchlsmth@gmail.com>

References

#41183 - [WideEP] Integrate DeepEP v2

Author

tlrmchlsmth

Parents

ed29278a

vllm 0f94cbef - Fix FP8 padding: use torch.where instead of masked_fill_

vllm
0f94cbef - Fix FP8 padding: use torch.where instead of masked_fill_