SemanticDiff pytorch
c757647d - [Better Transformer] make is_causal a hint and force attn_mask to be set on `is_causal=True` in F.MHA (#97214)

Loading