Fix `Qwen2AudioForConditionalGeneration.forward()` and `test_flash_attn_kernels_inference_equivalence` #39503
Add missing cache_position argument.
69a6e4b9
gante
commented
on 2025-07-18
Pass cache_position to language model.
73af1077
Overwrite prepare_inputs_for_generation.
d46f20c6
ebezzam
changed the title Add missing `cache_position` to `Qwen2AudioForConditionalGeneration.forward()` Fix `Qwen2AudioForConditionalGeneration.forward()` 317 days ago
Merge branch 'main' into qwen2audio
dc5c33ff
Merge branch 'main' into qwen2audio
dfe5dd8c
gante
approved these changes
on 2025-07-24
Merge branch 'main' into qwen2audio
42fef6b2
Set model to half precision for Flash Attention test.
3a3a470e
Merge branch 'main' into qwen2audio
ee0da864
Merge branch 'main' into qwen2audio
a7f1c7ed
Cast model to bfloat16.
e06ba17a
Merge branch 'qwen2audio' of github.com:ebezzam/transformers into qwe…
3270b436
ebezzam
changed the title Fix `Qwen2AudioForConditionalGeneration.forward()` Fix `Qwen2AudioForConditionalGeneration.forward()` and `test_flash_attn_kernels_inference_equivalence` 311 days ago
ebezzam
merged
7623aa3e
into main 311 days ago
ebezzam
deleted the qwen2audio branch 311 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub