text-generation-inference
Bug Fix: Sliding Window Attention
#3112
Merged

Bug Fix: Sliding Window Attention #3112

Narsil merged 11 commits into main from sw_fix
mht-sharma
mht-sharma (fix) sliding window attention
ff82f0f8
mht-sharma (fix) flashinfer
69e0a87d
HuggingFaceDocBuilderDev
mht-sharma (typo) collection link
eaf18c1c
mht-sharma mht-sharma requested a review from drbh drbh 1 year ago
mht-sharma mht-sharma requested a review from Narsil Narsil 1 year ago
mht-sharma Add window_size_left param ipex rocm
b30cdabf
mht-sharma Update window size rocm flash decoding
170a12f3
drbh fix: bump snapshots and improve exceed window test case
e5ec176b
drbh feat: add tests for image types and remove alpha from png
659ce4f3
Narsil Upgrading `from_env` to get token from file when necessary + fix
e5dfd41e
drbh fix: add pillow dependency and bump lock+requirements
2c2fc654
drbh fix: bump org name in gemma3 test
febc488e
Narsil
Narsil dismissed these changes on 2025-03-18
Narsil Fix qwen2.
07808428
Narsil Narsil dismissed their stale review via 07808428 1 year ago
Narsil
Narsil approved these changes on 2025-03-18
Narsil Narsil merged a35fbdb9 into main 1 year ago
Narsil Narsil deleted the sw_fix branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone