text-generation-inference
Add basic FP8 KV cache support
#2603
Merged

Add basic FP8 KV cache support #2603

danieldk merged 2 commits into main from feature/fp8-kv-cache
danieldk
HuggingFaceDocBuilderDev
danieldk danieldk force pushed from 801cf3b1 to 6264d97c 1 year ago
danieldk danieldk force pushed from 2628268e to 37df2ff1 1 year ago
Narsil
Narsil dismissed these changes on 2024-10-04
danieldk danieldk dismissed their stale review via 4cc54057 1 year ago
danieldk danieldk force pushed from 37df2ff1 to 4cc54057 1 year ago
danieldk Add basic FP8 KV cache support
78d6c27d
danieldk Fix Cargo.toml
ed5c2fb1
danieldk danieldk force pushed from 4cc54057 to ed5c2fb1 1 year ago
drbh
drbh commented on 2024-10-04
drbh
drbh approved these changes on 2024-10-04
danieldk danieldk merged 2358c2bb into main 1 year ago
danieldk danieldk deleted the feature/fp8-kv-cache branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone