text-generation-inference
fix: prefer inplace softmax to avoid copy
#2661
Merged

fix: prefer inplace softmax to avoid copy #2661

drbh
drbh fix: prefer inplace softmax to avoid copy
8d7448de
Narsil
Narsil
Narsil dismissed these changes on 2024-10-17
Narsil
Narsil commented on 2024-10-17
drbh drbh dismissed their stale review via 3e0a82d5 1 year ago
drbh Update server/text_generation_server/models/flash_causal_lm.py
3e0a82d5
drbh drbh merged 5f32dea1 into main 1 year ago
drbh drbh deleted the prefer-inplace-softmax-for-prefill-logprobs branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone