abetlen
changed the title Add attention and final logit soft-capping to Gemma2 Add attention and final logit soft-capping, custom scaling factor to Gemma21 year ago
Remove custom pre attention scaling and use computed value instead.
51f0bd50
abetlen
changed the title Add attention and final logit soft-capping, custom scaling factor to Gemma2 Add attention and final logit soft-capping, update scaling factor to Gemma21 year ago
Login to write a write a comment.
Login via GitHub