llama.cpp
51f0bd50 - Remove custom pre attention scaling and use computed value instead.

Commit

1 year ago

Remove custom pre attention scaling and use computed value instead.

References

add-gemma2-soft-capping

#8197 - Add attention and final logit soft-capping, update scaling factor to Gemma2

Author

abetlen

abetlen

Parents

Loading