iq3_s: PPL improvement - SemanticDiff

Commit

1 year ago

iq3_s: PPL improvement E.g., for a context of 4096 LLaMA-v2-7B goes to 5.1340 from 5.1653.