auto-round
b698213d - enable llava int4 inference with autoround format (#237)

Commit
1 year ago
enable llava int4 inference with autoround format (#237) Signed-off-by: Zhang, Weiwei1 <weiwei1.zhang@intel.com> Co-authored-by: wenhuach21 <wenhua.cheng@intel.com>
Author
Parents
Loading