enable static cache on TP model (#39164)
* enable static cache on TP model
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* check tp size before init kv cache
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix docstring
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* add tp tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix comment
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix other cache head size
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>