llama.cpp
db978373 - vulkan: perf_logger improvements (#17672)

Commit
13 days ago
vulkan: perf_logger improvements (#17672) * vulkan: perf_logger improvements - Move perf_logger from device to ctx. - Add an env var to control the frequency we dump the stats. If you set a very large value, it just dumps when the ctx is destroyed. - Add a fusion info string to the tracking, only log one item per fused op. - Fix MUL_MAT_ID flops calculation. * fix vector sizes
Author
Parents
Loading