avoid copying the entire vector - SemanticDiff

Commit

2 years ago

avoid copying the entire vector

References

#6187 - llama_model_loader: support multiple split/shard GGUFs

#4 - Hp/split/load model (test CI)

Author

phymbert

phymbert

Parents

Loading