llama.cpp
Private batch API (for AI review)
#14
Open

Private batch API (for AI review) #14

ngxson wants to merge 61 commits into master from xsn/private_batch_api
ngxson
ngxson first proposal for private llama_batch
4ed4fe75
ngxson rework, targeting llama-server
f2e59a8e
ngxson move to llama_batch_ext
17d3658b
ngxson server : use llama_batch_ext
85ef80cb
ngxson fix server
aed4a8e9
ngxson llama_decode_ext
4bf7ca39
ngxson Merge branch 'master' into xsn/private_batch_api
a1b1dea3
ngxson adapt common
f0ffd811
ngxson Merge branch 'master' into xsn/private_batch_api
9e75c49d
ngxson correct llama_decode_ext
40989f41
ngxson llama_batch_ext_add_text
1170135d
ngxson remove token_info API
1d6ba977
ngxson apply various in places
46596caf
ngxson Merge branch 'master' into xsn/private_batch_api
17f954c8
ngxson fix merge errors
86973cb1
ngxson return output ID from llama_batch_ext_add/set
4aabf4e8
ngxson apply to the rest
47086fa8
ngxson fix common_batch missing seq_id
9fb2d81e
ngxson compile ok
65f01845
ngxson fix llama_batch_ext_init_from_text
c3dd7900
ngxson rm redundant llama_batch_ext_set_output_last
04f86418
coderabbitai
github-actions github-actions added examples
github-actions github-actions added server
coderabbitai
coderabbitai commented on 2025-03-13
ngxson
coderabbitai
ngxson
coderabbitai
ngxson
coderabbitai
ghost ghost deleted a comment from coderabbitai on 2025-03-13
ngxson
coderabbitai
ngxson correct comment
54566ad9
coderabbitai
coderabbitai commented on 2025-03-13
ngxson bring back mistakenly deleted llama_batch_init/free
bfdddbc1
ngxson
coderabbitai
coderabbitai
coderabbitai commented on 2025-03-13
ngxson
coderabbitai
ngxson
coderabbitai
ngxson fix llama-run n_past
5e6a6d4e
ngxson fix gemma3-cli
32940369
ngxson fix missing n_past in various places
07d84fa3
ngxson
coderabbitai
coderabbitai
coderabbitai commented on 2025-03-14
ngxson fix llama_batch_ext_init_from_embd
ba793696
ngxson qwen2vl: use llama_batch_ext_set_pos
a363251f
ngxson fix compile
8e7714fa
coderabbitai
coderabbitai commented on 2025-03-14
coderabbitai
coderabbitai commented on 2025-03-14
ngxson llama_batch_ext_ptr::from_text/embd
eaffba0f
coderabbitai
coderabbitai commented on 2025-03-14
ngxson rename to init_from_text
116b9a16
ghost ghost deleted a comment from coderabbitai on 2025-03-14
ghost ghost deleted a comment from coderabbitai on 2025-03-14
ngxson
coderabbitai
ngxson fix compile
624a683c
coderabbitai
coderabbitai commented on 2025-03-14
ngxson Update examples/tts/tts.cpp
de788e07
ngxson Apply suggestions from code review
eab5606d
coderabbitai
coderabbitai commented on 2025-03-17
ngxson Merge branch 'master' into xsn/private_batch_api
dc4bb642
coderabbitai
coderabbitai commented on 2025-03-18
ggerganov speculative : adapt to new llama API
7a3c178d
ngxson Merge pull request #15 from ggml-org/xsn/private_batch_api
23d74073
ggerganov android : adapt to new API
b0db7fc2
github-actions github-actions added android
coderabbitai
coderabbitai commented on 2025-03-19
ggerganov swift : adapt to new API
96ca6e8d
coderabbitai
coderabbitai commented on 2025-03-19
ngxson android : fix permission
32c2c41d
coderabbitai
coderabbitai commented on 2025-03-19
ggerganov retrieval : avoid common_batch
6f54ee66
coderabbitai
coderabbitai commented on 2025-03-19
ggerganov embedding : avoid common_batch
8b80d683
ggerganov perplexity : avoid common_batch
76fd7d6f
ggerganov server : avoid common_batch
8a23b4a5
ggerganov server : remove old commented code [no ci]
b8b17327
ngxson Merge pull request #16 from ggml-org/xsn/private_batch_api_pooling_none
bd51d63b
github-actions github-actions added python
ngxson remove C API llama_batch_ext_init_from_text
30f1db99
coderabbitai
coderabbitai commented on 2025-03-20
ngxson Merge branch 'master' into xsn/private_batch_api
c5a01763
ngxson add cpp batch.add_text wrapper
2134cabf
coderabbitai
coderabbitai commented on 2025-03-21
ngxson move various places to batch.add_text
2cec1cff
coderabbitai
coderabbitai commented on 2025-03-21
ngxson add batch.clear() and batch.n_tokens()
3802ff2a
coderabbitai
coderabbitai commented on 2025-03-21
ngxson Merge branch 'master' into xsn/private_batch_api
e8827a6f
ngxson qwen2vl: fix mrope position
a9efdbbc
ngxson Merge branch 'master' into xsn/private_batch_api
1434c2c9
ngxson llama_batch_ext_init with ctx
d18a79ed
coderabbitai
coderabbitai commented on 2025-03-25
ngxson fix qwzn2vl mrope position input
c4fea7fe
ngxson fix build
42062cc2
coderabbitai
coderabbitai commented on 2025-03-25
ngxson fix server
56e82d02
coderabbitai
coderabbitai commented on 2025-03-25
ngxson server: fix batch_spec
50fb3963
ngxson fix embeddings and retrieval
8ec0ff9b
ngxson correct output_id for llama-cpp header
c1f4a78f
coderabbitai
coderabbitai commented on 2025-03-27
coderabbitai
coderabbitai commented on 2025-03-27

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone