llama.cpp
Min P sampler implementation [alternative to Top P/Top K]
#3841
Merged

Min P sampler implementation [alternative to Top P/Top K] #3841

Green-Sky merged 25 commits into ggml-org:master from kalomaze:min-p-sampling
kalomaze
ggerganov cuda : prints wip
59d1232e
ggerganov cuda : new cublas gemm branch for multi-batch quantized src0
52af7826
ggerganov cuda : add F32 sgemm branch
16b60dd7
ggerganov cuda : fine-tune >= VOLTA params + use MMQ only for small batches
a3c28439
ggerganov cuda : remove duplicated cuBLAS GEMM code
4c6744b5
ggerganov cuda : add CUDA_USE_TENSOR_CORES and GGML_CUDA_FORCE_MMQ macros
a4e15a36
ggerganov build : add compile option to force use of MMQ kernels
49af767f
kalomaze Super hacky starting implementation of Min P
a9e2b74f
kalomaze
cebtenzzre cebtenzzre marked this pull request as draft 1 year ago
KerfuffleV2
KerfuffleV2 commented on 2023-10-28
KerfuffleV2
KerfuffleV2 commented on 2023-10-28
KerfuffleV2
KerfuffleV2 commented on 2023-10-28
KerfuffleV2
KerfuffleV2 commented on 2023-10-28
KerfuffleV2
KerfuffleV2 commented on 2023-10-28
kalomaze Transform Min P into a proper CLI option
a235a0d2
KerfuffleV2
KerfuffleV2 commented on 2023-10-29
kalomaze Min P disabled if set to 1.0 or 0, otherwise Top P
838d58dc
kalomaze
kalomaze commented on 2023-10-29
kalomaze Debugging print statements removed
69ef4ca8
kalomaze erring on the side of caution; disable by default
833637b7
kalomaze
kalomaze commented on 2023-10-29
kalomaze
kalomaze Remove accidentally kept prints + min_keep support
62fc7715
kalomaze Standardize 0.0 disabling min_p upon feedback
49b68e82
KerfuffleV2
KerfuffleV2 commented on 2023-10-29
KerfuffleV2
KerfuffleV2 commented on 2023-10-29
kalomaze Simplified counter by checking candidates size
6f7cdec3
kalomaze minor whitespace fix
cb233584
KerfuffleV2
KerfuffleV2 commented on 2023-10-29
kalomaze Even formatting + exclusively 0.0f to disable now
fcbbfc16
kalomaze kalomaze marked this pull request as ready for review 1 year ago
cebtenzzre
KerfuffleV2
KerfuffleV2 commented on 2023-10-29
kalomaze
cebtenzzre cleanup
69e638e5
cebtenzzre
cebtenzzre commented on 2023-10-29
ivanstepanovftw
cebtenzzre permit simultaneous use of top_p and min_p
3ddfd67d
ivanstepanovftw
ivanstepanovftw commented on 2023-10-29
ivanstepanovftw
ivanstepanovftw commented on 2023-10-29
kalomaze Merge remote-tracking branch 'original/cuda-quantum-batch' into min-p…
18c0aa7c
kalomaze Merge branch 'min-p-sampling' of https://github.com/kalomaze/koboldcp…
87adfad2
kalomaze
ivanstepanovftw
ivanstepanovftw approved these changes on 2023-10-30
Green-Sky
Green-Sky
AlpinDale
KerfuffleV2
Green-Sky
Green-Sky requested changes on 2023-10-30
Green-Sky
kalomaze
KerfuffleV2
cebtenzzre
ivanstepanovftw
Green-Sky
LostRuins
kalomaze Update README & set 0.05 default
9248325f
kalomaze added a bit more context to the README
512cac63
kalomaze
cebtenzzre
cebtenzzre commented on 2023-10-31
Green-Sky
Green-Sky commented on 2023-10-31
cebtenzzre
cebtenzzre commented on 2023-10-31
ggerganov
ggerganov approved these changes on 2023-10-31
kalomaze Update README for consistency
974640ac
kalomaze forgot one small thing!
3b58af26
kalomaze kalomaze requested a review from Green-Sky Green-Sky 1 year ago
Mihaiii
kalomaze
Green-Sky
Green-Sky approved these changes on 2023-10-31
kalomaze
Green-Sky Green-Sky merged 238657db into master 1 year ago
Mihaiii
cebtenzzre
ivanstepanovftw
ivanstepanovftw
ivanstepanovftw commented on 2023-11-01
ggerganov
ggerganov
m18coppola
staviq
LostRuins
ggerganov
cebtenzzre
LostRuins
pacmanincarnate
LostRuins
ivanstepanovftw
cebtenzzre
kalomaze
cebtenzzre
ivanstepanovftw
ZoomRmc
ivanstepanovftw
Arcitec
ZoomRmc
Arcitec

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone