llama.cpp
Add Command R Plus support
#6491
Merged

Add Command R Plus support #6491

ggerganov merged 15 commits into ggml-org:master from RefractAI:master
RefractAI
Add Command R Plus GGUF
2efcd87b
Add Command R Plus GGUF
fbab9849
bartowski1182
N8python
bartowski1182
Noeda
bartowski1182
candre23
Noeda
N8python
sammcj
Loading works up to LayerNorm2D
e4b2e2d3
RefractAI
sammcj
N8python
github-actions
slaren
N8python
Noeda
Export new tensors in 1D so they are not quantized.
c354db75
RefractAI
RefractAI RefractAI marked this pull request as draft 2 years ago
Noeda
Noeda
dranger003
Noeda
Noeda
N8python
Noeda
Noeda
sammcj
Fix embedding layer based on Noeda's example
553b09ba
Whitespace
f3532ff8
RefractAI RefractAI marked this pull request as ready for review 2 years ago
RefractAI
dranger003
Add line
ec613b85
dranger003
slaren
dranger003
slaren
sjug
dranger003
slaren
dranger003
dranger003
Noeda
dranger003
Noeda
dranger003
Noeda
candre23
dranger003
sjug
dranger003
Noeda
Noeda
Noeda
pmysl
Fix unexpected tokens on MPS. Re-add F16 fix. ((Noeda)
c2658c3a
RefractAI
dranger003
dranger003: Fix block index overflow in CUDA dequantizing.
6745ea7a
slaren
slaren commented on 2024-04-06
Noeda
dranger003
Reverted blocked multiplication code as it still has issues and could…
26e8f23b
dranger003
slaren
Noeda
dranger003
Noeda
dranger003
Noeda
slaren export norms as f32
ce9413d8
slaren fix overflow issues during quant and other cleanup
78819c07
slaren
slaren
RefractAI Merge pull request #1 from slaren/cmrp-fixes
8b6577bd
RefractAI
slaren
zsqdx
zsqdx approved these changes on 2024-04-07
Noeda
slaren
slaren
slaren approved these changes on 2024-04-07
slaren slaren requested a review from ggerganov ggerganov 2 years ago
ggerganov
ggerganov approved these changes on 2024-04-07
RefractAI Type convention
d2924073
Noeda
slaren
Noeda
dranger003
sammcj
sammcj approved these changes on 2024-04-08
sammcj
ggerganov
araleza
pmysl
dranger003
ggerganov
teis-e
RefractAI
Noeda
dranger003
teis-e
dranger003
teis-e
candre23
dranger003
candre23
teis-e
dranger003
dranger003: Fix more int overflow during quant.
ea1aeba4
ghchris2021
phymbert
ggerganov
ggerganov ggerganov merged 5dc9dd71 into master 2 years ago
phymbert
phymbert commented on 2024-04-09
phymbert
phymbert commented on 2024-04-09
kalomaze
dranger003
kalomaze
yamikumo-DSD
christianwengert
yamikumo-DSD
4cecoder
Sintayew4

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone