llama.cpp
[SYCL] fix mul_mat fault in CI/unit-test
#5862
Merged

[SYCL] fix mul_mat fault in CI/unit-test #5862

NeoZhangJianyu
NeoZhangJianyu fix mul_mat fault in cpy_f32_f16
99b9edab
NeoZhangJianyu rm unused function
ddc12494
NeoZhangJianyu NeoZhangJianyu requested a review from ggerganov ggerganov 1 year ago
NeoZhangJianyu
LiangtaoJin add wait() for memcpy
0dce40a7
abhilash1910
abhilash1910 commented on 2024-03-04
ggerganov
ggerganov requested changes on 2024-03-04
NeoZhangJianyu restore ci/run.sh, rename struct defination, fix bug in ggml_sycl_op_…
89524c2f
NeoZhangJianyu fix format issue
32bf3df0
abhilash1910
abhilash1910 commented on 2024-03-05
abhilash1910
abhilash1910 commented on 2024-03-05
abhilash1910
abhilash1910 commented on 2024-03-05
abhilash1910
abhilash1910 approved these changes on 2024-03-05
compilade llama : fix segfault from unknown model arch name (#5820)
8899bdb6
ngxson llama : refactor internal quantization functions (#5830)
9758243a
ggerganov scripts : add pod-llama.sh
dcf09d3c
ikawrakow ggml : IQ3_S improvements (#5829)
d0c9a891
cebtenzzre convert-hf : make model class definitions self-contained (#5825)
9285e714
cebtenzzre convert : automatically fall back to HfVocab if tokenizer.model doesn…
0867b91a
ggerganov ggml : fix IQ3_S AVX implementation (#5834)
1a5ed7a2
Xarbirus llama : add abort_callback to interrupt computation (#5409)
506177de
phymbert server: tests: passkey challenge / self-extend with context shift de…
8479e7d4
ggerganov flake.lock: Update (#5842)
756a4ac7
phymbert server : init http requests thread pool with --parallel if set (#5836)
f72df318
phymbert ci : schedule slow server tests only on Release or on demand (#5839)
23a6275f
compilade llama : fix llama_copy_state_data with fragmented KV cache (#5840)
524864d3
Nindaleth gguf-dump : support i-quants (#5841)
8bb872de
iamlemec llama : allow for user specified embedding pooling type (#5849)
e55ee8a2
ggerganov readme : add API changes section
fd4a186d
slaren cuda : fix data race in soft max (#5853)
22dd02a6
dranger003 main : support special tokens as reverse/anti prompt (#5847)
f3a6dd6c
dranger003 common : use LLAMA_DEFAULT_SEED (#5855)
edabfadc
leejet add some new ops, fix some operators and add batch operations to cert…
9e4d115d
ggerganov sync : ggml
b15e7533
ngxson add alias for chat template (#5858)
3ae5525a
mscheong01 speculative : implement stochastic speculative sampling (#5625)
e245d6c8
danemadsen cmake : handle cases where git index is not found in .git (#5844)
465e411f
Xarbirus ggml : introduce ggml_status (ggml/750)
d87093e9
ggerganov sync : ggml
3a44f13b
ggerganov ggml : fix unknown status (#0)
86e4a3bd
ggerganov flake : fix
dabfd53d
ggerganov llama : fix embeddings (#5796)
2e4e9c00
hutli nix: static build (#5814)
6aac3d42
jquesnelle fix speculative decoding build on windows (#5874)
49a84772
NeoZhangJianyu rebase and rm tailing space
96b9179a
NeoZhangJianyu Merge branch 'master' into fix_mul_mat
fa30cc86
abhilash1910
abhilash1910 approved these changes on 2024-03-05
abhilash1910 abhilash1910 requested a review from ggerganov ggerganov 1 year ago
ggerganov
ggerganov approved these changes on 2024-03-05
abhilash1910 abhilash1910 merged 21b08674 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone