llama.cpp
Feature: Integrate with unified SYCL backend for Intel GPUs
#2690
Merged

Feature: Integrate with unified SYCL backend for Intel GPUs #2690

ggerganov merged 92 commits into ggml-org:master from abhilash1910:sycl
abhilash1910
abhilash1910 abhilash1910 marked this pull request as draft 2 years ago
ggerganov
Jacoby1218
abhilash1910
unbrice
abhilash1910
itlackey
JohnnyOpcode
ggerganov
AlexFierro9
netrunnereve
itlackey
netrunnereve
abhilash1910 abhilash1910 marked this pull request as ready for review 2 years ago
abhilash1910
koech-v
mgolub2
AidanBeltonS
AidanBeltonS commented on 2024-01-22
AidanBeltonS
AidanBeltonS commented on 2024-01-22
AidanBeltonS
AidanBeltonS commented on 2024-01-22
Green-Sky
Green-Sky commented on 2024-01-22
AidanBeltonS
AidanBeltonS commented on 2024-01-22
AidanBeltonS
AidanBeltonS commented on 2024-01-22
Jacoby1218
NeoZhangJianyu
NeoZhangJianyu first update for migration
7a4343df
NeoZhangJianyu update init_cublas
23387693
NeoZhangJianyu add debug functio, commit all help code
0c00b4f6
NeoZhangJianyu step 1
ff837110
NeoZhangJianyu step 2
02dffb68
NeoZhangJianyu step3 add fp16, slower 31->28
43f2c358
NeoZhangJianyu add GGML_LIST_DEVICE function
da752eda
NeoZhangJianyu step 5 format device and print
6dd32789
NeoZhangJianyu step6, enhance error check, remove CUDA macro, enhance device id to f…
3a9d2c54
NeoZhangJianyu support main device is non-zero
65f895d4
NeoZhangJianyu step7 add debug for code path, rm log
3b1a743e
NeoZhangJianyu step 8, rename all macro & func from cuda by sycl
c2ef7a9c
NeoZhangJianyu fix error of select non-zero device, format device list
69d76c8b
NeoZhangJianyu ren ggml-sycl.hpp -> ggml-sycl.h
c709c3cb
NeoZhangJianyu clear CMAKE to rm unused lib and options
fa3a5860
NeoZhangJianyu correct queue: rm dtct:get_queue
3645f25d
NeoZhangJianyu add print tensor function to debug
bd38129a
NeoZhangJianyu fix error: wrong result in 658746bb26702e50f2c59c0e4ada8e9da6010481
5b538994
NeoZhangJianyu summary dpct definition in one header file to replace folder:dpct
a47f5ec4
NeoZhangJianyu refactor device log
c67c2ab2
NeoZhangJianyu mv dpct definition from folder dpct to ggml-sycl.h
c3c5b20a
NeoZhangJianyu update readme, refactor build script
ca2cb698
NeoZhangJianyu fix build with sycl
95daece9
NeoZhangJianyu set nthread=1 when sycl, increase performance
a8936f49
NeoZhangJianyu add run script, comment debug code
79d30d77
NeoZhangJianyu add ls-sycl-device tool
0d6e7219
NeoZhangJianyu add ls-sycl-device, rm unused files
7350fd48
NeoZhangJianyu rm rear space
09b5619d
NeoZhangJianyu dos2unix
d80dd65f
NeoZhangJianyu Update README_sycl.md
593ce001
luoyu-intel fix return type
57e9fbad
luoyu-intel remove sycl version from include path
d5f7d364
NeoZhangJianyu restore rm code to fix hang issue
35a0daaa
NeoZhangJianyu add syc and link for sycl readme
ae941b1b
NeoZhangJianyu rm original sycl code before refactor
e3481faa
luoyu-intel fix code err
623d8031
NeoZhangJianyu add know issue for pvc hang issue
f396a3b6
luoyu-intel enable SYCL_F16 support
f008cc7b
abhilash1910
airMeng align pr4766
67e6b3cb
airMeng airMeng force pushed from 991e9cba to 67e6b3cb 1 year ago
airMeng
Jacoby1218
NeoZhangJianyu check for sycl blas, better performance
533c647d
abhilash1910 cleanup 1
dd7f1396
abhilash1910 abhilash1910 changed the title [Draft] Feature: Integrate with SYCL backend for Intel GPUs Feature: Integrate with unified SYCL backend for Intel GPUs , Nvidia and AMD 1 year ago
airMeng remove extra endif
b4037842
NeoZhangJianyu add build&run script, clean CMakefile, update guide by review comments
a0a1304b
NeoZhangJianyu Merge branch 'sycl' of https://github.com/abhilash1910/llama.cpp into…
27c08c04
NeoZhangJianyu rename macro to intel hardware
97cbe18d
abhilash1910 abhilash1910 changed the title Feature: Integrate with unified SYCL backend for Intel GPUs , Nvidia and AMD Feature: Integrate with unified SYCL backend for Intel GPUs 1 year ago
abhilash1910
abhilash1910 editor config format
1ddaf44c
abhilash1910 format fixes
bd716b25
abhilash1910 format fixes
be31379e
abhilash1910
NeoZhangJianyu
abhilash1910 editor format fix
d097e2a4
abhilash1910 Remove unused headers
88f64b7d
NeoZhangJianyu skip build sycl tool for other code path
756c4acc
abhilash1910
NeoZhangJianyu replace tab by space
b42a32d3
abhilash1910 fix blas matmul function
5f83a123
ggerganov
abhilash1910
ggerganov
ggerganov commented on 2024-01-23
ggerganov ggerganov added need feedback
ggerganov
abhilash1910 fix mac build
d6fc1a03
abhilash1910 restore hip dependency
c7e745e6
NeoZhangJianyu fix conflict
3bfb846d
NeoZhangJianyu ren as review comments
498121b1
NeoZhangJianyu mv internal function to .cpp file
91b14610
NeoZhangJianyu export funciton print_sycl_devices(), mv class dpct definition to sou…
816f480e
NeoZhangJianyu
NeoZhangJianyu update CI/action for sycl code, fix CI error of repeat/dup
7a44a95b
NeoZhangJianyu fix action ID format issue
7babd769
NeoZhangJianyu rm unused strategy
04a46c46
airMeng enable llama_f16 in ci
799af056
airMeng airMeng force pushed 1 year ago
airMeng airMeng force pushed to 799af056 1 year ago
airMeng airMeng force pushed from 799af056 1 year ago
airMeng airMeng force pushed to 799af056 1 year ago
NeoZhangJianyu fix conflict
ec5c8bc0
NeoZhangJianyu fix build break on MacOS, due to CI of MacOS depend on external ggml,…
22e1b45c
koech-v
koech-v approved these changes on 2024-01-24
abhilash1910 Merge branch 'master' into sycl
238ec31a
slaren
slaren requested changes on 2024-01-24
slaren
NeoZhangJianyu fix ci cases for unsupported data type
67de350e
airMeng revert unrelated changed in cuda cmake
fb15de38
airMeng revert hip cmake changes
96186a74
abhilash1910
airMeng fix indent
d07a88d4
NeoZhangJianyu add prefix in func name
8dd1b60a
airMeng revert no mmq
3aabd8a2
abhilash1910 rm cpu blas duplicate
18742f7a
NeoZhangJianyu
airMeng fix no_new_line
0e235fb8
luoyu-intel fix src1->type==F16 bug.
56001182
luoyu-intel pass batch offset for F16 src1
eef5faae
luoyu-intel fix batch error
5bb93d41
luoyu-intel fix wrong code
0635f844
abhilash1910 abhilash1910 requested a review from slaren slaren 1 year ago
abhilash1910
characharm
Jacoby1218
NeoZhangJianyu
abhilash1910 abhilash1910 requested a review from ggerganov ggerganov 1 year ago
NeoZhangJianyu
slaren
slaren commented on 2024-01-25
airMeng revert sycl checking in test-sampling
f1bab501
airMeng pass void as arguments of ggml_backend_sycl_print_sycl_devices
66e24c24
slaren
slaren commented on 2024-01-25
airMeng remove extra blank line in test-sampling
b06dca67
airMeng revert setting n_threads in sycl
05b7f9be
slaren
slaren
slaren approved these changes on 2024-01-25
NeoZhangJianyu
airMeng
koech-v
koech-v approved these changes on 2024-01-26
Jacoby1218
ggerganov ggerganov added high priority
NeoZhangJianyu
luoyu-intel implement std::isinf for icpx with fast math.
d6a65050
luoyu-intel
ggerganov
ggerganov approved these changes on 2024-01-26
abhilash1910 Update ci/run.sh
174c9a0e
abhilash1910 Update examples/sycl/run-llama2.sh
c08fec2a
abhilash1910 Update examples/sycl/run-llama2.sh
2cba564b
abhilash1910 Update CMakeLists.txt
f7070510
abhilash1910 Update CMakeLists.txt
45b06180
abhilash1910 Update CMakeLists.txt
55317540
abhilash1910 Update CMakeLists.txt
b9ffaab1
NeoZhangJianyu add copyright and MIT license declare
2ab97157
NeoZhangJianyu
NeoZhangJianyu
ggerganov
ngxson
ngxson commented on 2024-01-27
ngxson
NeoZhangJianyu update the cmd example
d394ca7f
NeoZhangJianyu
ggerganov ggerganov merged 0f648573 into master 1 year ago
sorasoras
abhilash1910
NeoZhangJianyu
sorasoras
NeoZhangJianyu
mudler
airMeng
mudler
airMeng
ElliottDyson
Jacoby1218

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone