[Kernel][LoRA]Punica prefill kernels fusion #11234
Init
ec3590d9
Sync main
9474fb01
Fix bug
8c2ac4ca
Merge branch 'vllm-project:main' into punica-kernel-fusion
2897d053
Merge branch 'vllm-project:main' into punica-kernel-fusion
35aebea9
Merge branch 'vllm-project:main' into punica-kernel-fusion
628a5670
Back up
d04121ca
shrink_sgmv Done
a306f424
Merge branch 'vllm-project:main' into punica-kernel-fusion
f6bccc78
Merge branch 'vllm-project:main' into punica-kernel-fusion
e5cb72e2
Optimize ptr compute
b6013db4
Merge commit 'b6013db4' into punica-kernel-fusion
7f088ec5
Merge branch 'vllm-project:main' into punica-kernel-fusion
32c52792
Increase the tile size
8d3742ba
Clean up triton interface
9564b33d
Sync main
3eb3ac3e
Backup
40124669
Optimize one sclice kernel
18bbadf1
jeejeelee
marked this pull request as draft 1 year ago
Delete unused code
43aae702
Refactor expand
482de154
format
259d382f
Merge branch 'vllm-project:main' into punica-kernel-fusion
00f19046
Optimize logic
a0197e3e
Add comments
38ba4f1c
Fix bug
3c372265
Fix expand bug
45180c13
Backup
2e52d2c4
revert expand tile size
2146141b
Merge branch 'vllm-project:main' into punica-kernel-fusion
d724891b
Clean up code
9719617c
Optimize expand tile size
5d2c557a
Merge branch 'vllm-project:main' into punica-kernel-fusion
958500d5
Merge branch 'vllm-project:main' into punica-kernel-fusion
5c88ec4d
improve expand (#3)
3460308f
Merge branch 'vllm-project:main' into punica-kernel-fusion
24e893c9
Lora expand (#4)
c9747c63
Lora expand (#5)
f3ecfc64
Fix K size
5859da77
Merge branch 'vllm-project:main' into punica-kernel-fusion
b3ea6fc0
Merge branch 'vllm-project:main' into punica-kernel-fusion
eb010892
revert (#6)
ebc9519b
Merge branch 'vllm-project:main' into punica-kernel-fusion
a4f46b6e
Merge branch 'vllm-project:main' into punica-kernel-fusion
2cdf4593
Add unit test
ba2c4442
Merge branch 'vllm-project:main' into punica-kernel-fusion
394886d9
Merge branch 'vllm-project:main' into punica-kernel-fusion
36fbeac2
Optimize unit test
0f7897b6
Optimize unit test
3edb696b
Fix comment
49c6c21d
Merge branch 'vllm-project:main' into punica-kernel-fusion
bf3b9ca9
jeejeelee
marked this pull request as ready for review 1 year ago
Merge branch 'vllm-project:main' into punica-kernel-fusion
fe24a41d
Merge branch 'vllm-project:main' into punica-kernel-fusion
9d89f47f
mgoin
commented
on 2024-12-27
Optimize code
489eca1e
Add lock for unit test
04ae0dd7
Merge branch 'vllm-project:main' into punica-kernel-fusion
fa489f2f
Merge branch 'vllm-project:main' into punica-kernel-fusion
ea19a7d0
Optimize arg
65d0f2f2
Merge branch 'vllm-project:main' into punica-kernel-fusion
797ae774
Merge branch 'vllm-project:main' into punica-kernel-fusion
2b9f928e
Merge branch 'vllm-project:main' into punica-kernel-fusion
09fb9a93
Merge branch 'vllm-project:main' into punica-kernel-fusion
f4464548
Merge branch 'vllm-project:main' into punica-kernel-fusion
767b233e
Fix expand bug
421382e0
mgoin
approved these changes
on 2025-01-03
Merge branch 'vllm-project:main' into punica-kernel-fusion
90a91178
Isotr0py
approved these changes
on 2025-01-03
Reduce memory
2c792952
Modify minicpmv test
7e8d3bd3
jeejeelee
force pushed
to
7e8d3bd3
1 year ago
Merge branch 'vllm-project:main' into punica-kernel-fusion
02b1d805
Merge branch 'vllm-project:main' into punica-kernel-fusion
bd8cc450
Merge branch 'vllm-project:main' into punica-kernel-fusion
7ffd15ec
Merge branch 'vllm-project:main' into punica-kernel-fusion
c1c5b4b7
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub