vllm
[Kernel] Update Cutlass fp8 configs
#5144
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
17
Changes
View On
GitHub
[Kernel] Update Cutlass fp8 configs
#5144
robertgshaw2-redhat
merged 17 commits into
vllm-project:main
from
neuralmagic:cutlass-fp8-configs
Add tile and cluster shape dispatch for fp8
77cadb72
Fix tile/cluster shape dispatch
288c0f60
add sm90_fp8 configs# Please enter the commit message for your changeā¦
06577796
add cutlass benchmark functions
2a3913f1
format
4a8e136e
format
ce1c8975
add noqa
7038d3cd
comaniac
approved these changes on 2024-05-31
add noqa
317cb72b
refactor weight shsapes
5548622c
format
707c3f33
isort fix
6ac1961a
fix weight shapes
221f31e1
mgoin
commented on 2024-05-31
review comments
e11ac840
Use a better next pow 2
d0dd5be9
format
316b7eb3
fix compile error
f1b35ebf
varun-sundar-rabindranath
requested a review
from
mgoin
1 year ago
Update weight_shapes.py
6319848e
tlrmchlsmth
approved these changes on 2024-06-01
robertgshaw2-redhat
enabled auto-merge (squash)
1 year ago
robertgshaw2-redhat
approved these changes on 2024-06-01
robertgshaw2-redhat
merged
f081c3ce
into main
1 year ago
robertgshaw2-redhat
deleted the cutlass-fp8-configs branch
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
robertgshaw2-redhat
tlrmchlsmth
comaniac
mgoin
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub