vllm
c715fb19
- [V1] TPU support
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Comment Changes
Previous Change (CTRL+↑)
Next Change (CTRL+↓)
Expand Context Lines
Collapse Context Lines
Hide Minimap (CTRL+M)
Commit
196 days ago
[V1] TPU support Signed-off-by: Alexander Matveev <amatveev@redhat.com>
Author
alexm-redhat
Committer
alexm-redhat
Parents
24b0205f
Files
16
.pre-commit-config.yaml
examples/offline_inference
basic.py
tests/entrypoints/openai
test_accuracy.py
tools
mypy.sh
vllm
platforms
cuda.py
interface.py
tpu.py
v1
attention/backends
pallas.py
core
scheduler.py
worker
gpu_input_batch.py
gpu_model_runner.py
gpu_worker.py
model_runner_base.py
tpu_model_runner.py
tpu_worker.py
worker_base.py
Loading