vllm
c715fb19 - [V1] TPU support

Commit

196 days ago

[V1] TPU support Signed-off-by: Alexander Matveev <amatveev@redhat.com>

Author

alexm-redhat

alexm-redhat

Committer

alexm-redhat

alexm-redhat

Parents

Files16

.pre-commit-config.yaml
examples/offline_inference
- basic.py
tests/entrypoints/openai
- test_accuracy.py
tools
- mypy.sh
vllm
- platforms
  - cuda.py
  - interface.py
  - tpu.py
- v1
  - attention/backends
    - pallas.py
  - core
    - scheduler.py
  - worker
    - gpu_input_batch.py
    - gpu_model_runner.py
    - gpu_worker.py
    - model_runner_base.py
    - tpu_model_runner.py
    - tpu_worker.py
    - worker_base.py