vllm
c715fb19 - [V1] TPU support

Comment changes are shownComment changes are hidden
Commit
196 days ago
[V1] TPU support Signed-off-by: Alexander Matveev <amatveev@redhat.com>
Author
Committer
Parents
  • File
    .pre-commit-config.yaml
  • examples/offline_inference
    • File
      basic.py
  • tests/entrypoints/openai
    • File
      test_accuracy.py
  • tools
    • File
      mypy.sh
  • vllm
    • platforms
      • File
        cuda.py
      • File
        interface.py
      • File
        tpu.py
    • v1
      • attention/backends
        • File
          pallas.py
      • core
        • File
          scheduler.py
      • worker
        • File
          gpu_input_batch.py
        • File
          gpu_model_runner.py
        • File
          gpu_worker.py
        • File
          model_runner_base.py
        • File
          tpu_model_runner.py
        • File
          tpu_worker.py
        • File
          worker_base.py