text-generation-inference
TensorRT-LLM backend bump to latest version + misc fixes
#2791
Merged

TensorRT-LLM backend bump to latest version + misc fixes #2791

mfuntowicz merged 30 commits into main from trtllm/cancellation
mfuntowicz
mfuntowicz mfuntowicz requested a review from Hugoch Hugoch 1 year ago
mfuntowicz misc(cmake) update dependencies
0f17415d
mfuntowicz feat(hardware) enable new hardware.hpp and unittests
7a81040d
mfuntowicz test(ctest) enable address sanitizer
1830fe88
mfuntowicz feat(backend): initial rewrite of the backend for simplicity
3a2698fb
mfuntowicz feat(backend): remove all the logs from hardware.hpp
6d356575
mfuntowicz feat(backend): added some logging
9bb63097
mfuntowicz feat(backend): enable compiler warning if support for RVO not applying
87272ffe
mfuntowicz feat(backend): missing return statement
702dc9cd
mfuntowicz feat(backend): introduce backend_workspace_t to store precomputed inf…
25c6bbe1
mfuntowicz feat(backend): delete previous backend impl
df99164d
mfuntowicz feat(backend): more impl
fd7e2b5b
mfuntowicz feat(backend): use latest trtllm main version to have g++ >= 13 compa…
71e700a6
mfuntowicz feat(backend): allow overriding which Python to use
879e1a41
mfuntowicz feat(backend): fix backend_exception_t -> backend_error_t naming
a7bad25c
mfuntowicz feat(backend): impl missing generation_step_t as return value of pull…
2f8634ec
mfuntowicz feat(backend): make backend_workspace_t::engines_folder constexpr
874bc28d
mfuntowicz mfuntowicz force pushed from 54769479 to 874bc28d 1 year ago
mfuntowicz feat(backend): fix main.rs retrieving the tokenizer
16ba2f5a
mfuntowicz feat(backend): add guard to multiple header definitions
c94b9de4
mfuntowicz test(backend): add more unittest
ad3ed0d1
mfuntowicz feat(backend): remove constexpr from par
881527a5
mfuntowicz feat(backend): remove constexpig
62530649
mfuntowicz test(backend): more test coverage
cc6bc339
mfuntowicz chore(trtllm): update dependency towards 0.15.0
b6dbf605
mfuntowicz effectively cancel the request on the executor
460f290d
mfuntowicz feat(backend) fix moving backend when pulling
300f6c6f
mfuntowicz feat(backend): make sure we can easily cancel request on the executor
b3cd5ea0
mfuntowicz feat(backend): fix missing "0" field access
049f4acd
mfuntowicz misc(backend): fix reborrowing Pin<&mut T> as described in the doc ht…
f0cd4742
Hugoch chore: Add doc and CI for TRTLLM (#2799)
ab6591e7
HuggingFaceDocBuilderDev
mfuntowicz mfuntowicz marked this pull request as ready for review 1 year ago
Hugoch
Hugoch dismissed these changes on 2024-12-02
mfuntowicz misc(backend): indent
1640da7c
mfuntowicz mfuntowicz dismissed their stale review via 1640da7c 1 year ago
mfuntowicz mfuntowicz merged ea7f4082 into main 1 year ago
mfuntowicz mfuntowicz deleted the trtllm/cancellation branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone