text-generation-inference
Disable `decoder_input_details` on OpenAI-compatible chat streaming, pass temp and top-k from API
#1470
Merged

Commits
  • Disable decoder_input_details for streaming requests
    EndlessReform committed 2 years ago
  • Transparently pass through temp and top_p
    EndlessReform committed 2 years ago
Loading