transformers
8a0a508f - Aligning modling code for GPT2 to work with vLLM (fallback) (#36934)

Commit

305 days ago

Aligning modling code for GPT2 to work with vLLM (fallback) (#36934) * aligning for vllm * using input shape rather than attn outputs * remove demo * revert Conv1D * style * style * Update src/transformers/models/gpt2/modeling_gpt2.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix copies * Apply suggestions from code review Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * adding docs about vllm * chore: style --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

References

#36934 - Aligning modling code for GPT2 to work with vLLM (fallback)

Author

ariG23498

Parents

e94a4807

transformers 8a0a508f - Aligning modling code for GPT2 to work with vLLM (fallback) (#36934)

transformers
8a0a508f - Aligning modling code for GPT2 to work with vLLM (fallback) (#36934)