transformers
e3fc90ae
- Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility (#29079)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility (#29079) * input_layernorm as the beacon of hope * cleaner dtype extraction * AQLM + CUDA graph test * is available check * shorter text test
References
#29079 - Cleaner Cache `dtype` and `device` extraction for CUDA graph generation for quantizers compatibility
#29969 - [SigLIP] Add fast tokenizer
#32831 - [Docs] Update resources
#33111 - [Backbone] Remove out_features everywhere
#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs
#39821 - Support MetaCLIP 2
#59 - Fix attention mask handling in EoMT-DINOv3 converter
#41212 - Add EoMT with DINOv3 backbone
#62 - Add initial DEIMv2 model implementation
Author
BlackSamorez
Parents
a3f9221a
Loading