test tensor parallel: make tests for dense model more robust (#41968)
* make test forward and backward more robust
* refactor compile part of test tensor parallel
* linting
* pass rank around instead of calling it over and over
* Run slow v2 (#41914)
* Super
* Super
* Super
* Super
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
* Fix `detectron2` installation in docker files (#41975)
* detectron2 - part 1
* detectron2 - part 2
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
* Fix `autoawq[kernels]` installation in quantization docker file (#41978)
fix autoawq[kernels]
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
* add support for saving encoder only so any parakeet model can be loaded for inference (#41969)
* add support for saving encoder only so any decoder model can be loaded
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* use convolution_bias
* convert modular
* convolution_bias in convertion script
---------
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Eustache Le Bihan <eulebihan@gmail.com>
Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com>
---------
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Eustache Le Bihan <eulebihan@gmail.com>
Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com>