DeepSpeed
DeepSpeed-Triton for Inference
#3748
Merged

DeepSpeed-Triton for Inference #3748

jeffra merged 31 commits into master from staging-triton-bert-v1
stephen-youn
jeffra [squash] styoun/triton fp16 transformer (#530)
6d291dbc
stephen-youn stephen-youn requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
stephen-youn stephen-youn requested a review from jeffra jeffra 2 years ago
stephen-youn stephen-youn requested a review from mrwyattii mrwyattii 2 years ago
stephen-youn stephen-youn requested a review from awan-10 awan-10 2 years ago
stephen-youn stephen-youn requested a review from cmikeh2 cmikeh2 2 years ago
stephen-youn stephen-youn requested a review from arashb arashb 2 years ago
stephen-youn stephen-youn requested a review from tjruwase tjruwase 2 years ago
stephen-youn stephen-youn requested a review from loadams loadams 2 years ago
stephen-youn Triton kernels and BERT inference using triton in float16 (#459)
b978eae4
styoun readme for blog
f41e279c
styoun typo in readme
2d499bf1
styoun readme
42b8de41
styoun readme
d16aaa20
styoun readme
b4fee896
styoun readme
468d6983
styoun plots
d5fff4fe
styoun readme
643a2fcc
styoun readme
e6d44c98
styoun readme
fdb87060
styoun readme
b29157b9
styoun readme
8078e04d
styoun typo in readme
f73bd90a
styoun readme revision after the feedbacks
c2bd7dd1
styoun typo
19d4231c
styoun refined the writing in readme
314c5eef
styoun readme
4da70a41
styoun readme
bad841c5
styoun removed obsolete comments from matmul_ext.py
47928a21
styoun typo
db16fa0d
jeffra Merge branch 'master' into staging-triton-bert-v1
39cbe478
stephen-youn stephen-youn changed the title [squash] styoun/triton fp16 transformer (#530) DeepSpeed-Triton for Inference 2 years ago
jeffra
jeffra commented on 2023-06-22
jeffra
jeffra commented on 2023-06-22
jeffra
jeffra approved these changes on 2023-06-22
cmikeh2
cmikeh2 approved these changes on 2023-06-22
styoun readme change from pr comments
219a1b8c
styoun Merge branch 'staging-triton-bert-v1' of https://github.com/microsoft…
072fe374
jeffra Merge branch 'master' into staging-triton-bert-v1
7f7b76d2
RezaYazdaniAminabadi
RezaYazdaniAminabadi commented on 2023-06-22
styoun removed obsolete codes and comments
223ad1fc
awan-10
awan-10 commented on 2023-06-22
styoun Merge branch 'staging-triton-bert-v1' of https://github.com/microsoft…
afc34fa5
awan-10
awan-10 approved these changes on 2023-06-22
styoun readme
1eadebdf
stephen-youn Merge branch 'master' into staging-triton-bert-v1
4cb8b371
jeffra jeffra added merge-queue
jeffra Merge branch 'master' into staging-triton-bert-v1
2835e0da
jeffra jeffra merged 4dc65f7b into master 2 years ago
jeffra jeffra deleted the staging-triton-bert-v1 branch 2 years ago
mpjlu

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone