llm-foundry
6dcc0d84 - Precompute flash attention padding info (#880)

Commit

2 years ago

Precompute flash attention padding info (#880) * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * Update llmfoundry/models/mpt/modeling_mpt.py Co-authored-by: Vitaliy Chiley <6439018+vchiley@users.noreply.github.com> * dummy data * undoing last commit * .. * .. * Update llmfoundry/models/mpt/modeling_mpt.py Co-authored-by: Vitaliy Chiley <6439018+vchiley@users.noreply.github.com> * .. * .. --------- Co-authored-by: Vitaliy Chiley <6439018+vchiley@users.noreply.github.com>

References

#880 - Precompute flash attention padding info

Author

ShashankMosaicML

Parents

f43d1cfb

llm-foundry 6dcc0d84 - Precompute flash attention padding info (#880)

llm-foundry
6dcc0d84 - Precompute flash attention padding info (#880)