llama.cpp
Inference support for T5 and FLAN-T5 model families
#8141
Merged

Inference support for T5 and FLAN-T5 model families #8141

fairydreaming
sszymczy llama : add inference support and model types for T5 and FLAN-T5 mode…
45681a57
fairydreaming Merge branch 'ggerganov:master' into t5-clean-3
1c8d37a2
sszymczy llama : updated llm_build_ffn() calls to new API in build_t5()
bad0cafe
sszymczy llama : make pos_bias contiguous for CUDA
c4ded1a8
fairydreaming fairydreaming assigned fairydreaming fairydreaming 1 year ago
github-actions github-actions added examples
github-actions github-actions added python
sszymczy Merge remote-tracking branch 'upstream/master' into t5-clean-3
7293243d
sszymczy llama : whitespace formatting
7d7fff46
mofosyne mofosyne added Review Complexity : Medium
vladfaust
vladfaust commented on 2024-06-28
fairydreaming fairydreaming requested a review from ggerganov ggerganov 1 year ago
fairydreaming
ggerganov
ggerganov commented on 2024-06-29
sszymczy llama : quantization-related fixes for T5
6dc9eb40
ggerganov
ggerganov
ggerganov commented on 2024-07-02
sszymczy llama : add early return in Unigram tokenizer when normalized input i…
78675f35
fairydreaming
sszymczy llama : remove obsolete code
1d1cb01b
ggerganov
fairydreaming
ggerganov add t5 tokenizer tests
7c610faf
sszymczy Merge remote-tracking branch 'upstream/master' into t5-clean-3
b01ce7df
sszymczy llama : move JAIS after T5 everywhere for easier merging later
d40c9a1d
ggerganov llama : change naming to prefer "_enc" suffix
03ab5dd6
ggerganov llama : simplify llama_encode_internal
88270a36
ggerganov llama-batched : add encoder support
ded682d4
ggerganov llama : minor
01cd5a66
ggerganov
ggerganov approved these changes on 2024-07-04
sszymczy llama : silence compiler warnings
8b560e63
fairydreaming Merge branch 'ggerganov:master' into t5-clean-3
9bcecf1d
fairydreaming fairydreaming merged 807b0c49 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
Labels
Milestone