Add LLaDA 8b Diffusion model #14771
am17an
force pushed
197 days ago
am17an
force pushed
196 days ago
am17an
force pushed
196 days ago
am17an
force pushed
196 days ago
am17an
force pushed
196 days ago
CISC
commented
on 2025-07-19
am17an
force pushed
196 days ago
CISC
commented
on 2025-07-20
am17an
force pushed
194 days ago
am17an
force pushed
190 days ago
am17an
force pushed
190 days ago
am17an
force pushed
188 days ago
am17an
force pushed
188 days ago
am17an
force pushed
188 days ago
am17an
force pushed
188 days ago
CISC
commented
on 2025-07-28
Add support for Llada-8b: diffusion model
bef6c2d0
Add README
0fa8b866
Fix README and convert_hf_to_gguf
267a09df
convert_hf_to_gguf.py: address review comments
812bc383
Make everything in a single example
6bb00936
am17an
force pushed
186 days ago
am17an
force pushed
186 days ago
Remove model-specific sampling
3e7efcba
am17an
force pushed
to
3e7efcba
186 days ago
Remove unused argmax
a50547c9
ggerganov
approved these changes
on 2025-07-31
Remove braced initializers, improve README.md a bit
e864a496
am17an
force pushed
to
e864a496
185 days ago
CISC
approved these changes
on 2025-07-31
Add diffusion specific gguf params in set_vocab, remove setting rope_…
9691f4ed
am17an
force pushed
to
9691f4ed
185 days ago
CISC
commented
on 2025-07-31
CISC
commented
on 2025-07-31
Remove adding the mask token
57201ccb
CISC
commented
on 2025-07-31
Move add_add_bos_token to set_vocab
a326b130
CISC
commented
on 2025-07-31
use add_bool in gguf_writer.py
ac3f91fe
am17an
force pushed
to
ac3f91fe
184 days ago
am17an
merged
8a4a8562
into master 184 days ago
am17an
deleted the add_llada_8b branch 184 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub