Add LLaDA 8b Diffusion model #14771
am17an
force pushed
329 days ago
am17an
force pushed
329 days ago
am17an
force pushed
329 days ago
am17an
force pushed
329 days ago
am17an
force pushed
329 days ago
CISC
commented
on 2025-07-19
am17an
force pushed
328 days ago
CISC
commented
on 2025-07-20
am17an
force pushed
326 days ago
am17an
force pushed
322 days ago
am17an
force pushed
322 days ago
am17an
force pushed
320 days ago
am17an
force pushed
320 days ago
am17an
force pushed
320 days ago
am17an
force pushed
320 days ago
CISC
commented
on 2025-07-28
Add support for Llada-8b: diffusion model
bef6c2d0
Add README
0fa8b866
Fix README and convert_hf_to_gguf
267a09df
convert_hf_to_gguf.py: address review comments
812bc383
Make everything in a single example
6bb00936
am17an
force pushed
318 days ago
am17an
force pushed
318 days ago
Remove model-specific sampling
3e7efcba
am17an
force pushed
to
3e7efcba
318 days ago
Remove unused argmax
a50547c9
ggerganov
approved these changes
on 2025-07-31
Remove braced initializers, improve README.md a bit
e864a496
am17an
force pushed
to
e864a496
317 days ago
CISC
approved these changes
on 2025-07-31
Add diffusion specific gguf params in set_vocab, remove setting rope_…
9691f4ed
am17an
force pushed
to
9691f4ed
317 days ago
CISC
commented
on 2025-07-31
CISC
commented
on 2025-07-31
Remove adding the mask token
57201ccb
CISC
commented
on 2025-07-31
Move add_add_bos_token to set_vocab
a326b130
CISC
commented
on 2025-07-31
use add_bool in gguf_writer.py
ac3f91fe
am17an
force pushed
to
ac3f91fe
317 days ago
am17an
merged
8a4a8562
into master 317 days ago
am17an
deleted the add_llada_8b branch 316 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub