Add LLaDA 8b Diffusion model (#14771)

Commit

180 days ago

Add LLaDA 8b Diffusion model (#14771) * Add support for Llada-8b: diffusion model * Add README * Fix README and convert_hf_to_gguf * convert_hf_to_gguf.py: address review comments * Make everything in a single example * Remove model-specific sampling * Remove unused argmax * Remove braced initializers, improve README.md a bit * Add diffusion specific gguf params in set_vocab, remove setting rope_theta and rms_norm_eps * Remove adding the mask token * Move add_add_bos_token to set_vocab * use add_bool in gguf_writer.py

References

#14771 - Add LLaDA 8b Diffusion model

Author

am17an

Parents

11490b36

llama.cpp 8a4a8562 - Add LLaDA 8b Diffusion model (#14771)

llama.cpp
8a4a8562 - Add LLaDA 8b Diffusion model (#14771)