transformers
25a91051 - Fix KeyError in convert_to_native_format for dict vocab (#44452)

Commit

74 days ago

Fix KeyError in convert_to_native_format for dict vocab (#44452) When loading tokenizers like vesteinn/ScandiBERT whose tokenizer_config specifies XLMRobertaTokenizer (model=Unigram) but whose tokenizer.json contains a dict-type vocab, the expression vocab[0] raises KeyError because dict keys are strings, not integers. Add an isinstance(vocab, list) guard so the list-to-tuple conversion is only attempted on list vocabs.

References

#44452 - Fix KeyError in convert_to_native_format for dict vocab

Author

weiguangli-io

Parents

70e454c9

transformers 25a91051 - Fix KeyError in convert_to_native_format for dict vocab (#44452)

transformers
25a91051 - Fix KeyError in convert_to_native_format for dict vocab (#44452)