langchain
641efcf4 - community: add runtime kwargs to HuggingFacePipeline (#17005)

Commit

2 years ago

community: add runtime kwargs to HuggingFacePipeline (#17005) This PR enables changing the behaviour of huggingface pipeline between different calls. For example, before this PR there's no way of changing maximum generation length between different invocations of the chain. This is desirable in cases, such as when we want to scale the maximum output size depending on a dynamic prompt size. Usage example: ```python from langchain_community.llms.huggingface_pipeline import HuggingFacePipeline from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline model_id = "gpt2" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained(model_id) pipe = pipeline("text-generation", model=model, tokenizer=tokenizer) hf = HuggingFacePipeline(pipeline=pipe) hf("Say foo:", pipeline_kwargs={"max_new_tokens": 42}) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>

References

#17005 - community: add runtime kwargs to HuggingFacePipeline

Author

ab-10

Parents

a32798ab

langchain 641efcf4 - community: add runtime kwargs to HuggingFacePipeline (#17005)

langchain
641efcf4 - community: add runtime kwargs to HuggingFacePipeline (#17005)