Hugging Face Inference Model node
Use the Hugging Face Inference Model node to use Hugging Face's models.
On this page, you'll find the node parameters for the Hugging Face Inference Model node, and links to more resources.
This node lacks tools support, so it won't work with the AI Agent node. Instead, connect it with the Basic LLM Chain node.
Credentials: You can find authentication information for this node here.
Node parameters
- Model: Select the model to use to generate the completion.
Node options
- Custom Inference Endpoint: Enter a custom inference endpoint URL.
- Frequency Penalty: Use this option to control the chances of the model repeating itself. Higher values reduce the chance of the model repeating itself.
- Maximum Number of Tokens: Enter the maximum number of tokens used, which sets the completion length.
- Presence Penalty: Use this option to control the chances of the model talking about new topics. Higher values increase the chance of the model talking about new topics.
- Sampling Temperature: Use this option to control the randomness of the sampling process. A higher temperature creates more diverse sampling, but increases the risk of hallucinations.
- Top K: Enter the number of token choices the model uses to generate the next token.
- Top P: Use this option to set the probability the completion should use. Use a lower value to ignore less probable options.
Related resources
Refer to LangChains's Hugging Face Inference Model documentation (opens in a new tab) for more information about the service.