CLIPTextEncodeHiDream - ComfyUI Built-in Node Documentation

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

The CLIPTextEncodeHiDream node processes four separate text inputs using different language models (CLIP-L, CLIP-G, T5-XXL, and LLaMA) and combines them into a single conditioning output. It tokenizes each text input with its corresponding model and encodes them together using a scheduled encoding approach, enabling more sophisticated text conditioning by leveraging multiple language models simultaneously.

Inputs

Parameter	Data Type	Required	Range	Description
`clip`	CLIP	Yes	-	The CLIP model used for tokenization and encoding
`clip_l`	STRING	Yes	-	Text input for CLIP-L model processing. Supports multiline text and dynamic prompts.
`clip_g`	STRING	Yes	-	Text input for CLIP-G model processing. Supports multiline text and dynamic prompts.
`t5xxl`	STRING	Yes	-	Text input for T5-XXL model processing. Supports multiline text and dynamic prompts.
`llama`	STRING	Yes	-	Text input for LLaMA model processing. Supports multiline text and dynamic prompts.

Note: All four text inputs (clip_l, clip_g, t5xxl, and llama) are required for proper functioning, as each contributes to the final conditioning output through the scheduled encoding process.

Outputs

Output Name	Data Type	Description
`CONDITIONING`	CONDITIONING	The combined conditioning output from all processed text inputs, encoded using the scheduled encoding method

Source fingerprint (SHA-256): 4d0febcfc7fa8035d817a6b36ec80b9bdf176ede8446b49f291efd96b9d1371c

Documentation Index

​Inputs

​Outputs

Inputs

Outputs