Documentation Index
Fetch the complete documentation index at: https://docs.comfy.org/llms.txt
Use this file to discover all available pages before exploring further.
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHubThe CLIPLoader node loads a text encoder model (CLIP, T5, or similar) from a file, making it available for use in other nodes that need to convert text prompts into numerical representations. It supports a wide variety of model architectures, each requiring a specific encoder type.
Inputs
| Parameter | Data Type | Required | Range | Description |
|---|---|---|---|---|
clip_name | STRING | Yes | List of files found in the text_encoders folder | The filename of the text encoder model to load. This must be a file located in the ComfyUI/models/text_encoders/ or ComfyUI/models/clip/ directory. |
type | STRING | Yes | "stable_diffusion""stable_cascade""sd3""stable_audio""mochi""ltxv""pixart""cosmos""lumina2""wan""hidream""chroma""ace""omnigen2""qwen_image""hunyuan_image""flux2""ovis""longcat_image""cogvideox" | The architecture type of the model being loaded. This determines which specific encoder variant to use. The default is "stable_diffusion". |
device | STRING | No | "default""cpu" | The device to load the model onto. "default" uses the GPU if available, while "cpu" forces CPU loading. This is an advanced option (default: "default"). |
Supported Type-to-Encoder Mappings
Thetype parameter selects the correct encoder for a given model architecture. The following are common mappings:
| Type | Encoder |
|---|---|
| stable_diffusion | clip-l |
| stable_cascade | clip-g |
| sd3 | t5 xxl / clip-g / clip-l |
| stable_audio | t5 base |
| mochi | t5 xxl |
| cogvideox | t5 xxl (226-token padding) |
| cosmos | old t5 xxl |
| lumina2 | gemma 2 2B |
| wan | umt5 xxl |
| hidream | llama-3.1 (recommended) or t5 |
| omnigen2 | qwen vl 2.5 3B |
Outputs
| Output Name | Data Type | Description |
|---|---|---|
clip | CLIP | The loaded text encoder model, ready to be connected to other nodes for text encoding and conditioning. |
Source fingerprint (SHA-256):
1051bfe5570dff81719682cb09938bae4c03e94e0e72f7a2be84867cccb48017