The `CLIPTextEncodeHunyuanDiT` node converts text descriptions into a format that the HunyuanDiT model can understand. It is an advanced conditioning node designed for the dual text encoder architecture of HunyuanDiT, processing two separate text inputs through different tokenizers.

## Inputs

| Parameter | Description | Data Type | Required | Range |
| --- | --- | --- | --- | --- |
| `clip` | A CLIP model instance used for text tokenization and encoding, which is core to generating conditions. | CLIP | Yes | - |
| `bert` | Text input for encoding via the BERT tokenizer. Prefers phrases and keywords. Supports multiline and dynamic prompts. | STRING | Yes | - |
| `mt5xl` | Text input for encoding via the mT5-XL tokenizer. Supports multiline and dynamic prompts (multilingual). Can use complete sentences and complex descriptions. | STRING | Yes | - |

## Outputs

| Output Name | Description | Data Type |
| --- | --- | --- |
| `CONDITIONING` | The encoded conditioning output, combining both BERT and mT5-XL tokenized text, used for further processing in generation tasks. | CONDITIONING |

> This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! [Edit on GitHub](https://github.com/Comfy-Org/embedded-docs/blob/main/comfyui_embedded_docs/docs/CLIPTextEncodeHunyuanDiT/en.md)

---
**Source fingerprint (SHA-256):** `bde7c884f72829491090965bd9af34ad59ec326f96e88bb7cdb9ddc47592137e`