The TextEncodeQwenImageEdit node processes text prompts and optional images to generate conditioning data for image generation or editing. It uses a CLIP model to tokenize the input and can optionally encode reference images using a VAE to create reference latents. When an image is provided, it automatically resizes the image to maintain consistent processing dimensions.

## Inputs

| Parameter | Description | Data Type | Required | Range |
| --- | --- | --- | --- | --- |
| `clip` | The CLIP model used for text and image tokenization | CLIP | Yes | - |
| `prompt` | Text prompt for conditioning generation, supports multiline input and dynamic prompts | STRING | Yes | - |
| `vae` | Optional VAE model for encoding reference images into latents | VAE | No | - |
| `image` | Optional input image for reference or editing purposes | IMAGE | No | - |

**Note:** When both `image` and `vae` are provided, the node encodes the image into reference latents and attaches them to the conditioning output. The image is automatically resized to maintain a consistent processing scale of approximately 1024x1024 pixels.

## Outputs

| Output Name | Description | Data Type |
| --- | --- | --- |
| `CONDITIONING` | Conditioning data containing text tokens and optional reference latents for image generation | CONDITIONING |

> This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! [Edit on GitHub](https://github.com/Comfy-Org/embedded-docs/blob/main/comfyui_embedded_docs/docs/TextEncodeQwenImageEdit/en.md)

---
**Source fingerprint (SHA-256):** `7d1f82174d92ee86ca35e14a364e1a703fb9fc1ac9979611bd894a0a639c58a2`