The Kling Image(First Frame) to Video with Audio node uses the Kling AI model to generate a short video from a single starting image and a text prompt. It creates a video sequence that begins with the provided image and can optionally include AI-generated audio to accompany the visuals.

## Inputs

| Parameter | Description | Data Type | Required | Range |
| --- | --- | --- | --- | --- |
| `model_name` | The specific version of the Kling AI model to use for video generation. | COMBO | Yes | `"kling-v2-6"` |
| `start_frame` | The image that will serve as the first frame of the generated video. The image must be at least 300x300 pixels and have an aspect ratio between 1:2.5 and 2.5:1. | IMAGE | Yes | - |
| `prompt` | Positive text prompt. This describes the video content you want to generate. The prompt must be between 1 and 2500 characters long. | STRING | Yes | - |
| `mode` | The operational mode for the video generation. | COMBO | Yes | `"pro"` |
| `duration` | The length of the video to generate, in seconds. | COMBO | Yes | `5`<br>`10` |
| `generate_audio` | When enabled, the node will generate audio to accompany the video. When disabled, the video will be silent. (default: True) | BOOLEAN | No | - |

## Outputs

| Output Name | Description | Data Type |
| --- | --- | --- |
| `video` | The generated video file, which may include audio depending on the `generate_audio` input. | VIDEO |

> This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! [Edit on GitHub](https://github.com/Comfy-Org/embedded-docs/blob/main/comfyui_embedded_docs/docs/KlingImageToVideoWithAudio/en.md)

---
**Source fingerprint (SHA-256):** `c994748fdfa6d79f81aaf864ec174a6b608c4f05943a7e23edd345608122d9a1`